Chaudhury 2024, ch. 3 (classifiers and vector calculus)

Notes from Chaudhury, et al. (2024), chapter 3

New highlights added June 19, 2024 at 4:30 PM

The answer is yes. A matrix can always be converted into a vector by a process called rasterization. (View Highlight)
- Note: The author is actually confused. Rasterization is the process of converting a vector to a bitmap (“raster”); vectorization is the process of converting a bitmap to a vector.
rasterization (View Highlight)
- Note: vectorization
rasterized vector (View Highlight)
- Note: vector
matrix (View Highlight)
- Note: raster
rasterization (View Highlight)
- Note: vectorization
In the special case of binary classifiers, the sign of the expression 휙  x; w, b representing  the decision boundary has a special significance. (View Highlight)
in binary classification, once we have estimated an optimal decision surface 휙  sign of 휙 x; w, b   x; w, b , given any input vector x, we can compute the  to predict the class. (View Highlight)
if we are describing a continuous (smooth) function in the neighborhood of a specific point, we use a Taylor series. A Taylor series allows us to describe a function in the neighborhood of a specific point in terms of the value of the function and its derivatives at that point. (View Highlight)
In equation 3.7, we express a function of one variable in a small neighborhood around a point in terms of the derivatives. Can we do a similar thing in higher dimensions? Yes. We simply need to replace the first derivative with the gradient. We replace the second derivative with its multidimensional counterpart: the Hessian matrix. (View Highlight)
(View Highlight)
- Note: Hessian matrix. Replaces the derivative in the multidimensional version of the Taylor series.
A convex surface (see figure 3.14) has a single optimum (maximum/minimum): the global one.2 (View Highlight)
, a set of points is said to be convex if and only if the straight line joining any pair of points in the set lies entirely within the set. (View Highlight)
a set of points is nonconvex if it contains at least one pair of points whose joining line contains a point not belonging to the set. (View Highlight)
we know that all points C on L can be expressed as a weighted average of the coordinates of A and B, with the sum of weights being 1. Thus, C ≡ (훼1x1 + 훼2x2, 훼1y1 + 훼2y2), where 훼1 + 훼2 = 1. (View Highlight)
if we drop a perpendicular to the X-axis from any point on the secant line joining a pair of points on the curve, that perpendicular will cut the curve at a lower point (that is, smaller in its Y-coordinate). (View Highlight)
(View Highlight)
- Note: First definition of convexity
convex curves always curl upward and/or rightward everywhere. This leads to another equivalent definition of convexity. (View Highlight)
?t=1718828586417 (View Highlight)
- Note: Second definition of convexity
t if the second derivative is negative everywhere or the Hessian is negative semi-definite, the curve or surface is said to be concave. (View Highlight)
The negative of a concave function is a convex function. But the negative of a nonconvex function is again nonconvex. (View Highlight)
(View Highlight)
- Note: Third definition of convexity

David's raw ML reference notes

Explorer

Chaudhury 2024, ch. 3 (classifiers and vector calculus)

New highlights added June 19, 2024 at 4:30 PM

Graph View

Backlinks