0% found this document useful (0 votes)

19 views13 pages

Transpose & Dot Product: M N A N M A A A A A A A A

Uploaded by

Methyl

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views13 pages

Transpose & Dot Product: M N A N M A A A A A A A A

Uploaded by

Methyl

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

Transpose & Dot Product

Def: The transpose of an m × n matrix A is the n × m matrix AT whose

columns are the rows of A.
So: The columns of AT are the rows of A. The rows of AT are the columns
of A.  
1 4
1 2 3
Example: If A = , then AT = 2 5.
4 5 6
3 6
Convention: From now on, vectors v ∈ Rn will be regarded as “columns”
(i.e.: n × 1 matrices). Therefore, vT is a “row vector” (a 1 × n matrix).

Observation: Let v, w ∈ Rn . Then vT w = v · w. This is because:

 
w
T
.1
v w = v1 · · · vn  ..  = v1 w1 + · · · + vn wn = v · w.

wn
Where theory is concerned, the key property of transposes is the following:

Prop 18.2: Let A be an m × n matrix. Then for x ∈ Rn and y ∈ Rm :

(Ax) · y = x · (AT y).
Here, · is the dot product of vectors.

Extended Example
Let A be a 5 × 3 matrix, so A : R3 → R5 .
◦ N (A) is a subspace of
◦ C(A) is a subspace of

The transpose AT is a matrix, so AT : →

◦ C(AT ) is a subspace of
◦ N (AT ) is a subspace of

Observation: Both C(AT ) and N (A) are subspaces of . Might there

be a geometric relationship between the two? (No, they’re not equal.) Hm...

Also: Both N (AT ) and C(A) are subspaces of . Might there be a

geometric relationship between the two? (Again, they’re not equal.) Hm...
Orthogonal Complements
Def: Let V ⊂ Rn be a subspace. The orthogonal complement of V is the
set
V ⊥ = {x ∈ Rn | x · v = 0 for every v ∈ V }.
So, V ⊥ consists of the vectors which are orthogonal to every vector in V .

Fact: If V ⊂ Rn is a subspace, then V ⊥ ⊂ Rn is a subspace.

Examples in R3 :
◦ The orthogonal complement of V = {0} is V ⊥ = R3
◦ The orthogonal complement of V = {z-axis} is V ⊥ = {xy-plane}
◦ The orthogonal complement of V = {xy-plane} is V ⊥ = {z-axis}
◦ The orthogonal complement of V = R3 is V ⊥ = {0}

Examples in R4 :
◦ The orthogonal complement of V = {0} is V ⊥ = R4
◦ The orthogonal complement of V = {w-axis} is V ⊥ = {xyz-space}
◦ The orthogonal complement of V = {zw-plane} is V ⊥ = {xy-plane}
◦ The orthogonal complement of V = {xyz-space} is V ⊥ = {w-axis}
◦ The orthogonal complement of V = R4 is V ⊥ = {0}

Prop 19.3-19.4-19.5: Let V ⊂ Rn be a subspace. Then:

(a) dim(V ) + dim(V ⊥ ) = n
(b) (V ⊥ )⊥ = V
(c) V ∩ V ⊥ = {0}
(d) V + V ⊥ = Rn .

Part (d) means: “Every vector x ∈ Rn can be written as a sum x = v + w

where v ∈ V and w ∈ V ⊥ .”
Also, it turns out that the expression x = v + w is unique: that is, there
is only one way to write x as a sum of a vector in V and a vector in V ⊥ .
Meaning of C(AT ) and N (AT )
Q: What does C(AT ) mean? Well, the columns of AT are the rows of A. So:
C(AT ) = column space of AT
= span of columns of AT
= span of rows of A.
For this reason: We call C(AT ) the row space of A.

Q: What does N (AT ) mean? Well:

x ∈ N (AT ) ⇐⇒ AT x = 0
⇐⇒ (AT x)T = 0T
⇐⇒ xT A = 0T .
So, for an m × n matrix A, we see that: N (AT ) = {x ∈ Rm | xT A = 0T }.
For this reason: We call N (AT ) the left null space of A.

Relationships among the Subspaces

Theorem: Let A be an m × n matrix. Then:
◦ C(AT ) = N (A)⊥
◦ N (AT ) = C(A)⊥

Corollary: Let A be an m × n matrix. Then:

◦ C(A) = N (AT )⊥
◦ N (A) = C(AT )⊥

Prop 18.3: Let A be an m × n matrix. Then rank(A) = rank(AT ).

Motivating Questions for Reading

Problem 1: Let b ∈ C(A). So, the system of equations Ax = b does have
solutions, possibly infinitely many.
Q: What is the solution x of Ax = b with kxk the smallest?

Problem 2: Let b ∈ / C(A). So, the system of equations Ax = b does not

have any solutions. In other words, Ax − b 6= 0.
Q: What is the vector x that minimizes the error kAx − bk? That is, what
is the vector x that comes closest to being a solution to Ax = b?
Orthogonal Projection
Def: Let V ⊂ Rn be a subspace. Then every vector x ∈ Rn can be written
uniquely as
x = v + w, where v ∈ V and w ∈ V ⊥ .
The orthogonal projection onto V is the function ProjV : Rn → Rn
given by: ProjV (x) = v. (Note that ProjV ⊥ (x) = w.)

Prop 20.1: Let V ⊂ Rn be a subspace. Then:

ProjV + ProjV ⊥ = In .
Of course, we already knew this: We have x = v+w = ProjV (x)+ProjV ⊥ (x).

Formula: Let {v1 , . . . , vk } be a basis of V ⊂ Rn . Let A be the n × k matrix

 

A = v1 · · · vk .

Then:
ProjV = A(AT A)−1 AT . (∗)

Geometry Observations: Let V ⊂ Rn be a subspace, and x ∈ Rn a vector.

(1) The distance from x to V is: kProjV ⊥ (x)k = kx − ProjV (x)k.
(2) The vector in V that is closest to x is: ProjV (x).

Derivation of (∗): Notice ProjV (x) is a vector in V = span(v1 , . . . , vk ) = C(A) = Range(A), and
therefore ProjV (x) = Ay for some vector y ∈ Rk .
Now notice that x − ProjV (x) = x − Ay is a vector in V ⊥ = C(A)⊥ = N (AT ), which means
that AT (x − Ay) = 0, which means AT x = AT Ay.
Now, it turns out that our matrix AT A is invertible (proof in L20), so we get y = (AT A)−1 AT x.
Thus, ProjV (x) = Ay = A(AT A)−1 AT x. ♦
Minimum Magnitude Solution
Prop 19.6: Let b ∈ C(A) (so Ax = b has solutions). Then there exists
exactly one vector x0 ∈ C(AT ) with Ax0 = b.
And: Among all solutions of Ax = b, the vector x0 has the smallest length.

In other words: There is exactly one vector x0 in the row space of A which
solves Ax = b – and this vector is the solution of smallest length.

To Find x0 : Start with any solution x of Ax = b. Then

x0 = ProjC(AT ) (x).

Least Squares Approximation

Idea: Suppose b ∈ / C(A). So, Ax = b has no solutions, so Ax − b 6= 0.
We want to find the vector x∗ which minimizes the error kAx∗ − bk. That
is, we want the vector x∗ for which Ax∗ is the closest vector in C(A) to b.

In other words, we want the vector x∗ for which Ax∗ − b is orthogonal to

C(A). So, Ax∗ − b ∈ C(A)⊥ = N (AT ), meaning that AT (Ax∗ − b) = 0, i.e.:

AT Ax∗ = AT b.

Quadratic Forms (Intro)

Given an m × n matrix A, we can regard it as a linear transformation
T : Rn → Rm . In the special case where the matrix A is a symmetric matrix,
we can also regard A as defining a “quadratic form”:

Def: Let A be a symmetric n × n matrix. The quadratic form associated

to A is the function QA : Rn → R given by:

QA (x) = x · Ax (· is the dot product)

 
x1
= x Ax = x1 · · · xn A  ... 
T

xn
Notice that quadratic forms are not linear transformations!
Orthonormal Bases
Def: A basis {w1 , . . . , wk } for a subspace V is an orthonormal basis if:
(1) The basis vectors are mutually orthogonal: wi · wj = 0 (for i 6= j);
(2) The basis vectors are unit vectors: wi · wi = 1. (i.e.: kwi k = 1)

Orthonormal bases are nice for (at least) two reasons:

(a) It is much easier to find the B-coordinates [v]B of a vector when the
basis B is orthonormal;
(b) It is much easier to find the projection matrix onto a subspace V
when we have an orthonormal basis for V .

Prop: Let {w1 , . . . , wk } be an orthonormal basis for a subspace V ⊂ Rn .

(a) Every vector v ∈ V can be written
v = (v · w1 )w1 + · · · + (v · wk )wk .
(b) For all x ∈ Rn :
ProjV (x) = (x · w1 )w1 + · · · + (x · wk )wk .
(c) Let A be the matrix with columns {w1 , . . . , wk }. Then AT A = Ik , so:
ProjV = A(AT A)−1 AT = AAT .

Orthogonal Matrices
Def: An orthogonal matrix is an invertible matrix C such that
C −1 = C T .
Example: Let {v1 , . . . , vn } be an orthonormal basis for Rn . Then the matrix
 

C = v1 · · · vn 

is an orthogonal matrix.
In fact, every orthogonal matrix C looks like this: the columns of any
orthogonal matrix form an orthonormal basis of Rn .

Where theory is concerned, the key property of orthogonal matrices is:

Prop 22.4: Let C be an orthogonal matrix. Then for v, w ∈ Rn :

Cv · Cw = v · w.
Gram-Schmidt Process
Since orthonormal bases have so many nice properties, it would be great
if we had a way of actually manufacturing orthonormal bases. That is:

Goal: We are given a basis {v1 , . . . , vk } for a subspace V ⊂ Rn . We would

like an orthonormal basis {w1 , . . . , wk } for our subspace V .

Notation: We will let

V1 = span(v1 )
V2 = span(v1 , v2 )
..
.
Vk = span(v1 , . . . , vk ) = V.

Idea: Build an orthonormal basis for V1 , then for V2 , . . . , up to Vk = V .

Gram-Schmidt Algorithm: Let {v1 , . . . , vk } be a basis for V ⊂ Rn .

(1) Define w1 = kvv11 k .
(2) Having defined {w1 , . . . , wj }, let

yj+1 = vj+1 − ProjVj (vj+1 )

= vj+1 − (vj+1 · w1 )w1 − (vj+1 · w2 )w2 − · · · − (vj+1 · wj )wj ,
y j+1
and define wj+1 = kyj+1 k.
Then {w1 , . . . , wk } is an orthonormal basis for V .
Definiteness
Def: Let Q : Rn → R be a quadratic form.
We say Q is positive definite if Q(x) > 0 for all x 6= 0.
We say Q is negative definite if Q(x) < 0 for all x 6= 0.
We say Q is indefinite if there are vectors x for which Q(x) > 0, and also
vectors x for which Q(x) < 0.

Def: Let A be a symmetric matrix.

We say A is positive definite if QA (x) = xT Ax > 0 for all x 6= 0.
We say A is negative definite if QA (x) = xT Ax < 0 for all x 6= 0.
We say A is indefinite if there are vectors x for which xT Ax > 0, and
also vectors x for which xT Ax < 0.

In other words:
◦ A is positive definite ⇐⇒ QA is positive definite.
◦ A is negative definite ⇐⇒ QA is negative definite.
◦ A is indefinite ⇐⇒ QA is indefinite.

The Hessian
Def: Let f : Rn → R be a function. Its Hessian at a ∈ Rn is the symmetric
matrix of second partials:
fx1 x1 (a) · · · fx1 xn (a)
 

Hf (a) =  · · · ... · · · .
fxn x1 (a) · · · fxn xn (a)

Note that the Hessian is a symmetric matrix. Therefore, we can also

regard Hf (a) as a quadratic form:
· · · fx1 xn (a)
  
f (a) x1
T
x1 x1 ... .
· · ·   .. .

QHf (a) (x) = x Hf (a) x = x1 · · · xn  · · ·
fxn x1 (a) · · · fxn xn (a) xn
In particular, it makes sense to ask whether the Hessian is positive definite,
negative definite, or indefinite.
Single-Variable Calculus Review
Recall: In calculus, you learned that for a function f : R → R, a critical
point is a point a ∈ R where f 0 (a) = 0 or f 0 (a) does not exist.
You learned that if f (x) has a local min/max at x = a, then x = a is a
critical point. Of course, the converse is false: critical points don’t have to
be local minima or local maxima (e.g., they could be inflection points.)
You also learned the “second derivative test.” If x = a is a critical point
for f (x), then f 00 (a) > 0 tells us that x = a is a local min, whereas f 00 (a) < 0
tells us that x = a is a local max.

It would be nice to have similar statements in higher dimensions:

Critical Points & Second Derivative Test

Def: A critical point of f : Rn → R is a point a ∈ Rn at which Df (a) = 0T
or Df (a) is undefined.
∂f
In other words, each partial derivative ∂xi
(a) is zero or undefined.

Theorem: If f : Rn → R has a local max / local min at a ∈ Rn , then a is a

critical point of f .

N.B.: The converse of this theorem is false! Critical points do not have to
be a local max or local min – e.g., they could be saddle points.

Def: A saddle point of f : Rn → R is a critical point of f that is not a local

max or local min.

Second Derivative Test: Let f : Rn → R be a function, and a ∈ Rn be a

critical point of f .
(a) If Hf (a) is positive definite, then a is a local min of f .
(b) If Hf (a) is positive semi-definite, then a is local min or saddle point.
(c) If Hf (a) is negative definite, then a is a local max of f .
(d) If Hf (a) is negative semi-definite, then a is local max or saddle point.
(e) If Hf (a) is indefinite, then a is a saddle point of f .
Local Extrema vs Global Extrema
Finding Local Extrema: We want to find the local extrema of a function
f : Rn → R.
(i) Find the critical points of f .
(ii) Use the Second Derivative Test to decide if the critical points are local
maxima / minima / saddle points.

Theorem: Let f : Rn → R be a function. If R ⊂ Rn is a closed and bounded

region, then f has a global max and a global min on R.

Finding Global Extrema: We want to find the global extrema of a func-

tion f : Rn → R on a region R ⊂ Rn .
(1) Find the critical points of f on the interior of R.
(2) Find the extreme values of f on the boundary of R. (Lagrange mult.)
Then:
◦ The largest value from Steps (1)-(2) is a global max value.
◦ The smallest value from Steps (1)-(2) is a global min value.

Lagrange Multipliers (Constrained Optimization)

Notation: Let f : Rn → Rm be a function, and S ⊂ Rn be a subset.
The restricted function f |S : S → Rm is the same exact function as f , but
where the domain is restricted to S.

Theorem: Suppose we want to optimize a function f (x1 , . . . , xn ) constrained

to a level set S = {g(x1 , . . . , xn ) = c}.
If a is an extreme value of f |S on the level set S = {g(x1 , . . . , xn ) = c},
and if ∇g(a) 6= 0, then
∇f (a) = λ∇g(a)
for some constant λ.

Reason: If a is an extreme value of f |S on the level set S, then Dv f (a) = 0

for all vectors v that are tangent to the level set S. Therefore, ∇f (a) · v = 0
for all vectors v that are tangent to S.
This means that ∇f (a) is orthogonal to the level set S, so ∇f (a) must be
a scalar multiple of the normal vector ∇g(a). That is, ∇f (a) = λ∇g(a).
Motivation for Eigenvalues & Eigenvectors
We want to understand a quadratic form QA (x), which might be ugly and
complicated.
Idea: Maybe there’s an orthonormal basis B = {w1 , . . . , wn } of Rn that
is somehow “best suited to A” – so that with respect to the basis B, the
quadratic form QA looks simple.

What do we mean by “basis suited to A”? And does such a basis always
exist? Well:

Spectral Theorem: Let A be a symmetric n × n matrix. Then there exists

an orthonormal basis B = {w1 , . . . , wn } of Rn such that each w1 , . . . , wn is
an eigenvector of A.
i.e.: There is an orthonormal basis of Rn consisting of eigenvectors of A.

Why is this good? Well, since B is a basis, every w ∈ Rn can be written

w = u1 w1 + · · · + un wn . (That is, the B-coordinates of w are (u1 , . . . , un ).)
It then turns out that:

QA (w) = QA (u1 w1 + · · · + un wn )
= (u1 w1 + · · · + un wn ) · A(u1 w1 + · · · + un wn )
= λ1 (u1 )2 + λ2 (u2 )2 + · · · + λn (un )2 . (yay!)

In other words: the quadratic form QA is in diagonal form with respect to

the basis B. We have made QA look as simple as possible!
Also: the coefficients λ1 , . . . , λn are exactly the eigenvalues of A.

Corollary: Let A be a symmetric n × n matrix, with eigenvalues λ1 , . . . , λn .

(a) A is positive-definite ⇐⇒ all of λ1 , . . . , λn are positive.
(b) A is negative-definite ⇐⇒ all of λ1 , . . . , λn are negative.
(c) A is indefinite ⇐⇒ there is a positive eigenvalue λi > 0 and a negative
eigenvalue λj < 0.

Useful Fact: Let A be any n × n matrix, with eigenvalues λ1 , . . . , λn . Then

det(A) = λ1 λ2 · · · λn .

Cor: If any one of the eigenvalues λj = 0 is zero, then det(A) = 0.

What is a (Unit) Sphere?
◦ The 1-sphere (the “unit circle”) is S1 = {(x, y) ∈ R2 | x2 + y 2 = 1} ⊂ R2 .
◦ The 2-sphere (the “sphere”) is S2 = {(x, y, z) ∈ R3 | x2 +y 2 +z 2 = 1} ⊂ R3 .
◦ The 3-sphere is S3 = {(x, y, z, w) ∈ R4 | x2 + y 2 + z 2 + w2 = 1} ⊂ R4 .
Note that the 3-sphere is not the same as the unit ball {x2 + y 2 + z 2 ≤ 1}.

◦ The (n − 1)-sphere is the set

Sn−1 = {(x1 , . . . , xn ) ∈ Rn | (x1 )2 + · · · + (xn )2 = 1}

= {x ∈ Rn | kxk2 = 1} ⊂ Rn .

In other words, Sn−1 consists of the unit vectors in Rn .

Optimizing Quadratic Forms on Spheres

Problem: Optimize a quadratic form QA : Rn → R on the sphere Sn−1 ⊂ Rn .
That is, what are the maxima and minima of QA (w) subject to the con-
straint that kwk = 1?

Solution: Let λmax and λmin be the largest and smallest eigenvalues of A.
◦ The maximum value of QA for unit vectors is λmax . Any unit vector wmax
which attains this maximum is an eigenvector of A with eigenvalue λmax .
◦ The minimum value of QA for unit vectors is λmin . Any unit vector wmin
which attains this minimum is an eigenvector of A with eigenvalue λmin .

Corollary: Let A be a symmetric n × n matrix.

(a) A is positive-definite ⇐⇒ the minimum value of QA restricted to unit
vector inputs is positive (i.e., iff λmin > 0).
(b) A is negative-definite ⇐⇒ the maximum value of QA restricted to
unit vector inputs is negative (i.e., iff λmax < 0).
(c) A is indefinite ⇐⇒ λmax > 0 and λmin < 0.
Directional First & Second Derivatives
Def: Let f : Rn → R be a function, a ∈ Rn be a point.
The directional derivative of f at a in the direction v is:

Dv f (a) = ∇f (a) · v.

The “directional second derivative” of f at a in the direction v is:

QHf (a) (v) = vT Hf (a)v.

That is: the quadratic form whose associated matrix is the Hessian Hf (a).

Q: What direction v increases the directional derivative the most? What

direction v decreases the directional derivative the most?

A: We’ve learned this: the gradient ∇f (a) is the direction of greatest in-
crease, whereas −∇f (a) is the direction of greatest decrease.

New Questions:
◦ What direction v increases the directional second derivative the most?
◦ What direction v decreases the directional second derivative the most?

Answer: The (unit) directions of minimum and maximum second derivative

are (unitized) eigenvectors of Hf (a), and so they are mutually orthogonal.
The max/min values of the directional second derivative are the max/min
eigenvalues of Hf (a).

10 SVD
No ratings yet
10 SVD
4 pages
Lecture6 Orthogonality Dot Product
No ratings yet
Lecture6 Orthogonality Dot Product
5 pages
斯坦福大学机器学习数学基础 9-16
No ratings yet
斯坦福大学机器学习数学基础 9-16
8 pages
Linalg Review
No ratings yet
Linalg Review
11 pages
Ila Sol5 ch04
No ratings yet
Ila Sol5 ch04
17 pages
Matrices and Linear Algebra
No ratings yet
Matrices and Linear Algebra
13 pages
LA7 Orthogonality
No ratings yet
LA7 Orthogonality
10 pages
Linear Algebra Cheat Sheet
100% (1)
Linear Algebra Cheat Sheet
9 pages
Linear Algebra Cheat-Sheet: Laurent Lessard
100% (1)
Linear Algebra Cheat-Sheet: Laurent Lessard
13 pages
Linear Algebra for Vision Experts
No ratings yet
Linear Algebra for Vision Experts
23 pages
Math 225 Linear Algebra II Lecture Notes: John C. Bowman University of Alberta Edmonton, Canada
No ratings yet
Math 225 Linear Algebra II Lecture Notes: John C. Bowman University of Alberta Edmonton, Canada
61 pages
Notes On Orthogonality 10 15 12
No ratings yet
Notes On Orthogonality 10 15 12
4 pages
Nonlinear Optimization (18799 B, PP) : Ist-Cmu PHD Course, Spring 2011
No ratings yet
Nonlinear Optimization (18799 B, PP) : Ist-Cmu PHD Course, Spring 2011
11 pages
Lec 33
No ratings yet
Lec 33
3 pages
Linear Algebra: Systems of Linear Equations
No ratings yet
Linear Algebra: Systems of Linear Equations
3 pages
Linear Algebra II Lecture Notes
No ratings yet
Linear Algebra II Lecture Notes
61 pages
Matrix
No ratings yet
Matrix
10 pages
Linear 3
No ratings yet
Linear 3
33 pages
Linear Algebra Concepts Guide
No ratings yet
Linear Algebra Concepts Guide
2 pages
NA 5 Latex
No ratings yet
NA 5 Latex
45 pages
Matrixanalysis PDF
No ratings yet
Matrixanalysis PDF
46 pages
Chapter1 - Numerical Analysis II 2023-2024
No ratings yet
Chapter1 - Numerical Analysis II 2023-2024
30 pages
Chapter1 - II 2024-2025
No ratings yet
Chapter1 - II 2024-2025
35 pages
Summary
No ratings yet
Summary
115 pages
Unit-2 Linear Algebra - 2 Notes
No ratings yet
Unit-2 Linear Algebra - 2 Notes
14 pages
DA241M Review of Linear Algebra Part 1
No ratings yet
DA241M Review of Linear Algebra Part 1
5 pages
MATH 20250 Cheat Sheet
No ratings yet
MATH 20250 Cheat Sheet
2 pages
HW 6 Solutions
No ratings yet
HW 6 Solutions
11 pages
Ilovepdf - Merged (2) - 1-13
No ratings yet
Ilovepdf - Merged (2) - 1-13
13 pages
MA412 Q&A Part 4
No ratings yet
MA412 Q&A Part 4
53 pages
MLF Week 3 Notes by Manisha Pal
No ratings yet
MLF Week 3 Notes by Manisha Pal
10 pages
Selected Linear Algebra For Machine Learning
No ratings yet
Selected Linear Algebra For Machine Learning
30 pages
Linalg Elia
No ratings yet
Linalg Elia
20 pages
Week - 1 - Linear Equations and Inverse Matrices
No ratings yet
Week - 1 - Linear Equations and Inverse Matrices
6 pages
MA412 Final
No ratings yet
MA412 Final
82 pages
2018 2 Solutions
No ratings yet
2018 2 Solutions
9 pages
MTH102: Linear Algebra Assignment-06 (August-December 2024)
No ratings yet
MTH102: Linear Algebra Assignment-06 (August-December 2024)
15 pages
Mathematical Treatise On Linear Algebra
No ratings yet
Mathematical Treatise On Linear Algebra
7 pages
MITRES 18 010S20 LA Slides
No ratings yet
MITRES 18 010S20 LA Slides
44 pages
Lin Alg ML Mimuw
No ratings yet
Lin Alg ML Mimuw
55 pages
Maths 3
No ratings yet
Maths 3
16 pages
Linear Algebra Exam Review Guide
No ratings yet
Linear Algebra Exam Review Guide
11 pages
Orthogonality: Orthogonality of The Four Subspaces 4.1
No ratings yet
Orthogonality: Orthogonality of The Four Subspaces 4.1
10 pages
Images Kernels and Subspaces
No ratings yet
Images Kernels and Subspaces
8 pages
Orthogonal Diagonalization of Symmetric Matrices: MATH10212 - Linear Algebra - Brief Lecture Notes
No ratings yet
Orthogonal Diagonalization of Symmetric Matrices: MATH10212 - Linear Algebra - Brief Lecture Notes
7 pages
Linear Algebra Cheatsheet
No ratings yet
Linear Algebra Cheatsheet
4 pages
참고자료
No ratings yet
참고자료
4 pages
Chapter 6 Orthogonality
No ratings yet
Chapter 6 Orthogonality
39 pages
Chapter 2 Lecture Notes
No ratings yet
Chapter 2 Lecture Notes
4 pages
Chapter 0 - Miscellaneous Preliminaries: EE 520: Topics - Compressed Sensing Linear Algebra Review
No ratings yet
Chapter 0 - Miscellaneous Preliminaries: EE 520: Topics - Compressed Sensing Linear Algebra Review
18 pages
LinAlg Other Imp Points
No ratings yet
LinAlg Other Imp Points
3 pages
Quiz 2 Solutions
No ratings yet
Quiz 2 Solutions
11 pages
MA2001 Summary Notes
No ratings yet
MA2001 Summary Notes
12 pages
Linear Algebra Gilbert Strang - MIT18 - 06S10 - Pset5 - s10 - Soln
No ratings yet
Linear Algebra Gilbert Strang - MIT18 - 06S10 - Pset5 - s10 - Soln
9 pages
Linear Algebra 2
No ratings yet
Linear Algebra 2
16 pages
Module 3 - Supplementary Slides
No ratings yet
Module 3 - Supplementary Slides
36 pages
ATA2012 Abstracts
No ratings yet
ATA2012 Abstracts
42 pages
Linear DPP-2 New
No ratings yet
Linear DPP-2 New
3 pages
Chapter 1 - Double Integrals
No ratings yet
Chapter 1 - Double Integrals
125 pages
Estimation Theory Eng
No ratings yet
Estimation Theory Eng
40 pages
Diff. Equ. of First Order But Not of First Degree
No ratings yet
Diff. Equ. of First Order But Not of First Degree
11 pages
Quantum State Density Analysis
No ratings yet
Quantum State Density Analysis
75 pages
XII Ch12 Linear Programming Remesh Hsslive
No ratings yet
XII Ch12 Linear Programming Remesh Hsslive
24 pages
Curriculum Map Math 10 2021-2022
No ratings yet
Curriculum Map Math 10 2021-2022
6 pages
How To Explain Euler's Identity Using Triangles and Spirals
50% (2)
How To Explain Euler's Identity Using Triangles and Spirals
25 pages
Graphing Linear Equations
No ratings yet
Graphing Linear Equations
8 pages
Computational Methods For Electric Power Systems 2nd Edition Mariesa L. Crow Digital Download
100% (5)
Computational Methods For Electric Power Systems 2nd Edition Mariesa L. Crow Digital Download
161 pages
MAT215 Ass-02 Sec-12
No ratings yet
MAT215 Ass-02 Sec-12
41 pages
Double-Angle, Half-Angle, and Sum-Product Identities
No ratings yet
Double-Angle, Half-Angle, and Sum-Product Identities
6 pages
2016 Heffernan Maths Methods Units 1 & 2 Exam 2 Solutions
No ratings yet
2016 Heffernan Maths Methods Units 1 & 2 Exam 2 Solutions
16 pages
Pure Mathematics (M208) Content Listing: Mathematical Language and Proof
No ratings yet
Pure Mathematics (M208) Content Listing: Mathematical Language and Proof
1 page
Trigonometric Formulas
No ratings yet
Trigonometric Formulas
2 pages
Maths MPH-02 - 952 PDF
No ratings yet
Maths MPH-02 - 952 PDF
5 pages
Continuity and Differentiability
No ratings yet
Continuity and Differentiability
12 pages
Algebraic Expressions Guide
No ratings yet
Algebraic Expressions Guide
12 pages
Control Systems Stability Guide
No ratings yet
Control Systems Stability Guide
31 pages
Vectors
No ratings yet
Vectors
4 pages
SMA 3303 Numerical Analysis I
No ratings yet
SMA 3303 Numerical Analysis I
3 pages
LinearAlgebra GDF Jan5 23
No ratings yet
LinearAlgebra GDF Jan5 23
305 pages
Math Assignment 1 Questions
No ratings yet
Math Assignment 1 Questions
2 pages
Finch S. - Mathematical Constants - Errata
No ratings yet
Finch S. - Mathematical Constants - Errata
106 pages
Measure Theory and Fourier Analysis
No ratings yet
Measure Theory and Fourier Analysis
2 pages
HS HEC RAS 6.5 Workshop Guide
100% (1)
HS HEC RAS 6.5 Workshop Guide
100 pages
Mit Fourier Series
No ratings yet
Mit Fourier Series
59 pages
Data Structures and Algorithms
No ratings yet
Data Structures and Algorithms
22 pages
Time-Optimal Control Problems
No ratings yet
Time-Optimal Control Problems
14 pages

Transpose & Dot Product: M N A N M A A A A A A A A

Uploaded by

Transpose & Dot Product: M N A N M A A A A A A A A

Uploaded by

Transpose & Dot Product

Def: The transpose of an m × n matrix A is the n × m matrix AT whose

Observation: Let v, w ∈ Rn . Then vT w = v · w. This is because:

Prop 18.2: Let A be an m × n matrix. Then for x ∈ Rn and y ∈ Rm :

The transpose AT is a matrix, so AT : →

Observation: Both C(AT ) and N (A) are subspaces of . Might there

Also: Both N (AT ) and C(A) are subspaces of . Might there be a

Fact: If V ⊂ Rn is a subspace, then V ⊥ ⊂ Rn is a subspace.

Prop 19.3-19.4-19.5: Let V ⊂ Rn be a subspace. Then:

Part (d) means: “Every vector x ∈ Rn can be written as a sum x = v + w

Q: What does N (AT ) mean? Well:

Relationships among the Subspaces

Corollary: Let A be an m × n matrix. Then:

Prop 18.3: Let A be an m × n matrix. Then rank(A) = rank(AT ).

Motivating Questions for Reading

Problem 2: Let b ∈ / C(A). So, the system of equations Ax = b does not

Prop 20.1: Let V ⊂ Rn be a subspace. Then:

Formula: Let {v1 , . . . , vk } be a basis of V ⊂ Rn . Let A be the n × k matrix

Geometry Observations: Let V ⊂ Rn be a subspace, and x ∈ Rn a vector.

To Find x0 : Start with any solution x of Ax = b. Then

Least Squares Approximation

In other words, we want the vector x∗ for which Ax∗ − b is orthogonal to

Quadratic Forms (Intro)

Def: Let A be a symmetric n × n matrix. The quadratic form associated

QA (x) = x · Ax (· is the dot product)

Orthonormal bases are nice for (at least) two reasons:

Prop: Let {w1 , . . . , wk } be an orthonormal basis for a subspace V ⊂ Rn .

Where theory is concerned, the key property of orthogonal matrices is:

Prop 22.4: Let C be an orthogonal matrix. Then for v, w ∈ Rn :

Goal: We are given a basis {v1 , . . . , vk } for a subspace V ⊂ Rn . We would

Notation: We will let

Idea: Build an orthonormal basis for V1 , then for V2 , . . . , up to Vk = V .

Gram-Schmidt Algorithm: Let {v1 , . . . , vk } be a basis for V ⊂ Rn .

yj+1 = vj+1 − ProjVj (vj+1 )

Def: Let A be a symmetric matrix.

Note that the Hessian is a symmetric matrix. Therefore, we can also

It would be nice to have similar statements in higher dimensions:

Critical Points & Second Derivative Test

Theorem: If f : Rn → R has a local max / local min at a ∈ Rn , then a is a

Def: A saddle point of f : Rn → R is a critical point of f that is not a local

Second Derivative Test: Let f : Rn → R be a function, and a ∈ Rn be a

Theorem: Let f : Rn → R be a function. If R ⊂ Rn is a closed and bounded

Finding Global Extrema: We want to find the global extrema of a func-

Lagrange Multipliers (Constrained Optimization)

Theorem: Suppose we want to optimize a function f (x1 , . . . , xn ) constrained

Reason: If a is an extreme value of f |S on the level set S, then Dv f (a) = 0

Spectral Theorem: Let A be a symmetric n × n matrix. Then there exists

Why is this good? Well, since B is a basis, every w ∈ Rn can be written

In other words: the quadratic form QA is in diagonal form with respect to

Corollary: Let A be a symmetric n × n matrix, with eigenvalues λ1 , . . . , λn .

Useful Fact: Let A be any n × n matrix, with eigenvalues λ1 , . . . , λn . Then

Cor: If any one of the eigenvalues λj = 0 is zero, then det(A) = 0.

◦ The (n − 1)-sphere is the set

Sn−1 = {(x1 , . . . , xn ) ∈ Rn | (x1 )2 + · · · + (xn )2 = 1}

In other words, Sn−1 consists of the unit vectors in Rn .

Optimizing Quadratic Forms on Spheres

Corollary: Let A be a symmetric n × n matrix.

The “directional second derivative” of f at a in the direction v is:

QHf (a) (v) = vT Hf (a)v.

Q: What direction v increases the directional derivative the most? What

Answer: The (unit) directions of minimum and maximum second derivative

You might also like