1. List of concepts and definitions Back to the main page (home)
2. For Examples that Illustrate some of the concepts Click here (Linear Algebra Tool Kit)
3. Important RESULTS (THEOREMS)(See the Table Below):
Back to the MAIN PAGE of Ayman Badawi
Eiginvalues and eigenspaces result
(F)
If
has
linearly independent eigenvectors
with
then
| Have you ever heard the words eigenvalue and eigenvector? They are
derived from the German word "eigen" which means "proper" or
"characteristic." An eigenvalue of a square matrix is a scalar that is
usually represented by the Greek letter
Remark 27 Remember that, in general, the word scalar is not restricted to real numbers. We are only using real numbers as scalars in this book, but eigenvalues are often complex numbers.
Definition 9.1 Consider the square matrix A . We say that Let's look at an example of an eigenvalue and eigenvector. If you
were asked if
Therefore, Now that you have seen an eigenvalue and an eigenvector, let's talk a
little more about them. Why did we require that an eigenvector not be
zero? If the eigenvector was zero, the equation
Since any non-zero, scalar multiple of an eigenvector is also an
eigenvector,
Let's find both of the eigenvalues of the matrix
Let's find an eigenvector corresponding to
Let's find an eigenvector corresponding to
Let's find the eigenpairs for the matrix
Let's find an eigenvector corresponding to
Let's find an eigenvector corresponding to
We can find eigenpairs for larger systems using this method, but the characteristic equation gets impossible to solve directly when the system gets too large. We could use approximations that get close to solving the characteristic equation, but there are better ways to find eigenpairs that you will study in the future. However, these two methods give you an idea of how to find eigenpairs. Another matrix for which the power method will not work is the matrix
We said that eigenvalues are often complex numbers. However, if the matrix A is symmetric, then the eigenvalues will always be real numbers. As you can see from the problems that we worked, eigenvalues can also be real when the matrix is not symmetric, but keep in mind that they are not guaranteed to be real. Did you know that the determinant of a matrix is related to the
eigenvalues of the matrix? The product of the eigenvalues of a square
matrix is equal to the determinant of that matrix. Let's look at the two
matrices that we have been working with. For
For
Example Show that
has only one real eigenvalue. Find the eigenspace corresponding to that eigenvalue.
Solution The characteristic polynomial is
Since
The general solution is
and the eigenpair is
The eigenspace
Solution The characteristic polynomial
To solve
so that
Now we can factor the quadratic part of
and the eigenvalues are 2, 4, and 6. To find the eigenvectors, we substitute
The eigenpairs are
and the eigenspaces are
Example The matrices
both have eigenvalues 3 and
Solution For
so the eigenspaces are
For
so the eigenspaces for
span
For
a Remark 32 The eigenvalue of smallest magnitude of a matrix is the same as the inverse (reciprocal) of the dominant eigenvalue of the inverse of the matrix. Since most applications of eigenvalues need the eigenvalue of smallest magnitude, the inverse matrix is often solved for its dominant eigenvalue. This is why the dominant eigenvalue is so important. Also, a bridge in Manchester, England collapsed in 1831 because of conflicts between frequencies. However, this time, the natural frequency of the bridge was matched by the frequency caused by soldiers marching in step. Large oscillations occurred and the bridge collapsed. This is why soldiers break cadence when crossing a bridge. Frequencies are also used in electrical systems. When you tune your radio, you are changing the resonant frequency until it matches the frequency at which your station is broadcasting. Engineers used eigenvalues when they designed your radio. Frequencies are also vital in music performance. When instruments are tuned, their frequencies are matched. It is the frequency that determines what we hear as music. Although musicians do not study eigenvalues in order to play their instruments better, the study of eigenvalues can explain why certain sounds are pleasant to the ear while others sound "flat" or "sharp." When two people sing in harmony, the frequency of one voice is a constant multiple of the other. That is what makes the sounds pleasant. Eigenvalues can be used to explain many aspects of music from the initial design of the instrument to tuning and harmony during a performance. Even the concert halls are analyzed so that every seat in the theater receives a high quality sound. Car designers analyze eigenvalues in order to damp out the noise so that the occupants have a quiet ride. Eigenvalue analysis is also used in the design of car stereo systems so that the sounds are directed correctly for the listening pleasure of the passengers and driver. When you see a car that vibrates because of the loud booming music, think of eigenvalues. Eigenvalue analysis can indicate what needs to be changed to reduce the vibration of the car due to the music. Eigenvalues are not only used to explain natural occurrences, but
also to discover new and better designs for the future. Some of the
results are quite surprising. If you were asked to build the strongest
column that you could to support the weight of a roof using only a
specified amount of material, what shape would that column take? Most of
us would build a cylinder like most other columns that we have seen.
However, Steve Cox of Rice University and Michael Overton of New York
University proved, based on the work of J. Keller and I. Tadjbakhsh,
that the column would be stronger if it was largest at the top, middle,
and bottom. At the points
Does that surprise you? This new design was discovered through the study of the eigenvalues of the system involving the column and the weight from above. Note that this column would not be the strongest design if any significant pressure came from the side, but when a column supports a roof, the vast majority of the pressure comes directly from above. Eigenvalues can also be used to test for cracks or deformities in a solid. Can you imagine if every inch of every beam used in construction had to be tested? The problem is not as time consuming when eigenvalues are used. When a beam is struck, its natural frequencies (eigenvalues) can be heard. If the beam "rings," then it is not flawed. A dull sound will result from a flawed beam because the flaw causes the eigenvalues to change. Sensitive machines can be used to "see" and "hear" eigenvalues more precisely. Oil companies frequently use eigenvalue analysis to explore land for oil. Oil, dirt, and other substances all give rise to linear systems which have different eigenvalues, so eigenvalue analysis can give a good indication of where oil reserves are located. Oil companies place probes around a site to pick up the waves that result from a huge truck used to vibrate the ground. The waves are changed as they pass through the different substances in the ground. The analysis of these waves directs the oil companies to possible drilling sites. There are many more uses for eigenvalues, but we only wanted to give you a sampling of their uses. When you study science or engineering in college, you will become quite familiar with eigenvalues and their uses. There are also numerical difficulties that can arise when data from real-world problems are used. |
IMPORTANT RESULTS:
Fact 1 (about homogeneous system of equations):
A system of linear equation is called homogeneous if the right sides are equal to 0.
Example:
2x + 3y - 4z = 0
x - y + z = 0
x - y = 0
A homogeneous system of equations always has a solution (0,0,...,0). Every homogeneous system has either exactly one solution or infinitely many solutions. If a homogeneous system has more unknowns than equations, then it has infinitely many solutions.
Fact 2( about nonsingular (invertible) matrices):
If A is an n by n square matrix, then the following statements are equivalent.
Fact 3 ( about elementary matrices):
If the elementary matrix E is obtained by performing a row operation on the identity matrix Im and if A is an m by n matrix, then the product EA is the matrix obtained from A by applying the same row operation.
Every elementary matrix is invertible and the inverse is again an elementary matrix. If an elementary matrix E is obtained from I by using a certain row-operation q then E-1 is obtained from I by the "inverse" operation q-1 defined as follows:
· If q is the adding operation (add x times row j to row i) then q-1 is also an adding operation (add -x times row j to row i).
· If q is the multiplication of a row by x then q-1 is the multiplication of the same row by x-1.
· If q is a swapping operation then q-1=q.
Fact 4( about symmetric matrices):
Fact 5(about triangular matrices):
Fact 6(about skew-symmetric matrices):
Fact 7(about Determinant of a matrix):
Let A be an n by n matrix. Then the following conditions hold.
Fact 8(about basis, dependent, independent)
Let V be a vector space. The following properties of bases of V hold:
|
[ f1(x) |
f2(x) |
.... |
fn(x) ] |
|
[ f'2(x) |
f'1(x) |
.... |
f'n(x) ] |
|
[ f''2(x) |
f''1(x) |
.... |
f''n(x) ] |
|
..................................... |
|||
|
[ f2n-1(x) |
f1n-1(x) |
.... |
fnn-1(x) ] |
is called the Wronskian of this set of functions.
If the Wronskian of this set of functions is not identically zero then the set of functions is linearly independent (The CONVERSE is not TRUE!!!).
Fact 9(about Linear Transformations):
1. If T is a linear transformation from V to W then T(0)=0.
2. If T is a linear transformation from V to W and S is a linear transformation from W to Y (V, W, Y are vector spaces) then the product (composition) ST is a linear transformation from V to Y.
3. If T and S are linear transformations from V to W (V and W are vector spaces) then the sum T+S which takes every vector A in V to the sum T(A)+S(A) in W is again a linear transformation from V to W.
4. If T is a linear transformation from V to W and k is a scalar then the map kT which takes every vector A in V to k times T(A) is again a linear transformation from V to W.
5. If T is a linear transformation from Rm to Rn then dim(range(T))+dim(kernel(T))=m.
6. If [T]B,K is the matrix of a linear Transformation
T from a vector space V into W, then for every vector v in
V we have:
[T(v)]K = [T]B,K[v]B.
Fact 10 (dim(column space of A ) + dim(Nul(A) = number of columns of A):
2. The column rank and the row rank of every matrix coincide and they are equal to the (row) rank of the reduced row echelon form of this matrix, and are equal to the number of leading 1's in this reduced row echelon form.
Fact 11(about orthogonal vectors):
1. Every set of pairwise orthogonal non-zero vectors is linearly independent.
Fact 12 (about Eigenvalues, Eigenspaces, and Diagolization )
(e) If
and
are similar, then the characteristic polynomial of A =
the characteristic polynomial of B, and hence the set of eigenvalues of
is equal to the set of eigenvalues of
.
(f) If
and
are similar , i.e.
and
is an eigenpair of
,
then
is an eigenpair of
.
(g) (Diagonalization):
If
is diagonalizable, then
has
linearly independent eigenvectors. Also, in the equation
is a matrix whose columns are eigenvectors, and the diagonal entries of
are the eigenvalues corresponding column by column to their respective
eigenvectors.
|
To diagonalize an n by n matrix A:
A = U D U-1. |
If a square matrix is not diagonable, it is still possible, and useful, to convert it with a similarity transformation into upper triangular echelon form. In a course on linear algebra you will study this in more depth. Here we will just give an example showing what can happen:
Example 5. Let
[ 0 1 1 ]
A := [12 -2 -8 ]
[0 2 2 ]
This matrix has a hidden special property. If you calculate its cube, you find, somewhat surprisingly, that
3 [ 0 0 0 ]
A = [ 0 0 0 ]
[ 0 0 0 ]
Now, what could the eigenvalues of such a nilpotent matrix be? If Av = v, then
A3 v = 3v
and we conclude that the only possible eigenvalue is 0. But if a basis of three eigenvectors all have eigenvalue zero, then the whole matrix A must be the zero matrix. It isn't.
Let's illustrate why diagonalizing is useful with another
Model problem. Let
[ 3 -5]
A = [-5 3].
Find A10.
Solution. Don't just multiply! Instead, first diagonalize:
A = U-1D U,
where by following the algorithm we find:
1 = -2, with eigenvector
[ 1]
v = [ 1].
1
and
1
= 8, with eigenvector
[ 1]
v = [-1].
2
Therefore,
[1/2 1/2] [2 0] [ 1 1] -1
A = [1/2 -1/2] [0 8] [ 1 -1] =: U D U .
Now, we notice that the product of A with itself p times collapses:
Ap = U D U-1 U D U-1 ... U D U-1
= U D D ...D U-1,
because U-1 U = I_n. Therefore the answer is:
6 [1/2 1/2] [26 0] [ 1 1] [131104 -131040]
A = [1/2 -1/2] [0 86] [ 1 -1] = [-131040 131104].
Now we state and prove the theorem which the last example illustrates.
|
An n by n matrix A can be diagonalized if any of the following conditions can be verified:
|
Definitions in Linear Algebra: (for examples click here)
A system of linear equations
is any sequence of linear equations. A
solution of a system
of linear equations is any common solution of these equations. A
system is called consistent if it has a solution.
A
general solution of a system of linear
equations is a formula which gives all solutions for different values of
parameters.
Examples. 1. Consider the system:
x + y = 7
2x + 4y = 18
This system has just one solution: x=5, y=2.
This is a general solution of the system.
2. Consider the system:
x + y + z = 7
2x + 4y + z = 18.
This system has infinitely many solutions given by this formula:
x = 5 - 3s/2
y = 2 + s/2
z = s
This is a general solution of our system.
Two matrices are said to be row-equivalent if one can be obtained from the other by a sequence (order may vary) of elementary row operations given below.
* Interchanging two rows
* Multiplying a row by a non zero constant
* Adding a multiple of a row to another row
Row-Echelon Form and Reduced Row-Echelon Form
A matrix in row-echelon form has the following properties.
1. All rows consisting entirely of zeros occur at the
bottom of the matrix
2. For each row that does not entirely consist of zeroes, the first non zero
entry is 1 – called a leading 1.
3. For two successive non zero rows, the leading 1 in the higher row is
farther to the left that the leading 1 in the lower row.
A matrix in row-echelon form is in reduced row-echelon form if every column that has a leading 1 has zeros in every position above and below its leading 1.
A matrix derived from a system of linear equations is called the augmented matrix of the system.
Example Consider the system of equations:
x1 + 2x2 + x4 = 6
x3 + 6x4 = 7
x5=1
Its augmented matrix is
| [ 1 | 2 | 0 | 1 | 0 | 6 ] |
| [ 0 | 0 | 1 | 6 | 0 | 7 ] |
| [ 0 | 0 | 0 | 0 | 1 | 1 ] |
The matrix has the reduced row echelon form. The leading unknowns are x1, x3 and x5; the free unknowns are x2 and x4. So the general solution is:
x1= 6-2t-s
x2= s
x3= 7-6t
x4= t
x5= 1
If the augmented matrix does not have the reduced row echelon form but has
the (ordinary) row echelon form then the general solution also can be easily
found.
The method of finding the solution is called the back-substitution.
First we solve each of the equations for the leading unknowns The last
non-zero equation gives us the expression for the last leading unknown in terms
of the free unknowns. Then we substitute this leading unknown in all other
equations by this expression. After that we are able to find an expression for
the next to the last leading unknown, replace this unknown everywhere by this
expression, etc. until we get expressions for all leading unknowns. The
expressions for leading unknowns that we find in this process form the general
solution of our system of equations.
Example. Consider the following system of equations.
x1-3x2+ x3-x4 = 2
x2+2x3-x4 = 3
x3+x4 = 1
Its augmented matrix
| [ 1 | -3 | 1 | -1 | 2 ] |
| [ 0 | 1 | 2 | -1 | 3 ] |
| [ 0 | 0 | 1 | 1 | 1 ] |
is in the row echelon form.
The leading unknowns are x1, x2, x3; the free unknown is x4.
Solving each equation for the leading unknown we get:
x1=2+3x2-x3+x4
x2=3-2x3+x4
x3=1-x4
The last equation gives us an expression for x3: x3=1-x4.
Substituting this into the first and the second equations gives:
x1=2+3x2-1+x4+x4=1+3x2+2x4
x2=3-2(1-x4)+x4=1+3x4
x3=1-x4
Now substituting x2=1+3x4 into the first equation,
we get
x1=1+3(1+3x4)+2x4=4+11 x4
x2=1+3x4
x3=1-x4
Now we can write the general solution:
x1=4+11 s
x2=1+ 3 s
x3=1- s
x4= s
Let us check if we made any arithmetic mistakes. Take x4=1 and
compute x1=15, x2=4, x3=0, x4=1.
Substitute it into the original system of equations:
15 - 3 * 4 + 0 - 1 = 2
4 + 2 * 0 - 1 = 3
0 + 1 = 1
OK, it seems that our solution is correct.
A system of linear equations which has a solution.
A homogeneous system of linear equations is of the form:
a11x1
+ a12x2
+ . . . + a1nxn=0
a21x1
+ a22x2
+ . . . + a2nxn=0
am1x1
+ am2x2
+ . . . + amnxn=0
That is, a homogeneous system of linear equations has all constant terms equal to zero.
The Gauss-Jordan elimination procedure
There exists a standard procedure to obtain a reduced row echelon matrix from
a given matrix by using the row operations.
This procedure consists of the following steps.
Two matrices A and B are row equivalent if A is a product of elementary matrices times B, or equivalently if A can be obtained from B by a finite sequence of elementary row operations.
In order to multiply a matrix
by a scalar, one has to multiply all entries of the matrix by this
scalar.
Example:
| 3 * |
|
= |
|
The product of a row-vector v of size (1, n) and a column vector u of size (n,1) is the sum of products of corresponding entries:
uv=u(1)v(1)+u(2)v(2)+...+u(n)v(n)
Example:
3
(1, 2, 3) * 4 =1*3 + 2*4 + 3*1 = 3+8+3=14
1
Example:
x
(2, 4, 3) * y = 2x + 4y + 3z
z
As you see, we can represent the left side of a linear equation as a product
of two matrices. The product of arbitrary two matrices which we shall define
next will allow us to represent the left side of any system of equations as a
product of two matrices.
Let A be a matrix of size (m,n), let B be a matrix of
size (n,k) (that is the number of columns in A is equal to
the number of rows in B. We can subdivide A into a column of m
row-vectors of size (1,n). We can also subdivide B into a row of
k column-vectors of size (n,1):
r1
r2
A =... B=[c1 c2 ... ck]
rm
Then the product of A and B is the matrix C
of size (m,k) such that
(C(i,j) is the product of the row-vector ri and the
column-vector cj).
Matrices A and B such that the number of columns of A
is not equal to the number of rows of B cannot be multiplied.
Example:
|
* |
|
= |
|
Example:
|
* |
|
= |
|
You see: we can represent the left part of a system of linear equations as a
product of a matrix and a column-vector. The whole system of linear equations
can thus be written in the following form:
AX = b
Let In denote the identity matrix of order n that is a square matrix of order n with 1s on the main diagonal and zeroes everywhere else. Then for every m by n matrix A the product of A and In is A and the product of Im and A is A.
Nonsingular (invertible) matrices:
A square matrix of size n A is called invertible (nonsingular) if there exists a square matrix B of the same size such that AB = BA = In, the identity matrix of size n. In this case B is called the inverse of A and B is denoted by A-1.
FINDING THE INVERSE OF A MATRIX
Let A be square matrix order n.
1. Write a matrix that consists of the given matrix A on the left and the identity matrix n x n next to A on the right, separating the matrices A and I by a (vertical) dotted line to obtain [A:I].
2. If possible, row reduce A to I using elementary row
operations on the matrix [A:I]. The result will be the matrix [I:A-1].
Note that for some matrices an inverse does not exist. If it is not possible to
obtain [I:A-1], then the matrix A is not invertible.
3. Check your work by multiplying to see that AA-1 = I = A-1 A.
An elementary matrix is a matrix obtained from an identity matrix by one of the row-operations.
Example.
|
, |
|
, |
|
, |
|
The first two matrices are obtained by adding a multiple of one row to another row. The third matrix is obtained by swapping two rows. The last matrix is obtained by multiplying row by a number.
As we see, elementary matrices usually have lots of zeroes.
Elementary matrices which are obtained by adding rows contain only one non-diagonal non-zero entry.
Elementary matrices which are obtained by multiplying a row by a number contain exactly 1 non-unit entry on the diagonal and no non-zero entries outside the diagonal.
Elementary matrices which are obtained by swapping consist of 0s and 1s and contain exactly 2 non-diagonal entries.
The converse statements are true also (for example every matrix with 1s on the diagonal and exactly one non-zero entry outside the diagonal) is an elementary matrix.
The main result about elementary matrices is that every invertible matrix is a product of elementary matrices. These are in some sense the smallest particles in the world of invertible matrices.
If A is any m by n matrix then the transpose
of A, denoted by AT, is defined to be the n by
m matrix obtained by interchanging the rows and columns of A, that
is the first column of AT is the first row of A, the
second column of AT is the second row of A, etc.
Example.
The transpose of
|
is |
|
If A is a square matrix of size n then the sum of the entries on the main diagonal of A is called the trace of A and is denoted by tr(A).
Example.
The trace of the matrix
| [ 1 | 2 | 3 ] |
| [ 4 | 5 | 6 ] |
| [ 7 | 8 | 9 ] |
is equal to 1+5+9=15
A square matrix A is called symmetric if AT=A, that is if A(i,j)=A(j,i) for every i and j. Thus A is symmetric if and only if it is symmetric with respect to the main diagonal. Here is an example of a symmetric matrix:
|
[ 1 |
2 |
3 ] |
|
[ 2 |
4 |
5 ] |
|
[ 3 |
5 |
6 ] |
An important subclass of symmetric matrices is formed by diagonal matrices, i.e. matrices which have zeroes everywhere outside the main diagonal. For example, the identity matrix is a diagonal matrix.
A square matrix A is called upper triangular (resp. lower triangular) if all its entries below (resp. above) the main diagonal are zeroes, that is A(i,j)=0 whenever i is greater than j (resp. A(i,j)=0 whenever i is less than j).
Example.
|
, |
|
The first of these matrices is upper triangular, the second is lower triangular.
A square matrix A is called skew-symmetric if A^T = -A, that is A(i,j)=-A(j,i) for every i and j. In particular, if i=j then A(i,i)=0, that is the diagonal entries of a skew-symmetric matrix are equal to 0.
Example:
|
[ 0 |
2 |
3 ] |
|
[ -2 |
0 |
4 ] |
|
[ -3 |
-4 |
0 ] |
Let Aij is the cofactor of entry a_ij that is equal to (-1)i+j multiplied by the determinant of the matrix obtained from A by deleting the i-th row and the j-th column (the determinant of this smaller matrix has a special name also; it is called the minor of entry A(i,j)).
For every i=1,...,n we have det(A)=a_i1 Ai1 + a_i2 Ai2 +...+ a_in Ain This is called the cofactor expansion of det(A) along the i-th row (a similar statement holds for columns).
The determinant of a triangular matrix is the product of the diagonal entries.
Definition. If A is any n by n matrix and Aij is the cofactor of a_ij then the matrix
|
[ A11 |
A21 |
... |
An1 ] |
|
[ A12 |
A22 |
... |
An2 ] |
|
............................ |
|||