Systems Of Linear Equation
It is quite hard to solve non-linear systems of equations, while linear systems are quite easy to study. There are numerical techniques which help to approximate nonlinear systems with linear ones in the hope that the solutions of the linear systems are close enough to the solutions of the nonlinear systems. We will not discuss this here. Instead, we will focus our attention on linear systems.
For the sake of simplicity, we will restrict ourselves to three, at most four, unknowns. The reader interested in the case of more unknowns may easily extend the following ideas.
Definition. The equation
a x + b y + c z + d w = h
where a, b, c, d, and h are known numbers, while x, y, z, and w are unknown numbers, is called a linear equation. If h =0, the linear equation is said to be homogeneous. A linear system is a set of linear equations and a homogeneous linear system is a set of homogeneous linear equations.
For example,
and
are linear systems, while
is a nonlinear system (because of y2). The system
is an homogeneous linear system.
Matrix Representation of a Linear System
Matrices are helpful in rewriting a linear system in a very simple form. The algebraic properties of matrices may then be used to solve systems. First, consider the linear system
Set the matrices
Using matrix multiplications, we can rewrite the linear system above as the matrix equation
As you can see this is far nicer than the equations. But sometimes it is worth to solve the system directly without going through the matrix form. The matrix A is called the matrix coefficient of the linear system. The matrix C is called the nonhomogeneous term. When , the linear system is homogeneous. The matrix X is the unknown matrix. Its entries are the unknowns of the linear system. The augmented matrix associated with the system is the matrix [A|C], where
In general if the linear system has n equations with m unknowns, then the matrix coefficient will be a nxm matrix and the augmented matrix an nx(m+1) matrix. Now we turn our attention to the solutions of a system.
Definition. Two linear systems with n unknowns are said to be equivalent if and only if they have the same set of solutions.
This definition is important since the idea behind solving a system is to find an equivalent system which is easy to solve. You may wonder how we will come up with such system? Easy, we do that through elementary operations. Indeed, it is clear that if we interchange two equations, the new system is still equivalent to the old one. If we multiply an equation with a nonzero number, we obtain a new system still equivalent to old one. And finally replacing one equation with the sum of two equations, we again obtain an equivalent system. These operations are called elementary operations on systems. Let us see how it works in a particular case.
Example. Consider the linear system
The idea is to keep the first equation and work on the last two. In doing that, we will try to kill one of the unknowns and solve for the other two. For example, if we keep the first and second equation, and subtract the first one from the last one, we get the equivalent system
Next we keep the first and the last equation, and we subtract the first from the second. We get the equivalent system
Now we focus on the second and the third equation. We repeat the same procedure. Try to kill one of the two unknowns (y or z). Indeed, we keep the first and second equation, and we add the second to the third after multiplying it by 3. We get
This obviously implies z = -2. From the second equation, we get y = -2, and finally from the first equation we get x = 4. Therefore the linear system has one solution
Going from the last equation to the first while solving for the unknowns is called backsolving.
Keep in mind that linear systems for which the matrix coefficient is upper-triangular are easy to solve. This is particularly true, if the matrix is in echelon form. So the trick is to perform elementary operations to transform the initial linear system into another one for which the coefficient matrix is in echelon form.
Using our knowledge about matrices, is there anyway we can rewrite what we did above in matrix form which will make our notation (or representation) easier? Indeed, consider the augmented matrix
Let us perform some elementary row operations on this matrix. Indeed, if we keep the first and second row, and subtract the first one from the last one we get
Next we keep the first and the last rows, and we subtract the first from the second. We get
Then we keep the first and second row, and we add the second to the third after multiplying it by 3 to get
This is a triangular matrix which is not in echelon form. The linear system for which this matrix is an augmented one is
As you can see we obtained the same system as before. In fact, we followed the same elementary operations performed above. In every step the new matrix was exactly the augmented matrix associated to the new system. This shows that instead of writing the systems over and over again, it is easy to play around with the elementary row operations and once we obtain a triangular matrix, write the associated linear system and then solve it. This is known as Gaussian Elimination. Let us summarize the procedure:
Gaussian Elimination. Consider a linear system.
- 1.
- Construct the augmented matrix for the system;
- 2.
- Use elementary row operations to transform the augmented matrix into a triangular one;
- 3.
- Write down the new linear system for which the triangular matrix is the associated augmented matrix;
- 4.
- Solve the new system. You may need to assign some parametric values to some unknowns, and then apply the method of back substitution to solve the new system.
Example. Solve the following system via Gaussian elimination
The augmented matrix is
We use elementary row operations to transform this matrix into a triangular one. We keep the first row and use it to produce all zeros elsewhere in the first column. We have
Next we keep the first and second row and try to have zeros in the second column. We get
Next we keep the first three rows. We add the last one to the third to get
This is a triangular matrix. Its associated system is
Clearly we have v = 1. Set z=s and w=t, then we have
The first equation implies
x = 2 + y + z - w - v.
Using algebraic manipulations, we get
x = - - s - t.
Putting all the stuff together, we have
Example. Use Gaussian elimination to solve the linear system
The associated augmented matrix is
We keep the first row and subtract the first row multiplied by 2 from the second row. We get
This is a triangular matrix. The associated system is
Clearly the second equation implies that this system has no solution. Therefore this linear system has no solution.
Definition. A linear system is called inconsistent or overdetermined if it does not have a solution. In other words, the set of solutions is empty. Otherwise the linear system is called consistent.
Following the example above, we see that if we perform elementary row operations on the augmented matrix of the system and get a matrix with one of its rows equal to, where , then the system is inconsistent.