Solving Over- and Under-Determined Systems
In linear algebra, many real-world problems involve systems of equations where the number of equations does not match the number of unknowns. These systems are either over-determined (more equations than unknowns) or under-determined (more unknowns than equations). This article explores the methods for solving such systems, discussing techniques like Least Squares solutions, regularization, and providing practical examples in data science and engineering.
1. Introduction to Over- and Under-Determined Systems
1.1 What is an Over-Determined System?
An over-determined system is a system of linear equations in which there are more equations than unknowns. Mathematically, for a system , if the matrix is with , the system is over-determined.
Example: Consider the system of equations:
This system has 3 equations and 2 unknowns, making it over-determined. In general, over-determined systems may not have a solution that satisfies all equations exactly.
1.2 What is an Under-Determined System?
An under-determined system is a system where there are fewer equations than unknowns. For a system , if the matrix is with , the system is under-determined.
Example: Consider the system:
This system has 2 equations and 3 unknowns, making it under-determined. Such systems typically have infinitely many solutions or special solutions depending on the constraints.
2. Solving Over-Determined Systems
2.1 Least Squares Solution
The Least Squares method is the most common approach to solving over-determined systems. It finds an approximate solution that minimizes the sum of the squared residuals, where the residual is the difference between the observed value and the value predicted by the model.
Mathematically: Given , where is with , the Least Squares solution minimizes:
This leads to the Normal Equations:
Example: Consider the following over-determined system:
To solve using Least Squares:
- Compute and :
- Solve the normal equations :
- The solution is:
2.2 Solving Over-Determined Systems with QR Decomposition
QR Decomposition can also be used to solve over-determined systems. If can be decomposed as , where is an orthogonal matrix and is upper triangular, the problem reduces to solving:
Example: Using the same from above, decompose it into and solve the reduced system for .
2.3 Using Singular Value Decomposition (SVD)
Singular Value Decomposition (SVD) provides a robust method for solving over-determined systems, particularly when is ill-conditioned. The system can be solved by expressing as:
The solution is then:
This approach is particularly useful when the matrix is close to being singular or has large condition numbers.
3. Solving Under-Determined Systems
3.1 General Solution Approach
For under-determined systems, there are typically infinitely many solutions. The general solution can be expressed as:
Where:
- is a particular solution.
- is the general solution to the homogeneous system , which lies in the null space of .
3.2 Minimum Norm Solution
In many cases, especially in optimization, we are interested in the minimum norm solution, which is the solution with the smallest L2 norm. This solution can be found using SVD:
Where is the pseudo-inverse of .
Example: Consider the under-determined system:
To find the minimum norm solution using SVD:
- Decompose into .
- Compute the pseudo-inverse of .
- Calculate .
3.3 Regularization Techniques
In practice, under-determined systems are often solved using regularization techniques to impose additional constraints that lead to a unique and stable solution.
- L2 Regularization (Tikhonov Regularization): Adds a penalty term proportional to the L2 norm of the solution:
Where is a regularization parameter that controls the trade-off between fitting the data and minimizing the solution norm.
- L1 Regularization (Lasso): Adds a penalty proportional to the L1 norm of the solution, encouraging sparsity:
These techniques are widely used in machine learning and signal processing to obtain stable solutions in the presence of noise or incomplete data.
4. Applications and Practical Examples
4.1 Data Fitting in Regression Analysis
In regression analysis, over-determined systems are common when fitting a model to data. The Least Squares method is used to minimize the difference between the observed data and the model predictions. Regularization is often applied to prevent overfitting, especially when the model is complex.
Example: In a linear regression problem with many data points (more than the number of parameters), the system is over-determined, and the Least Squares solution provides the best-fit line.
4.2 Inverse Problems in Physics and Engineering
Inverse problems, such as recovering a signal or an image from indirect measurements, often lead to under-determined systems. Regularization techniques are crucial in obtaining meaningful solutions.
Example: In medical imaging (e.g., MRI), the measured data is incomplete, and solving the inverse problem to reconstruct the image involves solving an under-determined system with regularization.
4.3 Optimization Problems
Optimization problems frequently involve under-determined systems, particularly in cases where the number of constraints is less than the number of variables. The minimum norm solution or regularization methods help in finding the most "reasonable" solution.
Example: In portfolio optimization, where the number of assets (variables) exceeds the number of constraints (such as target return), the system is under-determined, and regularization can be used to find an optimal portfolio with minimum risk.
5. Conclusion
Solving over- and under-determined systems is a fundamental task in linear algebra with wide-ranging applications in data science, engineering, and beyond. The methods discussed—Least Squares, QR Decomposition, SVD, and regularization techniques—provide robust tools for tackling these systems, even in the presence of noise, ill-conditioning, or incomplete data. By mastering these techniques, practitioners can develop more accurate and stable solutions to complex real-world problems.