Linear Algebra: Systems & Matrices

Updated at:24/03/2025

Document information

Author	Katrina Glaeser
school/university	University of California, Davis
subject/major	Linear Algebra
Document type	Textbook
city where the document was published	Davis

Language	English
Format	\| PDF
Size	5.62 MB

Summary

I.Linear Functions and Vectors

This section introduces the fundamental concepts of linear algebra, focusing on linear functions and vectors. It emphasizes that in linear algebra, functions operate on vectors as both inputs and outputs, unlike the scalar inputs and outputs of functions in calculus. The core characteristic of a linear function is its adherence to the principles of additivity and homogeneity, which are collectively termed linearity. The section uses examples to illustrate how various problems can be rephrased in terms of vector functions, laying the groundwork for subsequent exploration of matrix operations.

1. Linear Functions in Linear Algebra vs. Calculus

The section begins by contrasting the approach to functions in linear algebra with that in calculus. In calculus, the focus is on rates of change. Linear algebra, however, centers on a specific type of function: linear functions. The text introduces the concept of a function as a 'machine' that processes input to produce output. In precalculus, this is often presented with real numbers as both input and output. The key distinction in linear algebra is that linear functions operate on vectors as both inputs and outputs; this is a significant departure from the scalar inputs and outputs common in earlier mathematical study. The generality of vectors, which can be added and scalar-multiplied, is highlighted, emphasizing that the functions being studied in linear algebra will initially seem novel. The section previews this new type of function using the rephrasing of five different questions in terms of functions of vectors, paving the way for a deeper dive into the properties and behavior of these functions.

2. Defining Linearity Additivity and Homogeneity

This subsection defines the essence of a linear function: linearity. Linearity is the combined effect of two properties: additivity and homogeneity. Additivity implies that the function respects vector addition; the order of applying the function and addition does not affect the result. In simpler terms, L(u + v) = L(u) + L(v). Homogeneity (or scaling) means the function behaves consistently with scalar multiplication; scalar multiplication can be performed either before or after the function is applied without changing the outcome: L(cu) = cL(u), where c is a scalar. The text explicitly states that when a function of vectors satisfies both additivity and homogeneity, it's classified as linear. It’s important to note that these two conditions, additivity and homogeneity, together define linearity, a fundamental concept in linear algebra. The section further explores the intuitive understanding of these properties, relating them to operations like multiplication and division, setting the stage for the introduction of matrices.

3. Linear Functions and Matrix Notation

The section then connects linear functions to matrix notation. It explains how the output of a matrix acting on a vector can be expressed as a sum of the matrix's columns, each scaled by a scalar. This representation is critical for understanding the concept of column space, which is defined as the set of all possible outputs resulting from multiplying a matrix by a vector. The column space also represents the image of the linear function defined by the matrix. This connection between linear functions and matrices is a cornerstone of linear algebra. By expressing the output in this form, the text establishes the fundamental relationship between linear functions and matrices, providing the reader with a clearer understanding of how matrices can be used to represent and operate on linear functions. This lays the conceptual foundation for the forthcoming discussion on matrix operations and their role in solving systems of equations. An example using three linear equations and their components is presented, demonstrating how substitution methods might be used in solving such systems of equations.

4. Vectors as Functions and Alternative Representations

This subsection provides alternative perspectives on vectors. It presents two equivalent ways of understanding n-vectors: as ordered lists of n numbers, or as functions whose domain is the set {1, ..., n}. This dual representation helps to broaden the understanding of vectors and their flexibility. The section introduces the idea of representing vectors as functions, further solidifying the connection between vectors and functions, a viewpoint which will become increasingly relevant as the subject moves into more abstract vector spaces. By describing vectors in these two ways, the text helps in building a solid understanding of their nature and providing a basis for further exploring vector operations and manipulations. Additionally, the text touches on the concept of vector projection onto another vector within a plane.

II.Matrices and Systems of Linear Equations

This section delves into matrices and their applications in solving systems of linear equations. It explains how matrices provide a concise way to represent and manipulate such systems. Key concepts include Gaussian elimination, a pivotal algorithm for solving these systems; the concepts of row echelon form (REF) and reduced row echelon form (RREF); and the identification of consistent and inconsistent systems. The section also touches upon the use of elementary row operations (EROs) to simplify systems of equations without altering their solutions. The importance of pivot variables in determining the nature of solutions (unique, multiple, or no solutions) is highlighted.

1. Elementary Row Operations and System Simplification

This section introduces elementary row operations (EROs) as tools for simplifying systems of equations without altering their solutions. The three fundamental EROs are: row swap (exchanging two rows), scalar multiplication (multiplying a row by a non-zero constant), and row addition (adding one row to another). These operations are crucial for transforming a system of equations into a more manageable form, facilitating the solution process. The text emphasizes that applying these operations does not change the solution set of the corresponding system, making them invaluable tools in solving linear systems. The concept is further illustrated by showing how these operations can be applied to an augmented matrix, particularly when the first entry in the first row is non-zero, demonstrating a step-by-step approach to simplification. The significance of these operations in the context of solving linear systems cannot be overstated; they form the backbone of many solution techniques.

2. Gaussian Elimination and Row Echelon Forms

The core of this subsection is Gaussian elimination, a systematic method for solving systems of linear equations using EROs. The goal is to transform the augmented matrix into row echelon form (REF) or, preferably, reduced row echelon form (RREF). REF requires that all entries below the pivot (leading non-zero entry) in each row are zero. RREF additionally sets all pivots to one and all other entries in the pivot column to zero. The text highlights the importance of being able to convert RREF back into a system of equations to read off solutions directly. A key point is that non-zero entries beyond the last pivot in RREF indicate an inconsistent system (no solutions). The section describes both a standard approach and a non-standard approach to extracting solutions from an RREF augmented matrix, stressing that while different approaches might lead to varying descriptions of the solution set, as long as the solution sets themselves are identical the approaches are equally valid. The terms 'pivot variables' and 'free variables' are introduced to clarify the solution process.

3. Particular and Homogeneous Solutions

This part differentiates between particular and homogeneous solutions of linear systems. A particular solution is any single solution to the matrix equation, while homogeneous solutions are solutions to the associated homogeneous equation (where the constant terms are zero). The text notes that adding a homogeneous solution to a particular solution always yields another solution to the original equation. This concept is illustrated with an example using a differential equation, demonstrating how the general solution can be constructed as a sum of a particular solution and a linear combination of homogeneous solutions. This crucial idea—the ability to obtain additional solutions by adding homogeneous solutions to a particular solution—is applicable to various types of linear equations and systems, highlighting the significance of understanding both types of solutions in the context of linear algebra. The principles discussed here are foundational and will reappear throughout more complex linear algebra problems.

4. Row Echelon Form REF and LU Decomposition

The section then discusses alternative methods for solving linear systems, such as using row operations to reach row echelon form (REF). REF is less restrictive than RREF; pivots don’t need to be 1, and only entries to the left of pivots need to be zero. The text points out that REF, unlike RREF, is not unique. The concept of LU decomposition is introduced. This method factorizes a matrix into a lower triangular matrix (L) and an upper triangular matrix (U), offering advantages for large computations in science and engineering. The process of Gaussian elimination is presented as a method for achieving LU decomposition, highlighting that multiple LU decompositions exist for a given matrix, although a unique decomposition is obtained when L is a lower unit triangular matrix (with ones on the diagonal). The text briefly addresses LU decomposition for non-square matrices, noting the changes in dimensions for L and U while maintaining a similar decomposition process. It provides the foundation for understanding more advanced matrix factorization techniques.

III.Matrix Decompositions and LU Factorization

This part focuses on matrix decompositions, specifically LU decomposition. LU decomposition factorizes a matrix into the product of a lower triangular matrix (L) and an upper triangular matrix (U). This factorization simplifies complex calculations and is widely used in scientific and engineering computations. The section also addresses the uniqueness of the LU decomposition when L is a lower unit triangular matrix (with ones on the diagonal). The concept extends to non-square matrices, outlining how the process remains similar while adjusting dimensions.

1. LU Decomposition The Basics

This section introduces LU decomposition as a method to factorize a matrix into the product of a lower triangular matrix (L) and an upper triangular matrix (U): M = LU. This factorization is particularly useful for simplifying complex calculations, especially in scientific and engineering applications where large matrices are frequently encountered. The process involves eliminating entries below the diagonal during the elimination process, resulting in an upper triangular matrix (U). The elementary matrices used in this elimination process are lower triangular, as are their inverses. Combining the upper and lower triangular components leads to the LU factorization. The text highlights the efficiency of this approach for large-scale computations, emphasizing its practical importance in various fields. The section lays the groundwork for a deeper understanding of LU decomposition, particularly in relation to the solution of linear systems.

2. Uniqueness of LU Decomposition and Lower Unit Triangular Matrices

The discussion moves on to explore the uniqueness of LU decomposition. While multiple LU decompositions might exist for a given matrix, a unique decomposition is achieved when the lower triangular matrix (L) has ones along its diagonal. In this specific case, L is referred to as a lower unit triangular matrix. This uniqueness property is significant for consistent results and simplifies calculations. The section emphasizes the importance of understanding this nuance—that while LU decomposition itself is not inherently unique, there is a particular type of LU decomposition that is indeed unique. This understanding is crucial for applying the technique accurately and consistently, ensuring reliable and unambiguous results in practical applications. This forms a basis for further exploring the properties and applications of LU decomposition, particularly in computational contexts.

3. LU Decomposition for Non Square Matrices

The concept of LU decomposition is extended to non-square matrices. For an m x n matrix M, the decomposition takes the form M = LU, where L is a square (m x m) lower unit triangular matrix, and U is a rectangular (m x n) matrix. The process of obtaining the decomposition remains essentially the same as for square matrices; the key difference lies in the dimensions of the resulting matrices L and U. This demonstrates that the underlying principle of LU decomposition is applicable even when dealing with non-square matrices, broadening the scope of the technique. The adaptation to non-square matrices underscores the versatility and applicability of LU decomposition across various types of matrices, underscoring its widespread use in different computational problems and mathematical contexts. The text emphasizes that despite the dimension change, the core methodology for creating the LU factorization remains largely consistent with the approach used for square matrices.

IV.Linear Programming and the Simplex Algorithm

This section introduces linear programming, a field focusing on optimizing linear functions subject to linear constraints. It describes Dantzig's simplex algorithm, a computationally efficient method for solving linear programming problems, particularly those with numerous variables and constraints. The algorithm iteratively improves solutions until an optimal solution is found. The section uses a practical example (optimizing sugar intake in a school lunch) to illustrate the algorithm's steps and its effectiveness in handling complex scenarios. The key idea is optimizing a linear objective function subject to constraints, finding optimal solutions often at corner points of a feasible region.

1. Linear Programming An Overview

This section introduces the field of linear programming, focusing on problems involving the optimization of linear functions subject to a set of linear constraints. These problems are ubiquitous in various fields, ranging from military applications to modern science and industry. The text highlights the need for efficient computational methods to solve such problems, especially when dealing with numerous variables and constraints. Simple graphical methods might suffice for straightforward scenarios, but for more complex situations with potentially millions of variables, an algorithmic approach is necessary. This sets the stage for the introduction of the simplex algorithm, emphasizing its importance in handling large-scale linear programming problems. The section also uses a practical example involving a nutritionist, Pablo, optimizing the sugar intake of schoolchildren consuming apples and oranges, to illustrate the core concept and complexity of this type of optimization problem. The example helps to make the abstract concepts of linear programming more concrete and relevant.

2. Dantzig s Simplex Algorithm A Computational Approach

The section introduces Dantzig's simplex algorithm as a computational method for solving linear programming problems. The algorithm is designed to handle problems with many variables and constraints, an area where graphical methods become impractical. The simplex algorithm starts with a standard problem format, aiming to optimize a linear objective function subject to constraints. The process involves iteratively improving the solution by selecting appropriate row operations that move towards the optimal solution. The text demonstrates the simplex algorithm by walking through a step-by-step example of solving Pablo’s problem, highlighting the decision-making process involved in selecting row operations to minimize the objective function (sugar intake). Specific steps in the algorithm are described, showing how row operations are used to zero out negative coefficients and make adjustments to optimize the objective function. The algorithm is presented as superior to simply checking all vertices for larger problems, underscoring its efficiency in handling complex scenarios.

3. Simplex Algorithm and Computational Efficiency

This section concludes the discussion on linear programming and the simplex algorithm by emphasizing the computational advantage of the algorithm. While the algorithm may seem complex when performed manually, its strength lies in its suitability for computer implementation. The text highlights that for problems with numerous variables, the simplex algorithm provides a significantly faster solution compared to methods like checking all vertices, which quickly becomes infeasible with an increasing number of variables. This computational efficiency makes the simplex algorithm an indispensable tool for solving large-scale linear programming problems encountered in various fields. The discussion underscores the practical relevance of this algorithm in real-world applications, emphasizing its ability to handle problems of substantial complexity and scale, which are typically intractable using manual methods. The final example solidifies the algorithm's value as a powerful tool for computer-aided linear optimization.

V.Vector Spaces and Linear Transformations

This section expands on the concept of vector spaces, moving beyond the familiar Rⁿ spaces to more general definitions. It introduces the notion of a basis for a vector space and defines its dimension. Linear transformations (linear maps) are presented as functions between vector spaces that preserve the operations of vector addition and scalar multiplication. The importance of choosing appropriate bases for simplifying computations is highlighted, especially in the context of linear transformations. The concepts of homomorphism and the properties of linear operators are explored. Understanding vector spaces is essential to grasp the broader implications of linear algebra.

1. Beyond R sup n sup A General Definition of Vector Spaces

This section introduces a generalized definition of vector spaces, expanding beyond the familiar Rⁿ spaces. The text emphasizes that Rⁿ spaces are not the only vector spaces; it presents a broader definition encompassing Rⁿ for all n and R^S for all sets S, and more. This generalized framework is crucial for applying linear algebra to a much wider array of real-world problems. The concept of a basis for a vector space is introduced as a minimal set of linearly independent vectors that can generate the entire space. The importance of the basis in understanding vector spaces is highlighted, because it reveals their inherent structure. The section emphasizes that although many bases can exist within a vector space, they will all contain the same number of vectors, thereby defining the dimension of the vector space. The text uses various examples to help make these abstract concepts clearer and more accessible to the reader, and notes the use of the term 'let V be a vector space' as shorthand notation.

2. Properties of Vector Spaces and Examples

This subsection delves into the properties that define a vector space. These include closure under addition and scalar multiplication, the existence of a zero vector, additive inverses, associativity and commutativity of addition, and distributivity and associativity of scalar multiplication. The section illustrates these properties with examples. One example focuses on the vector space of functions that map natural numbers to real numbers, showing how addition and scalar multiplication are defined for these functions. This example showcases how vector space properties apply in contexts beyond the usual Rⁿ setting. Another example considers the vector space of differentiable functions, showing how the vector space properties naturally extend to this type of function. The text emphasizes the inheritance of vector space properties from the underlying field (e.g., real numbers), showing how the properties of the underlying field influence the properties of the vector space built upon it.

3. Linear Operators and the Consequence of Linearity

This section explores linear operators, which are special functions acting on vector spaces. The text highlights that a general real function of one variable requires an infinite amount of information to specify fully (one output for each input). In contrast, linear operators have a simpler structure due to the linearity property. The text explores the implications of this linearity, drawing a comparison between the requirements to specify a general function versus a linear operator, emphasizing how linearity dramatically reduces the amount of information needed for full specification. The concept of dimension as the number of independent directions in a vector space is introduced as a way to measure the simplicity of a vector space's structure. The concept of a basis, a minimal set of independent vectors, is revisited as a key element for determining this dimension. The text emphasizes that while multiple bases can exist for a given vector space, the number of vectors in each basis (the dimension) remains consistent, further reinforcing the value and significance of this concept.

4. Linear Transformations and Matrix Representations

This subsection introduces linear transformations (linear maps) as functions between vector spaces that respect vector addition and scalar multiplication. The text notes that while linear functions are often denoted by 'L', other notations (like 'f') can be used. The section connects linear transformations to matrix representations, emphasizing that the choice of basis significantly affects the resulting matrix. The simplicity of computations can be enhanced by choosing a basis that simplifies the matrix representation of a linear transformation. The text contrasts the standard basis (applicable to Rⁿ) with other potential bases for more complex vector spaces, such as spaces defined by solutions to differential equations. This section reinforces the idea that matrices are tools that enable a concise representation of linear transformations, and that this representation is intrinsically tied to the choice of basis. The choice of basis therefore plays a critical role in determining the efficacy and simplicity of computational methods associated with linear transformations.

VI.Matrices and Applications

This section explores further applications of matrices, such as their use in computer graphics (e.g., .gif image files), and cryptography (using invertible bit matrices for substitution ciphers). It demonstrates the versatility of matrices in various fields and highlights the significance of invertible matrices in decryption processes. It also touches upon the concept of the trace of a matrix as a way to extract essential information.

1. Matrices as Information Storage Example in Computer Graphics

This section highlights the role of matrices in storing information, using the example of .gif image files in computer graphics. A .gif file is essentially a matrix where the dimensions of the matrix specify the image size, and each matrix entry represents the color of a corresponding pixel. This demonstrates matrices' ability to efficiently store and represent large amounts of data in a structured manner. The example helps to illustrate the practical applications of matrices beyond the abstract mathematical concepts discussed earlier. The concise nature of matrix representation makes it highly efficient for storing and manipulating large datasets such as those in image processing, underscoring the practical utility of matrices in various computational domains.

2. Matrix Properties and Information Extraction

The section then discusses the properties of matrices and the importance of efficiently extracting essential information. It notes that large matrices can sometimes contain redundant information, which might indicate an inefficient setup of the underlying problem. A careful choice of basis can often simplify the matrix associated with a linear transformation, revealing underlying structure and simplifying computations. This makes techniques for extracting essential matrix information particularly useful. The concept of the trace of a matrix is introduced as one such method of information extraction, providing a single scalar value summarizing some aspect of the matrix. The section emphasizes the importance of finding ways to reduce complexity and extract relevant information from matrices, especially when dealing with large-scale problems, to improve efficiency and comprehension. This lays the groundwork for a discussion on more advanced matrix properties and operations.

3. Matrices in Cryptography Substitution Ciphers

This subsection explores the application of matrices in cryptography, specifically in substitution ciphers. Substitution ciphers involve permuting an alphabet to systematically replace letters in a message, thereby encoding it. The section mentions the simple ROT-13 cipher as an example and contrasts it with more secure methods like the one-time pad. The more secure systems utilize different substitutions for each letter in the message, provided that no substitution is reused for a different message. The section highlights the representation of English characters using ASCII format (as eight-bit vectors) and suggests a substitution cipher method using 8x8 invertible bit matrices to encrypt and decrypt messages. This example demonstrates how invertible matrices are essential for creating secure and reversible encryption algorithms, emphasizing the connection between matrix properties and cryptographic security. It showcases how the principles of linear algebra are applicable to creating and analyzing cryptographic systems.