Analysis of the properties of matrix rank and the relationship between matrix rank and matrix operations

The modern concept of the matrix was gradually formed in the 19th century. In 1801, German mathematician Gauss took all the coefficients of a linear transformation. In 1844, German mathematician Eisenstein discussed ‘transformation’ (matrix) and its product. In 1850, the British mathematician Sylvester first used the term matrix. In 1858, British mathematician Gloria published ‘Research Report on Matrix Theory’. He first studied matrices as an independent mathematical object and first published a series of articles on this subject. Therefore, he was considered the founder of matrix theory. He gave a series of definitions that are commonly used now, such as two matrices, equality, zero matrix, sum of two matrices, quantity product of a number and a matrix, product of two matrices, matrix inverse, transposed matrix, etc [1]. Kelley also noticed that matrix multiplications are combinable but generally not commutative and m ∗ n matrix can only be multiplied by n ∗ k matrix. In 1854, the French mathematician Emilt used the term ‘orthogonal matrix’, but his formal definition was not published by the German mathematician Ferrous until 1878. In 1879, Ferrous introduced the concept of matrix rank. Matrix is an important basic concept in mathematics. It is a main research object of algebra and an important tool for applied mathematics research.

Properties of matrix rank

The maximum order r of a subexpression that is not equal to zero in a matrix A is called the rank of the matrix. If a matrix has no subforums that are not equal to zero, the rank of the matrix is zero, which is called as rank (A) = r.

2.1

Invariance of the rank of a matrix

The ranks of the transposed matrices are equal, that is, rank (A^T) = rank (A). The elementary transformation does not change the rank of the matrix.

Proof 1

Suppose that the i row and j row of a matrix A = (a_ij)_m×n are exchanged to obtain the matrix B: (1) $A = (\begin{matrix} a_{11} & \dots & a_{1 n} \\ ⋮ & ⋮ \\ a_{i 1} & \dots & a_{in} \\ ⋮ & ⋮ \\ a_{j 1} & \dots & a_{jn} \\ ⋮ & ⋮ \\ a_{m 1} & \dots & a_{mn} \end{matrix}), B = (\begin{matrix} a_{11} & \dots & a_{1 n} \\ ⋮ & ⋮ \\ a_{j 1} & \dots & a_{jn} \\ ⋮ & ⋮ \\ a_{i 1} & \dots & a_{in} \\ ⋮ & ⋮ \\ a_{m 1} & \dots & a_{mn} \end{matrix})$ A = \left({\matrix{{{a_{11}}} & \cdots & {{a_{1n}}} \cr \vdots & {} & \vdots \cr {{a_{i1}}} & \cdots & {{a_{in}}} \cr \vdots & {} & \vdots \cr {{a_{j1}}} & \cdots & {{a_{jn}}} \cr \vdots & {} & \vdots \cr {{a_{m1}}} & \cdots & {{a_{mn}}} \cr}} \right),\;B = \left({\matrix{{{a_{11}}} & \cdots & {{a_{1n}}} \cr \vdots & {} & \vdots \cr {{a_{j1}}} & \cdots & {{a_{jn}}} \cr \vdots & {} & \vdots \cr {{a_{i1}}} & \cdots & {{a_{in}}} \cr \vdots & {} & \vdots \cr {{a_{m1}}} & \cdots & {{a_{mn}}} \cr}} \right) And the rank of matrix A is r. Obviously, the rank of matrix B is also r. Let matrix B have s-order sub-form D, s > r. If D does not contain elements of rows i and j at the same time, then D is an order s of the matrix A and then D = 0; if D contains elements of rows i and j at the same time, this is given as follows: (2) $D = | \begin{matrix} a_{l t_{_{1}}} & \dots & a_{l t_{s}} \\ ⋮ & ⋮ \\ a_{j t_{1}} & \dots & a_{j t_{s}} \\ ⋮ & ⋮ \\ a_{i t_{1}} & \dots & a_{j t_{s}} \\ ⋮ & ⋮ \\ a_{k t_{1}} & \dots & a_{k t_{s}} \end{matrix} | = | \begin{matrix} a_{l t_{1}} & \dots & a_{l t_{s}} \\ ⋮ & ⋮ \\ a_{i t_{1}} & \dots & a_{i t_{s}} \\ ⋮ & ⋮ \\ a_{j t_{1}} & \dots & a_{j t_{s}} \\ ⋮ & ⋮ \\ a_{k t_{1}} & \dots & a_{k t_{s}} \end{matrix} | = 0$ D = \left| {\matrix{{{a_{l{t_{_1}}}}} & \cdots & {{a_{l{t_s}}}} \cr \vdots & {} & \vdots \cr {{a_{j{t_1}}}} & \cdots & {{a_{j{t_s}}}} \cr \vdots & {} & \vdots \cr {{a_{i{t_1}}}} & \cdots & {{a_{j{t_s}}}} \cr \vdots & {} & \vdots \cr {{a_{k{t_1}}}} & \cdots & {{a_{k{t_s}}}} \cr}} \right| = \left| {\matrix{{{a_{l{t_1}}}} & \cdots & {{a_{l{t_s}}}} \cr \vdots & {} & \vdots \cr {{a_{i{t_1}}}} & \cdots & {{a_{i{t_s}}}} \cr \vdots & {} & \vdots \cr {{a_{j{t_1}}}} & \cdots & {{a_{j{t_s}}}} \cr \vdots & {} & \vdots \cr {{a_{k{t_1}}}} & \cdots & {{a_{k{t_s}}}} \cr}} \right| = 0 This shows that rank (A) ≥ rank (B). And we can also exchange matrix B for rows i and j to get matrix A, then rank (A) ≤ rank (B). So, rank (A) = rank (B). It can be proved that the first elementary transformation does not change the rank of the matrix.

Suppose that row i of matrix A is multiplied by a number k that is not equal to zero to obtain matrix B. Let matrix B have an s-order sub-formula G D, s > r. If D does not contain elements of the i row, then D is an s order sub-formula of the matrix A and then D = 0; if D contains elements of the i throw, then the following is obtained: (3) $D = | \begin{matrix} a_{l t_{1}} & \dots & a_{l t_{s}} \\ ⋮ & ⋮ \\ k a_{i t_{1}} & \dots & k a_{i t_{s}} \\ ⋮ & ⋮ \\ a_{k t_{1}} & \dots & a_{k t_{s}} \end{matrix} | = k | \begin{matrix} a_{l t_{1}} & \dots & a_{l t_{s}} \\ ⋮ & ⋮ \\ a_{i t_{1}} & \dots & a_{i t_{s}} \\ ⋮ & ⋮ \\ a_{k t_{1}} & \dots & a_{k t_{s}} \end{matrix} | = 0$ D = \left| {\matrix{{{a_{l{t_1}}}} & \cdots & {{a_{l{t_s}}}} \cr \vdots & {} & \vdots \cr {k{a_{i{t_1}}}} & \cdots & {k{a_{i{t_s}}}} \cr \vdots & {} & \vdots \cr {{a_{k{t_1}}}} & \cdots & {{a_{k{t_s}}}} \cr}} \right| = k\left| {\matrix{{{a_{l{t_1}}}} & \cdots & {{a_{l{t_s}}}} \cr \vdots & {} & \vdots \cr {{a_{i{t_1}}}} & \cdots & {{a_{i{t_s}}}} \cr \vdots & {} & \vdots \cr {{a_{k{t_1}}}} & \cdots & {{a_{k{t_s}}}} \cr}} \right| = 0 So, A rank (A) ≥ rank (B). And multiply the i row of matrix B by $\frac{1}{k}$ {1 \over k} to obtain matrix A, then rank (A) ≤ rank (B). So, rank (A) = rank (B). It can be proved that the second elementary transformation does not change the rank of the matrix.

Let's multiply the j row of a matrix A by the number k and add it to the i row to get the matrix B: (4) $A = (\dots \begin{matrix} a_{11} & \dots & a_{1 n} \\ ⋮ & ⋮ \\ a_{i 1} & \dots & a_{in} \\ ⋮ & ⋮ \\ a_{j 1} & \dots & a_{jn} \\ ⋮ & ⋮ \\ a_{m 1} & \dots & a_{mn} \end{matrix}) B = (\begin{matrix} a_{11} & \dots & a_{1 n} \\ ⋮ & ⋮ \\ a_{i 1} & \dots & a_{in} + k a_{jn} \\ ⋮ & ⋮ \\ a_{j 1} & \dots & a_{jn} \\ ⋮ & ⋮ \\ a_{m 1} & \dots & a_{mn} \end{matrix})$ A = \left({\cdots \matrix{{{a_{11}}} & \cdots & {{a_{1n}}} \cr \vdots & {} & \vdots \cr {{a_{i1}}} & \cdots & {{a_{in}}} \cr \vdots & {} & \vdots \cr {{a_{j1}}} & \cdots & {{a_{jn}}} \cr \vdots & {} & \vdots \cr {{a_{m1}}} & \cdots & {{a_{mn}}} \cr}} \right)\;B = \left({\matrix{{{a_{11}}} & \cdots & {{a_{1n}}} \cr \vdots & {} & \vdots \cr {{a_{i1}}} & \cdots & {{a_{in}} + k{a_{jn}}} \cr \vdots & {} & \vdots \cr {{a_{j1}}} & \cdots & {{a_{jn}}} \cr \vdots & {} & \vdots \cr {{a_{m1}}} & \cdots & {{a_{mn}}} \cr}} \right) And the rank of A is r, we have to prove that the rank of B is also r. We first prove that the rank of B cannot exceed r. If the matrix B has no sub-forms of order greater than r, then of course it does not have sub-forms of order greater than r which are not equal to zero; so, its rank obviously cannot exceed r. Let matrix B have s order sub-form D, and s > r. Then, there are three possible situations.

If D does not contain an element in line i. At this time, D is also a sub form of A, and the rank of the matrix A is r, but s > r, from this, D = 0.

If D contains elements in line i and elements in line j. Currently, there is (5) $D = | \begin{matrix} a_{h t_{1}} & \dots & a_{h t_{s}} \\ ⋮ & ⋮ \\ a_{i t_{1}} + k a_{j t_{1}} & \dots & a_{i t_{s}} + k a_{j t_{s}} \\ ⋮ & ⋮ \\ a_{j t_{1}} & \dots & a_{j t_{s}} \\ ⋮ & ⋮ \\ a_{l t_{1}} & \dots & a_{l t_{s}} \end{matrix} | = | \begin{matrix} a_{h t_{1}} & \dots & a_{h t_{s}} \\ ⋮ & ⋮ \\ a_{i t_{1}} & \dots & a_{i t_{s}} \\ ⋮ & ⋮ \\ a_{j t_{1}} & \dots & a_{j t_{s}} \\ ⋮ & ⋮ \\ a_{l t_{1}} & \dots & a_{l t_{s}} \end{matrix} | = 0$ D = \left| {\matrix{{{a_{h{t_1}}}} & \cdots & {{a_{h{t_s}}}} \cr \vdots & {} & \vdots \cr {{a_{i{t_1}}} + k{a_{j{t_1}}}} & \cdots & {{a_{i{t_s}}} + k{a_{j{t_s}}}} \cr \vdots & {} & \vdots \cr {{a_{j{t_1}}}} & \cdots & {{a_{j{t_s}}}} \cr \vdots & {} & \vdots \cr {{a_{l{t_1}}}} & \cdots & {{a_{l{t_s}}}} \cr}} \right| = \left| {\matrix{{{a_{h{t_1}}}} & \cdots & {{a_{h{t_s}}}} \cr \vdots & {} & \vdots \cr {{a_{i{t_1}}}} & \cdots & {{a_{i{t_s}}}} \cr \vdots & {} & \vdots \cr {{a_{j{t_1}}}} & \cdots & {{a_{j{t_s}}}} \cr \vdots & {} & \vdots \cr {{a_{l{t_1}}}} & \cdots & {{a_{l{t_s}}}} \cr}} \right| = 0

If D contains elements in line i but does not contain elements in line j. Currently (6) $D = | \begin{matrix} a_{h t_{1}} & \dots & a_{h t_{s}} \\ ⋮ & ⋮ \\ a_{i t_{1}} + k a_{j t_{1}} & \dots & a_{i t_{s}} + k a_{j t_{s}} \\ ⋮ & ⋮ \\ a_{l t_{1}} & \dots & a_{l t_{s}} \end{matrix} | = | \begin{matrix} a_{h t_{1}} & \dots & a_{h t_{s}} \\ ⋮ & ⋮ \\ a_{i t_{1}} & \dots & a_{i t_{s}} \\ ⋮ & ⋮ \\ a_{l t_{1}} & \dots & a_{l t_{s}} \end{matrix} | + k | \begin{matrix} a_{h t_{1}} & \dots & a_{h t_{s}} \\ ⋮ & ⋮ \\ a_{j t_{1}} & \dots & a_{j t_{s}} \\ ⋮ & ⋮ \\ a_{l t_{1}} & \dots & a_{l t_{s}} \end{matrix} | = D_{1} + k D_{2}$ D = \left| {\matrix{{{a_{h{t_1}}}} & \cdots & {{a_{h{t_s}}}} \cr \vdots & {} & \vdots \cr {{a_{i{t_1}}} + k{a_{j{t_1}}}} & \cdots & {{a_{i{t_s}}} + k{a_{j{t_s}}}} \cr \vdots & {} & \vdots \cr {{a_{l{t_1}}}} & \cdots & {{a_{l{t_s}}}} \cr}} \right| = \left| {\matrix{{{a_{h{t_1}}}} & \cdots & {{a_{h{t_s}}}} \cr \vdots & {} & \vdots \cr {{a_{i{t_1}}}} & \cdots & {{a_{i{t_s}}}} \cr \vdots & {} & \vdots \cr {{a_{l{t_1}}}} & \cdots & {{a_{l{t_s}}}} \cr}} \right| + k\left| {\matrix{{{a_{h{t_1}}}} & \cdots & {{a_{h{t_s}}}} \cr \vdots & {} & \vdots \cr {{a_{j{t_1}}}} & \cdots & {{a_{j{t_s}}}} \cr \vdots & {} & \vdots \cr {{a_{l{t_1}}}} & \cdots & {{a_{l{t_s}}}} \cr}} \right| = {D_1} + k{D_2}

Since D₁ and D₂ are a s-order sub-form of the matrix A, D₁ = 0, D₂ = 0. Thus, D = 0.

From the above three cases, we can see that all the sub-forms of matrix B are greater than r. Therefore, the rank of matrix B is not greater than r. Both rank (A) ≥ rank (B). Similarly, we can also perform elementary transformation on the matrix B to obtain the matrix A so that we can get rank (A) ≤ rank (B). In this way, we prove that, rank (A) = rank (B), the third elementary transformation does not change the rank of the matrix. The above three points prove that the elementary transformation does not change the rank of the matrix. Certificate completed [2, 3].

In fact, performing a row or column elementary transformation is equivalent to multiplying this matrix to the left or right by an invertible matrix. Let A be a m × n matrix: (7) $A = (\begin{matrix} a_{11} & a_{12} & \dots & a_{1 n} \\ a_{21} & a_{22} & \dots & a_{2 n} \\ ⋮ & ⋮ & ⋮ \\ a_{m 1} & a_{m 2} & \dots & a_{mn} \end{matrix})$ A = \left({\matrix{{{a_{11}}} & {{a_{12}}} & \cdots & {{a_{1n}}} \cr {{a_{21}}} & {{a_{22}}} & \cdots & {{a_{2n}}} \cr \vdots & \vdots & {} & \vdots \cr {{a_{m1}}} & {{a_{m2}}} & \cdots & {{a_{mn}}} \cr}} \right) A can be transformed into a staircase type by row elementary transformation and the first column elementary transformation: (8) $J = (\begin{matrix} 1 & * & * & \dots & * & * & \dots & * \\ 0 & 1 & * & \dots & * & * & \dots & * \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \\ 0 & 0 & 0 & \dots & 1 & * & \dots & * \\ 0 & 0 & 0 & \dots & 0 & 0 & \dots & 0 \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \\ 0 & 0 & 0 & \dots & 0 & 0 & \dots & 0 \end{matrix})$ J = \left({\matrix{1 & * & * & \cdots & * & * & \cdots & * \cr 0 & 1 & * & \cdots & * & * & \cdots & * \cr \vdots & \vdots & \vdots & {} & \vdots & \vdots & {} & \vdots \cr 0 & 0 & 0 & \cdots & 1 & * & \cdots & * \cr 0 & 0 & 0 & \cdots & 0 & 0 & \cdots & 0 \cr \vdots & \vdots & \vdots & {} & \vdots & \vdots & {} & \vdots \cr 0 & 0 & 0 & \cdots & 0 & 0 & \cdots & 0 \cr}} \right) And further becomes: (9) $(\begin{matrix} 1 & 0 & 0 & \dots & 0 & c_{1, r + 1} & \dots & c_{1 n} \\ 0 & 1 & 0 & \dots & 0 & c_{2, r + 1} & \dots & c_{2 n} \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \\ 0 & 0 & 0 & \dots & 1 & c_{r, r + 1} & \dots & c_{rn} \\ 0 & 0 & 0 & \dots & 0 & 0 & \dots & 0 \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \\ 0 & 0 & 0 & \dots & 0 & 0 & \dots & 0 \end{matrix}) = (\begin{matrix} I_{r} & C_{r \times (n - r)} \\ 0 & 0 \end{matrix})$ \left({\matrix{1 & 0 & 0 & \cdots & 0 & {{c_{1,r + 1}}} & \cdots & {{c_{1n}}} \cr 0 & 1 & 0 & \cdots & 0 & {{c_{2,r + 1}}} & \cdots & {{c_{2n}}} \cr \vdots & \vdots & \vdots & {} & \vdots & \vdots & {} & \vdots \cr 0 & 0 & 0 & \cdots & 1 & {{c_{r,r + 1}}} & \cdots & {{c_{rn}}} \cr 0 & 0 & 0 & \cdots & 0 & 0 & \cdots & 0 \cr \vdots & \vdots & \vdots & {} & \vdots & \vdots & {} & \vdots \cr 0 & 0 & 0 & \cdots & 0 & 0 & \cdots & 0 \cr}} \right) = \left({\matrix{{{I_r}} & {{C_{r \times \left({n - r} \right)}}} \cr 0 & 0 \cr}} \right) If the elements a_ij of matrix A are all equal to zero, then A has the form of J. Suppose that a_ij is not equal to zero. If necessary, swap the rows and columns of the matrix, and the element can be the top left corner of the matrix. Multiply the first line by $\frac{1}{a_{ij}}$ {1 \over {{a_{ij}}}} and then subtract the appropriate multiples of the first line from the remaining lines. Matrix A becomes

Proof

(10) $B = (\begin{matrix} 1 & * & \dots & * \\ 0 & * & \dots & * \\ ⋮ & ⋮ & ⋮ \\ 0 & * & \dots & * \end{matrix})$ B = \left({\matrix{1 & * & \cdots & * \cr 0 & * & \cdots & * \cr \vdots & \vdots & {} & \vdots \cr 0 & * & \cdots & * \cr}} \right) If in B, except for the first line, the elements of all other lines are zero, then B has the form of J. Suppose there is an element b in the last m − 1 rows of B that is not equal to zero. Change b to the position of the focus of the second row and second column and then use the same method as above to convert B to $(\begin{matrix} 1 & * & * & \dots & * \\ 0 & 1 & * & \dots & * \\ 0 & 0 & * & \dots & * \\ ⋮ & ⋮ & ⋮ & ⋮ \\ 0 & 0 & * & \dots & * \end{matrix})$ \left({\matrix{1 & * & * & \cdots & * \cr 0 & 1 & * & \cdots & * \cr 0 & 0 & * & \cdots & * \cr \vdots & \vdots & \vdots & {} & \vdots \cr 0 & 0 & * & \cdots & * \cr}} \right) . So, continue and finally get a matrix of the form J [4, 5].

We only need to further subtract the appropriate multiples of line r from the first, second, third and line r − 1 and subtract the appropriate multiples of line r − 1 from the first, second and r − 2 lines, respectively. Continue to get a matrix of the form $(\begin{matrix} I_{r} & C_{r, n - r} \\ 0 & 0 \end{matrix})$ \left({\matrix{{{I_r}} & {{C_{r,n - r}}} \cr 0 & 0 \cr}} \right) .

In fact, the matrix is transformed into a step shape by elementary transformation, and the rank of the number of non-zero rows in the step shape matrix is the rank of the matrix. This results in another equivalent definition of matrix rank:

The number of non-zero rows in the staircase formed by matrix A = (a_ij)_m×n after elementary transformation becomes the rank of the matrix. The rank of matrix A is r and denoted by R(A) = r. In particular, the rank R(O) = 0 of the zero matrix is 0.

The elementary transformation is used to transform the matrix A into the equivalent standard form $I = (\begin{matrix} E_{r} & 0 \\ 0 & 0 \end{matrix})$ I = \left({\matrix{{{E_r}} & 0 \cr 0 & 0 \cr}} \right) . Since the elementary transformation does not change the rank of the matrix, rank (A) = r. From this, the following theorem is obtained:

Any matrix A can be reduced to the form $I = (\begin{matrix} E_{r} & 0 \\ 0 & 0 \end{matrix})$ I = \left({\matrix{{{E_r}} & 0 \cr 0 & 0 \cr}} \right) , and I is called the equivalent standard form of A, and rank (A) = r. Similar matrices have the same rank, and matrices with the same rank are similar. A ∼ B ⇔ rank (A) = rank (B).

Proof

If the matrix A ∼ B is similarly defined, it can be seen that there is an invertible matrix T so that B = T⁻¹ AT because the matrix is multiplied with the invertible matrix, the rank is unchanged (there is a proof below); so, rank (A) = rank (T¹ A) = rank (T⁻¹ AT) = rank (B).

If rank (A) = rank (B), suppose the equivalent standard form of A is I_A, then the equivalent standard form of A ∼ I_A and B, is I_B; then, B ∼ I_B, and rank (A) = rank (B); then, I_A = I_B. From the similar transitivity, we know A ∼ B. The proof is complete. If the Jordan canonical form of both matrices A and B is J, then rank (A) = rank (B).

Proof

Set $J = (\begin{matrix} J_{1} \\ ⋱ \\ J_{_{S}} \end{matrix})$ J = \left({\matrix{{{J_1}} & {} & {} \cr {} & \ddots & {} \cr {} & {} & {{J_{_S}}} \cr}} \right) , $J_{i} = {(\begin{matrix} λ_{i} & 1 \\ λ_{i} & ⋱ \\ ⋱ & 1 \\ λ_{i} \end{matrix})}_{n_{i} \times n_{i}}$ {J_i} = {\left({\matrix{{{\lambda _i}} & 1 & {} & {} \cr {} & {{\lambda _i}} & \ddots & {} \cr {} & {} & \ddots & 1 \cr {} & {} & {} & {{\lambda _i}} \cr}} \right)_{{n_i} \times {n_i}}} , (i = 1, 2, … s). Then, the elementary factors of the matrix A and the matrix B are both (λ − λ₁)^n₁,(λ − λ₂)^n₂,⋯(λ − λ_s)^n_s, so A ∼ B. Known from theorem, rank (A) = rank (B). After the proof is obtained, we can get the following:

If the rank of matrix A is r, the rank of its Jordan canonical form is also r. That is, the ranks of the three canonical forms of the matrix are unchanged.

The matrix of contracts has the same rank. That is, if A ≅ B, then rank (A) = rank (B).

Proof

If A ≅ B, then there is a reversible matrix C such that C^T AC = B is known from the property 1.1.1, and rank C^T = rank (C), that is, C^T is also invertible. From 1.3.4, (11) $rank (C^{T} AC) = rank (C^{T} A) = rank (A)$ rank\left({{C^T}AC} \right) = rank\left({{C^T}A} \right) = rank\left(A \right) If the block matrix A is transformed into a matrix B through the elementary transformation of a finite number of block matrices, the rank of the matrix is unchanged.

2.2

Some basic properties of the rank of a matrix

0 ≤ rank (A_m_×n) ≤ min{m,n}. rank (A) = rank (A^T). Matrix A is divided into several rows (columns) to obtain matrix B, then rank (A) ≥ rank (B). Let A be a square matrix of order n(n ≥ 2), then $rank (A^{*}) = {\begin{array}{l} n & rank (A) = n \\ 1 & rank (A) = n - 1 \\ 0 & rank (A) < n - 1 \end{array}$ rank\left({{A^*}} \right) = \left\{{\matrix{n \hfill & {rank\left(A \right) = n} \hfill \cr 1 \hfill & {rank\left(A \right) = n - 1} \hfill \cr 0 \hfill & {rank\left(A \right) < n - 1} \hfill \cr}} \right. .

2.3

Matrix rank and matrix operations

$rank (kA) = {\begin{array}{l} rank (A), k \neq 0 \\ 0, k = 0 \end{array} . rank (\begin{matrix} A & 0 \\ 0 & B \end{matrix}) = rank (A) + rank (B) . rank (\begin{matrix} A & 0 \\ C & B \end{matrix}) \geq rank (A) + rank (B)$ rank\left({kA} \right) = \left\{{\matrix{{rank\left(A \right),k \ne 0} \hfill \cr {0,k = 0} \hfill \cr}} \right..\,\,rank\left({\matrix{A & 0 \cr 0 & B \cr}} \right) = rank\left(A \right) + rank\left(B \right).\,\,rank\left({\matrix{A & 0 \cr C & B \cr}} \right) \ge rank\left(A \right) + rank\left(B \right) . rank (A × B) ≤ min{rank (A), rank (B)}. In particular, if A is reversible, rank (A × B) = rank (B).

Proof

Let A be a m × n matrix, B be a n × p matrix, and rank (A) = r, let A be the equivalent standard form (12) $I_{A} = (\begin{matrix} E_{r} & 0_{r, n - r} \\ 0_{m - r, r} & 0_{m - r, n - r} \end{matrix})$ {I_A} = \left({\matrix{{{E_r}} & {{0_{r,n - r}}} \cr {{0_{m - r,r}}} & {{0_{m - r,n - r}}} \cr}} \right) In other words, there is an elementary matrix E₁,E₂,⋯, E_p of order m and an elementary matrix E_p₊₁,E_p₊₂,⋯, E_q of order n such that (13) $E_{1} \dots E_{p} A E_{p + 1} \dots E_{q} = I_{A}$ {E_1} \cdots {E_p}A{E_{p + 1}} \cdots {E_q} = {I_A}

So, have: (14) $E_{1} \dots E_{q} AB = E_{1} \dots E_{p} A E_{p + 1} \dots E_{q} E_{p + 1}^{- 1} \dots E_{q}^{- 1} B = I_{A} E_{p + 1}^{- 1} \dots E_{q}^{- 1} B = I_{A} B_{1}$ {E_1} \cdots {E_q}AB = {E_1} \cdots {E_p}A{E_{p + 1}} \cdots {E_q}E_{p + 1}^{- 1} \cdots E_q^{- 1}B = {I_A}E_{p + 1}^{- 1} \cdots E_q^{- 1}B = {I_A}{B_1} Here, $B_{1} = E_{p + 1}^{- 1} \dots E_{q}^{- 1}$ {B_1} = E_{p + 1}^{- 1} \cdots E_q^{- 1} . Obviously, except for the first r lines, I_AB₁. Obviously, the remaining rows are zero, and so, rank (I_AB₁) ≤ r. E₁ ···E_qAB is obtained by AB through elementary transformation; so, they have the same rank, which proves that rank (AB) ≤ rank (A). Similarly, it can be proved that rank (AB) ≤ rank (B). If one of A, B is an invertible matrix, let's set A to be invertible. Then, on the one hand, we know from the above proof process that rank (AB) ≤ rank (B) and B = A⁻¹ (AB) and s rank (B) ≤ rank (AB). Therefore, rank (AB) = rank (B) Certificate completed.

This property is extended to the case of the product of any m matrix. The rank of the product of any m matrices is not greater than the rank of each factor. If matrices A and B are homogeneous, then rank (A ± B) ≤ rank (A) ± rank (B).

Proof

First prove rank (A + B) ≤ rank (A) + rank (B). due to (15) $(\begin{matrix} A & B \\ 0 & B \end{matrix}) (\begin{matrix} E_{n} & 0 \\ E_{n} & 0 \end{matrix}) = (\begin{matrix} A + B & 0 \\ B & 0 \end{matrix})$ \left({\matrix{A & B \cr 0 & B \cr}} \right)\left({\matrix{{{E_n}} & 0 \cr {{E_n}} & 0 \cr}} \right) = \left({\matrix{{A + B} & 0 \cr B & 0 \cr}} \right) And so: (16) $rank (A + B) \leq rank (\begin{matrix} A + B & 0 \\ B & 0 \end{matrix}) = rank ((\begin{matrix} A & B \\ 0 & B \end{matrix}) (\begin{matrix} E_{n} & 0 \\ E_{n} & 0 \end{matrix})) \leq rank (\begin{matrix} A & B \\ 0 & B \end{matrix}) \leq rank (A) + rank (B) .$ rank\left({A + B} \right) \le rank\left({\matrix{{A + B} & 0 \cr B & 0 \cr}} \right) = rank\left({\left({\matrix{A & B \cr 0 & B \cr}} \right)\left({\matrix{{{E_n}} & 0 \cr {{E_n}} & 0 \cr}} \right)} \right) \le rank\left({\matrix{A & B \cr 0 & B \cr}} \right) \le rank\left(A \right) + rank\left(B \right). and so, rank (A) = rank (A − B + B) ≤ rank (A − B) + rank (B), and so, rank (A ± B) ≤ rank (A) ± rank (B).

2.4

Some inequality equations about the ranks of matrices and their applications

(Sylvester inequality) Let A be the s × n matrix and B be the n × m matrix, then rank (AB) ≥ rank (A) + rank (B) − n

Proof

(Proven using a block matrix) Since $(\begin{matrix} AB & 0 \\ 0 & E_{n} \end{matrix}) \overset{b r_{1} + A \times r_{2}}{\to} (\begin{matrix} AB & A \\ 0 & E_{n} \end{matrix}) \overset{b c_{1} + b c_{2} \times B}{\to} (\begin{matrix} 0 & A \\ - B & E_{n} \end{matrix}),$ \left({\matrix{{AB} & 0 \cr 0 & {{E_n}} \cr}} \right)\buildrel {b{r_1} + A \times {r_2}} \over \longrightarrow \left({\matrix{{AB} & A \cr 0 & {{E_n}} \cr}} \right)\buildrel {b{c_1} + b{c_2} \times B} \over \longrightarrow \left({\matrix{0 & A \cr {- B} & {{E_n}} \cr}} \right), And so: $rank (\begin{matrix} AB & 0 \\ 0 & E_{n} \end{matrix}) \geq rank (\begin{matrix} 0 & A \\ - B & E_{n} \end{matrix})$ rank\left({\matrix{{AB} & 0 \cr 0 & {{E_n}} \cr}} \right) \ge rank\left({\matrix{0 & A \cr {- B} & {{E_n}} \cr}} \right) , which is rank (AB) + n ≥ rank (A) + rank (B), get rank (AB) ≥ rank (A) + rank (B) − n. Certificate completed [6, 7].

If the matrices A and B are n × n matrices and AB = 0, the rank (A) + rank (B) ≤ n. (Sylvester inequality) Let A, B and C in turn be m × n, n × s and s × t matrixes, then rank (ABC) ≥ rank (AB) + rank (BC) − rank (B).

Prove: because $(\begin{matrix} AB & ABC \\ B & 0 \end{matrix}) (\begin{matrix} I_{s} & C \\ 0 & - I_{t} \end{matrix}) = (\begin{matrix} AB & 0 \\ B & BC \end{matrix})$ \left({\matrix{{AB} & {ABC} \cr B & 0 \cr}} \right)\left({\matrix{{{I_s}} & C \cr 0 & {- {I_t}} \cr}} \right) = \left({\matrix{{AB} & 0 \cr B & {BC} \cr}} \right) , with property 1.1.3 available: $rank (AB) + rank (BC) \leq rank (\begin{matrix} AB & 0 \\ B & BC \end{matrix}) \leq rank (\begin{matrix} AB & ABC \\ B & 0 \end{matrix}) \leq rank (\begin{matrix} 0 & ABC \\ B & 0 \end{matrix}) = rank (ABC) + rank (B)$ rank\left({AB} \right) + rank\left({BC} \right) \le rank\left({\matrix{{AB} & 0 \cr B & {BC} \cr}} \right) \le rank\left({\matrix{{AB} & {ABC} \cr B & 0 \cr}} \right) \le rank\left({\matrix{0 & {ABC} \cr B & 0 \cr}} \right) = rank\left({ABC} \right) + rank\left(B \right) Get rank (ABC) ≥ rank (AB) + rank (BC) − rank (B). Certificate completed. Let matrix A, B and B be matrix of order n, then rank (AB − I_n) = rank (A − I_n) = rank (B − I_n).

Because $(\begin{matrix} A - I_{n} & B - I_{n} \\ 0 & B - I_{n} \end{matrix}) (\begin{matrix} B & 0 \\ I_{n} & 0 \end{matrix}) = (\begin{matrix} AB - I_{n} & 0 \\ B - I_{n} & 0 \end{matrix})$ \left({\matrix{{A - {I_n}} & {B - {I_n}} \cr 0 & {B - {I_n}} \cr}} \right)\left({\matrix{B & 0 \cr {{I_n}} & 0 \cr}} \right) = \left({\matrix{{AB - {I_n}} & 0 \cr {B - {I_n}} & 0 \cr}} \right) , obtained from properties 1.3.2 and 1.3.4. $rank (AB - I_{n}) \leq rank (\begin{matrix} AB - I_{n} & 0 \\ B - I_{n} & 0 \end{matrix}) \leq rank (\begin{matrix} A - I_{n} & B - I_{n} \\ 0 & B - I_{n} \end{matrix})$ rank\left({AB - {I_n}} \right) \le rank\left({\matrix{{AB - {I_n}} & 0 \cr {B - {I_n}} & 0 \cr}} \right) \le rank\left({\matrix{{A - {I_n}} & {B - {I_n}} \cr 0 & {B - {I_n}} \cr}} \right) , and so, rank (AB − I_n) ≤ rank (A − I_n) + rank (B − I_n).

If A, B is n order matrix, then rank (AB + A + B) ≤ rank (A) + rank (B) [8, 9].

Because $(\begin{matrix} A & B \\ 0 & B \end{matrix}) (\begin{matrix} B + I_{n} & 0 \\ I_{n} & 0 \end{matrix}) = (\begin{matrix} AB + A + B & 0 \\ B & 0 \end{matrix})$ \left({\matrix{A & B \cr 0 & B \cr}} \right)\left({\matrix{{B + {I_n}} & 0 \cr {{I_n}} & 0 \cr}} \right) = \left({\matrix{{AB + A + B} & 0 \cr B & 0 \cr}} \right) , and so $rank (AB + A + B) \leq rank (AB + A + B) \leq rank (\begin{matrix} AB + A + B & 0 \\ B & 0 \end{matrix}) \leq rank (\begin{matrix} A & B \\ 0 & B \end{matrix}) = rank (A) + rank (B) .$ rank\left({AB + A + B} \right) \le rank\left({AB + A + B} \right) \le rank\left({\matrix{{AB + A + B} & 0 \cr B & 0 \cr}} \right) \le rank\left({\matrix{A & B \cr 0 & B \cr}} \right) = rank\left(A \right) + rank\left(B \right). Assume A ∈ P^n×n, f (x),g(x) ∈ P[x], then: rank (f (A)) + rank (g(A)) = rank (d (A)) + rank (m(A))

Among them, d (x) = (f (x),g(x)), m(x) is the greatest common factor o f (x) and g(x).

If one of f (x), g(x) has zero polyforms, it is obvious that the theorem holds. It may be assumed that f (x), g(x) is a non-zero polynomial.: f (x) = d (x) f₁ (x), g(x) = d (x)g₁ (x), d (x) = μ (x) f (x) + ν (x)g(x), f₁ (x),g₁ (x), μ (x),ν (x) ∈ P[x] d (A) = μ (A) f (A) + ν (A)g(A). Elementary transformation of a blocking matrix $(\begin{matrix} f (A) & 0 \\ 0 & g (A) \end{matrix})$ \left({\matrix{{f\left(A \right)} & 0 \cr 0 & {g\left(A \right)} \cr}} \right) . (17) $\begin{array}{l} (\begin{matrix} E & 0 \\ - g_{1} (A) & E \end{matrix}) (\begin{matrix} E & ν (A) \\ 0 & E \end{matrix}) (\begin{matrix} f (A) & 0 \\ 0 & g (A) \end{matrix}) (\begin{matrix} E & μ (A) \\ 0 & E \end{matrix}) (\begin{matrix} E & 0 \\ - f_{1} (A) & E \end{matrix}) (\begin{matrix} 0 & - E \\ E & 0 \end{matrix}) \\ = (\begin{matrix} E & 0 \\ - g_{1} (A) & E \end{matrix}) (\begin{matrix} f (A) & μ (A) f (A) + ν (A) g (A) \\ 0 & g (A) \end{matrix}) (\begin{matrix} E & 0 \\ - f_{1} (A) & E \end{matrix}) (\begin{matrix} 0 & - E \\ E & 0 \end{matrix}) \\ = (\begin{matrix} E & 0 \\ - g_{1} (A) & E \end{matrix}) (\begin{matrix} f (A) & d (A) \\ 0 & g (A) \end{matrix}) (\begin{matrix} E & 0 \\ - f_{1} (A) & E \end{matrix}) (\begin{matrix} 0 & - E \\ E & 0 \end{matrix}) \\ = (\begin{matrix} f (A) & d (A) \\ - g_{1} (A) f (A) & 0 \end{matrix}) (\begin{matrix} E & 0 \\ - f_{1} (A) & E \end{matrix}) (\begin{matrix} 0 & - E \\ E & 0 \end{matrix}) \\ = (\begin{matrix} d (A) & 0 \\ 0 & g_{1} (A) f (A) \end{matrix}) = (\begin{matrix} d (A) & 0 \\ 0 & g_{1} (A) f_{1} (A) d (A) \end{matrix}) = (\begin{matrix} d (A) & 0 \\ 0 & m (A) \end{matrix}) \end{array}$ \matrix{{\left({\matrix{E & 0 \cr {- {g_1}\left(A \right)} & E \cr}} \right)\left({\matrix{E & {\nu \left(A \right)} \cr 0 & E \cr}} \right)\left({\matrix{{f\left(A \right)} & 0 \cr 0 & {g\left(A \right)} \cr}} \right)\left({\matrix{E & {\mu \left(A \right)} \cr 0 & E \cr}} \right)\left({\matrix{E & 0 \cr {- {f_1}\left(A \right)} & E \cr}} \right)\left({\matrix{0 & {- E} \cr E & 0 \cr}} \right)} \hfill \cr {= \left({\matrix{E & 0 \cr {- {g_1}\left(A \right)} & E \cr}} \right)\left({\matrix{{f\left(A \right)} & {\mu \left(A \right)f\left(A \right) + \nu \left(A \right)g\left(A \right)} \cr 0 & {g\left(A \right)} \cr}} \right)\left({\matrix{E & 0 \cr {- {f_1}\left(A \right)} & E \cr}} \right)\left({\matrix{0 & {- E} \cr E & 0 \cr}} \right)} \hfill \cr {= \left({\matrix{E & 0 \cr {- {g_1}\left(A \right)} & E \cr}} \right)\left({\matrix{{f\left(A \right)} & {d\left(A \right)} \cr 0 & {g\left(A \right)} \cr}} \right)\left({\matrix{E & 0 \cr {- {f_1}\left(A \right)} & E \cr}} \right)\left({\matrix{0 & {- E} \cr E & 0 \cr}} \right)} \hfill \cr {= \left({\matrix{{f\left(A \right)} & {d\left(A \right)} \cr {- {g_1}\left(A \right)f\left(A \right)} & 0 \cr}} \right)\left({\matrix{E & 0 \cr {- {f_1}\left(A \right)} & E \cr}} \right)\left({\matrix{0 & {- E} \cr E & 0 \cr}} \right)} \hfill \cr {= \left({\matrix{{d\left(A \right)} & 0 \cr 0 & {{g_1}\left(A \right)f\left(A \right)} \cr}} \right) = \left({\matrix{{d\left(A \right)} & 0 \cr 0 & {{g_1}\left(A \right){f_1}\left(A \right)d\left(A \right)} \cr}} \right) = \left({\matrix{{d\left(A \right)} & 0 \cr 0 & {m\left(A \right)} \cr}} \right)} \hfill \cr} Available $rank (\begin{matrix} f (A) & 0 \\ 0 & g (A) \end{matrix}) = rank (\begin{matrix} d (A) & 0 \\ 0 & m (A) \end{matrix}) = rank (d (A)) + rank (m (A))$ rank\left({\matrix{{f\left(A \right)} & 0 \cr 0 & {g\left(A \right)} \cr}} \right) = rank\left({\matrix{{d\left(A \right)} & 0 \cr 0 & {m\left(A \right)} \cr}} \right) = rank\left({d\left(A \right)} \right) + rank\left({m\left(A \right)} \right) . Complete [10].

This results in the conditions for the equality of Sylvester and Frobenius inequality:

Assume f (x), g(x) ∈ F (x), (f (x),g(x)) = 1, A ∈ F^n×n, then rank (f (A))+rank (g(A)) = rank (f (A)g(A))+ n.

Proof

If (f (x),g(x)) = 1, then d (x) = 1, m(x) = f (x)g(x), then rank (d (A)) = rank (E) = n, rank (m(A)) = rank (f (A)g(A)).

With Theorem 1.4.4 rank (f (A)) + rank (g(A)) = rank (f (A)g(A)) + n. Certificate completed.

Assume A ∈ F^n×n, f (x), g(x) ∈ P[x], (f (x),g(x)) = 1, then rank (f (A))+rank (g(A)) = n ⇔ f (A)g(A) = 0.

Proof

If rank (f (A))+rank (g(A)) = n, known from Theorem 1.4.4 rank (f (A)g(A)) = 0 and so f (A)g(A) = 0.

If f (A)g(A) = 0, then rank (f (A)g(A)) = 0, then rank (f (A)) + rank (g(A)) = n. Certificate completed.

Assume A ∈ F^n×n, f (x), g(x), h(x) ∈ F [x], (f (x),h(x)) = 1, then: $rank (f (A) g (A)) + rank (g (A) h (A)) = rank (f (A) g (A) h (A)) + rank (g (A)) .$ rank\left({f\left(A \right)g\left(A \right)} \right) + rank\left({g\left(A \right)h\left(A \right)} \right) = rank\left({f\left(A \right)g\left(A \right)h\left(A \right)} \right) + rank\left({g\left(A \right)} \right).

Proof

due to (f (x),h(x)) = 1, and so (f (x)g(x),g(x)h(x)) = g(x), m(x) = f (x)g(x)h(x). From Theorem 1.4.3, we get: $rank (f (A) g (A)) + rank (g (A) h (A)) = rank (f (A) g (A) h (A)) + rank (g (A)) .$ rank\left({f\left(A \right)g\left(A \right)} \right) + rank\left({g\left(A \right)h\left(A \right)} \right) = rank\left({f\left(A \right)g\left(A \right)h\left(A \right)} \right) + rank\left({g\left(A \right)} \right). Certificate completed.

Assume A ∈ F^n×n, f_i (x), g_j (x) ∈ P[x], and (f_i (x),g_j (x)) = 1, 1 ≤ i ≤ m, 1 ≤ j ≤ t, then $rank (\prod_{i = 1}^{m} f_{i} (A)) + rank (\prod_{j = 1}^{t} g_{j} (A)) = n + rank (\prod_{i = 1}^{m} f_{i} (A) \prod_{j = 1}^{t} g_{j} (A)) .$ rank\left({\prod\limits_{i = 1}^m {f_i}\left(A \right)} \right) + rank\left({\prod\limits_{j = 1}^t {g_j}\left(A \right)} \right) = n + rank\left({\prod\limits_{i = 1}^m {f_i}\left(A \right)\prod\limits_{j = 1}^t {g_j}\left(A \right)} \right).

Proof

due to (f_i (x),g_j (x)) = 1, then d(A) = E, $m (A) = \prod_{i = 1}^{m} f_{i} (A) \prod_{j = 1}^{t} g_{j} (A)$ m\left(A \right) = \prod\limits_{i = 1}^m {f_i}\left(A \right)\prod\limits_{j = 1}^t {g_j}\left(A \right) . From Theorem 1.4.3, we get: $rank (\prod_{i = 1}^{m} f_{i} (A)) + rank (\prod_{j = 1}^{t} g_{j} (A)) = n + rank (\prod_{i = 1}^{m} f_{i} (A) \prod_{j = 1}^{t} g_{j} (A)) .$ rank\left({\prod\limits_{i = 1}^m {f_i}\left(A \right)} \right) + rank\left({\prod\limits_{j = 1}^t {g_j}\left(A \right)} \right) = n + rank\left({\prod\limits_{i = 1}^m {f_i}\left(A \right)\prod\limits_{j = 1}^t {g_j}\left(A \right)} \right).

Analysis of typical examples:

Example 1

Let A be a matrix of order n and A² = A, prove that: rank (A) + rank (A − E) = n E, is a matrix of order n.

Proof

Order f (x) = x, g(x) = x − 1, then (f (x),g(x)) = 1, and A² = A, then A² − A = 0. So applying the theorem, we get $\begin{array}{l} rank (A) + rank (A - E) = rank (f (A)) + rank (g (A)) \\ = n + rank (f (A) g (A)) = n + rank (A (A - E)) = n + rank (A^{2} - A) = n + 0 = n . \end{array}$ \matrix{{rank\left(A \right) + rank\left({A - E} \right) = rank\left({f\left(A \right)} \right) + rank\left({g\left(A \right)} \right)} \hfill \cr {= n + rank\left({f\left(A \right)g\left(A \right)} \right) = n + rank\left({A\left({A - E} \right)} \right) = n + rank\left({{A^2} - A} \right) = n + 0 = n.} \hfill \cr}

Example 2

Let A be a matrix of order n and A² = E, prove rank (A + E) + rank (A − E) = n.

Proof

Order f (x) = x + 1, g(x) = x − 1, then (f (x),g(x)) = 1. Applying the theorem, we get $rank (A + E) + rank (A - E) = n + rank ((A + E) (A - E)) = n + rank (A^{2} - E) = n + rank (0) = n + 0 = n .$ rank\left({A + E} \right) + rank\left({A - E} \right) = n + rank\left({\left({A + E} \right)\left({A - E} \right)} \right) = n + rank\left({{A^2} - E} \right) = n + rank\left(0 \right) = n + 0 = n.

Example 3

Let A ∈ F^n×n, n be a positive integer, then for any positive integer l, k, there are:

rank (A^l) + rank(A^m − E)^k = n, in case A^m⁺¹ = A;

rank(A − E)^l + rank (A^m⁻¹ + A^m⁻² + ⋯ + A + E)^k = n, in case A^m = E.

prove:

w f (x) = x^l, g(x) = (x^m − 1)^k, then (f (x),g(x)) = 1. Applying the theorem, $\begin{array}{l} rank (A^{l}) + rank {(A^{m} - E)}^{k} = rank (f (A)) + rank (g (A)) \\ = n + rank (f (A) g (A)) = n + rank (A^{l} {(A^{m} - E)}^{k}) = n + rank (A^{l - 1} (A^{m + 1} - A) {(A^{m} - E)}^{k - 1}) \\ = n + rank (A^{l - 1} 0 {(A^{m} - E)}^{k}) = n + rank (0) = n \end{array}$ \matrix{{rank\left({{A^l}} \right) + rank{{\left({{A^m} - E} \right)}^k} = rank\left({f\left(A \right)} \right) + rank\left({g\left(A \right)} \right)} \hfill \cr {= n + rank\left({f\left(A \right)g\left(A \right)} \right) = n + rank\left({{A^l}{{\left({{A^m} - E} \right)}^k}} \right) = n + rank\left({{A^{l - 1}}\left({{A^{m + 1}} - A} \right){{\left({{A^m} - E} \right)}^{k - 1}}} \right)} \hfill \cr {= n + rank\left({{A^{l - 1}}0{{\left({{A^m} - E} \right)}^k}} \right) = n + rank\left(0 \right) = n} \hfill \cr}

make f (x) = (x − 1)^l, g(x) = (x^m⁻¹ + x^m⁺² + ⋯ + x + 1)^k, then (f (x),g(x)) = 1. Applying Theorem 1.4.4, you can go to: $\begin{array}{l} rank {(A - E)}^{l} + rank {(A^{m - 1} + A^{m - 2} + \dots + A + E)}^{k} = rank (f (x)) + rank (g (x)) \\ = n + rank (f (A) g (A)) = n + rank ({(A - E)}^{l} {(A^{m - 1} + A^{m - 2} + \dots + A + E)}^{k}) \\ = n + rank ({(A - E)}^{l - 1} (A - E) (A^{m - 1} + A^{m + 2} + \dots + A + E) {(A^{m - 1} + A^{m + 2} + \dots + A + E)}^{k - 1}) \\ = n + rank ({(A - E)}^{l - 1} ((A^{m} + A^{m - 1} + \dots + A^{2} + A) - (A^{m - 1} + A^{m - 2} + \dots + A + E)) \\ {(A^{m - 1} + A^{m - 2} + A + E)}^{k - 1}) \\ = n + rank ({(A - E)}^{l - 1} (A^{m} - E) {(A^{m - 1} + A^{m + 2} + \dots + A + E)}^{k - 1}) \\ = n + rank (0) = n \end{array}$ \matrix{{rank{{\left({A - E} \right)}^l} + rank{{\left({{A^{m - 1}} + {A^{m - 2}} + \cdots + A + E} \right)}^k} = rank\left({f\left(x \right)} \right) + rank\left({g\left(x \right)} \right)} \hfill \cr {= n + rank\left({f\left(A \right)g\left(A \right)} \right) = n + rank\left({{{\left({A - E} \right)}^l}{{\left({{A^{m - 1}} + {A^{m - 2}} + \cdots + A + E} \right)}^k}} \right)} \hfill \cr {= n + rank\left({{{\left({A - E} \right)}^{l - 1}}\left({A - E} \right)\left({{A^{m - 1}} + {A^{m + 2}} + \cdots + A + E} \right){{\left({{A^{m - 1}} + {A^{m + 2}} + \cdots + A + E} \right)}^{k - 1}}} \right)} \hfill \cr {= n + rank({{\left({A - E} \right)}^{l - 1}}\left({\left({{A^m} + {A^{m - 1}} + \cdots + {A^2} + A} \right) - \left({{A^{m - 1}} + {A^{m - 2}} + \cdots + A + E} \right)} \right)} \hfill \cr {{{\left({{A^{m - 1}} + {A^{m - 2}} + A + E} \right)}^{k - 1}})} \hfill \cr {= n + rank\left({{{\left({A - E} \right)}^{l - 1}}\left({{A^m} - E} \right){{\left({{A^{m - 1}} + {A^{m + 2}} + \cdots + A + E} \right)}^{k - 1}}} \right)} \hfill \cr {= n + rank\left(0 \right) = n} \hfill \cr}

The above three examples are very cumbersome to solve with knowledge of zeroization polynomials, but it is very simple and easy to understand using the Sylvester inequality. Matrix inequality has good applications in solving problems, and this article will not explain them one by one.

2.5

The rank and invertibility of matrices

For any matrix n of order A, the following three terms are equivalent (1) matrix A is invertible; (2) rank (A) = n; (3) detA ≠ 0. The row rank, column rank and rank of the matrix are equal. Let A be m × n the order matrix and P be m. Order invertible matrix, Q is n order matrix, then rank (PAQ) = rank (AQ) = rank (PA) = rank (A).

Application of matrix rank in linear algebra

Let n be a linear equation set AX = B, where (18) $A = (\begin{matrix} a_{11} & a_{12} & \dots & a_{1 n} \\ a_{21} & a_{22} & \dots & a_{2 n} \\ ⋮ & ⋮ & ⋮ \\ a_{m 1} & a_{m 2} & \dots & a_{mn} \end{matrix}), B = (\begin{matrix} b_{1} \\ b_{2} \\ ⋮ \\ b_{m} \end{matrix})$ A = \left({\matrix{{{a_{11}}} & {{a_{12}}} & \cdots & {{a_{1n}}} \cr {{a_{21}}} & {{a_{22}}} & \cdots & {{a_{2n}}} \cr \vdots & \vdots & {} & \vdots \cr {{a_{m1}}} & {{a_{m2}}} & \cdots & {{a_{mn}}} \cr}} \right),\;B = \left({\matrix{{{b_1}} \cr {{b_2}} \cr \vdots \cr {{b_m}} \cr}} \right) Let its augmentation matrix be (19) $\bar{A} = (\begin{matrix} a_{11} & a_{12} & \dots & a_{1 n} & b_{1} \\ a_{21} & a_{22} & \dots & a_{2 n} & b_{2} \\ ⋮ & ⋮ & ⋮ & ⋮ \\ a_{m 1} & a_{m 2} & \dots & a_{mn} & b_{m} \end{matrix})$ \overline A = \left({\matrix{{{a_{11}}} & {{a_{12}}} & \cdots & {{a_{1n}}} & {{b_1}} \cr {{a_{21}}} & {{a_{22}}} & \cdots & {{a_{2n}}} & {{b_2}} \cr \vdots & \vdots & {} & \vdots & \vdots \cr {{a_{m1}}} & {{a_{m2}}} & \cdots & {{a_{mn}}} & {{b_m}} \cr}} \right) Then, there are the following assumptions: (1) The system of equations AX = B has no solution if and only if rank (A) < rank (Ā); (2) The system of equations AX = B has unique solution if and only if rank (A) < rank (Ā) = n; (3) The system of equations AX = B has infinitely many solutions if and only if rank (A) < rank (Ā) < n.

Proof

Use the elementary transformation indicated by Lemma 1.1.1 above to turn A and Ā into: (20) $B = (\begin{matrix} 1 & 0 & \dots & 0 & c_{1, r + 1} & \dots & c_{1 n} \\ 0 & 1 & \dots & 0 & c_{2, r + 1} & \dots & c_{2 n} \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \\ 0 & 0 & \dots & 1 & c_{r, r + 1} & \dots & c_{rn} \\ 0 & 0 & \dots & 0 & 0 & \dots & 0 \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \\ 0 & 0 & \dots & 0 & 0 & \dots & 0 \end{matrix}), \bar{B} = (\begin{matrix} 1 & 0 & \dots & 0 & c_{1, r + 1} & \dots & c_{1 n} & d_{1} \\ 0 & 1 & \dots & 0 & c_{2, r + 1} & \dots & c_{2 n} & d_{2} \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \\ 0 & 0 & \dots & 1 & c_{r, r + 1} & \dots & c_{rn} & d_{r} \\ 0 & 0 & \dots & 0 & 0 & \dots & 0 & d_{r + 1} \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \\ 0 & 0 & \dots & 0 & 0 & \dots & 0 & d_{m} \end{matrix})$ B = \left({\matrix{1 & 0 & \cdots & 0 & {{c_{1,r + 1}}} & \cdots & {{c_{1n}}} \cr 0 & 1 & \cdots & 0 & {{c_{2,r + 1}}} & \cdots & {{c_{2n}}} \cr \vdots & \vdots & {} & \vdots & \vdots & {} & \vdots \cr 0 & 0 & \cdots & 1 & {{c_{r,r + 1}}} & \cdots & {{c_{rn}}} \cr 0 & 0 & \cdots & 0 & 0 & \cdots & 0 \cr \vdots & \vdots & {} & \vdots & \vdots & {} & \vdots \cr 0 & 0 & \cdots & 0 & 0 & \cdots & 0 \cr}} \right),\;\overline B = \left({\matrix{1 & 0 & \cdots & 0 & {{c_{1,r + 1}}} & \cdots & {{c_{1n}}} & {{d_1}} \cr 0 & 1 & \cdots & 0 & {{c_{2,r + 1}}} & \cdots & {{c_{2n}}} & {{d_2}} \cr \vdots & \vdots & {} & \vdots & \vdots & {} & \vdots & \vdots \cr 0 & 0 & \cdots & 1 & {{c_{r,r + 1}}} & \cdots & {{c_{rn}}} & {{d_r}} \cr 0 & 0 & \cdots & 0 & 0 & \cdots & 0 & {{d_{r + 1}}} \cr \vdots & \vdots & {} & \vdots & \vdots & {} & \vdots & \vdots \cr 0 & 0 & \cdots & 0 & 0 & \cdots & 0 & {{d_m}} \cr}} \right) Since the elementary transformation does not change the rank of the matrix, (21) $rank (A) = rank (B) = r, rank (\bar{A}) = rank (\bar{B}), rank (A) \leq rank (\bar{A})$ rank\left(A \right) = rank\left(B \right) = r,\;rank\left({\overline A} \right) = rank\left({\overline B} \right),\;rank\left(A \right) \le rank\left({\overline A} \right) Now suppose that the system of linear equations AX = B has a solution, that is, there is d_r₊₁ = d_r₊₂ = ⋯ = d_m = 0 at this time, and either r < m, or r = m, both of which have $rank (\bar{A}) = rank (\bar{B}) = r$ rank\left({\overline A} \right) = rank\left({\overline B} \right) = r , so rank (A) < rank (Ā) = r. If the system of equations has only one solution, then the number of free unknowns is zero, then r = n.

Conclusion

The rank of a matrix is a basic concept and one of the most important quantitative characteristics of a matrix. It is an invariant under elementary transformation. The rank of a matrix is an important concept that reflects the inherent characteristics of a matrix. It has an important role in linear algebra, analytic geometry and even probability theory.

eISSN:: 2444-8656
Idioma:: Inglés

Calendario de la edición:: Volume Open
Temas de la revista:: Life Sciences, other, Mathematics, Applied Mathematics, General Mathematics, Physics

RSS Feed de revista

Analysis of the properties of matrix rank and the relationship between matrix rank and matrix operations

Publicado en línea: 22 nov 2021

Páginas: 103 - 114

Recibido: 17 jun 2021

Aceptado: 24 sept 2021

DOI: https://doi.org/10.2478/amns.2021.2.00139

Palabras clavematrix rank, matrix rank invariance, matrix rank inequality, matrix rank identity, linear equations, zero eigenvalue algebraic multiples, homogeneous linear equations

© 2021 Jiying Ma, published by Sciendo

This work is licensed under the Creative Commons Attribution 4.0 International License.

Palabras clave
matrix rank, matrix rank invariance, matrix rank inequality, matrix rank identity, linear equations, zero eigenvalue algebraic multiples, homogeneous linear equations