Numerical Linear Algebra

Gentle, James E.

doi:10.1007/978-3-319-64867-5_11

James E. Gentle⁵

Part of the book series: Springer Texts in Statistics ((STS))

8945 Accesses

Abstract

Many scientific computational problems in various areas of application involve vectors and matrices. Programming languages such as C provide the capabilities for working with the individual elements but not directly with the arrays. Modern Fortran and higher-level languages such as Octave or Matlab and R allow direct manipulation of objects that represent vectors and matrices. The vectors and matrices are arrays of floating-point numbers.

Access provided by CONRICYT-eBooks. Download chapter PDF

Many scientific computational problems in various areas of application involve vectors and matrices. Programming languages such as C provide the capabilities for working with the individual elements but not directly with the arrays. Modern Fortran and higher-level languages such as Octave or Matlab and R allow direct manipulation of objects that represent vectors and matrices. The vectors and matrices are arrays of floating-point numbers.

The distinction between the set of real numbers, $\mathrm{I\!R}$, and the set of floating-point numbers, $\mathrm{I\!F}$, that we use in the computer has important implications for numerical computations. As we discussed in Sect. 10.2, beginning on page 483, an element x of a vector or matrix is approximated by a computer number [x]_c, and a mathematical operation ∘ is simulated by a computer operation [∘]_c. The familiar laws of algebra for the field of the reals do not hold in $\mathrm{I\!F}$, especially if uncontrolled parallel operations are allowed. These distinctions, of course, carry over to arrays of floating-point numbers that represent real numbers, and the properties of vectors and matrices that we discussed in earlier chapters may not hold for their computer counterparts. For example, the dot product of a nonzero vector with itself is positive (see page 24), but 〈x _c, x _c〉_c = 0 does not imply x _c = 0.

A good general reference on the topic of numerical linear algebra is Čížková and Čížek (2012).

1 Computer Storage of Vectors and Matrices

The elements of vectors and matrices are represented as ordinary numeric data, as we described in Sect. 10.1, in either fixed-point or floating-point representation.

1.1 Storage Modes

The elements of vectors and matrices are generally stored in a logically contiguous area of the computer’s memory. What is logically contiguous may not be physically contiguous, however.

Accessing data from memory in a single pipeline may take more computer time than the computations themselves. For this reason, computer memory may be organized into separate modules, or banks, with separate paths to the central processing unit. Logical memory is interleaved through the banks; that is, two consecutive logical memory locations are in separate banks. In order to take maximum advantage of the computing power, it may be necessary to be aware of how many interleaved banks the computer system has.

There are no convenient mappings of computer memory that would allow matrices to be stored in a logical rectangular grid, so matrices are usually stored either as columns strung end-to-end (a “column-major” storage) or as rows strung end-to-end (a “row-major” storage). In using a computer language or a software package, sometimes it is necessary to know which way the matrix is stored. The type of matrix computation to be performed may determine whether a vectorized processor should operate on rows or on columns.

For some software to deal with matrices of varying sizes, the user must specify the length of one dimension of the array containing the matrix. (In general, the user must specify the lengths of all dimensions of the array except one.) In Fortran subroutines, it is common to have an argument specifying the leading dimension (number of rows), and in C functions it is common to have an argument specifying the column dimension. (See the examples in Fig. 12.2 on page 563 and Fig. 12.3 on page 564 for illustrations of the leading dimension argument.)

1.2 Strides

Sometimes in accessing a partition of a given matrix, the elements occur at fixed distances from each other. If the storage is row-major for an n × m matrix, for example, the elements of a given column occur at a fixed distance of m from each other. This distance is called the “stride”, and it is often more efficient to access elements that occur with a fixed stride than it is to access elements randomly scattered.

Just accessing data from the computer’s memory contributes significantly to the time it takes to perform computations. A stride that is not a multiple of the number of banks in an interleaved bank memory organization can measurably increase the computational time in high-performance computing.

1.3 Sparsity

If a matrix has many elements that are zeros, and if the positions of those zeros are easily identified, many operations on the matrix can be speeded up. Matrices with many zero elements are called sparse matrices. They occur often in certain types of problems; for example in the solution of differential equations and in statistical designs of experiments. The first consideration is how to represent the matrix and to store the matrix and the location information. Different software systems may use different schemes to store sparse matrices. The method used in the IMSL Libraries, for example, is described on page 550. An important consideration is how to preserve the sparsity during intermediate computations.

2 General Computational Considerations for Vectors and Matrices

All of the computational methods discussed in Chap. 10 apply to vectors and matrices, but there are some additional general considerations for vectors and matrices.

2.1 Relative Magnitudes of Operands

One common situation that gives rise to numerical errors in computer operations is when a quantity x is transformed to t(x) but the value computed is unchanged:

$$\displaystyle{ [t(x)]_{\mathrm{c}} = [x]_{\mathrm{c}}; }$$

(11.1)

that is, the operation actually accomplishes nothing. A type of transformation that has this problem is

$$\displaystyle{ t(x) = x+\epsilon, }$$

(11.2)

where | ε | is much smaller than | x |. If all we wish to compute is x + ε, the fact that [x + ε]_c = [x]_c is probably not important. Usually, of course, this simple computation is part of some larger set of computations in which ε was computed. This, therefore, is the situation we want to anticipate and avoid.

Another type of problem is the addition to x of a computed quantity y that overwhelms x in magnitude. In this case, we may have

$$\displaystyle{ [x + y]_{\mathrm{c}} = [y]_{\mathrm{c}}. }$$

(11.3)

Again, this is a situation we want to anticipate and avoid.

2.1.1 Condition

A measure of the worst-case numerical error in numerical computation involving a given mathematical entity is the “condition” of that entity for the particular computations. The condition number of a matrix is the most generally useful such measure. For the matrix A, we denote the condition number as κ(A). We discussed the condition number in Sect. 6.1 and illustrated it in the toy example of equation (6.1). The condition number provides a bound on the relative norms of a “correct” solution to a linear system and a solution to a nearby problem. A specific condition number therefore depends on the norm, and we defined κ ₁, κ ₂, and κ _∞ condition numbers (and saw that they are generally roughly of the same magnitude). We saw in equation (6.10) that the L₂ condition number, κ ₂(A), is the ratio of magnitudes of the two extreme eigenvalues of A.

The condition of data depends on the particular computations to be performed. The relative magnitudes of other eigenvalues (or singular values) may be more relevant for some types of computations. Also, we saw in Sect. 10.3.2 that the “stiffness” measure in equation (10.3.2.7) is a more appropriate measure of the extent of the numerical error to be expected in computing variances.

2.1.2 Pivoting

Pivoting, discussed on page 277, is a method for avoiding a situation like that in equation (11.3). In Gaussian elimination, for example, we do an addition, x + y, where the y is the result of having divided some element of the matrix by some other element and x is some other element in the matrix. If the divisor is very small in magnitude, y is large and may overwhelm x as in equation (11.3).

2.1.3 “Modified” and “Classical” Gram-Schmidt Transformations

Another example of how to avoid a situation similar to that in equation (11.1) is the use of the correct form of the Gram-Schmidt transformations.

The orthogonalizing transformations shown in equations (2.56) on page 38 are the basis for Gram-Schmidt transformations of matrices. These transformations in turn are the basis for other computations, such as the QR factorization. (Exercise 5.10 required you to apply Gram-Schmidt transformations to develop a QR factorization.)

As mentioned on page 38, there are two ways we can extend equations (2.56) to more than two vectors, and the method given in Algorithm 2.1 is the correct way to do it. At the k ^th stage of the Gram-Schmidt method, the vector x _k ^(k) is taken as x _k ^(k−1) and the vectors x _k+1 ^(k), x _k+2 ^(k), …, x _m ^(k) are all made orthogonal to x _k ^(k). After the first stage, all vectors have been transformed. This method is sometimes called “modified Gram-Schmidt” because some people have performed the basic transformations in a different way, so that at the k ^th iteration, starting at k = 2, the first k − 1 vectors are unchanged (i.e., x _i ^(k) = x _i ^(k−1) for i = 1, 2, …, k − 1), and x _k ^(k) is made orthogonal to the k − 1 previously orthogonalized vectors x ₁ ^(k), x ₂ ^(k), …, x _k−1 ^(k). This method is called “classical Gram-Schmidt” for no particular reason. The “classical” method is not as stable, and should not be used; see Rice (1966) and Björck (1967) for discussions. In this book, “Gram-Schmidt” is the same as what is sometimes called “modified Gram-Schmidt”. In Exercise 11.1, you are asked to experiment with the relative numerical accuracy of the “classical Gram-Schmidt” and the correct Gram-Schmidt. The problems with the former method show up with the simple set of vectors x ₁ = (1, ε, ε), x ₂ = (1, ε, 0), and x ₃ = (1, 0, ε), with ε small enough that

$$\displaystyle{[1 +\epsilon ^{2}]_{\mathrm{ c}} = 1.}$$

2.2 Iterative Methods

As we saw in Chap. 6, we often have a choice between direct methods (that is, methods that compute a closed-form solution) and iterative methods. Iterative methods are usually to be favored for large, sparse systems.

Iterative methods are based on a sequence of approximations that (it is hoped) converge to the correct solution. The fundamental trade-off in iterative methods is between the amount of work expended in getting a good approximation at each step and the number of steps required for convergence.

2.2.1 Preconditioning

In order to achieve acceptable rates of convergence for iterative algorithms, it is often necessary to precondition the system; that is, to replace the system Ax = b by the system

$$\displaystyle{M^{-1}Ax = M^{-1}b}$$

for some suitable matrix M. As we indicated in Chaps. 6 and 7, the choice of M involves some art, and we will not consider any of the results here. Benzi (2002) provides a useful survey of the general problem and work up to that time, but this is an area of active research.

2.2.2 Restarting and Rescaling

In many iterative methods, not all components of the computations are updated in each iteration. An approximation to a given matrix or vector may be adequate during some sequence of computations without change, but then at some point the approximation is no longer close enough, and a new approximation must be computed. An example of this is in the use of quasi-Newton methods in optimization in which an approximate Hessian is updated, as indicated in equation (4.28) on page 202. We may, for example, just compute an approximation to the Hessian every few iterations, perhaps using second differences, and then use that approximate matrix for a few subsequent iterations.

Another example of the need to restart or to rescale is in the use of fast Givens rotations. As we mentioned on page 241 when we described the fast Givens rotations, the diagonal elements in the accumulated C matrices in the fast Givens rotations can become widely different in absolute values, so to avoid excessive loss of accuracy, it is usually necessary to rescale the elements periodically. Anda and Park (1994, 1996) describe methods of doing the rescaling dynamically. Their methods involve adjusting the first diagonal element by multiplication by the square of the cosine and adjusting the second diagonal element by division by the square of the cosine. Bindel et al. (2002) discuss in detail techniques for performing Givens rotations efficiently while still maintaining accuracy. (The BLAS routines (see Sect. 12.2.1) rotmg and rotm, respectively, set up and apply fast Givens rotations.)

2.2.3 Preservation of Sparsity

In computations involving large sparse systems, we may want to preserve the sparsity, even if that requires using approximations, as discussed in Sect. 5.10.2. Fill-in (when a zero position in a sparse matrix becomes nonzero) would cause loss of the computational and storage efficiencies of software for sparse matrices.

In forming a preconditioner for a sparse matrix A, for example, we may choose a matrix $M =\widetilde{ L}\widetilde{U}$, where $\widetilde{L}$ and $\widetilde{U}$ are approximations to the matrices in an LU decomposition of A, as in equation (5.51). These matrices are constructed as indicated in equation (5.52) so as to have zeros everywhere A has, and $A \approx \widetilde{ L}\widetilde{U}$. This is called incomplete factorization, and often, instead of an exact factorization, an approximate factorization may be more useful because of computational efficiency.

2.2.4 Iterative Refinement

Even if we are using a direct method, it may be useful to refine the solution by one step computed in extended precision. A method for iterative refinement of a solution of a linear system is given in Algorithm 6.3.

2.3 Assessing Computational Errors

As we discuss in Sect. 10.2.2 on page 485, we measure error by a scalar quantity, either as absolute error, $\vert \tilde{r} - r\vert$, where r is the true value and $\tilde{r}$ is the computed or rounded value, or as relative error, $\vert \tilde{r} - r\vert /r$ (as long as r ≠ 0). We discuss general ways of reducing them in Sect. 10.3.2.

2.3.1 Errors in Vectors and Matrices

The errors in vectors or matrices are generally expressed in terms of norms; for example, the relative error in the representation of the vector v, or as a result of computing v, may be expressed as $\|\tilde{v} - v\|/\|v\|$ (as long as ∥v∥ ≠ 0), where $\tilde{v}$ is the computed vector. We often use the notation $\tilde{v} = v +\delta v$, and so ∥δv∥∕∥v∥ is the relative error. The choice of which vector norm to use may depend on practical considerations about the errors in the individual elements. The L _∞ norm, for example, gives weight only to the element with the largest single error, while the L ₁ norm gives weights to all magnitudes equally.

2.3.2 Assessing Errors in Given Computations

In real-life applications, the correct solution is not known, but we would still like to have some way of assessing the accuracy using the data themselves. Sometimes a convenient way to do this in a given problem is to perform internal consistency tests. An internal consistency test may be an assessment of the agreement of various parts of the output. Relationships among the output are exploited to ensure that the individually computed quantities satisfy these relationships. Other internal consistency tests may be performed by comparing the results of the solutions of two problems with a known relationship.

The solution to the linear system Ax = b has a simple relationship to the solution to the linear system Ax = b + ca _j, where a _j is the j ^th column of A and c is a constant. A useful check on the accuracy of a computed solution to Ax = b is to compare it with a computed solution to the modified system. Of course, if the expected relationship does not hold, we do not know which solution is incorrect, but it is probably not a good idea to trust either. To test the accuracy of the computed regression coefficients for regressing y on x ₁, …, x _m, they suggest comparing them to the computed regression coefficients for regressing y + dx _j on x ₁, …, x _m. If the expected relationships do not obtain, the analyst has strong reason to doubt the accuracy of the computations.

Another simple modification of the problem of solving a linear system with a known exact effect is the permutation of the rows or columns. Although this perturbation of the problem does not change the solution, it does sometimes result in a change in the computations, and hence it may result in a different computed solution. This obviously would alert the user to problems in the computations.

A simple internal consistency test that is applicable to many problems is to use two levels of precision in some of the computations. In using this test, one must be careful to make sure that the input data are the same. Rounding of the input data may cause incorrect output to result, but that is not the fault of the computational algorithm.

Internal consistency tests cannot confirm that the results are correct; they can only give an indication that the results are incorrect.

3 Multiplication of Vectors and Matrices

Arithmetic on vectors and matrices involves arithmetic on the individual elements. The arithmetic on the individual elements is performed as we have discussed in Sect. 10.2.

The way the storage of the individual elements is organized is very important for the efficiency of computations. Also, the way the computer memory is organized and the nature of the numerical processors affect the efficiency and may be an important consideration in the design of algorithms for working with vectors and matrices.

The best methods for performing operations on vectors and matrices in the computer may not be the methods that are suggested by the definitions of the operations.

In most numerical computations with vectors and matrices, there is more than one way of performing the operations on the scalar elements. Consider the problem of evaluating the matrix times vector product, c = Ab, where A is n × m. There are two obvious ways of doing this:

compute each of the n elements of c, one at a time, as an inner product of m-vectors, c _i = a _i ^T b = ∑ _j a _ij b _j, or
update the computation of all of the elements of c simultaneously as
1. 1.
  For i = 1, …, n, let c _i ⁽⁰⁾ = 0.
2. 2.
  For j = 1, …, m,
  
  {
  
      for i = 1, …, n,
  
      {
  
         let c _i ⁽ⁱ⁾ = c _i ⁽ⁱ⁻¹⁾ + a _ij b _j.
  
      }
  
  }

If there are p processors available for parallel processing, we could use a fan-in algorithm (see page 487) to evaluate Ax as a set of inner products:

The order of the computations is nm (or n ²).

Multiplying two matrices A and B can be considered as a problem of multiplying several vectors b _i by a matrix A, as described above. In the following we will assume A is n × m and B is m × p, and we will use the notation a _i to represent the i ^th column of A, a _i ^T to represent the i ^th row of A, b _i to represent the i ^th column of B, c _i to represent the i ^th column of C = AB, and so on. (This notation is somewhat confusing because here we are not using a _i ^T to represent the transpose of a _i as we normally do. The notation should be clear in the context of the diagrams below, however.) Using the inner product method above results in the first step of the matrix multiplication forming

Using the second method above, in which the elements of the product vector are updated all at once, results in the first step of the matrix multiplication forming

The next and each successive step in this method are axpy operations:

$$\displaystyle{c_{1}^{(k+1)} = b_{ (k+1),1}a_{1} + c_{1}^{(k)},}$$

for k going to m − 1.

Another method for matrix multiplication is to perform axpy operations using all of the elements of b ₁ ^T before completing the computations for any of the columns of C. In this method, the elements of the product are built as the sum of the outer products a _i b _i ^T. In the notation used above for the other methods, we have

and the update is

$$\displaystyle{c_{ij}^{(k+1)} = a_{ k+1}b_{k+1}^{\mathrm{T}} + c_{ ij}^{(k)}.}$$

The order of computations for any of these methods is O(nmp), or just O(n ³), if the dimensions are all approximately the same. Strassen’s method, discussed next, reduces the order of the computations.

3.1 Strassen’s Algorithm

Another method for multiplying matrices that can be faster for large matrices is the so-called Strassen algorithm (from Strassen 1969). Suppose A and B are square matrices with equal and even dimensions. Partition them into submatrices of equal size, and consider the block representation of the product,

$$\displaystyle{\left [\begin{array}{cc} C_{11} & C_{12} \\ C_{21} & C_{22}\\ \end{array} \right ] = \left [\begin{array}{cc} A_{11} & A_{12} \\ A_{21} & A_{22}\\ \end{array} \right ]\left [\begin{array}{cc} B_{11} & B_{12} \\ B_{21} & B_{22}\\ \end{array} \right ],}$$

where all blocks are of equal size. Form

$$\displaystyle\begin{array}{rcl} P_{1}& =& (A_{11} + A_{22})(B_{11} + B_{22}), {}\\ P_{2}& =& (A_{21} + A_{22})B_{11}, {}\\ P_{3}& =& A_{11}(B_{12} - B_{22}), {}\\ P_{4}& =& A_{22}(B_{21} - B_{11}), {}\\ P_{5}& =& (A_{11} + A_{12})B_{22}, {}\\ P_{6}& =& (A_{21} - A_{11})(B_{11} + B_{12}), {}\\ P_{7}& =& (A_{12} - A_{22})(B_{21} + B_{22}). {}\\ \end{array}$$

Then we have (see the discussion on partitioned matrices in Sect. 3.1)

$$\displaystyle\begin{array}{rcl} C_{11}& =& P_{1} + P_{4} - P_{5} + P_{7}, {}\\ C_{12}& =& P_{3} + P_{5}, {}\\ C_{21}& =& P_{2} + P_{4}, {}\\ C_{22}& =& P_{1} + P_{3} - P_{2} + P_{6}. {}\\ \end{array}$$

Notice that the total number of multiplications is 7 instead of the 8 it would be in forming

$$\displaystyle{\left [\begin{array}{cc} A_{11} & A_{12} \\ A_{21} & A_{22}\\ \end{array} \right ]\left [\begin{array}{cc} B_{11} & B_{12} \\ B_{21} & B_{22}\\ \end{array} \right ]}$$

directly. Whether the blocks are matrices or scalars, the same analysis holds. Of course, in either case there are more additions. The addition of two k × k matrices is O(k ²), so for a large enough value of n the total number of operations using the Strassen algorithm is less than the number required for performing the multiplication in the usual way.

The partitioning of the matrix factors can also be used recursively; that is, in the formation of the P matrices. If the dimension, n, contains a factor 2^e, the algorithm can be used directly e times, and then conventional matrix multiplication can be used on any submatrix of dimension ≤ n∕2^e.) If the dimension of the matrices is not even, or if the matrices are not square, it may be worthwhile to pad the matrices with zeros, and then use the Strassen algorithm recursively.

The order of computations of the Strassen algorithm is $\mathrm{O}(n^{\log _{2}7})$, instead of O(n ³) as in the ordinary method (log₂7 = 2. 81). The algorithm can be implemented in parallel (see Bailey et al. 1990), and this algorithm is actually used in some software systems.

Several algorithms have been developed that use similar ideas to Strassen’s algorithm and are asymptotically faster; that is, with order of computations O(n ^k) where k < log₂7). (Notice that k must be at least 2 because there are n ² elements.) None of the algorithms that are asymptotically faster than Strassen’s are competitive in practice, however, because they all have much larger start-up costs.

3.2 Matrix Multiplication Using MapReduce

While methods such as Strassen’s algorithm achieve speedup by decreasing the total number of computations, other methods increase the overall speed by performing computations in parallel. Although not all computations can be performed in parallel and there is some overhead in additional computations for setting up the job, when multiple processors are available, the total number of computations may not be very important. One of the major tasks in parallel processing is just keeping track of the individual computations. MapReduce (see page 515) can sometimes be used in coordinating these operations.

For the matrix multiplication AB, in the view that the multiplication is a set of inner products, for i running over the indexes of the rows of A and j running over the indexes of the columns of B, we merely access the i ^th row of A, a _i∗, and the j ^th column of B, b _∗j, and form the inner product a _i∗ ^T b _∗j as the (i, j)^th element of the product AB. In the language of relational databases in which the two matrices are sets of data with row and column identifiers, this amounts to accessing the rows of A and the columns of B one by one, matching the elements of the row and the column so that the column designator of the row element matches the row designator of the column element, summing the product of the A row elements and the B column elements, and then grouping the sums of the products (that is, the inner products) by the A row designators and the B column designators. In SQL, it is

SELECT A.row, B.col SUM(A.value*B.value) FROM A,B WHERE A.col=B.row GROUP BY A.row, B.col;

In a distributed computing environment, MapReduce could be used to perform these operations. However the matrices are stored, possibly each over multiple environments, MapReduce would first map the matrix elements using their respective row and column indices as keys. It would then make the appropriate associations of row element from A with the column elements from B and perform the multiplications and the sum. Finally, the sums of the multiplications (that is, the inner products) would be associated with the appropriate keys for the output. This process is described in many elementary descriptions of Hadoop, such as in Leskovec, Rajaraman, and Ullman (2014) (Chapter 2).

4 Other Matrix Computations

Many other matrix computations depend on a matrix factorization. The most useful factorization is the QR factorization. It can be computed stably using either Householder reflections, Givens rotations, or the Gram-Schmidt procedure, as described respectively in Sects. 5.8.8, 5.8.9, and 5.8.10 (beginning on page 252). This is one time when the computational methods can follow the mathematical descriptions rather closely. Iterations using the QR factorization are used in a variety of matrix computations; for example, they are used in the most common method for evaluating eigenvalues, as described in Sect. 7.4, beginning on page 318.

Another very useful factorization is the singular value decomposition (SVD). The computations for SVD described in Sect. 7.7 beginning on page 322, are efficient and preserve numerical accuracy. A major difference in the QR factorization and the SVD is that the computations for SVD are necessarily iterative (recall the remarks at the beginning of Chap. 7).

4.1 Rank Determination

It is often easy to determine that a matrix is of full rank. If the matrix is not of full rank, however, or if it is very ill-conditioned, it is often difficult to determine its rank. This is because the computations to determine the rank eventually approximate 0. It is difficult to approximate 0; the relative error (if defined) would be either 0 or infinite. The rank-revealing QR factorization (equation (5.43), page 251) is the preferred method for estimating the rank. (Although I refer to this as “estimation”, it more properly should be called “approximation”. “Estimation” and the related term “testing”, as used in statistical applications, apply to an unknown object, as in estimating or testing the rank of a model matrix as discussed in Sect. 9.5.5, beginning on page 433.) When this decomposition is used to estimate the rank, it is recommended that complete pivoting be used in computing the decomposition. The LDU decomposition, described on page 242, can be modified the same way we used the modified QR to estimate the rank of a matrix. Again, it is recommended that complete pivoting be used in computing the decomposition.

The singular value decomposition (SVD) shown in equation (3.276) on page 161 also provides an indication of the rank of the matrix. For the n × m matrix A, the SVD is

$$\displaystyle{A = UDV ^{\mathrm{T}},}$$

where U is an n × n orthogonal matrix, V is an m × m orthogonal matrix, and D is a diagonal matrix of the singular values. The number of nonzero singular values is the rank of the matrix. Of course, again, the question is whether or not the singular values are zero. It is unlikely that the values computed are exactly zero.

A problem related to rank determination is to approximate the matrix A with a matrix A _r of rank r ≤ rank(A). The singular value decomposition provides an easy way to do this,

$$\displaystyle{A_{r} = UD_{r}V ^{\mathrm{T}},}$$

where D _r is the same as D, except with zeros replacing all but the r largest singular values. A result of Eckart and Young (1936) guarantees A _r is the rank r matrix closest to A as measured by the Frobenius norm,

$$\displaystyle{\|A - A_{r}\|_{\mathrm{F}},}$$

(see Sect. 3.10). This kind of matrix approximation is the basis for dimension reduction by principal components.

4.2 Computing the Determinant

The determinant of a square matrix can be obtained easily as the product of the diagonal elements of the triangular matrix in any factorization that yields an orthogonal matrix times a triangular matrix. As we have stated before, however, it is not often that the determinant need be computed.

One application in statistics is in optimal experimental designs. The D-optimal criterion, for example, chooses the design matrix, X, such that | X ^T X | is maximized (see Sect. 9.3.2).

4.3 Computing the Condition Number

The computation of a condition number of a matrix can be quite involved. Clearly, we would not want to use the definition, κ(A) = ∥A∥ ∥A ⁻¹∥, directly. Although the choice of the norm affects the condition number, recalling the discussion in Sect. 6.1, we choose whichever condition number is easiest to compute or estimate.

Various methods have been proposed to estimate the condition number using relatively simple computations. Cline et al. (1979) suggest a method that is easy to perform and is widely used. For a given matrix A and some vector v, solve

$$\displaystyle{A^{\mathrm{T}}x = v}$$

and then

$$\displaystyle{Ay = x.}$$

By tracking the computations in the solution of these systems, Cline et al. conclude that

$$\displaystyle{\frac{\|y\|} {\|x\|}}$$

is approximately equal to, but less than, ∥A ⁻¹∥. This estimate is used with respect to the L₁ norm in the LINPACK software library (see page 558 and Dongarra et al. 1979), but the approximation is valid for any norm. Solving the two systems above probably does not require much additional work because the original problem was likely to solve Ax = b, and solving a system with multiple right-hand sides can be done efficiently using the solution to one of the right-hand sides. The approximation is better if v is chosen so that ∥x∥ is as large as possible relative to ∥v∥.

Stewart (1980) and Cline and Rew (1983) investigated the validity of the approximation. The LINPACK estimator can underestimate the true condition number considerably, although generally not by an order of magnitude. Cline et al. (1982) give a method of estimating the L₂ condition number of a matrix that is a modification of the L₁ condition number used in LINPACK. This estimate generally performs better than the L₁ estimate, but the Cline/Conn/Van Loan estimator still can have problems (see Bischof 1990).

Hager (1984) gives another method for an L₁ condition number. Higham (1988) provides an improvement of Hager’s method, given as Algorithm 11.1 below, which is used in the LAPACK software library (Anderson et al. 2000).

Algorithm 11.1

The Hager/Higham LAPACK condition number estimator γ of the n × n matrix A

Higham (1987) compares Hager’s condition number estimator with that of Cline et al. (1979) and finds that the Hager LAPACK estimator is generally more useful. Higham (1990) gives a survey and comparison of the various ways of estimating and computing condition numbers. You are asked to study the performance of the LAPACK estimate using Monte Carlo methods in Exercise 11.5 on page 538.

Exercises

11.1.
Gram-Schmidt orthonormalization.
1. a)
  Write a program module (in Fortran, C, R, Octave or Matlab, or whatever language you choose) to implement Gram-Schmidt orthonormalization using Algorithm 2.1. Your program should be for an arbitrary order and for an arbitrary set of linearly independent vectors.
2. b)
  Write a program module to implement Gram-Schmidt orthonormalization using equations (2.56) and (2.57).
3. c)
  Experiment with your programs. Do they usually give the same results? Try them on a linearly independent set of vectors all of which point “almost” in the same direction. Do you see any difference in the accuracy? Think of some systematic way of forming a set of vectors that point in almost the same direction. One way of doing this would be, for a given x, to form x + εe _i for i = 1, …, n − 1, where e _i is the i ^th unit vector and ε is a small positive number. The difference can even be seen in hand computations for n = 3. Take x ₁ = (1, 10⁻⁶, 10⁻⁶), x ₂ = (1, 10⁻⁶, 0), and x ₃ = (1, 0, 10⁻⁶).
11.2.
Given the n × k matrix A and the k-vector b (where n and k are large), consider the problem of evaluating c = Ab. As we have mentioned, there are two obvious ways of doing this: (1) compute each element of c, one at a time, as an inner product c _i = a _i ^T b = ∑ _j a _ij b _j, or (2) update the computation of all of the elements of c in the inner loop.
1. a)
  What is the order of computation of the two algorithms?
2. b)
  Why would the relative efficiencies of these two algorithms be different for different programming languages, such as Fortran and C?
3. c)
  Suppose there are p processors available and the fan-in algorithm on page 530 is used to evaluate Ax as a set of inner products. What is the order of time of the algorithm?
4. d)
  Give a heuristic explanation of why the computation of the inner products by a fan-in algorithm is likely to have less roundoff error than computing the inner products by a standard serial algorithm. (This does not have anything to do with the parallelism.)
5. e)
  Describe how the following approach could be parallelized. (This is the second general algorithm mentioned above.)
  $$\displaystyle{\begin{array}{l} \mathrm{for}\;i = 1,\ldots,n\\ \{ \\ \ \ c_{i} = 0 \\ \ \ \mathrm{for}\;j = 1,\ldots,k\\ \ \ \{ \\ \ \ c_{i} = c_{i} + a_{ij}b_{j}\\ \ \ \}\\ \}\\ \end{array} }$$
6. f)
  What is the order of time of the algorithms you described?
11.3.
Consider the problem of evaluating C = AB, where A is n × m and B is m × q. Notice that this multiplication can be viewed as a set of matrix/vector multiplications, so either of the algorithms in Exercise 11.2d above would be applicable. There is, however, another way of performing this multiplication, in which all of the elements of C could be evaluated simultaneously.
1. a)
  Write pseudocode for an algorithm in which the nq elements of C could be evaluated simultaneously. Do not be concerned with the parallelization in this part of the question.
2. b)
  Now suppose there are nmq processors available. Describe how the matrix multiplication could be accomplished in O(m) steps (where a step may be a multiplication and an addition).
  
  Hint: Use a fan-in algorithm.
11.4.
Write a Fortran or C program to compute an estimate of the L₁ LAPACK condition number γ using Algorithm 11.1 on page 536.
11.5.
Design and conduct a Monte Carlo study to assess the performance of the LAPACK estimator of the L₁ condition number using your program from Exercise 11.4. Consider a few different sizes of matrices, say 5 × 5, 10 × 10, and 20 × 20, and consider a range of condition numbers, say 10, 10⁴, and 10⁸. In order to assess the accuracy of the condition number estimator, the random matrices in your study must have known condition numbers. It is easy to construct a diagonal matrix with a given condition number. The condition number of the diagonal matrix D, with nonzero elements d ₁, …, d _n, is max | d _i | ∕min | d _i |. It is not so clear how to construct a general (square) matrix with a given condition number. The L₂ condition number of the matrix UDV, where U and V are orthogonal matrices is the same as the L₂ condition number of U. We can therefore construct a wide range of matrices with given L₂ condition numbers. In your Monte Carlo study, use matrices with known L₂ condition numbers. The next question is what kind of random matrices to generate. Again, make a choice of convenience. Generate random diagonal matrices D, subject to fixed κ(D) = max | d _i | ∕min | d _i |. Then generate random orthogonal matrices as described in Exercise 4.10 on page 223. Any conclusions made on the basis of a Monte Carlo study, of course, must be restricted to the domain of the sampling of the study. (See Stewart, 1980, for a Monte Carlo study of the performance of the LINPACK condition number estimator.)

References

Abramowitz, Milton, and Irene A. Stegun, eds. 1964. Handbook of Mathematical Functions with Formulas, Graphs, and Mathematical Tables. Washington: National Bureau of Standards (NIST). (Reprinted in 1965 by Dover Publications, Inc., New York.)
Google Scholar
Alefeld, Göltz, and Jürgen Herzberger. (1983). Introduction to Interval Computation. New York: Academic Press.
MATH Google Scholar
Ammann, Larry, and John Van Ness. 1988. A routine for converting regression algorithms into corresponding orthogonal regression algorithms. ACM Transactions on Mathematical Software 14:76–87.
Article MathSciNet MATH Google Scholar
Anda, Andrew A., and Haesun Park. 1994. Fast plane rotations with dynamic scaling. SIAM Journal of Matrix Analysis and Applications 15:162–174.
Article MathSciNet MATH Google Scholar
Anda, Andrew A., and Haesun Park. 1996. Self-scaling fast rotations for stiff least squares problems. Linear Algebra and Its Applications 234:137–162.
Article MathSciNet MATH Google Scholar
Anderson, E., Z. Bai, C. Bischof, L. S. Blackford, J. Demmel, J. Dongarra, J. Du Croz, A. Greenhaum, S. Hammarling, A. McKenney, and D. Sorensen. 2000. LAPACK Users’ Guide, 3rd ed. Philadelphia: Society for Industrial and Applied Mathematics.
MATH Google Scholar
Anderson, T. W. 1951. Estimating linear restrictions on regression coefficients for multivariate nomal distributions. Annals of Mthematical Statistics 22:327–351.
Article MATH Google Scholar
Anderson, T. W. 2003. An Introduction to Multivariate Statistical Analysis, 3rd ed. New York: John Wiley and Sons.
MATH Google Scholar
ANSI. 1978. American National Standard for Information Systems — Programming Language FORTRAN, Document X3.9-1978. New York: American National Standards Institute.
Google Scholar
ANSI. 1989. American National Standard for Information Systems — Programming Language C, Document X3.159-1989. New York: American National Standards Institute.
Google Scholar
ANSI. 1992. American National Standard for Information Systems — Programming Language Fortran-90, Document X3.9-1992. New York: American National Standards Institute.
Google Scholar
ANSI. 1998. American National Standard for Information Systems — Programming Language C++, Document ISO/IEC 14882-1998. New York: American National Standards Institute.
Google Scholar
Atkinson, A. C., and A. N. Donev. 1992. Optimum Experimental Designs. Oxford, United Kingdom: Oxford University Press.
MATH Google Scholar
Attaway, Stormy. 2016. Matlab: A Practical Introduction to Programming and Problem Solving, 4th ed. Oxford, United Kingdom: Butterworth-Heinemann.
Google Scholar
Bailey, David H. 1993. Algorithm 719: Multiprecision translation and execution of FORTRAN programs. ACM Transactions on Mathematical Software 19:288–319.
Article MATH Google Scholar
Bailey, David H. 1995. A Fortran 90-based multiprecision system. ACM Transactions on Mathematical Software 21:379–387.
Article MATH Google Scholar
Bailey, David H., King Lee, and Horst D. Simon. 1990. Using Strassen’s algorithm to accelerate the solution of linear systems. Journal of Supercomputing 4:358–371.
Article Google Scholar
Bapat, R. B., and T. E. S. Raghavan. 1997. Nonnegative Matrices and Applications. Cambridge, United Kingdom: Cambridge University Press.
Book MATH Google Scholar
Barker, V. A., L. S. Blackford, J. Dongarra, J. Du Croz, S. Hammarling, M. Marinova, J. Wasniewsk, and P. Yalamov. 2001. LAPACK95 Users’ Guide. Philadelphia: Society for Industrial and Applied Mathematics.
Book MATH Google Scholar
Barrett, R., M. Berry, T. F. Chan, J. Demmel, J. Donato, J. Dongarra, V. Eijkhout, R. Pozo, C. Romine, and H. Van der Vorst. 1994. Templates for the Solution of Linear Systems: Building Blocks for Iterative Methods, 2nd ed. Philadelphia: Society for Industrial and Applied Mathematics.
Book MATH Google Scholar
Basilevsky, A. 1983. Applied Matrix Algebra in the Statistical Sciences. New York: North Holland
MATH Google Scholar
Beaton, Albert E., Donald B. Rubin, and John L. Barone. 1976. The acceptability of regression solutions: Another look at computational accuracy. Journal of the American Statistical Association 71:158–168.
Article MATH Google Scholar
Benzi, Michele. 2002. Preconditioning techniques for large linear systems: A survey. Journal of Computational Physics 182:418–477.
Article MathSciNet MATH Google Scholar
Bickel, Peter J., and Joseph A. Yahav. 1988. Richardson extrapolation and the bootstrap. Journal of the American Statistical Association 83:387–393.
Article MathSciNet MATH Google Scholar
Bindel, David, James Demmel, William Kahan, and Osni Marques. 2002. On computing Givens rotations reliably and efficiently. ACM Transactions on Mathematical Software 28:206–238.
Article MathSciNet MATH Google Scholar
Birkhoff, Garrett, and Surender Gulati. 1979. Isotropic distributions of test matrices. Journal of Applied Mathematics and Physics (ZAMP) 30:148–158.
Article MathSciNet MATH Google Scholar
Bischof, Christian H. 1990. Incremental condition estimation. SIAM Journal of Matrix Analysis and Applications 11:312–322.
Article MathSciNet MATH Google Scholar
Bischof, Christian H., and Gregorio Quintana-Ortí. 1998a. Computing rank-revealing QR factorizations. ACM Transactions on Mathematical Software 24:226–253.
Google Scholar
Bischof, Christian H., and Gregorio Quintana-Ortí. 1998b. Algorithm 782: Codes for rank-revealing QR factorizations of dense matrices. ACM Transactions on Mathematical Software 24:254–257.
Google Scholar
Björck, Åke. 1967. Solving least squares problems by Gram-Schmidt orthogonalization. BIT 7:1–21.
Article MathSciNet MATH Google Scholar
Björck, Åke. 1996. Numerical Methods for Least Squares Problems. Philadelphia: Society for Industrial and Applied Mathematics.
Book MATH Google Scholar
Blackford, L. S., J. Choi, A. Cleary, E. D’Azevedo, J. Demmel, I. Dhillon, J. Dongarra, S. Hammarling, G. Henry, A. Petitet, K. Stanley, D. Walker, and R. C. Whaley. 1997a. ScaLAPACK Users’ Guide. Philadelphia: Society for Industrial and Applied Mathematics.
Google Scholar
Blackford, L. S., A. Cleary, A. Petitet, R. C. Whaley, J. Demmel, I. Dhillon, H. Ren, K. Stanley, J. Dongarra, and S. Hammarling. 1997b. Practical experience in the numerical dangers of heterogeneous computing. ACM Transactions on Mathematical Software 23:133–147.
Google Scholar
Blackford, L. Susan, Antoine Petitet, Roldan Pozo, Karin Remington, R. Clint Whaley, James Demmel, Jack Dongarra, Iain Duff, Sven Hammarling, Greg Henry, Michael Heroux, Linda Kaufman, and Andrew Lumsdaine. 2002. An updated set of basic linear algebra subprograms (BLAS). ACM Transactions on Mathematical Software 28:135–151.
Article MathSciNet Google Scholar
Bollobás, Béla. 2013. Modern Graph Theory. New York: Springer-Verlag.
MATH Google Scholar
Brown, Peter N., and Homer F. Walker. 1997. GMRES on (nearly) singular systems. SIAM Journal of Matrix Analysis and Applications 18: 37–51.
Article MathSciNet MATH Google Scholar
Bunch, James R., and Linda Kaufman. 1977. Some stable methods for calculating inertia and solving symmetric linear systems. Mathematics of Computation 31:163–179.
Article MathSciNet MATH Google Scholar
Buttari, Alfredo, Julien Langou, Jakub Kurzak, and Jack Dongarra. 2009. A class of parallel tiled linear algebra algorithms for multicore architectures. Parallel Computing 35:38–53.
Article MathSciNet Google Scholar
Calvetti, Daniela. 1991. Roundoff error for floating point representation of real data. Communications in Statistics 20:2687–2695.
Article MathSciNet MATH Google Scholar
Campbell, S. L., and C. D. Meyer, Jr. 1991. Generalized Inverses of Linear Transformations. New York: Dover Publications, Inc.
MATH Google Scholar
Carmeli, Moshe. 1983. Statistical Theory and Random Matrices. New York: Marcel Dekker, Inc.
MATH Google Scholar
Chaitin-Chatelin, Françoise, and Valérie Frayssé. 1996. Lectures on Finite Precision Computations. Philadelphia: Society for Industrial and Applied Mathematics.
Book MATH Google Scholar
Chambers, John M. 2016. Extending R. Boca Raton: Chapman and Hall/CRC Press.
MATH Google Scholar
Chan, T. F. 1982a. An improved algorithm for computing the singular value decomposition. ACM Transactions on Mathematical Software 8:72–83.
Google Scholar
Chan, T. F. 1982b. Algorithm 581: An improved algorithm for computing the singular value decomposition. ACM Transactions on Mathematical Software 8:84–88.
Google Scholar
Chan, T. F., G. H. Golub, and R. J. LeVeque. 1982. Updating formulae and a pairwise algorithm for computing sample variances. In Compstat 1982: Proceedings in Computational Statistics, ed. H. Caussinus, P. Ettinger, and R. Tomassone, 30–41. Vienna: Physica-Verlag.
Google Scholar
Chan, Tony F., Gene H. Golub, and Randall J. LeVeque. 1983. Algorithms for computing the sample variance: Analysis and recommendations. The American Statistician 37:242–247.
MathSciNet MATH Google Scholar
Chapman, Barbara, Gabriele Jost, and Ruud van der Pas. 2007. Using OpenMP: Portable Shared Memory Parallel Programming. Cambridge, Massachusetts: The MIT Press.
Google Scholar
Cheng, John, Max Grossman, and Ty McKercher. 2014. Professional CUDA C Programming. New York: Wrox Press, an imprint of John Wiley and Sons.
Google Scholar
Chu, Moody T. 1991. Least squares approximation by real normal matrices with specified spectrum. SIAM Journal on Matrix Analysis and Applications 12:115–127.
Article MathSciNet MATH Google Scholar
Čížková, Lenka, and Pavel Čížek. 2012. Numerical linear algebra. In Handbook of Computational Statistics: Concepts and Methods, 2nd revised and updated ed., ed. James E. Gentle, Wolfgang Härdle, and Yuichi Mori, 105–137. Berlin: Springer.
Chapter Google Scholar
Clerman, Norman, and Walter Spector. 2012. Modern Fortran. Cambridge, United Kingdom: Cambridge University Press.
Google Scholar
Cline, Alan K., Andrew R. Conn, and Charles F. Van Loan. 1982. Generalizing the LINPACK condition estimator. In Numerical Analysis, Mexico, 1981, ed. J. P. Hennart, 73–83. Berlin: Springer-Verlag.
Google Scholar
Cline, A. K., C. B. Moler, G. W. Stewart, and J. H. Wilkinson. 1979. An estimate for the condition number of a matrix. SIAM Journal of Numerical Analysis 16:368–375.
Article MathSciNet MATH Google Scholar
Cline, A. K., and R. K. Rew. 1983. A set of counter-examples to three condition number estimators. SIAM Journal on Scientific and Statistical Computing 4:602–611.
Article MathSciNet MATH Google Scholar
Cody, W. J. 1988. Algorithm 665: MACHAR: A subroutine to dynamically determine machine parameters. ACM Transactions on Mathematical Software 14:303–329.
Article MATH Google Scholar
Cody, W. J., and Jerome T. Coonen. 1993. Algorithm 722: Functions to support the IEEE standard for binary floating-point arithmetic. ACM Transactions on Mathematical Software 19:443–451.
Article MATH Google Scholar
Coleman, Thomas F., and Charles Van Loan. 1988. Handbook for Matrix Computations. Philadelphia: Society for Industrial and Applied Mathematics.
Book MATH Google Scholar
Cragg, John G., and Stephen G. Donald. 1996. On the asymptotic properties of LDU-based tests of the rank of a matrix. Journal of the American Statistical Association 91:1301–1309.
Article MathSciNet MATH Google Scholar
Cullen, M. R. 1985. Linear Models in Biology. New York: Halsted Press.
MATH Google Scholar
Dauger, Dean E., and Viktor K. Decyk. 2005. Plug-and-play cluster computing: High-performance computing for the mainstream. Computing in Science and Engineering 07(2):27–33.
Article MATH Google Scholar
Davies, Philip I., and Nicholas J. Higham. 2000. Numerically stable generation of correlation matrices and their factors. BIT 40:640–651.
Article MathSciNet MATH Google Scholar
Dempster, Arthur P., and Donald B. Rubin. 1983. Rounding error in regression: The appropriateness of Sheppard’s corrections. Journal of the Royal Statistical Society, Series B 39:1–38.
MATH Google Scholar
Devlin, Susan J., R. Gnanadesikan, and J. R. Kettenring. 1975. Robust estimation and outlier detection with correlation coefficients. Biometrika 62:531–546.
Article MATH Google Scholar
Dey, Aloke, and Rahul Mukerjee. 1999. Fractional Factorial Plans. New York: John Wiley and Sons.
Book MATH Google Scholar
Dongarra, J. J., J. R. Bunch, C. B. Moler, and G. W. Stewart. 1979. LINPACK Users’ Guide. Philadelphia: Society for Industrial and Applied Mathematics.
Book MATH Google Scholar
Dongarra, J. J., J. DuCroz, S. Hammarling, and I. Duff. 1990. A set of level 3 basic linear algebra subprograms. ACM Transactions on Mathematical Software 16:1–17.
Article MATH Google Scholar
Dongarra, J. J., J. DuCroz, S. Hammarling, and R. J. Hanson. 1988. An extended set of Fortran basic linear algebra subprograms. ACM Transactions on Mathematical Software 14:1–17.
Article MATH Google Scholar
Dongarra, Jack J., and Victor Eijkhout. 2000. Numerical linear algebra algorithms and software. Journal of Computational and Applied Mathematics 123:489–514.
Article MathSciNet MATH Google Scholar
Draper, Norman R., and Harry Smith. 1998. Applied Regression Analysis, 3rd ed. New York: John Wiley and Sons.
MATH Google Scholar
Duff, Iain S., Michael A. Heroux, and Roldan Pozo. 2002. An overview of the sparse basic linear algebra subprograms: the new standard from the BLAS technical forum. ACM Transactions on Mathematical Software 28:239–267.
Article MathSciNet MATH Google Scholar
Duff, Iain S., Michele Marrone, Guideppe Radicati, and Carlo Vittoli. 1997. Level 3 basic linear algebra subprograms for sparse matrices: A user-level interface. ACM Transactions on Mathematical Software 23:379–401.
Article MathSciNet MATH Google Scholar
Duff, Iain S., and Christof Vömel. 2002. Algorithm 818: A reference model implementation of the sparse BLAS in Fortran 95. ACM Transactions on Mathematical Software 28:268–283.
Article MathSciNet MATH Google Scholar
Eckart, Carl, and Gale Young. 1936. The approximation of one matrix by another of lower rank. Psychometrika 1:211–218.
Article MATH Google Scholar
Eddelbuettel, Dirk. 2013. Seamless R and C++ Integration with Rcpp. New York: Springer-Verlag.
Book MATH Google Scholar
Ericksen, Wilhelm S. 1985. Inverse pairs of test matrices. ACM Transactions on Mathematical Software 11:302–304.
Article MathSciNet MATH Google Scholar
Efron, Bradley, Trevor Hastie, Iain Johnstone, and Robert Tibshirani. 2004. Least angle regression. The Annals of Statistics 32:407–499.
Article MathSciNet MATH Google Scholar
Escobar, Luis A., and E. Barry Moser. 1993. A note on the updating of regression estimates. The American Statistician 47:192–194.
MathSciNet Google Scholar
Eskow, Elizabeth, and Robert B. Schnabel. 1991. Algorithm 695: Software for a new modified Cholesky factorization. ACM Transactions on Mathematical Software 17:306–312.
Article MATH Google Scholar
Eubank, Randall L., and Ana Kupresanin. 2012. Statistical Computing in C++ and R. Boca Raton: Chapman and Hall/CRC Press.
MATH Google Scholar
Fasino, Dario, and Luca Gemignani. 2003. A Lanczos-type algorithm for the QR factorization of Cauchy-like matrices. In Fast Algorithms for Structured Matrices: Theory and Applications, ed. Vadim Olshevsky, 91–104. Providence, Rhode Island: American Mathematical Society.
Chapter Google Scholar
Filippone, Salvatore, and Michele Colajanni. 2000. PSBLAS: A library for parallel linear algebra computation on sparse matrices. ACM Transactions on Mathematical Software 26:527–550.
Article MATH Google Scholar
Fuller, Wayne A. 1995. Introduction to Statistical Time Series, 2nd ed. New York: John Wiley and Sons.
Book Google Scholar
Galassi, Mark, Jim Davies, James Theiler, Brian Gough, Gerard Jungman, Michael Booth, and Fabrice Rossi. 2002. GNU Scientific Library Reference Manual, 2nd ed. Bristol, United Kingdom: Network Theory Limited.
Google Scholar
Gandrud, Christopher. 2015. Reproducible Research with R and R Studio, 2nd ed. Boca Raton: Chapman and Hall/CRC Press.
MATH Google Scholar
Gantmacher, F. R. 1959. The Theory of Matrices, Volumes I and II, translated by K. A. Hirsch, Chelsea, New York.
MATH Google Scholar
Geist, Al, Adam Beguelin, Jack Dongarra, Weicheng Jiang, Robert Manchek, and Vaidy Sunderam. 1994. PVM. Parallel Virtual Machine. A Users’ Guide and Tutorial for Networked Parallel Computing. Cambridge, Massachusetts: The MIT Press.
MATH Google Scholar
Gentle, James E. 2003. Random Number Generation and Monte Carlo Methods, 2nd ed. New York: Springer-Verlag.
MATH Google Scholar
Gentle, James E. 2009. Computational Statistics. New York: Springer-Verlag.
Book MATH Google Scholar
Gentleman, W. M. 1974. Algorithm AS 75: Basic procedures for large, sparse or weighted linear least squares problems. Applied Statistics 23:448–454.
Article Google Scholar
Gill, Len, and Arthur Lewbel. 1992. Testing the rank and definiteness of estimated matrices with applications to factor, state-space and ARMA models. Journal of the American Statistical Association 87:766–776.
Article MathSciNet MATH Google Scholar
Golub, G., and W. Kahan. 1965. Calculating the singular values and pseudo-inverse of a matrix. SIAM Journal of Numerical Analysis, Series B 2:205–224.
MathSciNet MATH Google Scholar
Golub, G. H., and C. Reinsch. 1970. Singular value decomposition and least squares solutions. Numerische Mathematik 14:403–420.
Article MathSciNet MATH Google Scholar
Golub, G. H., and C. F. Van Loan. 1980. An analysis of the total least squares problem. SIAM Journal of Numerical Analysis 17:883–893.
Article MathSciNet MATH Google Scholar
Golub, Gene H., and Charles F. Van Loan. 1996. Matrix Computations, 3rd ed. Baltimore: The Johns Hopkins Press.
MATH Google Scholar
Graybill, Franklin A. 1983. Introduction to Matrices with Applications in Statistics, 2nd ed. Belmont, California: Wadsworth Publishing Company.
MATH Google Scholar
Greenbaum, Anne, and Zdeněk Strakoš. 1992. Predicting the behavior of finite precision Lanczos and conjugate gradient computations. SIAM Journal for Matrix Analysis and Applications 13:121–137.
Article MathSciNet MATH Google Scholar
Gregory, Robert T., and David L. Karney. 1969. A Collection of Matrices for Testing Computational Algorithms. New York: John Wiley and Sons.
MATH Google Scholar
Gregory, R. T., and E. V. Krishnamurthy. 1984. Methods and Applications of Error-Free Computation. New York: Springer-Verlag.
Book MATH Google Scholar
Grewal, Mohinder S., and Angus P. Andrews. 1993. Kalman Filtering Theory and Practice. Englewood Cliffs, New Jersey: Prentice-Hall.
Google Scholar
Griva, Igor, Stephen G. Nash, and Ariela Sofer. 2009. Linear and Nonlinear Optimization, 2nd ed. Philadelphia: Society for Industrial and Applied Mathematics.
Book MATH Google Scholar
Gropp, William D. 2005. Issues in accurate and reliable use of parallel computing in numerical programs. In Accuracy and Reliability in Scientific Computing, ed. Bo Einarsson, 253–263. Philadelphia: Society for Industrial and Applied Mathematics.
Chapter Google Scholar
Gropp, William, Ewing Lusk, and Anthony Skjellum. 2014. Using MPI: Portable Parallel Programming with the Message-Passing Interface, 3rd ed. Cambridge, Massachusetts: The MIT Press.
MATH Google Scholar
Gropp, William, Ewing Lusk, and Thomas Sterling (Editors). 2003. Beowulf Cluster Computing with Linux, 2nd ed. Cambridge, Massachusetts: The MIT Press.
Google Scholar
Haag, J. B., and D. S. Watkins. 1993. QR-like algorithms for the nonsymmetric eigenvalue problem. ACM Transactions on Mathematical Software 19:407–418.
Article MATH Google Scholar
Hager, W. W. 1984. Condition estimates. SIAM Journal on Scientific and Statistical Computing 5:311–316.
Article MathSciNet MATH Google Scholar
Hanson, Richard J., and Tim Hopkins. 2013. Numerical Computing with Modern Fortran. Philadelphia: Society for Industrial and Applied Mathematics.
MATH Google Scholar
Harville, David A. 1997. Matrix Algebra from a Statistician’s Point of View. New York: Springer-Verlag.
Book MATH Google Scholar
Heath, M. T., E. Ng, and B. W. Peyton. 1991. Parallel algorithms for sparse linear systems. SIAM Review 33:420–460.
Article MathSciNet MATH Google Scholar
Hedayat, A. S., N. J. A. Sloane, and John Stufken. 1999. Orthogonal Arrays: Theory and Applications. New York: Springer-Verlag.
Book MATH Google Scholar
Heiberger, Richard M. 1978. Algorithm AS127: Generation of random orthogonal matrices. Applied Statistics 27:199–205.
Article MATH Google Scholar
Heroux, Michael A. 2015. Editorial: ACM TOMS replicated computational results initiative. ACM Transactions on Mathematical Software 41:Article No. 13.
Google Scholar
Higham, Nicholas J. 1987. A survey of condition number estimation for triangular matrices. SIAM Review 29:575–596.
Article MathSciNet MATH Google Scholar
Higham, Nicholas J. 1988. FORTRAN codes for estimating the one-norm of a real or complex matrix, with applications to condition estimation. ACM Transactions on Mathematical Software 14:381–386.
Article MathSciNet MATH Google Scholar
Higham, Nicholas J. 1990. Experience with a matrix norm estimator. SIAM Journal on Scientific and Statistical Computing 11:804–809.
Article MathSciNet MATH Google Scholar
Higham, Nicholas J. 1991. Algorithm 694: A collection of test matrices in Matlab. ACM Transactions on Mathematical Software 17:289–305.
Article MATH Google Scholar
Higham, Nicholas J. 1997. Stability of the diagonal pivoting method with partial pivoting. SIAM Journal of Matrix Analysis and Applications 18:52–65.
Article MathSciNet MATH Google Scholar
Higham, Nicholas J. 2002. Accuracy and Stability of Numerical Algorithms, 2nd ed. Philadelphia: Society for Industrial and Applied Mathematics.
Book MATH Google Scholar
Higham, Nicholas J. 2008. Functions of Matrices. Theory and Computation. Philadelphia: Society for Industrial and Applied Mathematics.
Book MATH Google Scholar
Hill, Francis S., Jr., and Stephen M Kelley. 2006. Computer Graphics Using OpenGL, 3rd ed. New York: Pearson Education.
Google Scholar
Hoffman, A. J., and H. W. Wielandt. 1953. The variation of the spectrum of a normal matrix. Duke Mathematical Journal 20:37–39.
Article MathSciNet MATH Google Scholar
Hong, H. P., and C. T. Pan. 1992. Rank-revealing QR factorization and SVD. Mathematics of Computation 58:213–232.
MathSciNet MATH Google Scholar
Horn, Roger A., and Charles R. Johnson. 1991. Topics in Matrix Analysis. Cambridge, United Kingdom: Cambridge University Press.
Book MATH Google Scholar
IEEE. 2008. IEEE Standard for Floating-Point Arithmetic, Std 754-2008. New York: IEEE, Inc.
Google Scholar
Jansen, Paul, and Peter Weidner. 1986. High-accuracy arithmetic software — some tests of the ACRITH problem-solving routines. ACM Transactions on Mathematical Software 12:62–70.
Article MathSciNet MATH Google Scholar
Jaulin, Luc, Michel Kieffer, Olivier Didrit, and Eric Walter. (2001). Applied Interval Analysis. New York: Springer.
Book MATH Google Scholar
Jolliffe, I. T. 2002. Principal Component Analysis, 2nd ed. New York: Springer-Verlag.
MATH Google Scholar
Karau, Holden, Andy Konwinski, Patrick Wendell, and Matei Zaharia. 2015. Learning Spark. Sabastopol, California: O’Reilly Media, Inc.
Google Scholar
Kearfott, R. Baker. 1996. Interval_arithmetic: A Fortran 90 module for an interval data type. ACM Transactions on Mathematical Software 22:385–392.
Article MATH Google Scholar
Kearfott, R. Baker, and Vladik Kreinovich (Editors). 1996. Applications of Interval Computations. Netherlands: Kluwer, Dordrecht.
MATH Google Scholar
Kearfott, R. B., M. Dawande, K. Du, and C. Hu. 1994. Algorithm 737: INTLIB: A portable Fortran 77 interval standard-function library. ACM Transactions on Mathematical Software 20:447–459.
Article MATH Google Scholar
Keller-McNulty, Sallie, and W. J. Kennedy. 1986. An error-free generalized matrix inversion and linear least squares method based on bordering. Communications in Statistics — Simulation and Computation 15:769–785.
Article MathSciNet MATH Google Scholar
Kennedy, William J., and James E. Gentle. 1980. Statistical Computing. New York: Marcel Dekker, Inc.
MATH Google Scholar
Kenney, C. S., and A. J. Laub. 1994. Small-sample statistical condition estimates for general matrix functions. SIAM Journal on Scientific Computing 15:191–209.
Article MathSciNet MATH Google Scholar
Kenney, C. S., A. J. Laub, and M. S. Reese. 1998. Statistical condition estimation for linear systems. SIAM Journal on Scientific Computing 19:566–583.
Article MathSciNet MATH Google Scholar
Kim, Hyunsoo, and Haesun Park. 2008. Nonnegative matrix factorization based on alternating non-negativity-constrained least squares and the active set method. SIAM Journal on Matrix Analysis and Applications 30:713–730.
Article MathSciNet MATH Google Scholar
Kleibergen, Frank, and Richard Paap. 2006. Generalized reduced rank tests using the singular value decomposition. Journal of Econometrics 133:97–126.
Article MathSciNet MATH Google Scholar
Kollo, Tõnu, and Dietrich von Rosen. 2005. Advanced Multivariate Statistics with Matrices. Amsterdam: Springer.
Book MATH Google Scholar
Kshemkalyani, Ajay D., and Mukesh Singhal. 2011. Distributed Computing: Principles, Algorithms, and Systems. Cambridge, United Kingdom: Cambridge University Press.
MATH Google Scholar
Kulisch, Ulrich. 2011. Very fast and exact accumulation of products. Computing 91:397–405.
Article MathSciNet MATH Google Scholar
Lawson, C. L., R. J. Hanson, D. R. Kincaid, and F. T. Krogh. 1979. Basic linear algebra subprograms for Fortran usage. ACM Transactions on Mathematical Software 5:308–323.
Article MATH Google Scholar
Lee, Daniel D., and H. Sebastian Seung. 2001. Algorithms for non-negative matrix factorization. Advances in Neural Information Processing Systems, 556–562. Cambridge, Massachusetts: The MIT Press.
Google Scholar
Lemmon, David R., and Joseph L. Schafer. 2005. Developing Statistical Software in Fortran 95. New York: Springer-Verlag.
MATH Google Scholar
Leskovec, Jure, Anand Rajaraman, and Jeffrey David Ullman. 2014. Mining of Massive Datasets, 2nd ed. Cambridge, United Kingdom: Cambridge University Press.
Book Google Scholar
Levesque, John, and Gene Wagenbreth. 2010. High Performance Computing: Programming and Applications. Boca Raton: Chapman and Hall/CRC Press.
Book Google Scholar
Liem, C. B., T. Lü, and T. M. Shih. 1995. The Splitting Extrapolation Method. Singapore: World Scientific.
Book MATH Google Scholar
Linnainmaa, Seppo. 1975. Towards accurate statistical estimation of rounding errors in floating-point computations. BIT 15:165–173.
Article MathSciNet MATH Google Scholar
Liu, Shuangzhe and Heinz Neudecker. 1996. Several matrix Kantorovich-type inequalities. Journal of Mathematical Analysis and Applications 197:23–26.
Article MathSciNet MATH Google Scholar
Loader, Catherine. 2012. Smoothing: Local regression techniques. In Handbook of Computational Statistics: Concepts and Methods, 2nd revised and updated ed., ed. James E. Gentle, Wolfgang Härdle, and Yuichi Mori, 571–596. Berlin: Springer.
Chapter Google Scholar
Longley, James W. 1967. An appraisal of least squares problems for the electronic computer from the point of view of the user. Journal of the American Statistical Association 62:819–841.
Article MathSciNet Google Scholar
Luk, F. T., and H. Park. 1989. On parallel Jacobi orderings. SIAM Journal on Scientific and Statistical Computing 10:18–26.
Article MathSciNet MATH Google Scholar
Magnus, Jan R., and Heinz Neudecker. 1999. Matrix Differential Calculus with Applications in Statistics and Econometrics, revised ed. New York: John Wiley and Sons.
MATH Google Scholar
Markus, Arjen. 2012. Modern Fortran in Practice. Cambridge, United Kingdom: Cambridge University Press.
Book MATH Google Scholar
Marshall, A. W., and I. Olkin. 1990. Matrix versions of the Cauchy and Kantorovich inequalities. Aequationes Mathematicae 40:89–93.
Article MathSciNet MATH Google Scholar
Metcalf, Michael, John Reid, and Malcolm Cohen. 2011. Modern Fortran Explained. Oxford, United Kingdom: Oxford University Press.
MATH Google Scholar
Meyn, Sean, and Richard L. Tweedie. 2009. Markov Chains and Stochastic Stability, 2nd ed. Cambridge, United Kingdom: Cambridge University Press.
Book MATH Google Scholar
Miller, Alan J. 1992. Algorithm AS 274: Least squares routines to supplement those of Gentleman. Applied Statistics 41:458–478 (Corrections, 1994, ibid. 43:678).
Google Scholar
Miller, Alan. 2002. Subset Selection in Regression, 2nd ed. Boca Raton: Chapman and Hall/CRC Press.
Book MATH Google Scholar
Miller, Alan J., and Nam-Ky Nguyen. 1994. A Fedorov exchange algorithm for D-optimal design. Applied Statistics 43:669–678.
Article Google Scholar
Mizuta, Masahiro. 2012. Dimension reduction methods. In Handbook of Computational Statistics: Concepts and Methods, 2nd revised and updated ed., ed. James E. Gentle, Wolfgang Härdle, and Yuichi Mori, 619–644. Berlin: Springer.
Chapter Google Scholar
Moore, E. H. 1920. On the reciprocal of the general algebraic matrix. Bulletin of the American Mathematical Society 26:394–395.
Google Scholar
Moore, Ramon E. (1979). Methods and Applications of Interval Analysis. Philadelphia: Society for Industrial and Applied Mathematics.
Book MATH Google Scholar
Mosteller, Frederick, and David L. Wallace. 1963. Inference in an authorship problem. Journal of the American Statistical Association 58:275–309.
MATH Google Scholar
Muirhead, Robb J. 1982. Aspects of Multivariate Statistical Theory. New York: John Wiley and Sons.
Book MATH Google Scholar
Mullet, Gary M., and Tracy W. Murray. 1971. A new method for examining rounding error in least-squares regression computer programs. Journal of the American Statistical Association 66:496–498.
Article MATH Google Scholar
Nachbin, Leopoldo. 1965. The Haar Integral, translated by Lulu Bechtolsheim. Princeton, New Jersey: D. Van Nostrand Co Inc.
MATH Google Scholar
Nakano, Junji. 2012. Parallel computing techniques. In Handbook of Computational Statistics: Concepts and Methods, 2nd revised and updated ed., ed. James E. Gentle, Wolfgang Härdle, and Yuichi Mori, 243–272. Berlin: Springer.
Chapter Google Scholar
Nguyen, Nam-Ky, and Alan J. Miller. 1992. A review of some exchange algorithms for constructing D-optimal designs. Computational Statistics and Data Analysis 14:489–498.
Article MathSciNet MATH Google Scholar
Olshevsky, Vadim (Editor). 2003. Fast Algorithms for Structured Matrices: Theory and Applications. Providence, Rhode Island: American Mathematical Society.
MATH Google Scholar
Olver, Frank W. J., Daniel w. Lozier, Ronald F. Boisvert, and Charles W. Clark. 2010. NIST Handbook of Mathematical Functions. Cambridge: Cambridge University Press.
Google Scholar
Overton, Michael L. 2001. Numerical Computing with IEEE Floating Point Arithmetic. Philadelphia: Society for Industrial and Applied Mathematics.
Book MATH Google Scholar
Parsian, Mahmoud. 2015. Data Algorithms. Sabastopol, California: O’Reilly Media, Inc.
Google Scholar
Penrose, R. 1955. A generalized inverse for matrices. Proceedings of the Cambridge Philosophical Society 51:406–413.
Article MATH Google Scholar
Quinn, Michael J. 2003. Parallel Programming in C with MPI and OpenMP. New York: McGraw-Hill.
Google Scholar
Rice, John R. 1966. Experiments on Gram-Schmidt orthogonalization. Mathematics of Computation 20:325–328.
Article MathSciNet MATH Google Scholar
Rice, John R. 1993. Numerical Methods, Software, and Analysis, 2nd ed. New York: McGraw-Hill Book Company.
MATH Google Scholar
Robin, J. M., and R. J. Smith. 2000. Tests of rank. Econometric Theory 16:151–175.
Article MathSciNet MATH Google Scholar
Roosta, Seyed H. 2000. Parallel Processing and Parallel Algorithms: Theory and Computation. New York: Springer-Verlag.
Book MATH Google Scholar
Rousseeuw, Peter J., and Geert Molenberghs. 1993. Transformation of nonpositive semidefinite correlation matrices. Communications in Statistics — Theory and Methods 22:965–984.
Article MATH Google Scholar
Rust, Bert W. 1994. Perturbation bounds for linear regression problems. Computing Science and Statistics 26:528–532.
Google Scholar
Saad, Y., and M. H. Schultz. 1986. GMRES: A generalized minimal residual algorithm for solving nonsymmetric linear systems. SIAM Journal on Scientific and Statistical Computing 7:856–869.
Article MathSciNet MATH Google Scholar
Schott, James R. 2004. Matrix Analysis for Statistics, 2nd ed. New York: John Wiley and Sons.
Google Scholar
Searle, S. R. 1971. Linear Models. New York: John Wiley and Sons.
MATH Google Scholar
Searle, Shayle R. 1982. Matrix Algebra Useful for Statistics. New York: John Wiley and Sons.
MATH Google Scholar
Shao, Jun. 2003. Mathematical Statistics, 2nd ed. New York: Springer-Verlag.
Google Scholar
Sherman, J., and W. J. Morrison. 1950. Adjustment of an inverse matrix corresponding to a change in one element of a given matrix. Annals of Mathematical Statistics 21:124–127.
Article MathSciNet MATH Google Scholar
Siek, Jeremy, and Andrew Lumsdaine. 2000. A modern framework for portable high-performance numerical linear algebra. In Advances in Software Tools for Scientific Computing, ed. Are Bruaset, H. Langtangen, and E. Quak, 1–56. New York: Springer-Verlag.
Google Scholar
Skeel, R. D. 1980. Iterative refinement implies numerical stability for Gaussian elimination. Mathematics of Computation 35:817–832.
Article MathSciNet MATH Google Scholar
Smith, B. T., J. M. Boyle, J. J. Dongarra, B. S. Garbow, Y. Ikebe, V. C. Klema, and C. B. Moler. 1976. Matrix Eigensystem Routines — EISPACK Guide. Berlin: Springer-Verlag.
Book MATH Google Scholar
Stallings, W. T., and T. L. Boullion. 1972. Computation of pseudo-inverse using residue arithmetic. SIAM Review 14:152–163.
Article MathSciNet MATH Google Scholar
Stewart, G. W. 1980. The efficient generation of random orthogonal matrices with an application to condition estimators. SIAM Journal of Numerical Analysis 17:403–409.
Article MathSciNet MATH Google Scholar
Stewart, G. W. 1990. Stochastic perturbation theory. SIAM Review 32:579–610.
Article MathSciNet MATH Google Scholar
Stodden, Victoria, Friedrich Leisch, and Roger D. Peng. 2014. Implementing Reproducible Research. Boca Raton: Chapman and Hall/CRC Press.
Google Scholar
Strang, Gilbert, and Tri Nguyen. 2004. The interplay of ranks of submatrices. SIAM Review 46:637–646.
Article MathSciNet MATH Google Scholar
Strassen, V. 1969, Gaussian elimination is not optimal. Numerische Mathematik 13:354–356.
Article MathSciNet MATH Google Scholar
Szabó, S., and R. Tanaka. 1967. Residue Arithmetic and Its Application to Computer Technology. New York: McGraw-Hill.
MATH Google Scholar
Tanner, M. A., and R. A. Thisted. 1982. A remark on AS127. Generation of random orthogonal matrices. Applied Statistics 31:190–192.
Article Google Scholar
Titterington, D. M. 1975. Optimal design: Some geometrical aspects of D-optimality. Biometrika 62:313–320.
Article MathSciNet MATH Google Scholar
Trefethen, Lloyd N., and Mark Embree. 2005. Spectra and Pseudospectra: The Behavior of Nonnormal Matrices and Operators. Princeton: Princeton University Press.
MATH Google Scholar
Trefethen, Lloyd N., and David Bau III. 1997. Numerical Linear Algebra. Philadelphia: Society for Industrial and Applied Mathematics.
Book MATH Google Scholar
Trosset, Michael W. 2002. Extensions of classical multidimensional scaling via variable reduction. Computational Statistics 17:147–163.
Article MathSciNet MATH Google Scholar
Unicode Consortium. 1990. The Unicode Standard, Worldwide Character Encoding, Version 1.0, Volume 1. Reading, Massachusetts: Addison-Wesley Publishing Company.
Google Scholar
Unicode Consortium. 1992. The Unicode Standard, Worldwide Character Encoding, Version 1.0, Volume 2. Reading, Massachusetts: Addison-Wesley Publishing Company.
Google Scholar
Vandenberghe, Lieven, and Stephen Boyd. 1996. Semidefinite programming. SIAM Review 38:49–95.
Article MathSciNet MATH Google Scholar
Venables, W. N., and B. D. Ripley. 2003. Modern Applied Statistics with S, 4th ed. New York: Springer-Verlag.
MATH Google Scholar
Walker, Homer F. 1988. Implementation of the GMRES method using Householder transformations. SIAM Journal on Scientific and Statistical Computing 9:152–163.
Article MathSciNet MATH Google Scholar
Walker, Homer F., and Lu Zhou. 1994. A simpler GMRES. Numerical Linear Algebra with Applications 1:571–581.
Article MathSciNet MATH Google Scholar
Walster, G. William. 1996. Stimulating hardware and software support for interval arithmetic. In Applications of Interval Computations, ed. R. Baker Kearfott and Vladik Kreinovich, 405–416. Dordrecht, Netherlands: Kluwer.
Chapter Google Scholar
Walster, G. William. 2005. The use and implementation of interval data types. In Accuracy and Reliability in Scientific Computing, ed. Bo Einarsson, 173–194. Philadelphia: Society for Industrial and Applied Mathematics.
Chapter Google Scholar
Watkins, David S. 2002. Fundamentals of Matrix Computations, 2nd ed. New York: John Wiley and Sons.
Book MATH Google Scholar
White, Tom. 2015. Hadoop: The Definitive Guide, 4th ed. Sabastopol, California: O’Reilly Media, Inc.
Google Scholar
Wickham, Hadley. 2015) Advanced R. Boca Raton: Chapman and Hall/CRC Press.
Google Scholar
Wilkinson, J. H. 1959. The evaluation of the zeros of ill-conditioned polynomials. Numerische Mathematik 1:150–180.
Article MathSciNet MATH Google Scholar
Wilkinson, J. H. 1963. Rounding Errors in Algebraic Processes. Englewood Cliffs, New Jersey: Prentice-Hall. (Reprinted by Dover Publications, Inc., New York, 1994).
Google Scholar
Wilkinson, J. H. 1965. The Algebraic Eigenvalue Problem. New York: Oxford University Press.
MATH Google Scholar
Woodbury, M. A. 1950. “Inverting Modified Matrices”, Memorandum Report 42, Statistical Research Group, Princeton University.
Google Scholar
Wynn, P. 1962. Acceleration techniques for iterated vector and matrix problems. Mathematics of Computation 16:301–322.
Article MathSciNet MATH Google Scholar
Xie, Yihui. 2015. Dynamic Documents with R and knitr, 2nd ed. Boca Raton: Chapman and Hall/CRC Press.
Google Scholar
Zhou, Bing Bing, and Richard P. Brent. 2003. An efficient method for computing eigenvalues of a real normal matrix. Journal of Parallel and Distributed Computing 63:638–648.
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

Fairfax, VA, USA
James E. Gentle

Authors

James E. Gentle
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Gentle, J.E. (2017). Numerical Linear Algebra. In: Matrix Algebra. Springer Texts in Statistics. Springer, Cham. https://doi.org/10.1007/978-3-319-64867-5_11

Download citation

DOI: https://doi.org/10.1007/978-3-319-64867-5_11
Published: 12 October 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-64866-8
Online ISBN: 978-3-319-64867-5
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics