LECTURE

LECTURE LECTURE 10
DETERMINANTS

Perhaps the most important class of expressions in geometric computation is the determinant. We examine some algorithmic techniques needed for computing determinants in the context of exact geometric computation.

The main issue is to compute some approximation to the determinant which has the correct sign. Clarkson [4] gave a general method for n×n matrices with L-bit integer entries, assuming limited precision exact integer arithmetic. The method can be very efficient when combined with machine floating point arithmetic for n and L that are moderate. When n = 2 or 3, Avnaim et al [1] gave efficient solutions when the inputs are machine representable. In [2], Brönnimann and Yvinec revisit Clarkson's algorithm and generalizes the method of Avnaim et al.

Filters for Determinants. If we use a standard Gaussian elimination algorithm for computing the determinant, then a simple filter is just to compare the absolute value of the computed determinant with the well-known error bounds found in Numerical Analysis books. Pan and Yanquiang [7] pointed out that this filter (which is static) is not very efficacious. They propose instead to use the the distance from the nearest singular instance of a matrix. This requires some extra but not too expensive computation, but leads to much better filtering ratios.

1 Concepts from Numerical Analysis

In this section, we recall some basic concepts developed in numerical analysis. We assume vectors in \Bbb Cⁿ and matrices in \Bbb C^m×n for various natural numbers n,m ł 1. For a matrix A Î \Bbb C^m×n, then A^* Î \Bbb C^n×m denotes its Hermitian transpose. In the special case where A is real, A^* is equal to A^T, the transpose. Let |A| be the matrix whose (i,j)th entry is |a_ij|. Two real matrices A, B of the same dimension can be compared componentwise: we write A Ł B if a_ij Ł b_ij for all i,j. The identity matrix is denoted I (of appropriate dimension determined by context).

References: John [6], Higham [5], Chaitin-Chatelin and Frayssé [3].

Vector Norms. We want some measure of ``size'' of a vector x Î \Bbb Cⁿ. This is needed, for instance, in estimating computational errors. This is captured by the notion of a norm. A norm on \Bbb Cⁿ is a function N:\Bbb Cⁿ® \Bbb R such that for all x,y Î \Bbb Cⁿ,

N(x) ł 0, with equality iff x = 0.

N(ax) = |a| N(x) for all a Î \Bbb C.

N(x+y) Ł N(x)+N(y)

The main examples are the p-norms for any positive p. This is denoted N(x) = ||x||_p and defined as

||x||_p : =
Ö[p] |x₁|^p + Ľ+|x_n|^p

where x = (x₁ ,Ľ, x_n)^T. The special cases of p = 1 and p = 2 are noteworthy. In particular, the 2-norm is differentiable and invariant under an orthogonal transformation of the space. We can also generalize this to p = Ą and define

||x||_Ą : =
max
{ |x₁| ,Ľ, |x_n| }.

Two norms N and N˘ are equivalent if there exists positive constants c < C such that for all x Î \Bbb Cⁿ,

c·N(x) Ł N˘(x) Ł C·N(x).

It is easy to see that this notion of equivalence is transitive.

Lemma 1 Any two vector norms are equivalent.

Proof. By transitivity of equivalence, it suffices to show that any norm N is equivalent to the Ą-norm. Write x = ĺ_{i = 1}ⁿ x_i eⁱ where eⁱ is the ith elementary vector. Then

N(x) Ł n
ĺ
i = 1
|x_i| N(eⁱ) Ł ||x||_Ą n
ĺ
i = 1
N(eⁱ) Ł C ||x||_Ą

where C = ĺ_{i = 1}ⁿ N(eⁱ). Now, consider the unit sphere under the Ą-norm, S = {x Î \Bbb Cⁿ: ||x||_Ą = 1}. Since S is compact, the norm function N:S® \Bbb R achieves its mininum value at some x⁰ Î S. Let N(x⁰) = c. If ||x||_Ą = b then we have

N(x) ł N(b · x
b
) = b N( x
b
) ł b·c = c ||x||_Ą.

We have thus specified the requisite constants c,C in the definition of equivalence. Q.E.D.

Matrix Norms. We can treat a matrix A Î \Bbb C^m×n as a mn-vector and apply the previous vector norms. In particular, the vector 2-norm when applied to A in this way is called the Frobenious norm, and given the special notation ||A||_F. If we view A as a linear operator from \Bbb Cⁿ to \Bbb C^m, then we can use a general method for defining norms on linear operators. To do this, we fix two vector norms N^m and Nⁿ on \Bbb C^m and \Bbb Cⁿ, respectively. We define the derived norm N˘ on A Î \Bbb C^m×n as follows:

N˘(A) : =

sup
x ą 0, x Î \Bbb Cⁿ
N^m(Ax)
Nⁿ(x)
.
(1)
Since N(lx) = |l|·N(x), we can equivalently define N˘ as

N˘(A) : =

sup
Nⁿ(x) = 1
N^m(Ax).
(2)
Thus N˘ measures the maximum ``expansion'' of a vector on the unit sphere {x Î \Bbb Cⁿ: Nⁿ(x) = 1} in \Bbb Cⁿ. We say that N˘ is derived from N^m, Nⁿ. The following properties are easy to show:

N˘(A) ł 0 with equality iff A = \0.

N˘(aA) = |a|·N˘(A) for a Î \Bbb C.

N˘(A+B) Ł N˘(A)+N˘(B).

N˘(AB) Ł N˘(A)N˘(B).

N^m(Ax) Ł N˘(A) Nⁿ(x).

The first 3 properties verify that N˘ is really a norm. Note that in property 4, up to three derived norms involved! The last property shows a connection between N˘ and the norms it is derived from. It is customary to simply use `N' for all three norms, N˘, N^m and Nⁿ. This is unambiguous as the context will tell us which is the case. With this convention, it is almost mechanical¹ to write down the above 5 properties.

Assume that Nⁿ and N^m are vector p-norms. We want to characterize the derived matrix norm, denoted N_p, for p = 1,2,Ą.

Theorem 2 Let A = (a_ij) Î \Bbb C^m×n. Assume B : =
A^*A has eigenvalues m₁, m₂ ,Ľ, m_n.

(a)
N_Ą(A)
=

max
i

ĺ
j
|a_ij|

(b)
N₁(A)
=

max
j

ĺ
i
|a_ij|

(c)
N₂(A)
=
ć
Ö

max
i
m_i

Proof. Let x = (x₁ ,Ľ, x_n)^T Î \Bbb Cⁿ be a non-zero vector.

(a) We have

||Ax||_Ą

=

max
i
|
ĺ
j
a_ijx_j|

Ł

||x||_Ą
max
i

ĺ
j
|a_ij|.

The inequality becomes an equality if we choose x_j = |a_i₀j|/a_i₀j where i₀ is the index that achieves the maximum in max_i ĺ_j |a_ij|. This proves N_Ą(A) = max_i ĺ_j |a_ij|. (b) Again, we have

||Ax||₁

=

ĺ
i
|
ĺ
j
a_ijx_j|

Ł ₍₁₎

ĺ
i

ĺ
j
|a_ijx_j|

Ł ₍₂₎

ĺ
j
|x_j|
ĺ
i
|a_ijx_j|

Ł ₍₃₎

||x||₁
max
j

ĺ
i
|a_ij|.

Suppose j₀ is the index that maximizes the expression max_j ĺ_i |a_ij|, and we choose x = e^j₀ (the unit vector with 1 in the j₀th position). Then inequalities Ł ₍₁₎ and Ł ₍₂₎ becomes equalities because x is now a unit vector, while inequality Ł ₍₂₎ becomes an equality because of our choice of j₀. This proves N₁(A) = max_j ĺ_i |a_ij|. (c) For any m and x, if Bx = mx then mx^* x = x^*Bx = (x^*A^*)(Ax). Hence

m = ||Ax||₂²
||x||₂²
.

This shows that the eigenvalues m_i of B are real and non-negative. It is well-known that for any Hermitian matrix B (i.e., B^* = B), there exists unitary a matrix T (i.e., T^*T = I) such that B = T^* D T where D is the diagonal matrix Diag(m₁ ,Ľ, m_n). Without loss of generality, assume m₁ is the largest eigenvalue. For any x where ||x||₂ = 1, we have

||Ax||₂²

=

(Ax)^*(Ax)

=

x^* B x

=

x^*T^* D T x

=

y^* D y (y = Tx)

=

ĺ
i
m_i |y_i|²

Ł

m₁ (||x||₂ = ||y||₂ = 1).

The final inequality is an equality if y = e¹ (the unit vector with 1 in the first component). This proves N₂(A) = Ö{max_i m_i}. Q.E.D.

We read (a) and (b) as saying: N_Ą(A) is the maximum of the 1-norm of the rows of A; N₁(A) is the maximum of the 1-norm of the columns of A. To remember whether it is row or column, use this device: the symbol ``Ą'' has a dominant horizontal direction, suggesting rows; likewise, the symbol ``1'' suggests columns.

It is not easy to compute N₂(A). Hence, the following connection between N₂(A) and the Frobenious norm ||A||_F of A is useful.

Lemma 3

Ön N₂(A) ł ||A||_F ł N₂(A).

Proof. Note that

||A||_F² =
ĺ
i

ĺ
j
|a_ij|² = Tr(A^*A).

Writing B = A^*A = T^*DT as in the above proof, we see

Tr(B) =
ĺ
i
m_i.

Therefore

n
max
i
m_i ł Tr(B) ł
max
i
m_i.

The result follows by taking square-roots. Q.E.D.

These bounds on N₂(A) can be computed in O(n²) arithmetic operations.

2 Elements of Error Analysis

Let u be the unit roundoff for the floating point number system under discussion (see the chapter on floating point numbers). In the floating point system F(b, t), u = b^1-t/2. Recall that the standard error model says that the approximate value [x\tilde] of any floating point arithmetic operation is related to the exact value x by the relation

~
x

= x(1+d), |d| Ł u.

We develop some basic tools for the error analysis within this model. The following is a slight extension of Higham [5,p. 69].

In the following, we assume that n is small enough so that

nu < 1.
(3)
For any n ł 1 and real e, define

g_n(e) : =
ne
1-ne
.
(4)
When e = u, we simply write

g_n : =
g_n(u).

Note that

1+g_n(e) = 1
1-ne

(5)

Lemma 4
(i) g_n(e) is a strictly increasing function for e < 1/n.

(ii) If 1 > 2nu then |g_n(e)| Ł g_n iff e Î [ [(-u)/(1-2nu)],u].

(iii) Assume 1 > 2nu. For any real q, we have |q| Ł g_n iff q = g_n(e) for some e Î [ [(-u)/(1-2nu)],u].

Proof. (i) If e < e˘ < 1/n, then we verify that 1+g_n(e) = [1/(1-ne)] < [1/(1-ne˘)] = 1+g_n(e˘). Note that e, e˘ can be negative.

(ii) Since 1 > 2nu, we have [(-u)/(1-2nu)] < 0 < u. We check that g_n([(-u)/(1-2nu)]) = -g_n(u) = -g_n. By part (i), it follows that for all e Î [ [(-u)/(1-2nu)],u], we have -g_n Ł g_n(e) Ł g_n.

(iii) This is a consequence of (ii). Q.E.D.

Lemma 5 Let |d_i| Ł u, r_i = ±1 and 1 > 2nu. Then

n
Ő
i = 1
(1+d_i)^r_i = 1 + g_n(e)

where for some e such that |g_n(e)| Ł u.

Proof. Suppose n = 1. Then

(1+d₁)^r₁

=

ě
ď
ď
í
ď
ď
î

1+d₁
if r₁ = 1,

1+ -d₁
1+d₁

if r₁ = -1

=

ě
ď
ď
í
ď
ď
î

1+ g₁ ć
ç
č d₁
1+d₁
ö
÷
ř
if r₁ = 1,

1+ g₁(-d₁)
if r₁ = -1

=

1+ g₁(e)

where e Î [ [(-u)/(1-2u)],u]. Thus |g₁(e)| Ł u, as required. Let n ł 2. By induction,

n-1
ĺ
i = 1
(1+d_i)^r_i = 1+g_n-1(e)

and

(1+d_n)^r_n = 1+g₁(e˘)

where |g_n-1(e)| Ł u and |g₁(e˘)| Ł u. It suffices to show that

1+g_n(e˘˘) = (1+g_n-1(e))(1+g₁(e˘))
(6)
for some e˘˘ satisfying |g_n(e˘˘)| Ł u. Alternatively, we must show that e˘˘ lies in the range [-u/(1-2nu), u], given that e Î [-u/(1-2(n-1)u),u] and e˘ Î [-u/(1-2u),u]. It follows from (5) that (6) amounts to showing

1
1-ne˘˘

=

1
1-(n-1)e
· 1
1-e˘

1-ne˘˘

=

(1-(n-1)e) (1-e˘)
e˘˘

=

n-1
n
e(1-e˘) + 1
n
e˘.

Thus e˘˘ is a convex combination of e(1-e˘) and e˘. There are three cases to consider:

Case A. If e˘ ł 0, then both e(1-e˘) and e˘ lie in the range [-u/(1-2nu), u]. So its convex combination e˘˘ lies in the range [-u/(1-2nu), u].

Case B. If e˘ < 0 and e ł 0 then

e˘˘ Ł n-1
n
e(1-e˘) Ł n-1
n
u ć
ç
č 1- u
1-2u
ö
÷
ř Ł u

and

e˘˘ ł 1
n
e˘ ł -u
n(1-2u)
ł -u
1-2nu
.

Case C. Finally, assume e˘ < 0 and e < 0. Then e˘˘ < 0 and we need to show that e˘˘ ł -u/(1-2nu). Since e˘˘ is a convex combination of e(1-e˘) and e˘, it suffices to show that both e(1-e˘) and e˘ are at least -u/(1-2nu). Clearly, e˘ ł -u/(1-2u) ł -u/(1-2nu). Also,

e(1-e˘)

ł

e(1+u)

ł

-u(1+u)
1-2(n-1)u

ł

-u
1-2nu
,

where the last inequality may be verified directly. Q.E.D.

On Evaluation of Expressions. We consider expressions E in the basic arithmetic operations, generalized to allow summation (ĺ_{i = 1}ⁿ) or multi-product (Ő_{i = 1}ⁿ). E.g.,

E = 3 x₂ + x₃ - (x₁ x₂ x₃) + 5
ĺ
i = 1
x_i.

An expression gives a mathematical value, but we also are interested in its evaluation on a computer. In this case, the order of evaluating the expression is important. This order is unique for a fully parenthesized expression. But in general, for any E, we define the natural evaluation of E to be the direct left-to-right of expressions, respecting any parentheses when present. Each summation or multiproduct notation is linearized in the obvious way, by increasing indices and evaluated left-to-right. E.g., ĺ_{i = 1}ⁿ x_n is simply a shorthand for x₁+x₂+Ľ+x_n, whose natural evaluation order is

((Ľ((x₁+x₂)+ x₃) + Ľ+ x_n-1)+x_n)

Suppose E˘ is any equivalent expression, i.e., obtained from E by applying any valid laws of arithmetic (associativity, commutativity, distributivity, etc). Because of finite precision arithmetic, the evaluation of E˘ and E need not produce the same result. The simplest example is

E = x+y+z, E˘ = x+z+y.
(7)
Another example is the following, using the law of distributivity:

E = (x+y)(v+w), E˘ = xv + xw + yv + yw.
(8)
In numerical analysis, we tend not to exploit distributivity because they can render E˘ very different from E. Comparing the errors in the natural evaluations E and E˘ can be complicated. One hint of this is the fact that the number of operations in E and E˘ can be exponentially different:

E = Ő
(1+x_i), E˘ = 1 +
ĺ
i
x_i +
ĺ
i < j
x_ix_j +Ľ
Ő
i
x_i.

Therefore, we focus on one class of mathematically equivalent expressions, called re-ordering equivalent. Two expressions are re-ordering equivalent if they can be obtained from each other by applying the laws of associativity and commutativity only. If E is re-ordering equivalent to E˘, then their natural evaluation incurs the same number of arithetic operations.

An approximate value [E\tilde] for E is called a natural value (resp. re-ordering value) if is obtained by the natural evaluation (resp. re-ordering evaluation) of E.

Lemma 6 Let E = ĺ_{i = 1}ⁿ x_i y_i.
(i) The natural value [E\tilde] of E satisfies [E\tilde] = ĺ_{i = 1}ⁿ x_i y_i(1+q_i) where |q₁| Ł g_n and for i = 2 ,Ľ, n, |q_i| Ł g_n-i+2.
(ii) If E˘ is any re-ordering equivalent expression, its natural value [(E˘)\tilde] satisfies [(E˘)\tilde] = ĺ_{i = 1}ⁿ x_i y_i(1+q_i) where each |q_i| Ł g_n.

Proof. Let E_j = ĺ_{i = 1}^j x_i y_i for j = 1 ,Ľ, n.
(i) If n = 1, [E\tilde] = x₁ y₁(1+q₁), |q₁| Ł g₁. This has the required form. For n ł 2, we inductively have

~
E_n-1

= n-1
ĺ
i = 1
x_iy_i(1+q˘_i)

where |q˘₁| Ł g_n-1 and for i = 2 ,Ľ, n-1, |q˘_i| Ł g_n-i+1. From

~
E_n

=

( ~
E_n-1

+ x_n y_n (1+q˘_n))(1+q)

=

n
ĺ
i = 1
x_iy_i(1+q˘_i)(1+q)

=

n
ĺ
i = 1
x_iy_i(1+q_i)

where |q_i| satisfies the lemma.
(ii) This is immediate from part (i). Q.E.D.

Lemma 7 Let E = (z - ĺ_{i = 1}^n-1 x_i y_i)/y_n. If E˘ is any re-ordering equivalent expression, its natural value [(E˘)\tilde] satisfies

~
E˘

(1+q_n) =
z - n-1
ĺ
i = 1
x_i y_i(1+q_i)
y_n

where each |q_i| Ł g_n.

Proof. Q.E.D.

3 Conditioning

Let f: X® Y be a continuous function on the normed spaces X, Y. The condition number k_f(x) of an input instance x Î X for problem f measures the sensitivity of f(x) to perturbations in x. In particular, ``ill or well conditioning'' refers to the relative largeness or smallness of k_f(x). We can work with an absolute or a relative notion of conditioning. We follow [8] in the following definitions.

There is an absolute and a relative concept of conditioning. The relative notion is more important, but since the absolute notion is simpler, we also will present it. The absolute conditioning function of f is defined by

k^A_f(x) : =

lim
d® 0

sup
||dx|| Ł d
||df||
||dx||
.
(9)
Here, we write df for f(x+dx)-f(x). If f is differentiable, write J(x) = J_f(x) for the Jacobian of f at x. The (i,j)th entry of J(x) is ¶f_i/ ¶x_j(x). Then the quantities dx and df becomes (respectively) the infinitesimals dx and J(x)dx. Thus

k^A_f(x) = || J_f(x)||
(10)
where || J_f(x)|| is the derived norm for the norms on X and Y.

The relative conditioning function of f is defined as

k_f^R(x) : =

lim
d® 0

sup
||dx|| Ł d
ć
ç
č ||df||
|| f(x) ||
/ ||dx||
||x||
. ö
÷
ř
(11)
Again when f is differentiable, we get

k_f^R(x) = || J_f(x)|| || x||
|| f(x)||
.
(12)

In the following, we simply write k_f(x) to refer to the relative conditioning function.

Condition Number of a Matrix. Let f: \Bbb Cⁿ ® \Bbb C^m where

f(x) = Ax

where A is a m×n matrix. Applying the definitions directly, we obtain

k_A(x)

: =

lim
d® 0

sup
||dx|| Ł d
ć
ç
č ||df||
|| f(x)||
/ ||dx||
||x||
ö
÷
ř

=

lim
d® 0

sup
||dx|| Ł d
ć
ç
č ||A(x + dx) - Ax||
||Ax||
/ ||dx||
||x||
ö
÷
ř

=

lim
d® 0

sup
||dx|| Ł d
ć
ç
č ||A dx|| ·||x||
||Ax|| ·||dx||
ö
÷
ř

=

||A|| ·||x||
||Ax||

If A is square, and A^-1 exists, then we have || x || = ||A^-1Ax || Ł ||A^-1|| ·||Ax||, Plugging for ||x|| into the above condition number, we obtain

k_A(x) Ł || A || ·|| A^-1 || .

The nice thing about the righthand side is that it is independent of x. This quantity is so useful that it deserves its own name and notation:

k(A) : =
|| A || ·|| A^-1 ||

is the condition number of the square matrix A.

This number k(A) is also when we want to solve the linear equation Ax = b where b is given and we need to compute x. This is just the original problem with A^-1 in place of A, but because of the way we define condition numbers of a matrix, the condition number of A can serve to bound the conditioning of this problem as well.

Condition Number of matrix. This is defined to be

k(A) : =
||A^-1|| ||A|| .

Normwise vs. componentwise norm. |A| = [ |a_ij| ]

Distance to a Singularity. The numerical stability of a numerical problem is influenced by its distance to the nearest singularity. For linear systems, this distance can be defined as follows: for a nonsingular matrix A and any matrix norm N, the closest distance from A to a singular matrix is defined as

d(A) : =

inf
S
{N(S-A)}

where S range over all singular matrices. We next characterize this distance. This result, attributed to Turing, was known to Banach in the 1920s. Its generalization to arbitrary norms is from Gastinel (1966). First, we note a simple fact:

Lemma 8 For any square matrix A and any derived norm N, if N(A) < 1 then I+A is invertible.

Proof. This is equivalent to showing that (I+A)x = \0 has the unique solution x = \0. If (I+A)x = \0 then x = -Ax and hence N(x) = N(Ax) Ł N(A)N(x). We claim that N(x) = 0 since otherwise N(A) < 1 implies N(x) Ł N(A)N(x) < N(x). Thus x = \0. Q.E.D.

Theorem 9 [Turing-Gastinel] For non-singular A and derived norm N,

d(A) =
inf
S
{ N(A-S)} = 1
N(A^-1)
.

Proof. We first prove that d(A) ł [1/(N(A^-1))]. Let S be any matrix such that N(S-A) < [1/(N(A^-1))]. It suffices to that S is non-singular. Writing D for S-A, we have N(A^-1D) Ł N(A^-1)N(D) < 1. Thus (I+A^-1D) is invertible and we see that

A(I+A^-1D) = A+D = S

As a product to two non-singular matrices, S is non-singular.

To show that d_T(A) Ł [1/(N(A^-1)N(A) )], we construct a singular matrix S such that N(S-A) = [1/(N(A^-1))]. We pick two vectors x, y such that

y^* A^-1 x = N(A^-1), N(xy^*) = 1.

We check that

ć
ç
č A - xy^*
N(A^-1)
ö
÷
ř A^-1x = 0.

This proves that S = A - [(xy^*)/(N(A^-1))] is singular. Moreover, N(A-S) = N(xy^*)/N(A^-1) = 1/N(A^-1). Q.E.D.

Component-wise Distance. For a non-singular square matrix A, define (see [3])

d_T(A) : =

inf
S
ě
í
î N(S-A)
N(A)
ü
ý
ţ
(13)
where S ranges all singular matrices. Thus d_T(A) is the relative distance from A to the nearest singular matrix S. The subscript refers to Turing. Rewrite the definition of d_A(T) in (13) as

d_T(A) =
inf
S
{ a: N(S-A) = aN(A) }

where S ranges over all singular matrices. We can further rewrite it as

d_T(A) =
inf
S,a
{ a: N(S-A) Ł aN(A) }

The componentwise analogue of this definition is therefore

d_C(A) : =

inf
S, a
{ a: |S-A| Ł a|A| }.

A characterization of d_C(A) is given by Rohn (1990) (see [3,p.66]).

4 Convex Hulls and Partial Compilation of Expressions

Sometimes, part of the arguments of an expression are fixed while others are vary. We can exploit this by constructing ``partially compiled expressions''. To see an example of this phenomenon, we look at the Beneath-Beyond Algorithm for convex hulls.

See [Page 147ff in Edelsbrunner].

Let S be a set of points in d dimensions. FOR SIMPLICITY, ASSUME GENERAL POSITION. We sort the points by their first coordinate, and add them in this order. The method (called Beneath-Beyond) begins with a d+1 simplex. At each step, we add a new point p to the current hull H:

Prove the 5 properties stated for the norm N˘ derived from N^m, Nⁿ.

Let A be a square matrix and N a matrix norm.
(i) Show that N(A) ł |m| for any eigenvalue of A.
(ii) Let us show that the above inequality cannot be improved to an equality. Let

A =

1
1

0
1
.

Then the eigenvalues of A satisfy |m₁| = |m₂| = 1. Prove that for any norm N, we have N(A) > 1. Hint: Write A = I+E where I is the identity matrix. What is A^k for any k Î \Bbb N? Show that if N(A) Ł 1 then N(E) = 0, contradiction.
(iii) (Fritz John) Show that for all square matrices A and all e > 0, there exists a norm N such that N(A) Ł e+ max_m |m| where m ranges over the eigenvalues of A.

(a) Give specific numerical values for the expressions E and E˘ (7) so that their natural evaluations result is different answers. Assume IEEE arithmetic.

(b) Do the same for (8).

Improve lemma 5 in case r_i = 1 for all i.

(Trefethan) Determine the absolute and relative conditioning function of the following functions.

(i) f(x) = x/2 for x Î \Bbb C.
(ii) Suppose f(x) = Öx for x > 0.
(ii) Suppose f(x) = x-y for (x,y) Î \Bbb C². Use the Ą-norm on \Bbb C².

5 OLD

REFERENCE: Certified Computation of the Sign of a Matrix Determinant, Dario A. Bini, Victor Y. Pan and Yanqiang Yu. 1998, abstract.

An approach to computing the sign of a determinant D of an n×n matrix M is to numerically compute an approximate value A for D, estimate the roundoff error E, and to certify the computed sign of A in case |A| > E. Unfortunately, |D| varies from 0 to ||M||ⁿ, and the the known techniques for error analysis gives error bounds that are large even when |D| is small. A predecessor to this paper by Pan, Yu and Stewart uses this approach. The present paper avoids estimating the roundoff error E, but instead solve the simpler task of estimating the mininum distance 1/||M^-1|| from M to a singular matrix. The method uses O(n³) single precision flops. Based on this estimate, in case the sign is not certified, a decision can be made for increasing the precision of the numerical computation or to fall back to symbolic guaranteed methods. Thus, this algorithm can be used as a kind of filter.

Other work: Clarkson proposed a general algorithm based on Gram-Schmidt orthogonalization. The certification method is rather complicated to program. For n Ł 4, Avnaim et al proposed an efficient ``single-precision'' method. Modular methods have been proposed by Brönnimann et al.

6 Roundoff Errors in Gaussian Elimination

Let A be a n×n matrix, and A_k be its kth column and a_i,j its (i,j)th entry. The following matrix operator norms are used: the 2-norm ||A||₂, the row norm ||A||_Ą = max_i ĺ_j |a_i,j|, the column norm ||A||₁ = max_j ĺ_i |a_i,j|. Also, let |A| denote the matrix with (i,j)th entry |a_i,j|. A triangular matrix is unit (resp., proper) if its diagonal entries are all 1 (resp., 0).

Theorem 10 (Higham, Theorem 9.3, page. 175): Let

A = PA˘Q, ~
A

= A˘+ E = ~
L

~
U

where the matrices are all n square, P and Q are permutation matrices, [L\tilde] and [U\tilde] are unit lower triangular and U upper triangular. If [L\tilde] and [U\tilde] are computed numerically from A by Gaussian elimination with complete pivoting, then

|E| Ł g_n | ~
L

|·| ~
U

|

where g_n = ne/(1-ne) and e is the unit roundoff.

Since det(A) = det(P)det(A˘)det(Q) and det(P) = ±1 and det(Q) = ±1 are known, it is enough to determine the sign of det(A˘).

Corollary 11 For any operator norm,

||E|| Ł e = || (|| ~
L

||·|| ~
U

||) || g_n

Computing the columns and row norms uses O(n²) arithmetic operations.

References

[1]: F. Avnaim, J.-D. Boissonnat, O. Devillers, F. Preparata, and M. Yvinec. Evaluating signs of determinants using single-precision arithmetic. Algorithmica, 17(2):111-132, 1997.
[2]: H. Brönnimann and M. Yvinec. Efficient exact evaluation of signs of determinants. Algorithmica, 27:21-56, 2000.
[3]: F. Chaitin-Chatelin and V. Frayssé. Lectures on Finite Precision Computations. Society for Industrial and Applied Mathematics, Philadelphia, 1996.
[4]: K. L. Clarkson. Safe and effective determinant evaluation. IEEE Foundations of Computer Science, 33:387-395, 1992.
[5]: N. J. Higham. Accuracy and stability of numerical algorithms. Society for Industrial and Applied Mathematics, Philadelphia, 1996.
[6]: F. John. Lectures on Advanced Numerical Analysis. Gordon and Breach, 150, Fifth Avenue, New York, NY 10011, 1967.
[7]: V. Y. Pan and Y. Yu. Certified computation of the sign of a matrix determinant. Proc. 10th ACM-SIAM Symp. on Discrete Algorithms (SODA99), pages 715-724, 1999. Accepted for Algorithmica.
[8]: L. N. Trefethen and I. David Bau. Numerical Linear Algebra. Society for Industrial and Applied Mathematics, Philadelphia, 1997.

Footnotes:

¹ We rely on the fact that these properties fit familiar ``mathematical patterns'' such the triangular inequality.

File translated from T_EX by T_TH, version 2.78.
On 26 Apr 2001, 13:46.