Algebraic Geometry Jottings 14

The Resultant, Episode 4

This episode has one sole purpose: to show that the two formulas for the resultant are equivalent. The next episode, the finale, will tie up some loose ends.

The formulas:

$\det(S)=\text{res}_x(E,F) = a_m^n b_n^m\prod_{i=1}^m\prod_{j=1}^n (u_i-v_j)$ (1)
$= a_m^n \prod_{i=1}^m F(u_i) = (-1)^{mn}b_n^m \prod_{j=1}^n E(v_j)$ (2)

where

E(x) = a_mx^m+···+a₀ = a_m(x–u₁)···(x–u_m) (3a)
F(x) = b_nxⁿ+···+b₀ = b_n(x–v₁)···(x–v_n) (3b)

Eq.(2) follows immediately from (1) once we expand either F(u_i) or E(v_j) using the right hand side of (3). We assume that m,n>0.

We learned in Episodes 2 and 3 that the equation

PE+QF=det(S) (4)

always has a solution with deg P<n, deg Q<m, P and Q nonzero, coefficients in R. (Succinctly: P∈R_n[x], Q∈R_m[x]. We had one proof in the singular case, another for nonsingular S.) Eq.(4) provides a crucial ingredient.

Here are the bones of the proof of eq.(1); flesh on the bones to follow. For some pair u_i, v_j, set u_i=v_j. If x=u_i=v_j, then E=F=0 from the right hand side of (3). Then (4) tells us that det(S)=0. From the factor theorem we conclude that (u_i–v_j) is a factor of det(S). Since u_i and v_j were arbitrarily chosen, all the factors of the product divide det(S), and so the product does. Comparing degrees, we can show that det(S) equals the product.

Now let’s dot some i’s and cross some t’s (switching to a less visceral metaphor). We can cast the argument in a concrete 19th century style, or take a more modern structural approach. We’ll do both together. Start with a special case, where the leading coefficients a_m and b_n are both 1. First piece of business: treat the u_i‘s and v_j‘s as formal symbols. Expanding out the right hand sides of eq.(3) we get expressions for the a_i‘s and b_j‘s as polynomials in them, the so-called elementary symmetric polynomials:

$\begin{aligned} a_0 &= (-1)^m u_1\cdots u_m\\ a_1 &= (-1)^{m-1} \sum_{i=1}^m u_1\cdots\widehat{u_i}\cdots u_m\\ &\vdots\\ a_{m-1} &= -(u_1+\cdots+u_m) \end{aligned}$ (5a)

where the hat means “omit this factor”, and likewise for the b_j‘s (eq.(5b), which I won’t bother to write out). Plugging these a_i‘s and b_j‘s into det(S), we obtain a big, elaborate polynomial in the u_i‘s and v_j‘s with integer coefficients.

Structurally, we’re working in the ring R=k[u₁,…,u_m, v₁,…,v_n], where the u_i‘s and v_j‘s are variables. The a_i‘s and b_j‘s and det(S) are all elements of R.

What happens to this det(S) if, say, you replace u₁ everywhere with v₁? We have to rewrite eq.(5a), but (5b) doesn’t change. Likewise for (3a) and (3b). Next, let’s substitute v₁ for x. The right hand side of (3b) becomes identically 0, implying that F(v₁), fully expanded, is identically 0. The modified right hand side of (3a) (with v₁ in place of u₁) also is identically 0, so E(v₁), is identically 0. And finally from (4) we conclude that if you replace u₁ with v₁ everywhere in the polynomial det(S), the result is identically 0.

Structurally, we pull out an old trick from our toolbox: we regard R=k[u₁,…,u_m, v₁,…,v_n] as R=k[u₂,…,u_m, v₁,…,v_n][u₁]. Write R₁ for k[u₂,…,u_m, v₁,…,v_n], so R=R₁[u₁]. Replacing u₁ with v₁ just means evaluating the polynomials of R₁[u₁] at the element v₁ of R₁. Previously we’ve called this an evaluation homomorphism R₁[u₁]→R₁. It extends canonically to a homomorphism from R[x] to R₁[x]. Under this map, E(x) goes to the polynomial (x–v₁)(x–u₂)···(x–u_m). We also have an evaluation homomorphism from R₁[x] to R₁, where we set x=v₁. That sends both E and F to 0, so the image of det(S) is 0, by eq.(4). But det(S) has no x‘s in it, so this is the same as applying the homomorphism R₁[u₁]→R₁ to det(S). The upshot: the polynomial det(S) of R=R₁[u₁] has the root v₁ in R₁.

Now we can appeal to the factor theorem, and conclude that det(S) is divisible by (u₁–v₁) in R. I should mention that the factor theorem still holds even for polynomials over a ring, and not a field. The proof amounts to using long division by a linear polynomial of the form x–a (or u₁–v₁ in our present circumstances); since the leading coefficient is 1, we never need inverses in the ring. This explicit long division serves as the concrete argument.

Because u₁ and v₁ were arbitrary, the same rigamarole shows that det(S) is divisible by (u_i–v_j) for any i and j. Indeed, det(S) is divisible by the product of all these factors; you can show this by induction, but perhaps a slicker approach is to note that R is a UFD, and that all the (u_i–v_j)’s are irreducible and no two are associates.

OK, now we have $\det(S)=h\cdot \prod_{i=1}^m\prod_{j=1}^n (u_i-v_j)$ , with h∈R. Next we compare degrees. The product consists of mn factors, each of degree 1 in the variables (the u_i‘s and v_j‘s), so when expanded, it’s homogeneous of degree mn. If we can show the same for det(S), then it will follow that h is a constant.

As it happens, Kendig provides the argument we need on p.66. Let’s say we replace u_i and v_j with tu_i and tv_j. If we can show that det(S) turns into t^mn det(S), it will follow that det(S) is homogeneous of degree mn. Looking at the elementary symmetric polynomials (5a), we see that a₀ has degree m, a₁ is homogeneous of degree m–1, and in general a_i is homogeneous of degree m–i. Likewise b_j is homogeneous of degree n–j. The Sylvester matrix (for m=2 and n=3, with leading coefficients a₂=b₃=1) looks like:

$\begin{bmatrix} 1 & a_1 & a_0\\ {} & 1 & a_1 & a_0\\ {} & {} & 1 & a_1 & a_0\\ 1 & b_2 & b_1 & b_0\\ {} & 1 & b_2 & b_1 & b_0 \end{bmatrix}$

So after we replace u_i and v_j with tu_i and tv_j, the entries are multiplied elementwise by this matrix:

$\begin{bmatrix} 1 & t & t^2\\ {} & 1 & t & t^2\\ {} & {} & 1 & t & t^2\\ 1 & t & t^2 & t^3\\ {} & 1 & t & t^2 & t^3 \end{bmatrix}$

It’s hard to guess what happens to the determinant as a whole from this. We improve matters by (as Kendig puts it) packing the matrix with additional powers of t. Multiply each row by a power of t to make the columns uniform, thus:

$\begin{bmatrix} 1 & t & t^2\\ {} & t & t^2 & t^3\\ {} & {} & t^2 & t^3 & t^4\\ 1 & t & t^2 & t^3\\ {} & t & t^2 & t^3 & t^4 \end{bmatrix}$

Since multiplying a column multiplies a determinant by the same factor, this matrix will multiply det(S) by t^1+2+3+4 = t¹⁰, in our special case. We packed with additional factors t¹⁺²⁺¹ = t⁴, again for this case. Net effect, multiplying u_i and v_j by t multiplies det(S) by t⁶, as desired. Kendig does the (straightforward) algebra for general m and n. So the determinant is homogeneous of degree mn in the u_i‘s and v_j‘s.

Our goal is in sight. We’ve shown h is constant, now to show it’s 1. For this, we expand the product, picking a term to focus on; then we focus on the same term in the determinant.

Say we choose v_j in each factor (u_i–v_j) of the product; that gives us one term of the expansion. Each v_j is paired with m u_i‘s, so our result is (–1)^mnv₁^m ··· v_n^m = ((–1)ⁿ v₁··· v_n)^m = b₀^m. In det(S), if we go down the main diagonal we get b₀^m. All the other terms in the “sum of products” formula for the determinant involve replacing at least one of the b₀‘s with a b_j, j>0. That means at least one v_j is replaced with a u_i. So the main diagonal is the only term of the determinant formula contributing a term that is all v‘s. Therefore the constant h must be 1.

What about the general case, with arbitrary leading coefficients a_m and b_n? Throw them in as additional variables; let’s just call them a and b. The a_i‘s and b_j‘s acquire an additional factor; that is, the “new” a_i is aa_i and the “new” b_j is bb_j. So the “new” det(S) is aⁿb^m times the “old” det(S). That’s exactly the factor in front of the double product in (1). Since (1) held before (without the aⁿb^m on the right), it remains true after multiplying both sides by aⁿb^m.

Closing remarks: we’ve actually shown that (1) is a polynomial identity in the ring k[a, b, u₁,…,u_m, v₁,…,v_n]. (We didn’t even use the fact that k is a field.)

Prev TOC Next

	Michael Weiss on Aristotle and Falling Obj…
	hypnosifl on Aristotle and Falling Obj…
	Michael Weiss on Set Theory Jottings 16. Axioms…
	Bruce Smith on Set Theory Jottings 16. Axioms…
	Michael Weiss on Set Theory Jottings 12. Zermel…

Algebraic Geometry Jottings 14

Leave a comment Cancel reply

Recent Posts

Recent Comments

Archives

Categories

Meta

Algebraic Geometry Jottings 14

Share this:

Related

Leave a comment Cancel reply

Recent Posts

Recent Comments

Archives

Categories

Meta