Algebraic Geometry Jottings 11

Prev TOC Next

The Resultant, Episode 2

By now you know the characters: the polynomials E(x) (degree m) and F(x) (degree n) with coefficients in an integral domain R, its fraction field K, and the extension field L of K in which E and F split completely:

E(x) = a(x–u1)···(x–um),    F(x) = b(x–v1)···(x–vn)

And we have our resultant resx(E,F), an element of R, which is 0 if and only if E and F have a common root in L. When R=k[y] and K=k(y), that common root would be an algebraic function of y—what we’ve been calling a branch.

We assume, by the way, that m and n are both positive.

(In the Inside-the-Episode example, with the roles of x and y switched, we have the branches y=\pm\sqrt{1-x^2} for E(y)=y2+(x2–1) and y=\pm\sqrt{1-2x^2} for F(y)=y2+(2x2–1). The resultant is not zero, because the ellipses E and F don’t share a branch. Instead it’s resy(E,F)=x4.)

Even when E and F have no common branch, the resultant still has news to impart, when R=k[y]. We can have ui(y)=vj(y) at particular values of y (for some i and j). So resx(E,F), a polynomial in y, tells us the possible values of y at the intersections of the curves E and F. (Inside-the-Episode example: the intersections occur at x4=0.)

Let’s look at the abstract situation, and see what we can glean from that. Our theme: “As above, so below.” That is, the resultant belongs to R, while the common root (if any) belongs to L. What other assertions about L translate down to R?

First observation: E(x) and F(x) have a common root in L if and only if they have a common factor over L that’s not a constant.

Obviously a common root implies a nonconstant common factor, by the so-called factor theorem: if r is a root of f(x), then (x–r) is a factor. But the converse is also easy: since E(x) factors completely in L, so does a nonconstant common factor, since L[x] is a UFD (because L is a field). So our common factor has a root in L, which naturally must also be a root of F(x).

“Having a common root” doesn’t translate between R and L, or even between K and L. But “having a nonconstant common factor” might. Obviously it translates upwards. How about downwards? In other words, if E(x) and F(x) have no nonconstant common factor in R[x] (or K[x])—if they are coprime there—are they coprime in L[x]?

Remarkably, yes, when R is a UFD. (Remarkable because being irreducible isn’t preserved as we “go upwards”. Standard example: x2+1 is irreducible in ℝ[x] but not in ℂ[x].) From Gauss’s lemma we get: “coprime in R[x]” implies “coprime in K[x]”. Now, K[x] is a PID, so if E and F are coprime, then pE+qF=1 for some polynomials p,qK[x]. But that equation still holds in L[x], and it implies that any common factor of E and F divides 1. So it’s a unit, and E and F are coprime in L[x].

The expression pE+qF has made its entrance onto the stage! It will play a major role in the rest of “The Resultant” miniseries, and even beyond.

Observe: if E and F are not coprime, then E=AB and F=BC for some A≠0, C≠0, and nonconstant B. So EC=(AB)C=A(BC)=AF, or EC–AF=0. Therefore there are nonzero polynomials p and q such that

pE+qF=0
with deg(p)<deg(F),  deg(q)<deg(E)

Conversely, if we have pE+qF=0 with the degree inequalities, then pE=–qF, so E divides qF. E doesn’t divide q because deg(q)<deg(E). Using unique factorization, it follows that some irreducible (nonconstant) factor of E divides F, so E and F are not coprime. A bonus: this argument works equally well in R[x], K[x], and L[x].

The upshot: being coprime and not being coprime both boil down to the existence of certain polynomials in K[x], and if they exist down in K[x], they also exist up in L[x].

The “pE+qF=1″ argument relied on K[x] being a PID. When R=k[y], the integral domain R[x] (aka k[x,y]) isn’t a PID, just a UFD. But the equation pE+qF=1 in K[x] still tells us something about R[x]. Namely, K is the fraction field of R, so we can clear the denominators in the coefficients of p and q, getting PE+QF=D in R[x], with D in R.

Remember the coordinate ring Γ(EF) of post 7? The quotient k[x,y]/(E,F)? Notice that the ideal (E,F) of k[x,y] is the set {PE+QF : P,Qk[x,y]}, and its smallest enclosing ideal in K[x] is {pE+qF : p,qK}, where K=k(y).

With all this as motivation, we’re ready for the final plot twist of Episode 2. Let’s look at the homomorphism

K[x]⊕K[x] → K[x]
p,q⟩ ↦ pE+qF

Call it Φ. If we regard K[x]⊕K[x] and K[x] as vector spaces over K, then Φ is a linear operator. This turns our thoughts to its null space, its image, stuff like that. K[x] is infinite-dimensional over K. Which is OK, but finite dimensional vector spaces are easier to deal with. But now recall the degree inequalities from above: deg(p)<deg(F)=n,  deg(q)<deg(E)=m. Let’s write Km[x] for the space of all polynomials over K of degree <m, likewise Kn[x]; these have dimensions m and n as vector spaces over K. So we can restrict Φ to get

Kn[x]⊕Km[x] → Km+n[x]
p,q⟩ ↦ pE+qF

We observed earlier that E and F are not coprime if and only if pE+qF=0 for some nonzero pKn[x] and qKm[x]. Our restricted Φ is singular if and only if E and F are not coprime.

It’s easy enough to write the matrix for the restricted Φ, over the bases {1,…,xn–1; 1,…,xm–1} and {1,…,xm+n–1}. You might think I repeated some elements when I wrote {1,…,xn–1; 1,…,xm–1}. Not so. We get the basis of Kn[x]⊕Km[x] by concatenating the bases of Kn[x] and Km[x]. So (for example) those two 1’s represent different basis elements, namely ⟨1,0⟩ and ⟨0,1⟩, sent by Φ to E and F respectively.

Suppose E(x)=a0+···+amxm, F(x)=b0+···+bnxn. Here’s the matrix, picking the special case m=2 and n=3, with the bases shown above and at right:

\begin{array}{cc}  \begin{array}{ccccc}  1\;\; & x & x^2 & 1 & x  \end{array}\\  \begin{bmatrix}  a_0 & {} & {} & b_0 & {} \\  a_1 & a_0 & {} & b_1 & b_0 \\  a_2 & a_1 & a_0 & b_2 & b_1 \\  {} & a_2 & a_1 & b_3 & b_2 \\  {} & {} & a_2 & {} & b_3  \end{bmatrix}  &  \begin{array}{l}  1\\ x\\ x^2\\ x^3 \\ x^4  \end{array}  \end{array} 

Blank entries are zeros. For example, the basis element ⟨x,0⟩, topping the second column, is sent to a0x+a1x2+a2x3; the basis element ⟨0,x⟩, topping the last column, is sent to b0x+b1x2+b2x3+b3x4.

This is called the Sylvester matrix. (Usually transposed, and often with the rows/columns rearranged. Conventions differ. I’ll switch to the traditional form in the next post.)

The determinant of the Sylvester matrix is therefore zero, when and only when E and F have a common nonconstant factor. Hey, that’s just like the resultant! Maybe the Sylvester determinant equals the resultant?

Indeed it does. (Within a factor of ±1, depending on conventions). We’ll see why in Episode 4.

Prev TOC Next

Leave a comment

Filed under Algebraic Geometry

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s

This site uses Akismet to reduce spam. Learn how your comment data is processed.