The matrix M is a Vandermonde matrix (biography) if Mi,j = Mi,0j. Here is a simple example.
1 1 1 1 1 2 4 8 1 3 9 27 1 4 16 64
As you can see, the jth column is the first column raised to the j power. Let x be the first column vector, so that Mi,j = xij. In fact, let the elements of x be unknowns, variables that we will fill in later. Thus the determinant of M becomes a polynomial in n variables, x0 through xn-1.
Evaluate the Vandermonde determinant via det2(M), the sum of permutation products. Consider the binomial xl-xk for any l > k. For every product with xki×xlj there is another with xkj×xli. These products come from permutations with opposite parity, so subtract them and find something divisible by xl-xk. This holds for every l > k.
Multivariable polynomials over the integers exhibit unique factorization (not proved here), so the unique factorization of the determinant includes xl-xk for each l > k. Furthermore, their product gives an expression of degree (n2-n)/2. Consider any permutation product in the determinant of M. It's degree is 0+1+2+…+n-1 = (n2-n)/2. The factors identified thus far create an expression that divides the determinant, and has the same degree, hence we can only be off by a constant.
run down the main diagonal and note that the product of xii appears in the determinant with coefficient 1. Multiply the binomials together: (x1-x0) × (x2-x0) × (x2-x1) × (x3-x0) × …, and one of the resulting terms, taking the first variable from each binomial, gives the same expression, with a coefficient of 1. Therefore the determinant of M is the product of xl-xk, for 0 ≤ k < l < n.
In our example, the determinant is
(2-1)×(3-1)×(4-1)×(3-2)×(4-2)×(4-3) = 12.
As a corollary, any set of distinct Vandermonde vectors, drawn from any integral domain, is linearly independent. The determinant is the product of nonzero differences, and is nonzero.
In the earlier 4×4 example, scale the second row by 217. Now the row runs from 217 to 220, yet the matrix remains nonsingular. Do this for every row, and the matrix starts at exponent 17 instead of exponent 0.
Let's use the matrix to evaluate the functions six*p(x), where si is the base for the ith row, and p() is a fixed polynomial. To start, replace x with 0 through n-1, whence six gives a vandermonde matrix. If some of the integers from 0 to n-1 are roots of p, advance the exponents, as we did above. Make sure the exponents, from left to right, are not roots of p. This can always be done, since p has finitely many roots. Now, multiply the jth column by p(x), where x is the exponent associated with the jth column. Scaling a column keeps the matrix nonsingular. The exponential functions, represented by the rows, are multiplied by a particular polynomial, and the result is evaluated at n successive integers in the domain. Since the matrix remains nonsingular, the functions are linearly independent. Different exponentials, times a fixed polynomial, become linearly independent functions.