Tensor Calculus
Partial Derivative of a Tensor
Partial differentiation of a tensor is in general not a tensor. Depending on the circumstance, we will represent the partial derivative of a tensor in the following way
(3.1)
where we have
taken the special case of a contravariant vector
We now show explicitly that the partial derivative of a contravariant vector cannot be a tensor. Consider the transformation relation for such a tensor.
(3.2) |
|
|
Differentiating
with respect to coordinate
(3.3) |
|
|
Using the chain rule this becomes:
(3.4) |
|
|
Expanding this out we get:
If
only the first term on the right-hand side were present, then this would be the
usual tensor transformation law for a tensor of type (1,1). However, the presence of the second term
prevents
This
problem arises because of the very definition of the derivative. Differentiation involves
comparing a quantity evaluated at two neighbouring points, P
and Q say, dividing by some parameter representing the separation
of P and Q, and then taking the limit as
this parameter goes to zero. In the
case of a contravariant vector field
(3.6) |
|
|
for some
appropriate parameter
in the form ,
(3.7) |
|
|
we see that
(3.8) |
|
|
and
(3.9) |
|
|
This
involves the transformation matrix evaluated at different points! Thus
it is clear that
To define a tensor derivative we shall introduce a quantity called an affine connection and use it to define covariant differentiation. We will then introduce a tensor called a metric and from it build a special affine connection, called the metric connection, and again we will define covariant differentiation but relative to this specific connection.
The Affine Connection and Covariant Differentiation
Consider
a contravariant vector field
to first
order. If we denote the second term by
(3.11)
then
By the same
argument as in previous discussion of the partial derivative,
(3.12)
tensorial. It is natural to require that
|
|
and the minus sign is introduced to agree with convention.
We
have therefore introduced a set of
(3.14)
In
other words, it is the difference between the vector
(3.15)
or in terms of the semi-colon notation
(3.16)
Note
that in the formula the differentiation index
or equivalently (exercise)
If the second term on the right-hand side were absent, then this would be the usual transformation law for a tensor of type (1,2). However, the presence of the second term reveals that the transformation law is linear inhomogeneous. (3.17) or (3.18) is called an affine connection [or sometimes simply a connection or affinity]. A manifold with a continuous connection prescribed on it is called an affine manifold. From another point of view, the existence of the inhomogeneous term in the transformation law is not surprising if we are to define a tensorial derivative, since its role is to compensate for the second term that occurs in (3.5).
We next define the covariant derivative of a scalar field to be the same as its partial derivative, i.e.
(3.19) |
|
|
If we now demand that covariant differentiation satisfies the usual product rule of calculus, then we find
(3.20)
Notice
again that the differentiation index comes last in the
(3.21)
It
follows directly from the transformation laws that the sum of two connections
is not a connection or a tensor.
However, the difference of two connections is a tensor of type
(1,2), because the inhomogeneous term cancels out in the transformation. For the same reason, the anti-symmetric part
of a
(3.22) |
|
|
is a tensor (called the torsion tensor). If the torsion tensor vanishes, then the connection is symmetric, i.e.
(3.23) |
|
|
Affine Geodesics
If
(3.24) |
|
|
that is,
whenever the tangent vector field to the congruence is
We next define the
absolute derivative of a tensor
written
by the following relation
|
|
The
tensor
|
|
This is a
first-order ordinary differential equation for
Using this notation, an affine geodesic is defined as a privileged curve along which the tangent vector is propagated parallel to itself. In other words, the parallely propagated vector at any point of the curve is parallel, that is, proportional to the tangent vector at that point:
(3.27) |
|
|
Using (3.25), the equation for an affine geodesic can be written in the form
(3.28) |
|
|
or equivalently (exercise)
(3.29) |
|
|
The last result is
very important and so we shall establish it afresh from first principles using
the notation of the last section. Let
the neighbouring points P and Q on C
be given by
(3.30) |
|
|
to first order in
|
|
The vector
(3.32) |
|
|
The vector already at Q is
(3.33) |
|
|
to first order in
(3.34)
where we have
written the proportionality factor as
(3.35) |
|
|
Note that
If
the curve is parameterized in such a way that
(3.36) |
|
|
or equivalently
(3.37) |
|
|
An affine parameter s is only defined up to an affine transformation (exercise)
(3.38) |
|
|
where
Similarly, as long as the points are sufficiently 'close', any point can be joined to any other point by a unique geodesic. However, in the large, geodesics may focus, that is, meet again as shown in the following diagram.