Metric tensor

In the mathematical field of differential geometry, a metric tensor is a type of function which takes as input a pair of tangent vectors v and w at a point of a surface (or higher dimensional differentiable manifold) and produces a real number scalar g(v, w) in a way that generalizes many of the familiar properties of the dot product of vectors in Euclidean space. In the same way as a dot product, metric tensors are used to define the length of and angle between tangent vectors. Through integration, the metric tensor allows one to define and compute the length of curves on the manifold.

A metric tensor is called positive definite if it assigns a positive value g(v, v) > 0 to every nonzero vector v. A manifold equipped with a positive definite metric tensor is known as a Riemannian manifold. On a Riemannian manifold, the curve connecting two points that (locally) has the smallest length is called a geodesic, and its length is the distance that a passenger in the manifold needs to traverse to go from one point to the other. Equipped with this notion of length, a Riemannian manifold is a metric space, meaning that it has a distance function d(p, q) whose value at a pair of points p and q is the distance from p to q. Conversely, the metric tensor itself is the derivative of the distance function (taken in a suitable manner). Thus the metric tensor gives the infinitesimal distance on the manifold.

While the notion of a metric tensor was known in some sense to mathematicians such as Carl Gauss from the early 19th century, it was not until the early 20th century that its properties as a tensor were understood by, in particular, Gregorio Ricci-Curbastro and Tullio Levi-Civita, who first codified the notion of a tensor. The metric tensor is an example of a tensor field.

The components of a metric tensor in a coordinate basis take on the form of a symmetric matrix whose entries transform covariantly under changes to the coordinate system. Thus a metric tensor is a covariant symmetric tensor. From the coordinate-independent point of view, a metric tensor is defined to be a nondegenerate symmetric bilinear form on each tangent space that varies smoothly from point to point.

Introduction

Carl Friedrich Gauss in his 1827 Disquisitiones generales circa superficies curvas (General investigations of curved surfaces) considered a surface parametrically, with the Cartesian coordinates x, y, and z of points on the surface depending on two auxiliary variables u and v. Thus a parametric surface is (in today's terms) a vector valued function

\vec{r}(u,v) = ( x(u,v), y(u,v), z(u,v) )

depending on an ordered pair of real variables (u,v), and defined in an open set D in the uv-plane. One of the chief aims of Gauss' investigations was to deduce those features of the surface which could be described by a function which would remain unchanged if the surface underwent a transformation in space (such as bending the surface without stretching it), or a change in the particular parametric form of the same geometrical surface.

One natural such invariant quantity is the length of a curve drawn along the surface. Another is the angle between a pair of curves drawn along the surface and meeting at a common point. A third such quantity is the area of a piece of the surface. The study of these invariants of a surface led Gauss to introduce the predecessor of the modern notion of the metric tensor.

Arclength

If the variables u and v are taken to depend on a third variable, t, taking values in an interval [a, b], then $\scriptstyle{\vec{r}(u(t),v(t))}$ will trace out a parametric curve in parametric surface M. The arclength of that curve is given by the integral

\begin{align} s &= \int_a^b\left\|\frac{d}{dt}\vec{r}(u(t),v(t))\right\|\,dt \\ &= \int_a^b \sqrt{u'(t)^2\,\vec{r}_u\cdot\vec{r}_u + 2u'(t)v'(t)\, \vec{r}_u\cdot\vec{r}_v+ v'(t)^2\,\vec{r}_v\cdot\vec{r}_v}\,\,\, dt , \end{align}

where $\left\| \cdot \right\|$ represents the Euclidean norm. Here the chain rule has been applied, and the subscripts denote partial derivatives ( $\scriptstyle \vec{r}_u=\tfrac{\partial \vec{r}}{\partial u}$ , $\scriptstyle \vec{r}_v=\tfrac{\partial \vec{r}}{\partial v}$ ). The integrand is the restriction^[1] to the curve of the square root of the (quadratic) differential

$ds^2 = E \,du^2 + 2F \,du\, dv + G\, dv^2 ,$

(1)

where

$E=\vec r_u\cdot\vec r_u, \quad F=\vec r_u\cdot\vec r_v , \quad G=\vec r_v\cdot \vec r_v .$

(2)

The quantity ds in (1) is called the line element, while ds² is called the first fundamental form of M. Intuitively, it represents the principal part of the square of the displacement undergone by $\scriptstyle{\vec{r}(u,v)}$ when u is increased by du units, and v is increased by dv units.

Using matrix notation, the first fundamental form becomes

\begin{align} ds^2 &= \begin{bmatrix} du&dv \end{bmatrix} \begin{bmatrix} E&F\\ F&G \end{bmatrix} \begin{bmatrix} du\\dv \end{bmatrix}\\ \end{align}

Coordinate transformations

Suppose now that a different parameterization is selected, by allowing u and v to depend on another pair of variables u′ and v′. Then the analog of (2) for the new variables is

$E'=\vec r_{u'}\cdot\vec r_{u'}, \quad F'=\vec r_{u'}\cdot\vec r_{v'}, \quad G'=\vec r_{v'}\cdot \vec r_{v'}.$

(2')

The chain rule relates E′, F′, and G′ to E, F, and G via the matrix equation

$\begin{bmatrix} E'&F'\\ F'&G' \end{bmatrix} = \begin{bmatrix} \frac{\partial u}{\partial u'}&\frac{\partial u}{\partial v'}\\ \frac{\partial v}{\partial u'}&\frac{\partial v}{\partial v'} \end{bmatrix}^\mathrm{T} \begin{bmatrix} E&F\\ F&G \end{bmatrix} \begin{bmatrix} \frac{\partial u}{\partial u'}&\frac{\partial u}{\partial v'}\\ \frac{\partial v}{\partial u'}&\frac{\partial v}{\partial v'} \end{bmatrix}$

(3)

where the superscript T denotes the matrix transpose. The matrix with the coefficients E, F, and G arranged in this way therefore transforms by the Jacobian matrix of the coordinate change

J=\begin{bmatrix} \frac{\partial u}{\partial u'}&\frac{\partial u}{\partial v'}\\ \frac{\partial v}{\partial u'}&\frac{\partial v}{\partial v'} \end{bmatrix}.

A matrix which transforms in this way is one kind of what is called a tensor. The matrix

\begin{bmatrix} E&F\\ F&G \end{bmatrix}

with the transformation law (3) is known as the metric tensor of the surface.

Invariance of arclength under coordinate transformations

Ricci-Curbastro & Levi-Civita (1900) first observed the significance of a system of coefficients E, F, and G, that transformed in this way on passing from one system of coordinates to another. The upshot is that the first fundamental form (1) is invariant under changes in the coordinate system, and that this follows exclusively from the transformation properties of E, F, and G. Indeed, by the chain rule,

\begin{bmatrix} du\\dv \end{bmatrix} =\begin{bmatrix} \frac{\partial u}{\partial u'} & \frac{\partial u}{\partial v'}\\ \frac{\partial v}{\partial u'} & \frac{\partial v}{\partial v'} \end{bmatrix} \begin{bmatrix} du'\\dv' \end{bmatrix}

so that

\begin{align} ds^2 &= \begin{bmatrix} du&dv \end{bmatrix} \begin{bmatrix} E&F\\ F&G \end{bmatrix} \begin{bmatrix} du\\dv \end{bmatrix}\\ &=\begin{bmatrix} du'&dv' \end{bmatrix} \begin{bmatrix} \frac{\partial u}{\partial u'} & \frac{\partial u}{\partial v'}\\ \frac{\partial v}{\partial u'} & \frac{\partial v}{\partial v'} \end{bmatrix}^\mathrm{T} \begin{bmatrix} E&F\\ F&G \end{bmatrix} \begin{bmatrix} \frac{\partial u}{\partial u'} & \frac{\partial u}{\partial v'}\\ \frac{\partial v}{\partial u'} & \frac{\partial v}{\partial v'} \end{bmatrix} \begin{bmatrix} du'\\dv' \end{bmatrix}\\ &= \begin{bmatrix} du'&dv' \end{bmatrix} \begin{bmatrix} E'&F'\\ F'&G' \end{bmatrix} \begin{bmatrix} du'\\dv' \end{bmatrix}\\ &=(ds')^2. \end{align}

Length and angle

Another interpretation of the metric tensor, also considered by Gauss, is that it provides a way in which to compute the length of tangent vectors to the surface, as well as the angle between two tangent vectors. In contemporary terms, the metric tensor allows one to compute the dot product of tangent vectors in a manner independent of the parametric description of the surface. Any tangent vector at a point of the parametric surface M can be written in the form

\mathbf{p} = p_1\vec{r}_u + p_2\vec{r}_v

for suitable real numbers p₁ and p₂. If two tangent vectors are given

\mathbf{a} = a_1\vec{r}_u + a_2\vec{r}_v

\mathbf{b} = b_1\vec{r}_u + b_2\vec{r}_v

then using the bilinearity of the dot product,

\begin{align} \mathbf{a} \cdot \mathbf{b} &= a_1 b_1 \vec{r}_u\cdot\vec{r}_u + a_1b_2 \vec{r}_u\cdot\vec{r}_v + b_1a_2 \vec{r}_v\cdot\vec{r}_u + a_2 b_2 \vec{r}_v\cdot\vec{r}_v\\ &= a_1 b_1 E + a_1b_2 F + b_1a_2 F + a_2b_2G \\ &=\begin{bmatrix} a_1 & a_2 \end{bmatrix} \begin{bmatrix} E&F\\F&G \end{bmatrix} \begin{bmatrix} b_1 \\ b_2 \end{bmatrix} \end{align}.

This is plainly a function of the four variables a₁, b₁, a₂, and b₂. It is more profitably viewed, however, as a function that takes a pair of arguments a = [a₁ a₂] and b = [b₁ b₂] which are vectors in the uv-plane. That is, put

g(\mathbf{a}, \mathbf{b}) = a_1b_1 E + a_1b_2 F + b_1a_2 F + a_2b_2G.

This is a symmetric function in a and b, meaning that

g(\mathbf{a}, \mathbf{b}) = g(\mathbf{b}, \mathbf{a}).

It is also bilinear, meaning that it is linear in each variable a and b separately. That is,

g(\lambda\mathbf{a}+\mu\mathbf{a'},\mathbf{b}) = \lambda g(\mathbf{a},\mathbf{b}) + \mu g(\mathbf{a'},\mathbf{b}),\quad\text{and}

g(\mathbf{a}, \lambda\mathbf{b}+\mu\mathbf{b'}) = \lambda g(\mathbf{a},\mathbf{b}) + \mu g(\mathbf{a},\mathbf{b'})

for any vectors a, a′, b, and b′ in the uv plane, and any real numbers μ and λ.

In particular, the length of a tangent vector a is given by

\left\| \mathbf{a} \right\| = \sqrt{g(\mathbf{a},\mathbf{a})}

and the angle θ between two vectors a and b is calculated by

\cos\theta = \frac{g(\mathbf{a},\mathbf{b})}{ \left\| \mathbf{a} \right\| \, \left\| \mathbf{b} \right\| } .

Area

The surface area is another numerical quantity which should depend only on the surface itself, and not on how it is parameterized. If the surface M is parameterized by the function $\vec{r}(u,v)$ over the domain D in the uv-plane, then the surface area of M is given by the integral

\iint_D \left|\vec{r}_u\times\vec{r}_v\right|\,du\,dv

where × denotes the cross product, and the absolute value denotes the length of a vector in Euclidean space. By Lagrange's identity for the cross product, the integral can be written

\begin{align} \iint_D &\sqrt{(\vec{r}_u\cdot\vec{r}_u)(\vec{r}_v\cdot\vec{r}_v)-(\vec{r}_u\cdot\vec{r}_v)^2}\,du\,dv\\ &\quad=\iint_D\sqrt{EG-F^2}\,du\,dv\\ &\quad=\iint_D\sqrt{\operatorname{det}\begin{bmatrix}E&F\\ F&G\end{bmatrix}} \, du\, dv\end{align}

where det is the determinant.

Definition

Let M be a smooth manifold of dimension n; for instance a surface (in the case n = 2) or hypersurface in the Cartesian space Rⁿ⁺¹. At each point p ∈ M there is a vector space T_pM, called the tangent space, consisting of all tangent vectors to the manifold at the point p. A metric at p is a function g_p(X_p,Y_p) which takes as inputs a pair of tangent vectors X_p and Y_p at p, and produces as an output a real number (scalar), so that the following conditions are satisfied:

g_p is bilinear. A function of two vector arguments is bilinear if it is linear separately in each argument. Thus if U_p, V_p, Y_p are three tangent vectors at p and a and b are real numbers, then

g_p(aU_p+bV_p,Y_p) = ag_p(U_p,Y_p)+bg_p(V_p,Y_p),\ \ \text{and}

g_p(Y_p,aU_p+bV_p) = ag_p(Y_p,U_p)+bg_p(Y_p,V_p).\,

g_p is symmetric.^[2] A function of two vector arguments is symmetric provided that for all vectors X_p and Y_p,

g_p(X_p,Y_p) = g_p(Y_p,X_p).\,

g_p is nondegenerate. A bilinear function is nondegenerate provided that, for every tangent vector X_p ≠ 0, the function

Y_p\mapsto g_p(X_p,Y_p)

obtained by holding X_p constant and allowing Y_p to vary is not identically zero. That is, for every X_p ≠ 0 there exists a Y_p such that g_p(X_p,Y_p) ≠ 0.

A metric tensor g on M assigns to each point p of M a metric g_p in the tangent space at p in a way that varies smoothly with p. More precisely, given any open subset U of manifold M and any (smooth) vector fields X and Y on U, the real function

g(X,Y)(p) = g_p(X_p,Y_p)\,

is a smooth function of p.

Components of the metric

This section assumes some familiarity with coordinate vectors.

The components of the metric in any basis of vector fields, or frame, f = (X₁, ..., X_n) are given by^[3]

$g_{ij}[\mathbf{f}] = g\left(X_i,X_j\right).$

(4)

The n² functions g_ij[f] form the entries of an n×n symmetric matrix, G[f]. If

v = \sum_{i=1}^n v^iX_i,\quad w = \sum_{i=1}^n w^iX_i

are two vectors at p ∈ U, then the value of the metric applied to v and w is determined by the coefficients (4) by bilinearity:

g(v,w) = \sum_{i,j=1}^n v^iw^jg\left(X_i,X_j\right) = \sum_{i,j=1}^n v^iw^jg_{ij}[\mathbf{f}]

Denoting the matrix (g_ij[f]) by G[f] and arranging the components of the vectors v and w into column vectors v[f] and w[f],

g(v,w) = \mathbf{v}[\mathbf{f}]^\mathrm{T} G[\mathbf{f}] \mathbf{w}[\mathbf{f}] = \mathbf{w}[\mathbf{f}]^\mathrm{T} G[\mathbf{f}]\mathbf{v}[\mathbf{f}]

where v[f]^T and w[f]^T denote the transpose of the vectors v[f] and w[f], respectively. Under a change of basis of the form

\mathbf{f}\mapsto \mathbf{f}' = \left(\sum_k X_ka_{k1},\dots,\sum_k X_ka_{kn}\right) = \mathbf{f}A

for some invertible n × n matrix A = (a_ij), the matrix of components of the metric changes by A as well. That is,

G[\mathbf{f}A] = A^\mathrm{T} G[\mathbf{f}]A

or, in terms of the entries of this matrix,

g_{ij}[\mathbf{f}A] = \sum_{k,\ell=1}^n a_{ki}g_{k\ell}[\mathbf{f}]a_{\ell j}.

For this reason, the system of quantities g_ij[f] is said to transform covariantly with respect to changes in the frame f.

Metric in coordinates

A system of n real valued functions (x¹, ..., xⁿ), giving a local coordinate system on an open set U in M, determines a basis of vector fields on U

\mathbf{f}=\left(X_1=\frac{\partial}{\partial x^1},\dots,X_n=\frac{\partial}{\partial x^n}\right).

The metric g has components relative to this frame given by

g_{ij}[\mathbf{f}] = g\left(\frac{\partial}{\partial x^i},\frac{\partial}{\partial x^j}\right).

Relative to a new system of local coordinates, say

y^i = y^i(x^1,x^2,\dots,x^n),\quad i=1,2,\dots,n

the metric tensor will determine a different matrix of coefficients,

g_{ij}[\mathbf{f}'] = g\left(\frac{\partial}{\partial y^i},\frac{\partial}{\partial y^j}\right).

This new system of functions is related to the original g_ij(f) by means of the chain rule

\frac{\partial}{\partial y^i} = \sum_{k=1}^n\frac{\partial x^k}{\partial y^i}\frac{\partial}{\partial x^k}

so that

g_{ij}[\mathbf{f'}]=\sum_{k,\ell=1}^n \frac{\partial x^k}{\partial y^i}g_{k\ell}[\mathbf{f}]\frac{\partial x^\ell}{\partial y^j}.

Or, in terms of the matrices G[f] = (g_ij[f]) and G[f′] = (g_ij[f′]),

G[\mathbf{f}'] = \left((Dy)^{-1}\right)^\mathrm{T} G[\mathbf{f}](Dy)^{-1}\,

where Dy denotes the Jacobian matrix of the coordinate change.

Signature of a metric

Main article: Metric signature

Associated to any metric tensor is the quadratic form defined in each tangent space by

q_m(X_m) = g_m(X_m,X_m),\quad X_m\in T_mM.

If q_m is positive for all non-zero X_m, then the metric is positive definite at m. If the metric is positive definite at every m ∈ M, then g is called a Riemannian metric. More generally, if the quadratic forms q_m have constant signature independent of m, then the signature of g is this signature, and g is called a pseudo-Riemannian metric.^[4] If M is connected, then the signature of q_m does not depend on m.^[5]

By Sylvester's law of inertia, a basis of tangent vectors X_i can be chosen locally so that the quadratic form diagonalizes in the following manner

q_m\left(\sum_i\xi^iX_i\right) = (\xi^1)^2+(\xi^2)^2+\cdots+(\xi^p)^2 - (\xi^{p+1})^2-\cdots-(\xi^n)^2

for some p between 1 and n. Any two such expressions of q (at the same point m of M) will have the same number p of positive signs. The signature of g is the pair of integers (p, n − p), signifying that there are p positive signs and n − p negative signs in any such expression. Equivalently, the metric has signature (p, n − p) if the matrix g_ij of the metric has p positive and n − p negative eigenvalues.

Certain metric signatures which arise frequently in applications are:

If g has signature (n, 0), then g is a Riemannian metric, and M is called a Riemannian manifold. Otherwise, g is a pseudo-Riemannian metric, and M is called a pseudo-Riemannian manifold (the term semi-Riemannian is also used).
If M is four-dimensional with signature (1, 3) or (3, 1), then the metric is called Lorentzian. More generally, a metric tensor in dimension n other than 4 of signature (1, n − 1) or (n − 1, 1) is sometimes also called Lorentzian.
If M is 2n-dimensional and g has signature (n, n), then the metric is called ultrahyperbolic.

Inverse metric

Let f = (X₁, ..., X_n) be a basis of vector fields, and as above let G[f] be the matrix of coefficients

g_{ij}[\mathbf{f}] = g(X_i,X_j) .

One can consider the inverse matrix G[f]⁻¹, which is identified with the inverse metric (or conjugate or dual metric). The inverse metric satisfies a transformation law when the frame f is changed by a matrix A via

$G[\mathbf{f}A]^{-1} = A^{-1}G[\mathbf{f}]^{-1}(A^{-1})^\mathrm{T}.$

(5)

The inverse metric transforms contravariantly, or with respect to the inverse of the change of basis matrix A. Whereas the metric itself provides a way to measure the length of (or angle between) vector fields, the inverse metric supplies a means of measuring the length of (or angle between) covector fields; that is, fields of linear functionals.

To see this, suppose that α is a covector field. To wit, for each point p, α determines a function α_p defined on tangent vectors at p so that the following linearity condition holds for all tangent vectors X_p and Y_p, and all real numbers a and b:

\alpha_p(aX_p+bY_p) = a\alpha_p(X_p)+b\alpha_p(Y_p).\,

As p varies, α is assumed to be a smooth function in the sense that

p\mapsto \alpha_p(X_p)

is a smooth function of p for any smooth vector field X.

Any covector field α has components in the basis of vector fields f. These are determined by

\alpha_i = \alpha(X_i),\quad i=1,2,\dots,n.

Denote the row vector of these components by

\alpha[\mathbf{f}] = \left[\alpha_1\ \ \alpha_2\ \ \dots\ \ \alpha_n\right].

Under a change of f by a matrix A, α[f] changes by the rule

\alpha[\mathbf{f}A] = \alpha[\mathbf{f}]A.

That is, the row vector of components α[f] transforms as a covariant vector.

For a pair α and β of covector fields, define the inverse metric applied to these two covectors by

$\tilde{g}(\alpha,\beta) = \alpha[\mathbf{f}]G[\mathbf{f}]^{-1}\beta[\mathbf{f}]^\mathrm{T}.$

(6)

The resulting definition, although it involves the choice of basis f, does not actually depend on f in an essential way. Indeed, changing basis to fA gives

\begin{align} \alpha[\mathbf{f}A]G[\mathbf{f}A]^{-1}\beta[\mathbf{f}A]^\mathrm{T} &= (\alpha[\mathbf{f}]A)\left(A^{-1}G[\mathbf{f}]^{-1}(A^{-1})^\mathrm{T}\right)A^\mathrm{T}\beta[\mathbf{f}]^\mathrm{T}\\ &=\alpha[\mathbf{f}]G[\mathbf{f}]^{-1}\beta[\mathbf{f}]^\mathrm{T}. \end{align}

So that the right-hand side of equation (6) is unaffected by changing the basis f to any other basis fA whatsoever. Consequently, the equation may be assigned a meaning independently of the choice of basis. The entries of the matrix G[f] are denoted by g^ij, where the indices i and j have been raised to indicate the transformation law (5).

Raising and lowering indices

Induced metric

Let U be an open set in Rⁿ, and let φ be a continuously differentiable function from U into the Euclidean space R^m, where m > n. The mapping φ is called an immersion if its differential is injective at every point of U. The image of φ is called an immersed submanifold.

Suppose that φ is an immersion onto the submanifold M ⊂ R^m. The usual Euclidean dot product in R^m is a metric which, when restricted to vectors tangent to M, gives a means for taking the dot product of these tangent vectors. This is called the induced metric.

Suppose that v is a tangent vector at a point of U, say

v = v^1\mathbf{e}_1+\dots+v^n\mathbf{e}_n

where e_i are the standard coordinate vectors in Rⁿ. When φ is applied to U, the vector v goes over to the vector tangent to M given by

\varphi_*(v) = \sum_{i=1}^n \sum_{a=1}^m v^i\frac{\partial \varphi^a}{\partial x^i}\mathbf{e}_a.

(This is called the pushforward of v along φ.) Given two such vectors, v and w, the induced metric is defined by

g(v,w) = \varphi_*(v)\cdot \varphi_*(w).

It follows from a straightforward calculation that the matrix of the induced metric in the basis of coordinate vector fields e is given by

G(\mathbf{e}) = (D\varphi)^\mathrm{T}(D\varphi)

where Dφ is the Jacobian matrix:

D\varphi = \begin{bmatrix} \frac{\partial\varphi^1}{\partial x^1}&\frac{\partial\varphi^1}{\partial x^2}&\dots&\frac{\partial\varphi^1}{\partial x^n}\\[1ex] \frac{\partial\varphi^2}{\partial x^1}&\frac{\partial\varphi^2}{\partial x^2}&\dots&\frac{\partial\varphi^2}{\partial x^n}\\ \vdots&\vdots&\ddots&\vdots\\ \frac{\partial\varphi^m}{\partial x^1}&\frac{\partial\varphi^m}{\partial x^2}&\dots&\frac{\partial\varphi^m}{\partial x^n}\\ \end{bmatrix}.

Intrinsic definitions of a metric

The notion of a metric can be defined intrinsically using the language of fiber bundles and vector bundles. In these terms, a metric tensor is a function

$g : TM\times_M TM\to \mathbf{R}$

(5)

from the fiber product of the tangent bundle of M with itself to R such that the restriction of g to each fiber is a nondegenerate bilinear mapping

g_p : T_pM\times T_pM \to \mathbf{R}.

The mapping (5) is required to be continuous, and often continuously differentiable, smooth, or real analytic, depending on the case of interest, and whether M can support such a structure.

Metric as a section of a bundle

By the universal property of the tensor product, any bilinear mapping (5) gives rise naturally to a section g_⊗ of the dual of the tensor product bundle of TM with itself

g_\otimes\in \Gamma\left((TM\otimes TM)^*\right).

The section g_⊗ is defined on simple elements of TM⊗TM by

g_\otimes(v\otimes w) = g(v,w)

and is defined on arbitrary elements of TM⊗TM by extending linearly to linear combinations of simple elements. The original bilinear form g is symmetric if and only if

g_\otimes\circ\tau = g_\otimes

where

\tau : TM\otimes TM\stackrel{\cong}{\to} TM\otimes TM

is the braiding map.

Since M is finite-dimensional, there is a natural isomorphism

(TM\otimes TM)^*\cong T^*M\otimes T^*M,

so that g_⊗ is regarded also as a section of the bundle T*M⊗T*M of the cotangent bundle T*M with itself. Since g is symmetric as a bilinear mapping, it follows that g_⊗ is a symmetric tensor.

Metric in a vector bundle

More generally, one may speak of a metric in a vector bundle. If E is a vector bundle over a manifold M, then a metric is a mapping

g : E\times_M E\to \mathbf{R}

from the fiber product of E to R which is bilinear in each fiber:

g_p : E_p \times E_p\to \mathbf{R}.

Using duality as above, a metric is often identified with a section of the tensor product bundle $\scriptstyle E^*\otimes E^*$ , (See metric (vector bundle).)

Tangent–cotangent isomorphism

Arclength and the line element

Suppose that g is a Riemannian metric on M. In a local coordinate system xⁱ, i = 1,2,...,n, the metric tensor appears as a matrix, denoted here by G, whose entries are the components g_ij of the metric tensor relative to the coordinate vector fields.

Let $\gamma (t)$ be a piecewise differentiable parametric curve in M, for a ≤t ≤ b. The arclength of the curve is defined by

L = \int_a^b \sqrt{ \sum_{i,j=1}^n g_{ij}(\gamma(t))\left({d\over dt}x^i\circ\gamma(t)\right)\left({d\over dt}x^j\circ\gamma(t)\right)}\,dt.

In connection with this geometrical application, the quadratic differential form

ds^2 = \sum_{i,j=1}^n g_{ij}(p)dx^i dx^j

is called the first fundamental form associated to the metric, while ds is the line element. When ds² is pulled back to the image of a curve in M, it represents the square of the differential with respect to arclength.

For a pseudo-Riemannian metric, the length formula above is not always defined, because the term under the square root may become negative. We generally only define the length of a curve when the quantity under the square root is always of one sign or the other. In this case, define

L = \int_a^b \sqrt{ \left|\sum_{i,j=1}^ng_{ij}(\gamma(t))\left({d\over dt}x^i\circ\gamma(t)\right)\left({d\over dt}x^j\circ\gamma(t)\right)\right|}\,dt \ .

Note that, while these formulas use coordinate expressions, they are in fact independent of the coordinates chosen; they depend only on the metric, and the curve along which the formula is integrated.

The energy, variational principles and geodesics

Given a segment of a curve, another frequently defined quantity is the (kinetic) energy of the curve:

E = \frac{1}{2} \int_a^b \sum_{i,j=1}^ng_{ij}(\gamma(t))\left({d\over dt}x^i\circ\gamma(t)\right)\left({d\over dt}x^j\circ\gamma(t)\right)\,dt. \

This usage comes from physics, specifically, classical mechanics, where the integral E can be seen to directly correspond to the kinetic energy of a point particle moving on the surface of a manifold. Thus, for example, in Jacobi's formulation of Maupertuis principle, the metric tensor can be seen to correspond to the mass tensor of a moving particle.

In many cases, whenever a calculation calls for the length to be used, a similar calculation using the energy may be done as well. This often leads to simpler formulas by avoiding the need for the square-root. Thus, for example, the geodesic equations may be obtained by applying variational principles to either the length or the energy. In the latter case, the geodesic equations are seen to arise from the principle of least action: they describe the motion of a "free particle" (a particle feeling no forces) that is confined to move on the manifold, but otherwise moves freely, with constant momentum, within the manifold.^[7]

Canonical measure and volume form

In analogy with the case of surfaces, a metric tensor on an n-dimensional paracompact manifold M gives rise to a natural way to measure the n-dimensional volume of subsets of the manifold. The resulting natural positive Borel measure allows one to develop a theory of integrating functions on the manifold by means of the associated Lebesgue integral.

A measure can be defined, by the Riesz representation theorem, by giving a positive linear functional Λ on the space C₀(M) of compactly supported continuous functions on M. More precisely, if M is a manifold with a (pseudo-)Riemannian metric tensor g, then there is a unique positive Borel measure μ_g such that for any coordinate chart (U,φ),

\Lambda f = \int_U f\,d\mu_g = \int_{\varphi(U)} f\circ\varphi^{-1}(x) \sqrt{|\det g|}\,dx

for all ƒ supported in U. Here det g is the determinant of the matrix formed by the components of the metric tensor in the coordinate chart. That Λ is well-defined on functions supported in coordinate neighborhoods is justified by Jacobian change of variables. It extends to a unique positive linear functional on C₀(M) by means of a partition of unity.

If M is in addition oriented, then it is possible to define a natural volume form from the metric tensor. In a positively oriented coordinate system (x¹,...,xⁿ) the volume form is represented as

\omega = \sqrt{|\det g|}\, dx^1\wedge\cdots\wedge dx^n

where the dxⁱ are the coordinate differentials and the wedge ∧ denotes the exterior product in the algebra of differential forms. The volume form also gives a way to integrate functions on the manifold, and this geometric integral agrees with the integral obtained by the canonical Borel measure.

Examples

The Euclidean metric

The most familiar example is that of elementary Euclidean geometry: the two-dimensional Euclidean metric tensor. In the usual $x$ - $y$ coordinates, we can write

g = \begin{bmatrix} 1 & 0 \\ 0 & 1\end{bmatrix}. \

The length of a curve reduces to the formula:

L = \int_a^b \sqrt{ (dx)^2 + (dy)^2}. \

The Euclidean metric in some other common coordinate systems can be written as follows.

Polar coordinates: $(r, \theta) \$

x = r \cos\theta

y = r \sin\theta

J = \begin{bmatrix}\cos\theta & -r\sin\theta \\ \sin\theta & r\cos\theta\end{bmatrix}.

g = J^\mathrm{T}J = \begin{bmatrix}\cos^2\theta+\sin^2\theta & -r\sin\theta \cos\theta + r\sin\theta\cos\theta \\ -r\cos\theta\sin\theta + r\cos\theta\sin\theta & r^2 \sin^2\theta + r^2\cos^2\theta\end{bmatrix}=\begin{bmatrix} 1 & 0 \\ 0 & r^2\end{bmatrix} \

by trigonometric identities.

In general, in a Cartesian coordinate system xⁱ on a Euclidean space, the partial derivatives $\partial/\partial x^i$ are orthonormal with respect to the Euclidean metric. Thus the metric tensor is the Kronecker delta δ_ij in this coordinate system. The metric tensor with respect to arbitrary (possibly curvilinear) coordinates $q^{i}$ is given by:

g_{ij} = \sum_{kl}\delta_{kl}{\partial x^k \over \partial q^i} {\partial x^l \over \partial q^j} = \sum_k\frac{\partial x^k}{\partial q^i}\frac{\partial x^k}{\partial q^j}.

The round metric on a sphere

The unit sphere in R³ comes equipped with a natural metric induced from the ambient Euclidean metric. In standard spherical coordinates $(\theta,\varphi)$ , with $\theta$ the colatitude, the angle measured from the z axis, and $\varphi$ the angle from the x axis in the xy plane, the metric takes the form

g = \left[\begin{array}{cc} 1 & 0 \\ 0 & \sin^2 \theta\end{array}\right].

This is usually written in the form

ds^2 = d\theta^2 + \sin^2\theta\,d\varphi^2.

Lorentzian metrics from relativity

Main article: Metric tensor (general relativity)

In flat Minkowski space (special relativity), with coordinates $r^\mu \rightarrow (x^0, x^1, x^2, x^3)=(ct, x, y, z) \ ,$ the metric is

g = \begin{bmatrix} 1 & 0 & 0 & 0\\ 0 & -1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & -1 \end{bmatrix}. \

For a curve with—for example—constant time coordinate, the length formula with this metric reduces to the usual length formula. For a timelike curve, the length formula gives the proper time along the curve.

In this case, the spacetime interval is written as

ds^2 = c^2 dt^2 - dx^2 - dy^2 - dz^2 = dr^\mu dr_\mu = g_{\mu \nu} dr^\mu dr^\nu\

The Schwarzschild metric describes the spacetime around a spherically symmetric body, such as a planet, or a black hole. With coordinates $(x^0, x^1, x^2, x^3)=(ct, r, \theta, \varphi)$ , we can write the metric as

G = (g_{\mu\nu}) = \begin{bmatrix} (1-\frac{2GM}{rc^2}) & 0 & 0 & 0\\ 0 & -(1-\frac{2GM}{r c^2})^{-1} & 0 & 0 \\ 0 & 0 & -r^2 & 0 \\ 0 & 0 & 0 & -r^2 \sin^2 \theta \end{bmatrix}\,

where G (inside the matrix) is the gravitational constant and M represents the total mass-energy content of the central object.

Notes

↑ More precisely, the integrand is the pullback of this differential to the curve.
↑ In several formulations of classical unified field theories, the metric tensor was allowed to be non-symmetric; however, the antisymmetric part of such a tensor plays no role in the contexts described here, so it will not be further considered.
↑ The notation of using square brackets to denote the basis in terms of which the components are calculated is not universal. The notation employed here is modeled on that of Wells (1980). Typically, such explicit dependence on the basis is entirely suppressed.
↑ Dodson & Poston 1991, Chapter VII §3.04
↑ Vaughn 2007, §3.4.3
↑ For the terminology "musical isomorphism", see Gallot, Hulin & Lafontaine (2004, p. 75). See also Lee (1997, pp. 27–29)
↑ Sternberg 1983

References

Dodson, C. T. J.; Poston, T. (1991), Tensor geometry, Graduate Texts in Mathematics, 130 (2nd ed.), Berlin, New York: Springer-Verlag, ISBN 978-3-540-52018-4, MR 1223091
Gallot, Sylvestre; Hulin, Dominique; Lafontaine, Jacques (2004), Riemannian Geometry (3rd ed.), Berlin, New York: Springer-Verlag, ISBN 978-3-540-20493-0 .
Gauss, Carl Friedrich (1827), General Investigations of Curved Surfaces, New York: Raven Press (published 1965) translated by A.M.Hiltebeitel and J.C.Morehead; "Disquisitiones generales circa superficies curvas", Commentationes Societatis Regiae Scientiarum Gottingesis Recentiores Vol. VI (1827), pp. 99–146.
Hawking, S.W.; Ellis, G.F.R. (1973), The large scale structure of space-time, Cambridge University Press .
Kay, David (1988), Schaum's Outline of Theory and Problems of Tensor Calculus, McGraw-Hill, ISBN 978-0-07-033484-7 .
Kline, Morris (1990), Mathematical thought from ancient to modern times, Volume 3, Oxford University Press .
Lee, John (1997), Riemannian manifolds, Springer Verlag, ISBN 978-0-387-98322-6 .
Michor, Peter W. (2008), Topics in Differential Geometry, Graduate Studies in Mathematics, Vol. 93, Providence: American Mathematical Society (to appear).
Misner, Charles W.; Thorne, Kip S.; Wheeler, John A. (1973), Gravitation, W. H. Freeman, ISBN 0-7167-0344-0
Ricci-Curbastro, Gregorio; Levi-Civita, Tullio (1900), "Méthodes de calcul différentiel absolu et leurs applications", Mathematische Annalen, 54 (1): 125–201, doi:10.1007/BF01454201, ISSN 1432-1807
Sternberg, S. (1983), Lectures on Differential Geometry (2nd ed.), New York: Chelsea Publishing Co., ISBN 0-8218-1385-4
Vaughn, Michael T. (2007), Introduction to mathematical physics, Weinheim: Wiley-VCH Verlag GmbH & Co., ISBN 978-3-527-40627-2, MR 2324500
Wells, Raymond (1980), Differential Analysis on Complex Manifolds, Berlin, New York: Springer-Verlag

Tensors

Glossary of tensor theory

Scope

Mathematics	coordinate system multilinear algebra Euclidean geometry tensor algebra differential geometry exterior calculus tensor calculus

Physics Engineering	continuum mechanics electromagnetism transport phenomena general relativity computer vision

Notation

Tensor definitions

Operations

Related abstractions

Notable tensors

Mathematics	Kronecker delta Levi-Civita symbol metric tensor nonmetricity tensor Christoffel symbols Ricci curvature Riemann curvature tensor Weyl tensor torsion tensor

Physics	moment of inertia angular momentum tensor spin tensor Cauchy stress tensor stress–energy tensor EM tensor gluon field strength tensor Einstein tensor metric tensor (GR)

Mathematicians

This article is issued from Wikipedia - version of the 11/7/2016. The text is available under the Creative Commons Attribution/Share Alike but additional terms may apply for the media files.