Skip to main content

Section 5.1 Paths

We have now acquired a solid body of linear algebra knowledge. It’s time to apply it to the non-linear world!

Subsection 5.1.1 Curves vs. Paths

Recall that a path is defined to be a function
\begin{equation} \mb{r} : I \to \mathbb{R}^n\tag{5.1.1} \end{equation}
Here the domain \(I\) is a (potentially infinite) interval in \(\mathbb{R}\text{.}\) A curve is the image of one or more paths. In this course, we distinguish between a path and a curve, although some texts refer to paths as `parameterized curves’, which is a more explicit description of what they are - a path gives a (connected) curve a (non-linear) coordinate or parameter. Let us formalize this notion.

Definition 5.1.1.

Given a curve \(\mathcal{C}\) in a vector space \(V\text{,}\) we say that \(\mb{r} : I \to V\) parameterizes \(\mathcal{C}\) if it is one-to-one and its range is \(\mathcal{C}\) (except possibly at the endpoints). If \(I\) is a connected interval \((a,b)\text{,}\) \((a,b]\) or \([a,b]\text{,}\) we will say that \(\mathcal{C}\) is a connected curve.
There are three common notations we will encounter. If \(n = 2\) or \(n = 3\text{,}\) we may write
\begin{equation*} \mb{r} (t) = \twovec{x(t)}{y(t)} \hspace{.2in} \text{ or } \hspace{.2in} \mb{r} (t) = \threevec{x(t)}{y(t)}{z(t)} \end{equation*}
respectively. It is also very common to see these written out as row vectors with the notation
\begin{equation*} \mb{r} (t) = \left\langle x(t) , y(t) \right\rangle \hspace{.2in} \text{ or } \hspace{.2in} \mb{r} (t) = \left\langle x(t), y(t), z(t) \right\rangle. \end{equation*}
In the general case, we will write
\begin{equation*} \mb{r} (t) = \threevec{x_1 (t) }{ \vdots}{x_n (t)} \hspace{.2in} \text{ or } \hspace{.2in} \mb{r} (t) = \left\langle x_1 (t) , \ldots, x_n (t) \right\rangle. \end{equation*}
In this setting, it is common to write the path as \(\mb{x} (t)\) and we often use this instead of \(\mb{r} (t)\) in this case. Let us take a look at some common curves and their parameterizations.

Example 5.1.2. Parameterizing an ellipse.

For a fixed positive real number \(R\text{,}\) the path
\begin{equation*} \mb{r} (t) = \twovec{R \cos t}{R \sin t} \end{equation*}
will be used frequently as it parameterizes the circle \(\mathcal{C}_R\) of radius \(R\) centered at the origin. In this case, the circle is the curve and \(\mb{r} (t)\) is a path. Of course, here we would take the domain of \(\mb{r}(t)\) to be an interval like \([0, 2\pi]\) or \([0, 2\pi)\text{.}\) A variant of this is the path
\begin{equation*} \mb{r} (t) = \twovec{a \cos t}{b \sin t}. \end{equation*}
This parameterizes the ellipse given by
\begin{equation*} \left( \frac{x}{a} \right)^2 + \left( \frac{y}{b} \right)^2 = 1 . \end{equation*}
Before giving the next example, let’s define a line!

Definition 5.1.3.

Given a non-zero real vector space \(V\text{,}\) a line in \(V\) is any subset which equals the translation of a one-dimensional vector subspace in \(V\text{.}\)
The reason for including the translation is that vector subspaces always pass through the origin and we often want to consider lines which do not. Of course, we will consider translating by \(\mb{0}\) as an option, so lines passing through the origin do count as lines!

Example 5.1.4. Parameterizing lines.

A quick look at Definition 5.1.3 will confirm that a line \(\ell\) in \(V\) is indeed a curve, not a path. It does not come with any given parameterization. However, it does assert that there must be a vector \(\mb{v}_0\) and a one dimensional subspace \(W\) such that
\begin{equation*} \ell = W + \mb{v}_0 = \left\{ \mb{w} + \mb{v}_0 : \mb{w} \text{ a vector in } W \right\}. \end{equation*}
Given this information, we actually can parameterize \(\ell\) by choosing a basis (or in this case, a single non-zero vector) \(\{\mb{v} \}\) of \(W\text{.}\) Indeed, once we do this, we consider the function
\begin{equation*} \mb{r} (t) = \mb{v}_0 + t \mb{v}. \end{equation*}
Since every vector in \(W\) is uniquely written as a multiple \(t \mb{v}\) for some \(t \in \mathbb{R}\text{,}\) this will be a one-to-one correspondence from \(\mathbb{R}\) to our line \(\ell\text{.}\) This is called in some texts the vector parameterization of \(\ell\), \(\mb{v}_0\) is called the initial vector, and \(\mb{v}\) is called the direction vector. Of course, we could have chosen a non-zero multiple of \(\tilde{\mb{v}} = - 4 \mb{v}\) as our direction vector and
\begin{equation*} \tilde{\mb{r}} (t) = \mb{v}_0 - 4 t \mb{v} \end{equation*}
would be another path parameterizing \(\ell\text{.}\) Again, paths parameterizing the same curve may be different!
We recall the discussion from Section 1.3 about the fact that in the path setting, we assign a number to each point of the curve. So the path has much more information in it than the curve. In some ways, this information is useful because we can compute. But often, we are not as interested in the extra information as we are the curve itself. For example, we may want to know the length of the curve which does not depend on our parameterization.
In many ways, this difference is analogous to the difference between a real \(n\)-dimensional vector space \(V\) and \(\mathbb{R}^n\text{.}\) The former is something we may be interested in, but the latter is something we can compute with. A basis of \(V\) gives us a way of parameterizing \(V\) with coordinates in \(\mathbb{R}^n\) and studying important features. But enough reminiscing, back to paths!

Subsection 5.1.2 Tangent Vectors of Paths

All of the notation conventions for paths are with respect to paths with codomain \(\mathbb{R}^n\) (or \(\mathbb{C}^n\)), but we will promote this a bit by taking our codomain to be an arbitrary inner product space \(V\) over \(K\) and considering paths to be vector valued functions of one variable (which usually will be \(t\))
\begin{equation*} \mb{r} : \mathbb{R} \to V . \end{equation*}
Thus \(\mb{r} (t)\) will be a vector in \(V\) dependent on a `time’ parameter \(t\text{.}\) Of course, for finite dimensional real inner product spaces with a choice of basis, a path in this sense can be written with coordinates using the basis, and we may freely go back and forth once such a basis is chosen. In particular, we will often use the form of equation (5.1.1) in examples and exercises as it is where we can compute most effectively.
However, in some contexts it is helpful to keep the flexibility of working over complex numbers or without a predetermined choice of basis. In particular, some results and definitions are best expressed without these constraints. Here is an example.

Definition 5.1.5.

If \(\mb{r}_1 (t)\text{,}\) \(\mb{r}_2 (t)\) are paths in the inner product space \(V\) over \(K\text{,}\) \(W\) a vector space over \(K\text{,}\) \(T : V \to W\) a linear transformation, and \(\lambda\) is a scalar, then
  1. \((\mb{r}_1 + \lambda \mb{r}_2 ) (t) = \mb{r}_1 (t) + \lambda \mb{r}_2 (t)\) defines the sum of two paths,
  2. \(\left\langle \mb{r}_1 , \mb{r_2} \right\rangle (t) = \left\langle \mb{r}_1 (t) , \mb{r}_2 (t) \right\rangle\) defines the pairing of two paths and gives a scalar function of \(t\text{,}\)
  3. \(T ( \mb{r}_1 ) (t) = T ( \mb{r}_1 (t) )\) is the path of the composition of \(T\) with \(\mb{r}_1\text{.}\)
Thus we can do vector arithmetic on paths and compute angles and distances between paths (at varying times). However, the main ingredient that an inner product provides that allows for us to start doing calculus is distance. This is because without it (or a more general form of it called a topology), we cannot make sense of the following notion.

Definition 5.1.6.

Let \(I\) be an interval in \(\mathbb{R}\) with \(a\) in its interior and \(\mb{r} : I \to V\) a path in \(V\text{.}\) The tangent vector or velocity vector of \(\mb{r}\) at \(a\) is
\begin{equation} \mb{r}^\prime (a) = \lim_{h \to 0} \frac{1}{h} \left( \mb{r} (a + h) - \mb{r} (a) \right).\tag{5.1.2} \end{equation}
If this is well defined at all \(a\) in \(I\text{,}\) we will say that \(\mb{r}\) is differentiable and write \(\mb{r}^\prime\) for the path of tangent vectors \(\mb{r}^\prime : I \to V\text{.}\)
The tangent vector at \(a\) is also sometimes written as \(\mb{v} (a)\) (although we will refrain from using this notation to avoid confusion with vectors in \(V\)). It is also very common to see it written as
\begin{equation*} \dot{\mb{r}} (a) = \mb{r}^\prime (a) \end{equation*}
or,
\begin{equation*} \left. \frac{d}{dt} \right|_a\mb{r} (t) = \mb{r}^\prime (a). \end{equation*}
Now, I ask that the student look carefully at equation (5.1.2) and consider what is within the limit, namely
\begin{equation*} \frac{1}{h} \left( \mb{r} (a + h) - \mb{r} (a) \right) . \end{equation*}
For a given \(h\) (note that \(a\) is fixed), the object inside the parentheses is the difference of two vectors in \(V\text{,}\) so it is a vector in \(V\text{.}\) We are scalar multiplying this by the non-zero real scalar \(\frac{1}{h}\text{.}\) So altogether this expression is a vector for any \(h\text{.}\) In fact, it is a rescaled the displacement vector from \(\mb{r}\) at time \(a\) to \(\mb{r}\) at time \(a + h\text{.}\)
In fact, if \(V = \mathbb{R}\) and \(\mb{r} (t) = f(t)\) were our one variable function, this definition would simply be
\begin{equation*} f^\prime (a) = \lim_{h \to 0} \frac{f(a + h) - f(a)}{h} \end{equation*}
which every red blooded high school student knows as the derivative!
But as was mentioned above, the key ingredient that we need to make sense of the definition for higher dimensions \(V\) is the inner product (which gives us a definition of distance, and in turn, for the limit). Right now, this is just a comment and I ask students to make a mental note of it and consider reading through an introductory analysis book to gather a more in depth understanding of limits in many dimensions. Suffice it to say, many results that we had in one variable carry over to many variables. In particular, we have the following two very useful propositions.

Proof.

The proof of this fact follows from some basic analysis, so we will omit these details.
We also have the proposition.
We should read this last proposition in the following way. Part (1) says that if a path is differentiable, then its coordinate functions are differentiable for any coordinate system you choose. Part (2) is a converse to this - it says if the coordinates of \(\mb{r} (t)\) in any given basis are differentiable functions of \(t\text{,}\) then so is \(\mb{r} (t)\) and furthermore the tangent vector has coordinates that are derivatives of the coordinates. Let us write this as a corollary which is really just a rephrasing of part (2).

Proof.

With a little help of some basic analysis, we can prove Proposition 5.1.8. By the fact that a dual vector is a continuous function, a linear scalar function is a continuous map which implies that composing commutes with limits. But finding the tangent vector \(\mb{r}^\prime (a)\) is such a limit and we see that
\begin{equation*} \mb{v}^* ( \mb{r}^\prime (a) ) = \left. \frac{d}{dt}\right|_a \mb{v}^* ( \mb{r} (t)) . \end{equation*}
Thus, if \(\mb{r}\) is differentiable, then the left hand side exists which implies the right exists as well.
On the other hand, assume the right hand side exists for a basis of dual vectors \(\mb{v}^*_i\text{.}\) The \(\mb{v}^*_i\) have a dual basis \(\mb{v}_1, \ldots, \mb{v}_n\) as in equation (2.3.3) and any vector \(\mb{v}\) has the unique expression
\begin{equation*} \mb{v}^*_1 (\mb{v} ) \mb{v}_1 + \cdots + \mb{v}^*_n (\mb{v} ) \mb{v}_n = \mb{v} \end{equation*}
This is also true for \(\mb{r} (t)\) so we have
\begin{equation*} \mb{v}^*_1 (\mb{r} (t)) \mb{v}_1 + \cdots + \mb{v}^*_n (\mb{r} (t) ) \mb{v}_n = \mb{r} (t) \end{equation*}
Now, the existence of the derivatives of the coordinate functions and Proposition 5.1.7 gives
\begin{align*} \left(\left. \frac{d}{dt} \right|_a \mb{v}^*_1 (\mb{r} (t)) \right) \mb{v}_1 + \cdots + \left( \left. \frac{d}{dt} \right|_a \mb{v}^*_n (\mb{r} (t) ) \right) \mb{v}_n \amp = \left. \frac{d}{dt} \right|_a \left(\mb{v}^*_1 (\mb{r} (t)) \mb{v}_1 \right) + \cdots + \left. \frac{d}{dt} \right|_a \left( \mb{v}^*_n (\mb{r} (t) ) \mb{v}_n \right), \\ \amp = \left. \frac{d}{dt} \right|_a \left(\mb{v}^*_1 (\mb{r} (t)) \mb{v}_1 + \cdots + \mb{v}^*_n (\mb{r} (t) ) \mb{v}_n \right), \\ \amp = \mb{r}^\prime (a) . \end{align*}
With Corollary 5.1.9 in hand, we can calculate tangent vectors confidently.

Example 5.1.10. Computing tangent vectors.

Suppose
\begin{equation*} \mb{r} (t) = \threevec{e^{2t}}{\sin ( \pi t^2 )}{\ln (4 t)} . \end{equation*}
Its tangent vector can be computed by taking derivatives of the components so that
\begin{equation*} \mb{r}^\prime (t) = \threevec{2 e^{2t}}{2 \pi t \cos (\pi t^2)}{\frac{1}{t}}. \end{equation*}
This example illustrates the more general application of Corollary 5.1.9, if \(\mb{x}\) is a path in \(\mathbb{R}^n\) and you want to find its tangent vector, you need only take \(n\) derivatives of the coordinate functions and you are done.
Certainly, being able to compute tangent vectors is useful, but so is knowing that our prior rules and results still hold.

Proof.

For the first statement, we simply write \(\mb{r} (t)\) in coordinates for some basis \(\mathcal{B}\) and differentiate using Corollary 5.1.9.
\begin{align*} \frac{d}{dt} \left( f(t) \mb{r} (t) \right) \amp = \frac{d}{dt} \left( f(t) \mb{r} (t) \right), \\ \amp = \frac{d}{dt} \left( f(t) \threevec{x_1 (t)}{\vdots}{x_n (t)}_{\mathcal{B}} \right), \\ \amp = \frac{d}{dt} \left( \threevec{ f(t) x_1 (t)}{\vdots}{f(t) x_n (t)}_{\mathcal{B}} \right), \\ \amp = \threevec{ f^\prime(t) x_1 (t) + f(t) x_1^\prime (t) }{\vdots}{f^\prime(t) x_n (t) + f(t) x^\prime_n (t)}_{\mathcal{B}}, \\ \amp = \threevec{ f^\prime(t) x_1 (t) }{\vdots}{f^\prime(t) x_n (t) }_{\mathcal{B}} + \threevec{ f(t) x_1^\prime (t) }{\vdots}{f(t) x^\prime_n (t)}_{\mathcal{B}}, \\ \amp = f^\prime(t) \threevec{ x_1 (t) }{\vdots}{ x_n (t) }_{\mathcal{B}} + f(t) \threevec{ x_1^\prime (t) }{\vdots}{ x^\prime_n (t)}_{\mathcal{B}}, \\ \amp = f^\prime(t) \mb{r} (t) + f(t) \mb{r}^\prime (t). \end{align*}
The second statement is proved similarly and is left to the student.

Proof.

The proof is left as an exercise.

Subsection 5.1.3 The Geometry of Curves

There is much to say about the geometry of curves, and here we will define only a few important notions, saving for later a more detailed discussion.

Definition 5.1.13.

A differentiable path \(\mb{r} : I \to V\) has speed
\begin{equation*} \| \mb{r}^\prime (a) \| \end{equation*}
at \(a\text{.}\)
Note that speed and velocity are two different (but related) concepts. In particular, velocity is a vector, while speed is a non-negative scalar.

Example 5.1.14. Speed of a parameterized helix.

To find the speed of the helix
\begin{equation*} \mb{r} (t) = \threevec{ 2 \cos (2t)}{- 2 \sin (2t)}{3t} \end{equation*}
we first find the velocity
\begin{equation*} \mb{r}^\prime (t) = \threevec{ -4 \sin (2t)}{- 4 \cos (2t)}{3} \end{equation*}
and take the magnitude
\begin{equation*} \| \mb{r}^\prime (t) \| = \sqrt{ 16 \sin^2 (2t) + 16 \cos^2 (2t) + 9} = \sqrt{25} = 5. \end{equation*}
Just as in one variable calculus, we will wish to talk about vectors being `tangent’ to a curve (and latter a surface or subspace).

Definition 5.1.15.

Let \(\mathcal{C}\) be a curve. We say a vector \(\mb{v}\) is tangent to \(\mathcal{C}\) at \(\mb{v}_0\) if there is a differentiable parameterization \(\mb{r} : I \to \mathcal{C}\) so that \(\mb{r} (a) = \mb{v}_0\) and \(\mb{v}\) is a scalar multiple of \(\mb{r}^\prime (a)\text{.}\) The tangent line to \(\mathcal{C}\) is the line \(\mb{v}_0 + W\) where \(W\) is the subspace of tangent vectors to \(\mathcal{C}\) at \(\mb{v}_0\text{.}\)

Example 5.1.16. Tangent line to a hyperbola.

Consider the positive branch \(\mathcal{C}\) of the hyperbola
\begin{equation*} x^2 - y^2 = 1 . \end{equation*}
It was seen in Section 1.3 that this branch is parameterized by the hyperbolic trigonometric functions via
\begin{equation*} \mb{r} (t) = \twovec{ \cosh t}{\sinh t} . \end{equation*}
To write down a parametric equation for the tangent line to the curve at, say, \(t_0\text{,}\) we need only find the tangent vector
\begin{equation*} \mb{r}^\prime (t_0) = \twovec{\sinh t_0}{\cosh t_0} \end{equation*}
and use this as the direction vector to obtain
\begin{equation*} \ell (t) = \mb{r} (t_0 ) + t \mb{r}^\prime (t_0 ) = \twovec{ \cosh t_0 + t \sinh t_0} {\sinh t_0 + t \cosh t_0}. \end{equation*}
A can be useful to have a tangent vector to a curve which is a unit vector. This can be accomplished by normalizing the tangent vector. In fact, such a vector has a name.

Definition 5.1.17.

The unit tangent vector \(\mb{T} (p)\) to a curve \(\mathcal{C}\) at a point \(p\) is a unit vector which is tangent to \(\mathcal{C}\) at \(p\text{.}\) We call the function \(\mb{T}\) which assigns a point of \(\mathcal{C}\) to its unit tangent vector the unit vector frame of \(\mathcal{C}\text{.}\)
If the curve \(\mathcal{C}\) is parameterized by \(\mb{r} (t)\text{,}\) we may write \(\mb{T} (t)\) instead of \(\mb{T} (\mb{r} (t))\text{.}\)

Example 5.1.18. Finding the unit vector frame.

To find unit vectors of the helix
\begin{equation*} \mb{r} (t) = \threevec{ 2 \cos (2t)}{- 2 \sin (2t)}{3t} \end{equation*}
from Example 5.1.14, we simply normalize the velocity vector by dividing by the speed.
\begin{equation*} \mb{T} (t) = \threevec{ \frac{2}{5} \cos (2t)}{-\frac{2}{5} \sin (2t)}{\frac{3}{5}t} \end{equation*}
One of the reasons to consider unit tangent vectors is that they give us ways of projecting other vectors ‘along the curve’ at a point. We will see this approach when we integrate along curves later on.

Exercises 5.1.4 Exercises

1.

Give a path parameterizing the line in \(\mathbb{R}^5\) which passes through the vectors \(\left\langle 1, 4, -2, 0 , 1 \right\rangle\) and \(\left\langle 0, 2, -3, 1, 2 \right\rangle\text{.}\)

2.

Find the tangent vectors of the following paths
(a)
\begin{equation*} \mb{r} (t) = \twovec{\cos t}{\sin t} \end{equation*}
where \(\mb{r} : \mathbb{R} \to \mathbb{R}^2\text{.}\)
(b)
\begin{equation*} \mb{r} (t) = e^{i t} \end{equation*}
where \(\mb{r} : \mathbb{R} \to \mathbb{C}\text{.}\)
Compare your answers by relating \(\mathbb{R}^2\) to \(\mathbb{C}\) via the real linear isomorphism
\begin{equation*} T \left( \twovec{x}{y} \right) = x + i y . \end{equation*}
Explain your findings.

3.

Consider a slightly more general case than the previous exercise. Suppose \(\lambda\) is a complex number. Show that
\begin{equation*} \frac{d}{dt} e^{\lambda t} = \lambda e^{\lambda t}. \end{equation*}

4.

Find the speed and unit tangent vectors for the following paths
(a)
\begin{equation*} \mb{r} (t) = \twovec{\cos 3 t}{\sin 3 t} \end{equation*}
where \(\mb{r} : \mathbb{R} \to \mathbb{R}^2\text{..}\)
(b)
\begin{equation*} \mb{r} (t) = \left\langle 3\cos (t ) , \cos (2t), 3 \sin (t), \sin (2t) \right\rangle \end{equation*}
where \(\mb{r} : \mathbb{R} \to \mathbb{R}^4\text{.}\)

6.

Suppose \(\mb{r} (t)\) is a differentiable path that parameterizes \(\mathcal{C}\) and \(\mb{r} (0) = \mb{v}_0\text{.}\) We can reparameterize \(\mathcal{C}\) by composing \(\mb{r}\) with another differentiable function \(g (t)\) with \(g^\prime (t) \ne 0\text{.}\) Suppose \(g(0) = 0\) and write
\begin{equation*} \tilde{\mb{r}} (t) = \mb{r} (g(t)) \end{equation*}
for the new parametrization. Show that the tangent vectors \(\mb{r}^\prime (0)\) and \(\tilde{\mb{r}}^\prime (0\) are linearly dependent. This shows that a curve only has a tangent line (not a plane).

7.

The acceleration vector of a twice differentiable path \(\mb{r} (t)\) is \(\mb{r}^{\prime \prime} (t)\text{.}\) We say that a curve is arc length parameterized by \(\mb{r} (t)\) if the speed of \(\mb{r}(t)\) is \(1\) for all \(t\text{.}\) Show that if \(\mb{r} (t)\) is an arc length parameterization then its velocity and acceleration are orthogonal. In other words, show that
\begin{equation*} \left\langle \mb{r}^\prime (t) , \mb{r}^{\prime \prime} (t) \right\rangle = 0 \end{equation*}
for every \(t\text{.}\)