Complex Numbers Birth Trigonometry!

The Cosine and Sine Functions

We saw when we introduced complex numbers previously that multiplication of one complex number by another involves both rotation about the origin and scaling the distance from the same. Of course, if the complex numbers involved were both of unit magnitude, then only rotation was involved.

With this in mind, let us consider again the unit circle of complex values with unit magnitude. In particular, let us consider the real and imaginary parts of the $z$ on that unit circle where $\theta = arg(z)$ is known. We should recognize that knowing $\theta$ fixes exactly where this $z$ must be located in the complex plane. To see this, imagine one starts at $0$ (the origin) and then moves exactly one unit distance in a direction corresponding to $\theta$.

The real part of such a $z$ and its imaginary coefficient (and consequently the $x$ and $y$ coordinates of the corresponding point in the related coordinate plane -- also called the Cartesian plane, named after French mathematician Rene Descartes.) are also then fixed once we know $\theta$. In this way, we can think of these $x$ and $y$ coordinates as functions of $\theta$. To attach specific names to these functions, let us make the following definitions.

(...and don't worry, we'll explain exactly why we name them with these two specific names very soon!)

For any $\theta$, let $z = x + iy$ be the unique complex value on the unit circle (i.e., $|z| = 1$) with argument $\theta$. Then, define functions cosine and sine, denoted* $\cos \theta$ and $\sin \theta $ respectively, so that: $$\cos \theta = x = Re(z) \quad \textrm{ and } \quad \sin \theta = y = Im(z)$$

*Note that -- just like log functions -- we traditionally omit the parentheses around the input when doing so doesn't cause confusion.

As an ink-saving way to refer to the unique $z$ on the unit circle with a given argument $\theta$, we might find it useful to also define the following suggestively-named function:

Let $\textrm{cis}(\theta)$ be defined to be the unique complex value on the unit circle with argument $\theta$, which then requires $$\textrm{cis}(\theta) = \cos(\theta) + i \sin(\theta)$$

To visualize what these functions do, consider the diagram drawn on the left below. Of course, we can also (and more frequently do) draw such things instead on the related coordinate plane, as shown on the below right.

     

You may be curious about why we have included the shaded blue triangle in the diagram as well. This triangle -- formed by drawing a vertical line from the point on the unit circle to the real axis (or $x$-axis on the right) is called a reference triangle, and serves as a way to quickly compute sine and cosine values for certain commonly encountered angles. We will have more to say about it shortly.

For now, let us simply say that the cosine and sine functions form the basis of the area of mathematics called trigonometry, a word which traces back to the Greek words trigonan which means "triangle" and metron, meaning "to measure". In many ways, this reference triangle can be thought of as the triangle to which the word "trigonometry" owes its origin!

Polar vs. Rectangular Forms

The above discussion focused on points on the unit circle -- points $z$ in the complex plane at distance $1$ unit from the complex number $0$ (i.e., $|z|=1$). However, we can easily extend the ideas presented there into something applicable to every $z$ in the the complex plane.

Let us start with the simple fact that for every such $z$, we have $z = 1 \cdot z$.

Recalling that multiplying any complex number (including $1$) by $z$ rotates that complex number by $arg(z)$ and scales its magnitude by $|z|$, $z$ itself must be the result of rotating $1$ by $arg(z) = \theta$ (which takes us to $\textrm{cis}(\theta)$ on the unit circle), and then scaling the result by the positive real value $r=|z|$.

Recognizing that the set of all complex values whose magnitude is $r$ forms a circle of radius $r$ centered at the origin, where we end up after the scaling -- at $r \cdot \textrm{cis}(\theta)$ -- lies on this circle, as shown below.

This gives us an alternate way to describe complex numbers. We already know that we can write them all in the form $a+bi$ for appropriate real values of $a$ and $b$ -- with their locations coinciding with the point $(a,b)$ in the associated Cartesian plane. Now, we can also write every complex number $z$ in the form $r \cdot \textrm{cis}(\theta)$, where $r = |z|$ and $\theta = arg(z)$. We call this the polar form of $z$, a term coined by Italian mathematician Gregorio Fontana in the 18th century.


A "polar map" of Earth
As a useful metaphor to explain the word choice here, imagine the origin $0$ in the complex plane is earth's north pole. Certainly, one can get to any point on the planet from the north pole by picking an appropriate direction in which to travel, and then traveling (south) from the pole in that direction by the requisite distance. In a similar way, polar form gives us a way to describe how to get from the origin to any point in the complex plane. In this case, the direction to travel is specified by the angle of rotation $\theta$, and the requisite distance to travel is given by the magnitude $r$.

Interestingly, while $|z|$ is always positive, we can allow $r$ to be negative in polar form and everything still makes sense. We interpret a negative $r$ to be equivalent to instructions to "travel backwards" $r$ units, to continue the analogy just discussed. This is perfectly consistent with what we know already, as $$\begin{array}{rcl} -r \cdot \textrm{cis}(\theta) &=& r \cdot \textrm{cis}(\theta) \cdot (-1)\\ &=& r \cdot \textrm{cis}(\theta) \cdot \textrm{cis}(\pi)\\ &=& r \cdot \textrm{cis}(\theta+\pi) \end{array}$$ You can also get to this same result by recalling that negating a complex number (thinking in terms of transformations) is equivalent to reflecting it across the origin. This in turn can also be thought of as rotating it around the origin by $180^{\circ} = \pi$ radians.

Regardless of how you argue it, the direction of our actual movement ends up being directly opposite (backwards) from the direction we would have traveled if $r$ was positive.

We should note before concluding this section that -- and going back to our original $a+bi$ form for a moment -- complex numbers written in this way are said to be in rectangular form. The blue rectangle in the following image perhaps provides adequate inspiration for this name:

Some Basic Properties of the Sine and Cosine Functions

Whenever we develop a new function, there are always questions we should ask -- things like what's the domain? ..what's the image/range? ..what interesting properties does it have? ..etc. The below addresses some of these questions for our newly-defined cosine and sine functions.

The Square Root of $i$?

Recall that the field of complex numbers arose out of extending the real numbers with some mysterious value $i = \sqrt{-1}$. One might naturally wonder -- if dealing with the square root of a negative value required "expanding our universe of numbers" from $\mathbb{R}$ to $\mathbb{C}$, what about something like $\sqrt{i}$? Is this "universe expanding" too? ..or can we find a complex value that when squared yields $i$?

Upon a little reflection, there has to be at least one complex value that when squared yields $i$. To see this, consider how our previous exploration into products of complex numbers revealed the following two important facts that are true for any complex numbers $z_1$ and $z_2$: $$|z_1 \cdot z_2| = |z_1| \cdot |z_2| \quad \quad \textrm{ and } \quad \quad arg(z_1 \cdot z_2) = arg(z_1) + arg(z_2)$$ The first implies that the product of two complex numbers on the unit circle will also be on the unit circle. The second tells us that if we multiply a complex number by itself (i.e., square it), we will double the angle. Thus, taken together and noting that $arg(i)=90^{\circ} = \frac{\pi}{2}$ radians, it must be the case that $z = \textrm{cis}(\frac{\pi}{4}) = \textrm{cis}(45^{\circ})$ is one such square root of $i$, as suggested below.

This spawns two more questions though. We know $\textrm{cis}(\frac{\pi}{4}) = \cos(\frac{\pi}{4}) + \sin(\frac{\pi}{4})$, but what are the actual values of $\cos(\frac{\pi}{4})$ and $\sin(\frac{\pi}{4})$? Also, if $\textrm{cis}(\frac{\pi}{4})$ is one square root of $i$, are there others? (After all, for real numbers at least -- when there are any square roots of a value, there are two. One of these is positive, the other negative. However, we really can't speak of a "positive" complex value, nor a "negative" one. So what can we say?)

There are some nice ways leveraging some hopefully familiar results from geometry that we could employ to answer the first question. However, let us take a different approach so that we might answer both questions simultaneously.

Suppose there exists some complex number $z = a+bi$ (where $a,b \in \mathbb{R}$) that when squared yields $i$. That is to say, $$(a+bi)^2 = i$$ Of course, we can expand the left side and identify the real and imaginary parts of the result, and then note these same parts on the right side as well, as shown below. $$\begin{array}{rcl} a^2 + 2abi + b^2i^2 &=& i\\ a^2 + 2abi - b^2 &=& i\\ (a^2 - b^2) + (2ab)i &=& 0 + 1 \cdot i \end{array}$$ Equating the real parts between the left and right sides, and doing the same for the imaginary parts, we then have two equations that must hold: $$a^2 - b^2 = 0 \quad \quad \textrm{ and } \quad \quad 2ab = 1$$ If we solve the first for $a$ in terms of $b$ (using the "socks and shoes" strategy), we find $a = \pm b$.

Pursuing each case separately, we find...

Cube Roots of $i$

We can play the same game to find a complex value that when cubed yields $i$.

First, we can feel confident that at least one such complex value must exist. Certainly, $z = \textrm{cis}(\frac{\pi}{6}) = \textrm{cis}(30^{\circ})$ satisfies the requisite criteria, given the picture below and realizing that $$|z^3| = |z|^3 = 1^3 = 1 = |i| \quad \quad \textrm{and,}$$ $$arg(z^3) = 3 arg(z) = 3 \cdot \frac{\pi}{6} = \frac{\pi}{2} = arg(i)$$

Then, as before, let us suppose $z = a + bi$ (where $a, b \in \mathbb{R}$) cubed yields $i$, and expand the power and identify its real and imaginary parts: $$\begin{array}{rcl} a^3 + 3a^2 bi + 3ab^2 i^2 + b^3 i^3 &=& i\\ a^3 + 3a^2 bi - 3ab^2 - b^3 i &=& i\\ (a^3 - 3ab^2) + (3a^2b - b^3)i &=& 0 + 1\cdot i \end{array}$$ Equating the real parts above gives us one equation relating $a$ and $b$ -- equating the imaginary parts gives us another. $$a^3 - 3ab^2 = 0 \quad \quad \textrm{ and } \quad \quad 3a^2b - b^3 = 1$$ As before, we attempt to solve the first for $a$ in terms of $b$. In this case, if we factor the left side of the first equation and apply the zero product property, we find $$a(a^2 - 3b^2) = 0 \quad \textrm{ which implies } \quad a = 0, \quad \textrm{ or } \quad a^2 -3b^2 = 0$$ The last equation can be solved via the "socks and shoes" strategy, yielding $a = \pm b\sqrt{3}$, so we end up with three possible values for $a$ that can be substituted into the equation resulting from equating the imaginary parts above, $3a^2b-b^3=1$:

Yes, you heard correctly -- there are actually $3$ cube roots of $i$!

Further, if we mark where they occur on the unit circle, you will see they are symmetrically spaced around the circle -- quite beautifully, actually.

Of course, this gives us a couple more specific values of the cosine and sine functions. Noting that $\frac{\pi}{6}$ radians corresponds to the cube root in the first quadrant shown above, we have $$\cos(\frac{\pi}{6}) = \frac{\sqrt{3}}{2} \quad \quad \textrm{ and } \quad \quad \sin(\frac{\pi}{6}) = \frac{1}{2}$$

Exploiting Familiar Triangles and Symmetries

As mentioned previously, the values of $\cos \theta$ and $\sin \theta$ for the values of $\theta$ considered above can also be easily found using geometry. Using these values in conjunction with the symmetries seen in the unit circle, the values corresponding to other commonly encountered angles can also be easily found. We explore this idea below:

On Names and (Other) Trigonometric Functions

One might naturally wonder why the functions discussed above are named sine and cosine. The answer is fascinating! Considering the sine function first, let us begin our story in ancient India, where the Sanscrit word $jy\overline{a}$ is known to mean "bowstring". As an example, सज्य (sajya) means "furnished with a bowstring" or "strung (as a bow)".*

The connection to a bowstring is not hard to see -- consider the images below. If the circle on the right is a unit circle, note how the sine value associated with $\theta = m\angle BOC$ is the half the length of the (undrawn) "bowstring", segment $BC$. Coincidently, in the same image we can even see the "bow" as arc $\stackrel{\mbox{$\frown$}}{BAC}$, and the arrow $OA$ notched at $O$ to the (drawn) bowstring formed by the union of radii $OB$ and $OC$!

     
Aryabhata I
(statue in Pune, Western India)
a Greek Lyre

One naturally might wonder why the sine is referred to as a bowstring, when in truth it represents only half such a length. To address this, one should know that the first known table of values for the sine function can be found in the Aryabhatiya (c. 499).

Its author, Aryabhata I, actually initially uses the word ardha-jya to describe the value of the sine function -- and (appropriately) the Sanscrit word ardha means "half". However, Aryabhata eventually drops (presumably in the interests of expediency) the qualifier of ardha-, essentially abbreviating the sine value (a "half bowstring") with just the word for "bowstring".

*: See "The Practical Sanskrit-English Dictionary", 1890

Going backwards in time, the astronomers and mathematicians of India likely started using this word after reading the work of the greek Hipparchus, and seeing the greek word, χορδή (pronounced (khordḗ).

Certainly, given its pronunciation, this greek word connects to our modern "chord" of a circle -- and even the word "cord" in the sense of something like a string. More interestingly, this greek word translates directly as guts or intestines, or something made from these -- and in ancient Greece, strings of musical instruments (like the Greek Lyre shown below) and some bowstrings were made out of animal guts/sinew.**

**:As some literary evidence, for what its worth: one can find the following in the ancient greek story of the Iliad: "...he drew the bow, clutching at once the notched arrow and the string of ox's sinew" (from the Iliad 4 105-120, Murray)

Arabic astronomers then transcribe (not translate) the sanscrit word for bowstring ($jy\overline{a}$) essentially as jiba. Vowels are not always written in Arabic, and thus someone like $12^{th}$-century Gherardo of Cremona, who was translating an Arabic text on geometry into Latin, would have seen simply the Arabic equivalent to the letters "jb". As a consequence, Gherado failed to translate this word correctly, thinking it was another (i.e., جيب (jaib) -- which means "pocket".

In Latin, the word sinus means "curved or bent surface, fold, or hollow", has much in common with a "pocket".

Thus, sinus was the word to which Gherado translated to Latin the Arabic transcription of the Sanscrit version of "bowstring" (from India), which itself stemmed from the original Greek word for "chord". From there, one can easily see the final evolution into sine upon one more translation into English.

Whew! Isn't it amazing how we can trace the idea of the sine function from starting in one region of the world, and then moving to another, and another, and another still!

With regard to the origins of the word cosine, consider the following diagrams:

Clearly, $\theta_1$ and $\theta_2$ are complementary angles since their measures add to $90^{\circ}$ (or equivalently, their radian measures add to $\frac{\pi}{2}$).

Note that the two triangles containing angles marked in blue must be congruent as they are reflections of each other across the line $y=x$. The same can be said of the triangles containing angles marked in red. As such, we can see there is a relation between the sine and cosine of an angle $\theta_1$ and its complement $\theta_2$. Namely, $$\cos \theta_1 = \sin \theta_2 \quad \textrm{ and } \quad \sin \theta_1 = \cos \theta_2$$ Given complement to any angle $\theta$ (in radians) is $\frac{\pi}{2} - \theta$, we can equivalently say for any $\theta$ that

$$\textstyle{\cos \theta = \sin (\frac{\pi}{2} - \theta) \quad \textrm{ and } \quad \sin \theta = \cos (\frac{\pi}{2} - \theta)}$$

In this way, we see that the cosine of an angle is the sine of its complement. This important relationship is actually captured in the name of the cosine function. In Medieval Latin this function was expressed as complementi sinus (note the use of the word sinus discussed earlier). Around 1620, English mathematician Edmund Gunter abbreviated this with co.sinus -- which ultimately was contracted to the "cosine" we use today.

Recall, we defined the cosine and sine functions of an angle $\theta$ as the $x$ and $y$ coordinates of the point $(a,b)$ on the unit circle with $\arg(a+b) = \theta$. However, this is not the only route we could have taken to relate $\theta$, a point on the unit circle, and a unique pair of $x$ and $y$ values.

Rather than focusing on the coordinates of the point itself, perhaps we draw instead a tangent to the point in question and notice the $x$ and $y$ values where this tangent cuts through the two axes, as seen in the picture below at the points labeled $x$ and $y$.

We call the function that gives the $x$ coordinate where this tangent cuts through the $x$-axis the secant function, denoted by $\sec \theta$, noting that the Latin word secare means "to cut" into pieces, not unlike the modern word "section" (when used as a verb).

In a related way, we call the function that gives the $y$-coordinate where this tangent cuts through the $y$ axis the cosecant function, denoting this by $\csc \theta$, given that the relationship between the secant of an angle and its complement is similar to that of the sine of an angle and its complement. To see this, again note that under a reflection of the entire image above over the line $y=x$, the terminal side of the angle $\theta$ (i.e., the one labeled with unit length) would move to the same for the complement of $\theta$, and the red segment would fall on the $x$-axis, as the blue segment is now.

$$\textstyle{\sec \theta = \csc (\frac{\pi}{2} - \theta) \quad \textrm{ and } \quad \csc \theta = \sec (\frac{\pi}{2} - \theta)}$$

Alternatively, we can see this relation between complements upon discovering the relationship between the secant and cosine functions. Notice that one can easily argue $\triangle BZO$ above is similar to $\triangle ZPO$, upon which we have the proportion $\sec \theta = \frac{1}{c}$.

Recognizing the convenience of picking an angle in the first quadrant, which keeps $c = \cos \theta$ and $s = \sin \theta$ (note $c$ and $s$ are distances and thus always positive, but $\cos \theta$ and $\sin \theta$ are coordinates which can sometimes be negative.), we see that in this context the secant is simply the reciprocal of the cosine function.

In the same manner, using the similarity of $\triangle OZR$ and $\triangle ZPO$, we can argue $\csc \theta = \frac{1}{s}$. Nicely, the argument above easily extends to other quadrants, making this relationship true in general:

$$\sec \theta = \frac{1}{\cos \theta} \quad \textrm{ and } \quad \csc \theta = \frac{1}{\sin \theta}$$

Of course, drawing the tangent segment $\overline{RB}$ to the point $Z$ makes us wonder something. How long are the two segments $\overline{ZR}$ and $\overline{ZB}$ (shown below in orange and magenta, respectively) that comprise it?

Let us focus on $\overline{ZB}$ first. Note that $\triangle BZO$ must be similar to $\triangle ZPO$, so $ZB = \frac{s}{c}$, suggesting (at least for this quadrant) that $ZB$ is the quotient of the related sine and cosine values.

In other quadrants, we note that the quotient of the sine and cosine values can sometimes be negative (as they are both coordinates), but the segments produced there always have positive distance. Still, in these other quadrants similar arguments establish that the magnitudes of these two things always agree.

Using a similar argument involving $\triangle RZO$ and $\triangle ZPO$ establishes the length $ZR$ and the magnitude of $\cos \theta$ divided by $\sin \theta$ (i.e., the reciprocal of $ZB$).

As the lengths we are finding are both on the tangent line to the unit circle at the point corresponding to $\theta$, let us call the functions of $\theta$ that produce them the tangent and cotangent functions, denoted $\tan \theta$ and $\cot \theta$, defining these in the following way:

$$\tan \theta = \frac{\sin \theta}{\cos \theta} \quad \textrm{ and } \quad \cot \theta = \frac{\cos \theta}{\sin \theta}$$

The typical "co-function" relationship seen for other trigonometric function pairs (i.e., sine and cosine, secant and cosecant) holds for the two functions above as well. Again, think of reflecting the entire image above over the line $y=x$ to see this.

$$\textstyle{\tan \theta = \cot (\frac{\pi}{2} - \theta) \quad \textrm{ and } \quad \cot \theta = \tan (\frac{\pi}{2} - \theta)}$$