numpy
¶Plotting data in NumPy and pandas is handled by an external Python module called matplotlib. Like NumPy and pandas it is a large library and has been around for a while (first released in 2003). Hence we won't cover all its functionality in this lesson.
To see the wide range of possibilities you have with matplotlib see matplotlib example gallery, Nicolas P. Rougier's tutorial and Ben Root's tutorial after the course.
Here we will cover the basic uses of it and how it integrates with NumPy and in a later episode with pandas. While working through these examples you may want to refer to the matplotlib documentation.
The most common interface to matplotlib is its pyplot
module which provides a way to affect the current state of matplotlib directly. As with both NumPy and pandas, there is a conventional way to import matplot lib, which is as follows:
%config InlineBackend.figure_format = 'svg'
import numpy as np
import matplotlib.pyplot as plt
It's very useful to be able to quickly plot any data we have in front of us. As we will see on the next page, matplotlib's power comes from its configurability.
You may recall that in the "Introduction to Python" course, we met the sinc function. NumPy has its own implementation of the sinc function, which for those interested is defined as: $$ \text{sinc}(x):= \begin{cases} \frac{\sin(\pi x)}{\pi x} &\text{ if } x\neq{0}\\ 1 & \text{ if } x = 0 \end{cases} $$ This function often crops up in signal processing applications.
Beyond the abstract mathematical definition, we can use what we have learnt in Python to find out more about sinc.
First we pick 1001 points on the x-axis between -5 and 5 using the linspace
function. Then we can evaluate the sinc
function at all 1001 points (recall that we can perform bulk operations on NumPy arrays).
x = np.linspace(-5, 5, 1000)
# sinc(x) is defined to be sin(pi*x)/(pi*x) in numpy
y_sinc = np.sinc(x)
Now x
contains an array of x-coordinates and y_sinc
contains an array of y-coordinates. We can plot y against x by calling matplotlib's plot
function:
plt.plot(x, y_sinc)
You can see that it has plotted the sinc function between -5 and +5.
If you have ever drawn a graph before, you can probably remember someone shouting:
"But you haven't labelled your axes! And there's no title!"
We will go over this functionality now. Note that any functions called on the plt
object will affect the state of the matplotlib plot from that point on in the script, until the show
method is called.
Firstly, we can add some additional information to the line that we plot, by passing the keyword argument label
with a string that describes the line that is being drawn. Next, we call the title
function to set a title for the plot. The legend
function then causes the label that we set first to be displayed, by default, matplotlib will try and find the best position for the legend. Finally, calling the show
function displays the plot so we can see it.
plt.plot(x, y_sinc, label='sinc(x)')
# Set title and legend, then show plot
plt.title('The sinc function')
plt.legend()
plt.show()
Looking at our new plot, it seems better, but we have still not labelled our axes. We can do this by calling xlabel
and ylabel
to set the x and y labels respectively. We can also change the location of the legend if we didn't like the position that matplotlib chose for us, using the loc
keyword argument.
plt.plot(x, y_sinc, label='sinc(x)')
# Set title and legend, then show plot
plt.title('The sinc function')
plt.legend(loc='upper left')
plt.xlabel('x')
plt.ylabel('f(x)')
plt.show()
The resultant graph is now suitable for use as a diagram or in a paper. We can customise it further as we will see in the next part, but for now we will look at some other plots.