Statistics Definitions > What is a Normal Probability Plot?
What is a Normal Probability Plot?
When you have a set of data that you think might have a normal distribution (i.e. a bell curve), a graph of your data can help you decide whether or not your data is normal. Making a histogram of your data can help you decide whether or not a set of data is normal, but there is a more specialized type of plot you can create, called a normal probability plot. A normal probability plot graphs z-scores (normal scores) against your data set.
A straight, diagonal line means that you have normally distributed data. If the line is skewed to the left or right, it means that you do not have normally distributed data.
What is a Normal Probability Plot used for?
It can be easy to see with a histogram how data fits the norm, or skews from the norm.
With a normal probability plot, it can be easier to see individual data items that don’t quite fit a normal distribution. In the image below, the upper right data item is clearly out of line with the rest of the data, meaning that it doesn’t fit with a normal distribution.
How to Draw a Normal Probability Plot
Note: you may want to watch the Excel video below as it explains many of these steps in more detail:
- Arrange your x-values in ascending order.
- Calculate fi = (i-0.375)/(n+0.25), where i is the position of the data value in the
ordered list and n is the number of observations.
- Find the z-score for each fi
- Plot your x-values on the horizontal axis and the corresponding z-score
on the vertical axis.
Normal probability plots aren’t normally drawn by hand, because the normal scores used for the plot can’t be looked up in a table. That’s why technology like Minitab or SPSS is a good idea to make these types of graphs. You can also use Excel to create a simple normal probability plot:
Note: It’s best to make a histogram of your data to make sure it’s normally distributed before you make a normal probability plot. That’s because it’s easier to see a bell curve on a histogram that it is to gauge whether or not your data is normally distributed on a straight line (or almost straight line).
A normal probability plot is one way you can tell if data fits a normal distribution (a bell curve). With this type of graph, z-scores are plotted against your data set. A straight line in a normal probability plot indicates your data does fit a normal probability distribution. A skewed line means that your data is not normal. (“Not normal” in this sense means that it doesn’t fit a bell curve). Watch the video below to learn how to create a normal probability plot in Minitab or read the steps below.
How to create a normal probability plot in Minitab
Step 1: Type your data into columns in a Minitab worksheet. Give your variables meaningful names in the first (blank) row (this makes it easier to build the plot when you select a variable name in Step 4).
Step 2: Click “Graph” on the toolbar and then click “Probability plot.”
Step 3: Click the “Single” probability plot image. This is the option you’re likely to use 99% of the time in elementary statistics.
Step 4: Choose a variable name and then click “Select” to move the variable name to the Graph Variables box. If you didn’t name your variables in Step 1, the variable names will be listed as column identifiers (C1, C2 etc.).
Step 5: Click “OK.” Minitab will create a normal probability graph in a new window.
Tip: Make a histogram in minitab to see how well your data fits a normal distribution. Often a normal probability plot will appear to be fairly straight, but it might not be a great match to a bell curve. Checking the histogram first will allow you to see if your data fits a bell curve before you make assumptions about your data using the normal probability plot.
Need help with a homework or test question? With Chegg Study, you can get step-by-step solutions to your questions from an expert in the field. If you'd rather get 1:1 study help, Chegg Tutors offers 30 minutes of free tutoring to new users, so you can try them out before committing to a subscription.
If you prefer an online interactive environment to learn R and statistics, this free R Tutorial by Datacamp is a great way to get started. If you're are somewhat comfortable with R and are interested in going deeper into Statistics, try this Statistics with R track.
Comments? Need to post a correction? Please post a comment on our Facebook page.