Statistics How To

Q Q Plots: Simple Definition & Example

Descriptive Statistics: Definition & Charts and Graphs > Q Q plots

You may want to read this article first: What is a Quantile?

What is a Q Q Plot?

Q Q Plots (Quantile-Quantile plots) are plots of two quantiles against each other. A quantile is a fraction where certain values fall below that quantile. For example, the median is a quantile where 50% of the data fall below that point and 50% lie above it. The purpose of Q Q plots is to find out if two sets of data come from the same distribution. A 45 degree angle is plotted on the Q Q plot; if the two data sets come from a common distribution, the points will fall on that reference line.

q q plots

A Q Q plot showing the 45 degree reference line. Image: skbkekas|Wikimedia Commons.


The image above shows quantiles from a theoretical normal distribution on the horizontal axis. It’s being compared to a set of data on the y-axis. This particular type of Q Q plot is called a normal quantile-quantile (QQ) plot. The points are not clustered on the 45 degree line, and in fact follow a curve, suggesting that the sample data is not normally distributed.

How to Make a Q Q Plot

Sample question: Do the following values come from a normal distribution?
7.19, 6.31, 5.89, 4.5, 3.77, 4.25, 5.19, 5.79, 6.79.

Step 1: Order the items from smallest to largest.

  • 3.77
  • 4.25
  • 4.50
  • 5.19
  • 5.89
  • 5.79
  • 6.31
  • 6.79
  • 7.19

Step 2: Draw a normal distribution curve. Divide the curve into n+1 segments. We have 9 values, so divide the curve into 10 equally-sized areas. For this example, each segment is 10% of the area (because 100% / 10 = 10%).



Step 3: Find the z-value (cut-off point) for each segment in Step 3. These segments are areas, so refer to a z-table (or use software) to get a z-value for each segment.
The z-values are:

  • 10% = -1.28
  • 20% = -0.84
  • 30% = -0.52
  • 40% = -0.25
  • 50% = 0
  • 60% = 0.25
  • 70% = 0.52
  • 80% = 0.84
  • 90% = 1.28
  • 100% = 3.0

A few of the z-values plotted on the graph.



Step 4: Plot your data set values (Step 1) against your normal distribution cut-off points (Step 3). I used Open Office for this chart:
how to make a q q plot

The (almost) straight line on this q q plot indicates the data is approximately normal.




Note: This example used the standard normal distribution, but if think your data could have come from a different normal distribution (i.e. one with a different mean and standard deviation) then you could use that instead.

Q Q Plots and the Assumption of Normality

The assumption of normality is an important assumption for many statistical tests; you assume you are sampling from a normally distributed population. The normal Q Q plot is one way to assess normality. However, you don’t have to use the normal distribution as a comparison for your data; you can use any continuous distribution as a comparison (for example a Weibull distribution or a uniform distribution), as long as you can calculate the quantiles. In fact, a common procedure is to test out several different distributions with the Q Q plot to see if one fits your data well.

Check out our YouTube channel for hundreds of elementary stats and probability videos!

------------------------------------------------------------------------------

If you prefer an online interactive environment to learn R and statistics, this free R Tutorial by Datacamp is a great way to get started. If you're are somewhat comfortable with R and are interested in going deeper into Statistics, try this Statistics with R track.

Comments are now closed for this post. Need help or want to post a correction? Please post a comment on our Facebook page and I'll do my best to help!
Q Q Plots: Simple Definition & Example was last modified: October 12th, 2017 by Stephanie Glen

2 thoughts on “Q Q Plots: Simple Definition & Example

  1. lokta

    It will be nice if you left a “real example” showing two sets of data and how to deduce all the quantiles used and then the plot. Thanks.