# Dispersion in Statistics: Definition

Statistics Definitions > Dispersion

## What is Dispersion?

Dispersion in statistics is a way of describing how spread out a set of data is. When a data set has a large value, the values in the set are widely scattered; when it is small the items in the set are tightly clustered. Very basically, this set of data has a small value:
1, 2, 2, 3, 3, 4
…and this set has a wider one:
0, 1, 20, 30, 40, 100

The spread of a data set can be described by a range of descriptive statistics including variance, standard deviation, and interquartile range. Spread can also be shown in graphs: dot plots, boxplots, and stem and leaf plots have a greater distance with samples that have a larger dispersion and vice versa.

The larger the box, the more dispersion in a set of data. Image: Seton Hall University

## Measures of Dispersion.

In some processes, like manufacturing or measurement, low dispersion is associated with high precision. High dispersion is associated with low precision.

## Measures of Dispersion: Example

Let’s say you were asked to compare measures of dispersion for two data sets. Data set A has the items 97,98,99,100,101,102,103 and data set B has items 70,80,90,100,110,120,130. By looking at the data sets you can probably tell that the means and medians are the same (100) which technically are called “measures of central tendency” in statistics.

However, the range (which gives you an idea of how spread out the entire set of data is) is much larger for data set B (60) when compared to data set A (6). In fact, nearly all measures of dispersion would be ten times greater for data set B, which makes sense as the range is ten times larger. For example, take a look at the standard deviations for the two data sets:
Standard deviation for A: 2.160246899469287.
Standard deviation for B: 21.602468994692867.
The figure for data set B is exactly ten times that of A.

Warning: When using a calculator (or a formula), check to make sure you are using the correct setting (or formula) for your data. Many measures of dispersion (like the variance) have two different formulas, one for a population and one for a sample. If you aren’t sure if you have a sample or a population, read these articles:
What is a population in statistics?
Sample in statistics: What it is, how to find it.

Check out our statistics YouTube channel. Hundreds of basic videos for an array of elementary statistics topics.

------------------------------------------------------------------------------

If you prefer an online interactive environment to learn R and statistics, this free R Tutorial by Datacamp is a great way to get started. If you're are somewhat comfortable with R and are interested in going deeper into Statistics, try this Statistics with R track.

# 11 thoughts on “Dispersion in Statistics: Definition”

1. braydon cook

I just wanted to say I just discovered this website and i feel it will help me with my grade 12 data management course. my teacher isnt the greatest and i will share this website with the rest of my class!

2. Shikha

Can you explain the importance of dispersion in a context by taking some example where one data set has high dispersion and the other has low.

3. Andale Post author

Shikha,
Can you be a little more specific about what you mean by “importance”? A high dispersion isn’t more “important” than a low dispersion set, unless you’re wanting a very specific context (for example, if you’re looking for drug efficacy, you’d want the data to be dispersed tightly around the sweet spot of a cure!).
Thanks.

4. Andale Post author

Dispersion isn’t really a “method”, it just gives you an idea of how spread out any set of data is. That could be very useful depending on what you’re studying. For example, a neighborhood that has a very tightly spread per-capita income is going to be more homogenous than one with a set that’s tightly clustered around a point.

5. Cliff Doss

Does standard error assist you in determining which variable has more dispersion? If so how?