Probability and Statistics > Normal Distributions

**Contents**:

- What is a Normal distribution?
- The Standard Normal Model
- Normal Distribution Word Problems.
- Normal Distribution on the TI 89 Examples
- What is a Gaussian Distribution?
- Related Articles.

## What is a Normal distribution?

A normal distribution, sometimes called the bell curve, is a distribution that occurs naturally in many situations. For example, the bell curve is seen in tests like the SAT and GRE. The bulk of students will score the average (C), while smaller numbers of students will score a B or D. An even smaller percentage of students score an F or an A. This creates a distribution that resembles a bell (hence the nickname). The bell curve is symmetrical. Half of the data will fall to the left of the mean; half will fall to the right.

Many groups follow this type of pattern. That’s why it’s widely used in business, statistics and in government bodies like the FDA:

- Heights of people.
- Measurement errors.
- Blood pressure.
- Points on a test.
- IQ scores.
- Salaries.

The empirical rule tells you what percentage of your data falls within a certain number of standard deviations from the mean:

• 68% of the data falls within one standard deviation of the mean.

• 95% of the data falls within two standard deviations of the mean.

• 99.7% of the data falls within three standard deviations of the mean.

The standard deviation controls the spread of the distribution. A smaller standard deviation indicates that the data is tightly clustered around the mean; the normal distribution will be taller. A larger standard deviation indicates that the data is spread out around the mean; the normal distribution will be flatter and wider.

## Properties of a normal distribution

- The mean, mode and median are all equal.
- The curve is symmetric at the center (i.e. around the mean, μ).
- Exactly half of the values are to the left of center and exactly half the values are to the right.
- The total area under the curve is 1.

The Standard Normal Model

A standard normal model is a normal distribution with a mean of 1 and a standard deviation of 1.

## Standard Normal Model: Distribution of Data

One way of figuring out how data are distributed is to plot them in a graph. If the data is evenly distributed, you may come up with a **bell curve**. A bell curve has a small percentage of the points on both tails and the bigger percentage on the inner part of the curve. In the **standard normal model**, about 5 percent of your data would fall into the “tails” (colored darker orange in the image below) and 90 percent will be in between. For example, for test scores of students, the normal distribution would show 2.5 percent of students getting *very* low scores and 2.5 percent getting *very* high scores. The rest will be in the middle; not too high or too low. The shape of the standard normal distribution looks like this:

### Practical Applications of the Standard Normal Model

The standard normal distribution could help you figure out which subject you are getting good grades in and which subjects you have to exert more effort into due to low scoring percentages. Once you get a score in one subject that is higher than your score in another subject, you might think that you are better in the subject where you got the higher score. This is not always true.

You can only say that you are better in a particular subject if you get a score with a certain number of standard deviations above the mean. The standard deviation tells you how tightly your data is clustered around the mean; It allows you to compare different distributions that have different types of data — including different means.

For example, if you get a score of 90 in Math and 95 in English, you might think that you are better in English than in Math. However, in Math, your score is 2 standard deviations above the mean. In English, it’s only one standard deviation above the mean. It tells you that in Math, your score is far higher than most of the students (your score falls into the tail).

Based on this data, you actually performed better in Math than in English!

## Probability Questions using the Standard Model

Questions about standard normal distribution probability can *look* alarming but the key to solving them is understanding what the area under a standard normal curve represents. The total area under a standard normal distribution curve is 100% (that’s “1” as a decimal). For example, the left half of the curve is 50%, or .5. So the probability of a random variable appearing in the left half of the curve is .5.

Of course, not all problems are quite *that* simple, which is why there’s a z-table. All a z-table does is measure those probabilities (i.e. 50%) and put them in standard deviations from the mean. The mean is in the center of the standard normal distribution, and a probability of 50% equals zero standard deviations.

## Standard normal distribution: How to Find Probability (Steps)

**Step 1:** Draw a bell curve and shade in the area that is asked for in the question. The example below shows z >-0.8. That means you are looking for the probability that z is greater than -0.8, so you need to draw a vertical line at -0.8 standard deviations from the mean and shade everything that’s greater than that number.

**Step 2:** Visit the normal probability area index and find a picture that looks like your graph. Follow the instructions on that page to find the z-value for the graph. The z-value *is *the probability.

**Tip: **Step 1 is technically optional, but it’s *always* a good idea to sketch a graph when you’re trying to answer probability word problems. That’s because most mistakes happen not because you can’t do the math or read a z-table, but because you subtract a z-score instead of adding (i.e. you imagine the probability under the curve in the wrong direction. A sketch helps you cement in your head exactly what you are looking for.

## Normal Distribution Word Problems

This video shows one example of a normal distribution word problem. For more examples, read on below:

When you tackle normal distribution in a statistics class, you’re trying to find the area under the curve. The total area is 100% (as a decimal, that’s 1). ** Normal distribution problems** come in **six **basic types. How do you know that a word problem involves normal distribution? Look for the key phrase “*assume the variable is normally distributed*” or “*assume the variable is approximately normal*.” To solve a word problem you need to figure out which type you have.

- “Between”: Contain the phrase “between” and includes an upper and lower limit (i.e. “find the number of houses priced between $50K and 200K”).
- “More Than” or “Above”: contain the phrase “more than” or “above”.
- “Less Than”.
- Lower Cut Off Example (video)
- Upper Cut Off Example (video)
- Middle Percent Example (video)

## 1. “Between”

This how-to covers solving problems that contain the phrase “between” and includes an upper and lower limit (i.e. “find the number of houses priced between $50K and 200K”. Note that this is different from finding the “middle percentage” of something.

## Word problems with normal distribution: “Between”: Steps

**Step 1:** *Identify the parts of the word problem*. The word problem will identify:

- The mean (average or μ).
- Standard deviation (σ).
- Number selected (i.e. “choose one at random” or “select ten at random”).
- X: the numbers associated with “between” (i.e. “between $5,000 and $10,000” would have X as 5,000 and as $10,000).

In addition, you will be given EITHER:

- Sample size (i.e. 400 houses, 33 people, 99 factories, 378 plumbers etc.). OR
- You might be asked for a probability (in which case your sample size will most likely be everyone, i.e. “Journeyman plumbers” or “First year pilots.”

**Step 2:** *Draw a graph*. Put the mean you identified in Step 1 in the center. Put the number associated with “between” on the graph (take a guess at where the numbers would fall–it doesn’t have to be exact). For example, if your mean was $100, and you were asked for “hourly wages between $75 and $125”) your graph will look something like this:

**Step 3:***Figure out the z-scores*. Plug the first X value (in my graph above, it’s 75) into the z value formula and solve. The μ (the mean), is 100 from the sample graph. You can get these figures (including σ, the standard deviation) from your answers in step 1 :

- *Note: if the formula confuses you, all this formula is asking you to do is:
- subtract the mean from X
- divide by the standard deviation.

* Step 4: Repeat step 3 for the second X*.

**Step 5:** *Take the numbers from step 3 and 4 and use them to find the area in the **z-table**.*

If you were asked to find a probability in your question, go to step 6a. If you were asked to find a number from a specific given sample size, go to step 6b.

**Step 6a:**

*Convert the answer from step 5 into a percentage. *

- For example, 0.1293 is 12.93%.

That’s it–skip step 6b!

**Step 6b**

*Multiply the sample size (found in step 1) by the z-value you found in step 4*. For example, 0.300 * 100 = 30.

That’s it!

## 2. “More Than” or “Above”

This how-to covers solving normal distribution problems that contain the phrase “**more than**” (or a phrase like “above”).

**Step 1:** *Break up the word problem into parts. Find:*

- The mean (average or μ)
- Standard deviation (σ)
- A number (for example, “choose fifty at random” or “select 90 at random”)
- X: the number associated with the “less than” statement. For example, if you were asked to find “under $9,999” then X is 9,999.

**Step 2:** Find the sample from the problem. You’ll have either a specific size (like “1000 televisions”) or a general sample (“Every television”).

*Draw a picture if the problem with the mean and the area you are looking for. *For example, if the mean is $15, and you were asked to find what dinners cost more than $10, your graph might look like this:

**Step 3:** *Calculate the z-score* (plug your values into the z value formula and solve). Use your answers from step 1 :

Basically, all you are doing with the formula is subtracting the mean from X and then dividing that answer by the standard deviation.

**Step 4:** *Find the area using the z-score from step 3. Use the **z-table**.* Not sure how to read a z-table? See Step 1 of this post for an example: Area under a curve.).

**Step 6:** *Go to Step 6a to find a probability OR go to step 6b to calculate a certain number or amount.*

**Step 6a**

*Turn step 5’s answer into a percentage.*

- For example, 0.1293 is 12.93%.

Skip step 6b: you’re done!

**Step 6b**

*Multiply the sample size from Step 1 by the z-score from step 4*. For example, 0.500 * 100 = 50.

You’re done!

## 3. Less Than

This how-to covers solving **normal distribution word problems** that have the phrase “**less than**” (or a similar phrase such as “fewer than”).

## Normal distribution word problems less than: Steps

**Step 1:** *Break up the word problem into parts*:

- The mean (average or μ)
- Standard deviation (σ)
- Number selected (i.e. “choose one at random” or “select ten at random”)
- X: the number that goes with “less than” (i.e. “under $99,000” would list X as 99,000)

Plus, you will have EITHER:

- A specific sample size. For example, 500 boats, 250 sandwiches, 100 televisions etc.
- Everyone in the sample (you’ll be asked to find a probability). For example “first year medical students,” “Cancer patients” or “Airline pilots.”

**Step 2:** *Draw a picture* to help you visualize the problem. The following graph shows a mean of 15, and an area “under 4”):

**Step 3:** *Find the z value* by plugging the given values into the formula. The “X” in our sample graph is 4, and the μ (or mean) is 15. You can get these figures (including σ, the standard deviation) from your answers in step 1, where you identified the parts of the problem:

All you have to do to solve the formula is:

- Subtract the mean from X.
- Divide by the standard deviation.

**Step 4:** *Take the number from step 3, then use the **z-table* to find the area.

**Step 5:***To find a probability, go to step 6a. To find a number from a specific given sample size, go to step 6b.*

**Step 6a**

*Change the number from step 5 into percentage. *

- For example, 0.1293 is 12.93%.

That’s it!

**Step 6b**

*Multiply the sample size (found in step 1) by the z-value you found in step 4*. For example, 0.300 * 100 = 30.

That’s it!

## 4. Lower Cut Off

Sometimes on **a normal distribution word problem** you’ll be asked to find a “lower limit of an upper* percentage*” of something (i.e. “find the cut-off point to pass a certain exam where the upper 40% of test takers pass”). A lower cut off point is the point where scores will fall below that point. For example, you might want to find where the cut off point is for the bottom 10% of test takers.

Check out our YouTube channel for more worked problems.

## Normal Distribution TI 89 Examples

In **elementary statistics**, you’ll often be faced with a question that asks you the cut off points for a certain percentage of the normal distribution, like the top 90% or the top 10%. While working out these types of problems by hand is cumbersome, the **TI-89 graphing calculator** makes light work of finding cut off points for a top percentage with the **Inverse Normal** function. What you’re actually doing is looking for the cut off points for a certain percentile: for example, if you have a list of grades and you want to know what score is at the 99th percentile, you can use the inverse normal function to find that percentage cut-off point.

## 1. Finding Cut Off Points For a Top Percentage

**Sample problem**: Students at a certain college average 5 feet 8 inches (68 inches) tall. The heights are normally distributed, with a standard deviation of 2.5 inches. What is the value that separates the top 1% of heights from the rest of the population?

**Step 1:** Press APPS and use the scroll keys to highlight **Stats/List Editor**.

**Step 2:** Press F5 2 1 (this gets you to the Inverse Normal screen).

**Step 3:** Enter 0.99 in the **Area** box.

**Step 4:** Enter 68 in the **μ** box.

**Step 5:** Enter 2.5 in the **σ** box.

**Step 6:** Press ENTER.

**Step 7:** Read the results: **Inverse=73.8159** means that the cut off height for the 99th percentile is **73.8159 inches**.

That’s it!

## 2. Probability Proportion Example (NormalCDF function)

**Sample question**: A group of students with normally distributed salaries earn an average of $6,800 with a standard deviation of $2,500. What proportion of students earn between $6,500 and $7,300?

**Step 1:** Press APPS. Scroll to the **Stats/List Editor** and press ENTER.

**Step 2:** Press F5 4.

**Step 3:** Enter 6500 in the **lower value** box.

**Step 4:** Enter 7300 in the **upper value** box.

**Step 5:** Enter 6800 in the **μ** box.

**Step 6:** Enter 2500 in the **σ** box. Press ENTER.

**Step 7:** Read the answer. **Cdf=.127018**. In other words, .013, or 13% of students earn between $6,500 and $7,300.

## TI-89 Graphing a Normal Distribution Curve

The **TI-89** can not only calculate **z-scores** and return values for normal distributions, it can graph the normal distribution curve as well. Graphing a normal distribution can help you see what it is you are supposed to be looking for, and gives you one more tool in solving normal distribution problems. The TI-89 can graph a normal distribution curve with an area shaded for any value. For example, you could create a graph that is: less than a certain number, greater than a certain number, or in-between a certain set of numbers.

**Sample problem**: Draw a normal distribution curve for student salaries during a typical semester. The student salaries have a mean (average) of $6,800 and standard deviation of $2,500. Shade the area on the graph that corresponds to salaries between $7,300 and $9,000.

**Step 1:** Press APPS and select the **Stats/List Editor**.

**Step 2:** Press F2 3 and F2 4.

**Step 3:** Press F5 ) 1.

**Step 4:** Scroll down and enter 7300 in the **lower value** box.

**Step 5:** Scroll down and enter 9000 in the **upper value** box.

**Step 6:** Scroll down and enter 6800 in the **μ** box.

**Step 7:** Scroll down and enter 2500 in the **σ** box.

**Step 8:** Scroll down. Turn **Auto Scale** to “yes” by pressing the right scroll key, then the down scroll key to select yes. Press ENTER.

*That’s it!*

**Tip**: If you want to enter ∞ (infinity) as one of your lower or upper values, press the diamond key and then Catalog.

## What is a Gaussian Distribution?

Gaussian Distribution is another name for a normal distribution.

- In statistics, the normal distribution is called the normal curve.
- In the social sciences, it’s called the bell curve (because of it’s shape).
- In physics, it’s called the Gaussian distribution.

## Why the Different Names for the same Distribution?

Although de Moivre first described the normal distribution as an approximation to the binomial, Carl Friedrich Gauss used it in 1809 for the analysis of astronomical data on positions, hence the term Gaussian distribution.

## A Family of Curves

The Gaussian distribution is a continuous family of curves, all shaped like a bell. In other words, there are endless possibilities for the number of possible distributions, given the limitless possibilities for standard deviation measurements (which could be from 0 to infinity). The standard Gaussian distribution has a mean of 0 and a standard deviation of 1. The larger the standard deviation, the flatter the curve. The smaller the standard deviation, the higher the peak of the curve.

## What is a Gaussian Distribution Function?

A Gaussian distribution function can be used to describe physical events if the number of events is very large (see: Central Limit Theorem(CLT)). In simple terms, the CLT says that while you may not be able to predict what one item will do, if you have a whole ton of items, you can predict what they will do as a whole. For example, if you have a jar of gas at a constant temperature, the Gaussian distribution will enable you to figure out the probability that one particle will move at a certain velocity.

- Approximately 68% of events fall within one standard deviation of the mean.
- 95% fall within two standard deviations of the mean.
- 99% fall within three standard deviations from the mean .

For more info, see: the 68 95 99.7 rule.

The Gaussian distribution approximates the binomial and has a mean of n*p, where n is the number of events and p is the probability of any integer x (see: mean of the binomial distribution).

## Related Articles

- What is the 68-95-99.7 Rule?
- Gaussian Mixture Models.
- What is a Normal Probability Plot?
- How to Calculate a Z-Score in Statistics
- Find the area to the right of a z score.
- Using the Normal Approximation to solve a Binomial Problem
- What is the continuity correction factor?
- Area Under a Normal Distribution Curve Index
- Central Limit Theorem.
- The Skew Normal Distribution.
- Two Tailed Normal Curve.
- The Q Function.

If you prefer an online interactive environment to learn R and statistics, this *free R Tutorial by Datacamp* is a great way to get started. If you're are somewhat comfortable with R and are interested in going deeper into Statistics, try *this Statistics with R track*.

*Facebook page*and I'll do my best to help!

How would you find the summation of b(50, i, .3) for i= 11 to 50

Hello,

How can I take my data and create a standard divination curve? What part of the data do I use to find the points on the curve?

Thanks,

-Olivia

I’m not exactly sure what you mean by standard divination curve?

Hi,

How to find the problems where percentage is given and I’m asked to find the numbers? For example: If the professor wants to give grade A to the 15% of the students, grade B to the next 35% of the students, grade C to the next 30% of the students and grades D to the rest, what are the score boundaries for each grade?

To find the z-score at that boundary, look in the table for the top 15%, the next 35% etc. For example, an area of .35 in the right hand z-table would give you the z-score cutting off the top 15%.

I just need her with this problem:

The standard deviation of a normal distribution is 12 and 90% of the values are greater than 6. What is the value of the mean?

Professor, I genuinely love you for your explanations!

If 90% of the values are above the mean, that puts the cutoff at about -1.24 (from the z-table). That gives you the number of standard deviations left of the mean for your 90% cutoff (i.e. at 6). Note: You’ll get a slightly different z-value if you use a calculator. You should be able to do the arithmetic from there :)

I really doubt that salaries follow a normal distribution, am I wrong?

Salaries are said to have a normal distribution at the start of this article

Hi, do u happen to know on how to solve this question? A staff in a cafe found that duration a customer spends in the cafe per visit is normally distributed with mean 10k minutes and standard deviation k minutes. If 2 customers are selected randomly, find the probability that

(1) the first customer spends 20% more time than the second customer

(2) the total time of two customers is more than 18k minutes

Can you give me an idea of where you get stuck?

It would be really helpful if you solve and explain tis questin to me it says in a nor al distribution 30% of the items are under 45 and 8% are under 34. Find the mean and the standard deviation of the disribution?

I hope that you see this soon i would be really greatful to you.

You can take those two percentages and look in the z-table for a difference in z-scores (which are really just standard deviations). I would start there. You’re going to end up applying the z score formula to find the mean.

Can you please help me with this one?

Now assume the 229 sample to be random, irrespective of your response to a) and consider the following. The report shows that 37.1% of the respondents show the impact quantitatively. You know that this is US data, but you happen to have data on 24 Canadian companies, and you see that 7 of those companies show the impact quantitatively. Do you have reason to doubt the Canadian companies can be considered part of the same population as the survey respondents? Show your calculations and assume a cut-off point of 5%.

So what you have is:

n = 229

37.1% show impact (US)

7/24 show impact (CAN)

cut off = 5%

Are you given any other information in part a/…like the distribution of the data?

On your website you state:

• 68% of the data falls within one standard deviation of the mean.

• 95% of the data falls within two standard deviations of the mean.

• 99.7% of the date falls within three standard deviations of the mean.

The last line should read data and not date.

Thanks for the correction, George. It’s fixed.

how would you do this type of question?

A machine fills pop bottles with a standard deviation of 25 mL no matter what the setting is for the mean. At how many mililiters should the mean be set, so that 90% of the bottles will contain at least 500 mL?

Thanks so much

hie could you please help me with this question

A new calibration baseline has recently been installed. To accurately determine its length, a precise laser measuring device was set up, and 500 repeated observations were taken of the baseline. The observations follow a normal distribution with a mean of 125.0022 m and a standard deviation of 0.0011 m.

a]Approximately how many more observations would you need in order to have 400 of all observations to fall within +/- 0.001 m of the mean?

The mean is unknown, so I would use the z-score formula to solve for the mean.

Have a need a formula for Excel 2013 that will select data (stock returns) that are arranged in a one column many row array using a random normal distribution.

THK’s