Probability Distributions > Bernoulli Distribution

## What is a Bernoulli Distribution?

A Bernouilli distribution is a discrete probability distribution for a Bernouilli trial — a random experiment that has only two outcomes (usually called a “Success” or a “Failure”). For example, the probability of getting a heads (a “success”) while flipping a coin is 0.5. The probability of “failure” is 1 – P (1 minus the probability of success, which also equals 0.5 for a coin toss). It is a special case of the binomial distribution for n = 1. In other words, it is a binomial distribution with a single trial (e.g. a single coin toss).

The probability of a failure is labeled on the x-axis as 0 and success is labeled as 1. In the following Bernoulli distribution, the probability of success (1) is 0.4, and the probability of failure (0) is 0.6:

The probability density function (pdf) for this distribution is p^{x} (1 – p)^{1 – x}, which can also be written as:

The expected value for a random variable, X, from a Bernoulli distribution is:

E[X] = p.

For example, if p = .04, then E[X] = 0.4.

The variance of a Bernoulli random variable is:

Var[X] = p(1 – p).

## What is a Bernoulli Trial?

A **Bernoulli trial** is one of the simplest experiments you can conduct in probability and statistics. It’s an experiment where you can have one of two possible outcomes. For example, “Yes” and “No” or “Heads” and “Tails.” A few more examples:

**Coin tosses**: record how many coins land heads up and how many land tails up.**Births**: how many boys are born and how many girls are born each day.**Rolling Dice**: the probability of a roll of two die resulting in a double six.

**success**and

**failure**. Success doesn’t mean success in the usual way — it just refers to an outcome you want to keep track of. For example, you might want to find out how many boys are born each day, so you call a boy birth a “success” and a girl birth a “failure.” In the dice rolling example, a double six die roll would be your “success” and everything else rolled would be considered a “failure.”

## Independence

An important part of every Bernoulli trial is that each action must be independent. That means the probabilities must remain the same throughout the trials; each event must be completely separate and have nothing to do with the previous event.

Winning a scratch off lottery is an independent event. Your odds of winning on one ticket are the same as winning on any other ticket. On the other hand, drawing lotto numbers is a dependent event. Lotto numbers come out of a ball (the numbers aren’t replaced) so the probability of successive numbers being picked depends upon how many balls are left; when there’s a hundred balls, the probability is 1/100 that any number will be picked, but when there are only ten balls left, the probability shoots up to 1/10. While it’s possible to find those probabilities, it isn’t a Bernoulli trial because the events (picking the numbers) are connected to each other.

The Bernouilli process leads to several probability distributions:

## Relation to the Binomial Distribution

The Bernoulli distribution is closely related to the Binomial distribution. As long as each individual Bernoulli trial is independent, then the number of successes in a series of Bernoulli trails has a Binomial Distribution. The Bernoulli distribution can also be defined as the Binomial distribution with n = 1.

## Use in Epedemiology

In experiments and clinical trials, the Bernoulli distribution is sometimes used to model a single individual experiencing an event like death, a disease, or disease exposure. The model is an excellent indicator of the probability a person has the event in question.

- 1 = “event” (P = p)
- 0 = “non event” (P = 1 – p)

Bernoulli distributions are used in logistic regression to model disease occurrence.

**Reference:**

WSU. Retrieved Feb 15, 2016 from: www.stat.washington.edu/peter/341/Hypergeometric%20and%20binomial.pdf

If you prefer an online interactive environment to learn R and statistics, this *free R Tutorial by Datacamp* is a great way to get started. If you're are somewhat comfortable with R and are interested in going deeper into Statistics, try *this Statistics with R track*.

*Facebook page*and I'll do my best to help!

THANKS.