Statistics Definitions > Goodness of Fit Tests

The goodness of fit test is used to test if sample data fits a distribution from a certain population (i.e. a population with a normal distribution or one with a Weibull distribution). In other words, it tells you if your sample data represents the data you would expect to find in the actual population. Goodness of fit tests commonly used in statistics are:

## The Chi Square Goodness of Fit Test

The chi-square test is the most common of the goodness of fit tests and is the one you’ll come across in AP statistics or elementary statistics. The chi square can be used for discrete distributions like the binomial distribution and the Poisson distribution, while the The Kolmogorov-Smirnov and Anderson-Darling goodness of fit tests can only be used for continuous distributions.

Two potential disadvantages of chi square are:

- The chi square test can only be used for data put into classes (bins). If you have non-binned data you’ll need to make a frequency table or histogram before performing the test.
- Another disadvantage of the chi-square test is that it requires a sufficient sample size in order for the chi-square approximation to be valid.

There is another type of chi-square test, called the chi-square test for independence. The two are sometimes confused but they are quite different.

- The chi-square test for independence compares
**two**sets of data to see if there is a relationship. - The chi-square Goodness of fit is to fit
**one**categorical variable to a distribution.

Both tests use the chi-square statistic and distribution. For more information about calculating the chi square statistic, see:

The chi square test statistic (includes calculations): What is a chi square statistic?

## Running the Test

Typically, this test is run using software. The null hypothesis for the chi-square goodness of fit test is that the data comes from a specified distribution. The alternate hypothesis is that the data does not come from a specified distribution.

To interpret the test, you’ll need to choose an alpha level (1%, 5% and 10% are common). The chi-square test will return a p-value. If the p-value is small (less than the significance level), you can reject the null hypothesis that the data comes from the specified distribution.

## Less Common Goodness of Fit Tests used in Elementary Statistics

### Kolmogorov-Smirnov

Although this is called a test for normality, it actually doesn’t tell you whether a particular sample likely came from a normal population. Instead, it will tell you when it is unlikely that you have a normal distribution. One advantage to this test is that it doesn’t make any assumptions about the distribution of data. A sample can be compared to a distribution using a one-sample K–S test or two-sample K–S test. The test is usually performed using software (like SPSS), because critical values have to be calculated for each distribution and finding the tables of critical values isn’t an easy task. The test is usually recommended for large samples over 2000. For smaller samples, use Shapiro-Wilk.

### Anderson-Darling

This test is a modification of Kolmogorov-Smirnov. It is more sensitive to deviations in a distribution’s tails. Like the Kolmogorov-Smirnov, this test will tell you when it is unlikely that you have a normal distribution and is normally run using statistical software.

### Shapiro-Wilk

This test calculates a W value that will tell you if a random sample came from a normally distributed population. The test is recommended for samples up to n=2000.

------------------------------------------------------------------------------If you prefer an online interactive environment to learn R and statistics, this *free R Tutorial by Datacamp* is a great way to get started. If you're are somewhat comfortable with R and are interested in going deeper into Statistics, try *this Statistics with R track*.

*Facebook page*and I'll do my best to help!

Does chi squire tell us either positive or negative significance

Chi square doesn’t tell you anything about significance. It tells you how much difference exists between your observed counts and the counts you would expect if there were no relationship at all in the population. It’s always non-negative.