Statistics Definitions > Large Enough Sample Condition
The Large Enough Sample Condition tests whether you have a large enough sample size compared to the population. A general rule of thumb for the Large Enough Sample Condition is that n≥30, where n is your sample size. However, it depends on what you are trying to accomplish and what you know about the distribution. In general, the Large Enough Sample Condition applies if any of these conditions are true:
- You have a symmetric distribution or unimodal distribution without outliers: a sample size of 15 is “large enough.”
- You have a moderately skewed distribution, that’s unimodal without outliers; If your sample size is between 16 and 40, it’s “large enough.”
- Your sample size is >40, as long as you do not have outliers.
- Your population has a normal distribution.
Central Limit Theorem and the Large Enough Sample Rule
The central limit theorem states that if sample size are large enough, the distribution will be approximately normal. The general rule of n≥30 applies.
Chi Square and the Large Enough Sample Condition
There are three different tests that use the chi-square ; in each test, the assumptions and conditions are the same, including the Large Enough Sample Condition. To know if your sample is large enough to use chi-square, you must check the Expected Counts Condition: if the counts in every cell is 5 or more, the cells meet the Expected Counts Condition and your sample is large enough. Note that 5 is arbitrary and is open to interpretation. Some texts suggest that it’s okay to have a few expected counts less than 5 (no more than 20%) as long as none are less than 1 (i.e. Yates, Moore & McCabe, The Practice of Statistics, 1999).
Calculating sample size
Calculating sample size can be one of the most confusing aspects of statistics, mostly because of all the rules (and rules of thumb) surrounding appropriate sizes for different distributions. You have to make sure your sample is sufficiently large, but not too large. For more techniques, see: Sample Size (includes techniques like using tables and calculators).
Reference: Chi-Square John’s Hopkins.------------------------------------------------------------------------------
If you prefer an online interactive environment to learn R and statistics, this free R Tutorial by Datacamp is a great way to get started. If you're are somewhat comfortable with R and are interested in going deeper into Statistics, try this Statistics with R track.Comments? Need to post a correction? Please post on our Facebook page.