Cluster Sampling in Statistics: Definition, Types

Statistics Definitions > Cluster Sampling

Cluster sampling is used when natural groups are present in a population. The whole population is subdivided into clusters, or groups, and random samples are then collected from each group.

Watch the video for an overview.

When to Use Cluster Sampling

Cluster sampling is typically used in market research. It’s used when a researcher can’t get information about the population as a whole, but they can get information about the clusters. For example, a researcher may be interested in data about city taxes in Florida. The researcher would compile data from selected cities and compile them to get a picture about the state. The individual cities would be the clusters in this case. Cluster sampling is often more economical or more practical than stratified sampling or simple random sampling.

Requirements

Cluster elements should be as heterogeneous as possible. In other words, the population should contain distinct subpopulations of different types.
Each cluster should be a small representation of the entire population.
Each cluster should be mutually exclusive. In other words, it should be impossible for each cluster to occur together. In the city tax example, it would be impossible for Miami city taxes and Jacksonville city taxes to occur together, so it fits the requirements for mutual exclusivity.

Types of Cluster Sampling

Single-stage cluster sampling: all the elements in each selected cluster are used.
Two-stage cluster sampling: where a random sampling technique is applied to the selected clusters. For example, once you’ve decided on your clusters, you could use simple random sampling to select your sample.

Difference Between Cluster Sampling and Stratified Sampling

For a stratified random sample, a population is divided into stratum, or sub-populations, before sampling. At first glance, the two techniques seem very similar. However, in cluster sampling the actual cluster is the sampling unit; in stratified sampling, analysis is done on elements within each strata. In cluster sampling, a researcher will only study selected clusters; with stratified sampling, a random sample is drawn from each strata.

Reference

Lohr, S. (2019). Sampling: Design and Analysis (Chapman & Hall/CRC Texts in Statistical Science) 2nd Edition. Routledge.