Statistics How To

What is a Bin in Statistics? Finding Class Interval

Statistics Definitions > What is a bin (or Class Interval) in statistics?

What is a Bin in statistics? Overview

In statistics, data is usually sorted in one way or another. You might sort the data into classes, categories, by range or placement on the number line. A bin — sometimes called a class interval — is a way of sorting data in a histogram. It’s very similar to the idea of putting data into categories.

What is a bin in statistics: Why not use “Categories” instead of Class Intervals?

When you put data into categories, you’re putting them into those categories without any thoughts about how that data might tell you something. Basic sorting into categories like male/female or yes/no does exist in statistics, but when it comes to making a histogram you’re aiming to make a chart that tells you some very good information about how your data is spread out. Therefore you want to carefully choose categories/classes. You can think of a bin as being a physical bin where you might sort objects into.
what is a bin in statistics


Imagine you’re working in a clothing store and want to know which shoe items is most popular in your inventory. If you only fill one bin, your bin might end up overflowing pretty fast and you’d have no information. You could try using different bins for flats, heels, sneakers and sandals. That might give you a better idea about your inventory. Or you could further add bins for black heels, white heels and so on. It’s the same principle when choosing bins for a histogram; you want to choose the right amount of bins to give you the information you need.

What is a bin in statistics: Choosing bins

Choosing bins can be done by hand for simple histograms in most cases. For example, if you are making a histogram for exam scores, choosing bins that matches grades (70-79, 80-89, 90-100) is a fairly obvious choice. You have two numbers associated with each bin: the low value (sometimes called bin low), which in this example would be 70, 80, 90 and the high value (sometimes called bin high) which for this example is 79 89 100.

In most cases though, choosing bins isn’t going to be that simple especially for large data sets. When dealing with large sets of numbers, you’re usually better off using technology like Microsoft Excel to create a histogram (how to create a histogram in excel 2007), because if your bin choice doesn’t make for a nice-looking diagram you can dynamically change the bin values without having to draw a graph.
That said, if you want to choose bin sizes by hand, watch this video or click the link below for the full article:

Next: How to choose bin sizes in statistics

------------------------------------------------------------------------------

If you prefer an online interactive environment to learn R and statistics, this free R Tutorial by Datacamp is a great way to get started. If you're are somewhat comfortable with R and are interested in going deeper into Statistics, try this Statistics with R track.

Comments are now closed for this post. Need help or want to post a correction? Please post a comment on our Facebook page and I'll do my best to help!
What is a Bin in Statistics? Finding Class Interval was last modified: October 15th, 2017 by Stephanie

8 thoughts on “What is a Bin in Statistics? Finding Class Interval

  1. Andale Post author

    Not, you could find bins for quantitative too. For example, heights or weights could be separated into bins.

  2. William Wendt

    But in Excel using the Analytic Pack add-in it when doing a Histogram is the Bin that it generates the upper bound, middle, or lower bound of the space?

  3. lisa

    you can interpret bin as the size or width of the category. Then you can put your stuff or data in this bin, if the bin size is large, it can accommodate more data in this bin, it the bin size is too large, we may lose the peak values. e.g. we have one bin for all the data.
    if it is small, we get more details about the data, but more computation effort.

  4. Andale Post author

    You have two numbers associated with each bin: the low value (sometimes called bin low), and the high value (sometimes called bin high). I added an example to the article, hope it helps clarify this.