Hypothesis Testing > Sign Test
What is the Sign Test?
The sign test is a statistical test to compare the sizes of two groups. It is a non-parametric or “distribution free” test, which means the test doesn’t assume the data comes from a particular distribution, like the normal distribution. The sign test is an alternative to a one sample t test or a paired t test. It can also be used for ordered (ranked) categorical data.
How to Calculate a Paired/Matched Sample Sign Test
Assumptions for the test (your data should meet these requirements before running the test) are:
- The data should be from two samples.
- The two dependent samples should be paired or matched. For example, depression scores from before a medical procedure and after.
To set up the test, put your two sets of sample data into a table (I used Excel). This set of data represents test scores at the end of Spring and the beginning of the Fall semesters. The hypothesis is that the summer break means a significant drop in test scores.
H0: No difference in median of the signed differences.
H1: Median of the signed differences is less than zero.
Step 2: Add a fourth column indicating the sign of the number in column 3.
Step 3: Count the number of positives and negatives.
- 4 positives.
- 12 negatives.
12 negatives seems like a lot, but we can’t say for sure that it’s significant (i.e. that it didn’t happen by chance) until we run the sign test.
Step 3: Add up the number of items in your sample and subtract any you had a difference of zero for (in column 3). The sample size in this question was 17, with one zero, so n = 16.
- .5 for the probability. The null hypothesis is that there are an equal number of signs (i.e. 50/50). Therefore, the test is a simple binomial experiment with a .5 chance of the sign being negative and .5 of it being positive (assuming the null hypothesis is true).
- 16 for the number of trials.
- 4 for the number of successes. “Successes” here is the smaller of either the positive or negative signs from Step 2.
The p-value is 0.038, which is smaller than the alpha level of 0.05. We can reject the null hypothesis and say there is a significant difference.
If you prefer an online interactive environment to learn R and statistics, this free R Tutorial by Datacamp is a great way to get started. If you're are somewhat comfortable with R and are interested in going deeper into Statistics, try this Statistics with R track.Comments are now closed for this post. Need help or want to post a correction? Please post a comment on our Facebook page and I'll do my best to help!