The term “U statistic” can have several meanings:
- Unbiased (U) statistics.
- Mann Whitney U Statistic.
- U statistic in L-estimators.
- Theil’s U.
1. Unbiased (U) statistics
Constructing U-statistics involves use of a kernel— a symmetric real valued function h(x,y). An (Unbiased) U statistic is any statistic that can be placed into the following form:
Although they look complicated, a U statistic is really just a type of generalized average (Abraham et. al, 2013).
2. Mann Whitney U Statistic
The result of performing a Mann Whitney U Test is a U Statistic. For details on that particular statistic, see: Mann Whitney U Test.
Theil proposed two U statistics, used in finance. The first (U1) is a measure of forecast accuracy (Theil, 1958, pp 31-42); The second (U2) is a measure of forecast quality (Theil, 1966, chapter 2).
Wharton University’s J. Scott Armstrong gleaned the following formulae from Theil’s original works; the same formula for U2 appeared in Brown and Rozeff (1978):
A = change in actual earnings,
P = predicted change in earnings.
If you perform a Google search for Theil’s U, you’ll find an interesting assortment of different formulae, each purporting to be “Theil’s U”. According to Armstrong, the existence of multiple formulas “…has caused some confusion.” For example, the occasional author refers to the Uncertainty Coefficient (unrelated to the U statistic) as Theil’s U, because Theil originally worked on its derivation. The end result is that if you need to use Theil’s U for some reason, the best idea is to refer to Theil’s original formulae, shown above. That said, if you’re reading about Theil’s U in the literature, you may want to make sure that the author is using the correct formula.
Abraham, N. et. al (2013). Measures of Complexity and Chaos. Springer Science & Business Media.
Armstrong, J. (No Title). Retrieved December 8, 2017 from: http://armstrong.wharton.upenn.edu/dictionary/definitions/theil%27s%20u.html
Brown, L. & Rozeff, M. (1978). The superiority of analyst forecasts as measures of expectations: Evidence from earnings. In The Journal of Finance, Vol. XXXIII, March. No. 1.
Kaigh, W.D. (1983), “Quantile Interval Estimation,” Communications in Statistics, Part A – Theory and Methods, 12,2427-2443.
Theil, H. (1958), Economic Forecasts and Policy. Amsterdam: North Holland.
Thiel, H. (1966), Applied Economic Forecasting. Chicago: Rand McNally.
If you prefer an online interactive environment to learn R and statistics, this free R Tutorial by Datacamp is a great way to get started. If you're are somewhat comfortable with R and are interested in going deeper into Statistics, try this Statistics with R track.Comments? Need to post a correction? Please post on our Facebook page.