The probability integral transform states that, for a continuous random variable
Suppose we have a random variable
What does
Let’s try to draw the pdf of
First, suppose we select the top 3% of the distribution, (i.e. values between the
The orange lines bound the top 3%.
For now, since we don’t know what the density of
However, recall that we selected the top 3% of the probability mass, so within the orange interval, the area under the curve must be
Now, think about the region between the
However, we note that all intervals have this same property (even arbitrarily small intervals): the width of each interval is equal to its corresponding probability mass. So, the pdf of
It is natural for me to suspect the pdf of
We use the above insight to illustrate the theorem.
Theorem (Probability Integral Transform):
Proof: the standard proof of the PIT found on the Wikipedia page.
Therefore,
P-value distribution under 1
The p-value of a test statistic
Define
Thus, we have shown that one-sided p-values are uniformly distributed under the null hypothesis.2
Acknowledgements
Thank you to Meimingwei Li, Raphael Rehms, Dr. Fabian Scheipl, Prof. Michael Schomaker, and J.P. Weideman for their helpful input.
Footnotes
Notation and proof from and inspired by Raphael Rehms’s exercise and solution from the Statistical Methods in Epidemiology course for master’s students at LMU Munich.↩︎
This holds only for divergence p-values, not decision p-values. My understanding is that divergence p-values are one-sided p-values. Thanks to Prof. Schomaker for this insight.↩︎