In statistics, every conjecture concerning the unknown probability distribution of a collection of random variables representing the observed data X in some study is called a statistical hypothesis. If we state one hypothesis only and the aim of the statistical test is to see whether this hypothesis is tenable, but not to investigate other specific hypotheses, then such a test is called a null hypothesis test.
The p-value is used in the context of null hypothesis testing in order to quantify the statistical significance of a result, the result being the observed value of the chosen statistic , The lower the p-value is, the lower the probability of getting that result if the null hypothesis were true. A result is said to be statistically significant if it allows us to reject the null hypothesis. All other things being equal, smaller p-values are taken as stronger evidence against the null hypothesis.
Definition of p-value
The p-value is the probability under the null hypothesis of obtaining a real-valued test statistic at least as extreme as the one obtained.
Consider an observed test statistic from an unknown distribution . Then the p-value is the prior probability of observing a test-statistic value at least as “extreme” as if the null hypothesis were true.
Formally:
-
One-sided right-tail test:
-
One-sided left-tail test:
-
Two-sided test:
If the distribution of is symmetric about zero:
In a significance test, the null hypothesis is rejected if the p-value is less than to a predefined threshold value , which is referred to as the alpha level or significance level. is not derived from the data, but rather is set by the researcher before examining the data. is commonly set to 0.05