P-value calculator

Did you know?

A p-value of 0.05 doesn't mean there's a 5% chance your hypothesis is wrong. It means if the null hypothesis were true, there's a 5% chance of seeing data this extreme. The distinction matters: p-values don't tell you the probability that your theory is correct. They tell you how surprising your data is under the assumption that nothing interesting is happening.

p = 0.049996

Significant at α = 0.05 (reject null hypothesis). The result is unlikely to have occurred by chance.

Significant (α=0.05)

Yes

What this number means

The p-value is the probability of observing your data (or something more extreme) if the null hypothesis is true. A small p-value (typically <0.05) suggests your data would be unlikely under the null hypothesis, providing evidence against it. A large p-value means your data is consistent with the null hypothesis.

This calculator computes p-values for common statistical tests: z-tests, t-tests, and chi-squared tests. You input a test statistic and degrees of freedom (where applicable), and we return the probability from the appropriate distribution. The math is the tail area under a probability curve.

Good to know

P < 0.05 is a convention, not a law. Ronald Fisher suggested 0.05 as a convenient threshold in 1925. It stuck. But there's nothing magical about 5%. Some fields use 0.01 or 0.001 for stricter standards. A p-value of 0.049 and 0.051 are practically identical, yet one is "significant" and the other isn't. The threshold is arbitrary.

Statistical significance ≠ practical importance. With large enough sample sizes, tiny effects become "statistically significant." A drug that lowers blood pressure by 0.1 mmHg might achieve p < 0.001 with 100,000 participants, but that effect is clinically meaningless. Always ask: significant, but how much?

P-hacking is a real problem. Researchers who test many hypotheses and only report the significant ones inflate false positive rates. If you run 20 tests at α = 0.05, you expect one "significant" result by chance. Pre-registration and transparency help combat this.

Disclaimers & sources

References

More facts

The replication crisis

Confidence intervals are often more useful

One-tailed vs. two-tailed tests

Bayesian alternatives exist

FAQ

What does a p-value of 0.05 mean?

What is a "statistically significant" result?

How do I interpret a large p-value?

What's the difference between one-tailed and two-tailed tests?

P-value calculator

Good to know

Disclaimers & sources

References

More facts

FAQ

Related calculators