📊

Statistics

Hypothesis testing, PCA, bootstrap resampling, normal distribution and other foundational tools of statistical inference — all interactive.

📊 14 simulations 🆕 Category added 2026-05-16

🧪 Simulations (14)

❓ Frequently asked questions

What is a p-value?

The p-value is the probability of observing data at least as extreme as what was measured, assuming the null hypothesis is true. A p-value below 0.05 traditionally suggests rejecting the null. But it does NOT measure the probability the null is correct — a common misinterpretation.

What is PCA?

Principal Component Analysis finds linear combinations of variables (components) that capture the most variance. The first component points along the direction of greatest spread; subsequent components are orthogonal. Used for dimensionality reduction, visualisation and feature engineering.

What is the central limit theorem?

CLT states that the sum (or average) of many independent random variables tends toward a normal distribution, regardless of the individual distributions. This is why the bell curve appears everywhere: heights, measurement errors, lab assays.

What is bootstrap resampling?

Bootstrapping draws random samples (with replacement) from your data to estimate the sampling distribution of a statistic. Useful when analytic confidence intervals are intractable — works for medians, ratios, correlations. The basic procedure: resample → compute → repeat 10 000 times.

When should I use a t-test vs a non-parametric test?

Use a t-test when data are approximately normal or n is large (CLT applies). Use Wilcoxon / Mann-Whitney when data are non-normal, ordinal, or have heavy outliers. The interactive simulations let you compare both on synthetic data.