Question 1

What is a p-value?

Accepted Answer

The p-value is the probability of observing data at least as extreme as what was measured, assuming the null hypothesis is true. A p-value below 0.05 traditionally suggests rejecting the null. But it does NOT measure the probability the null is correct — a common misinterpretation.

Question 2

What is PCA?

Accepted Answer

Principal Component Analysis finds linear combinations of variables (components) that capture the most variance. The first component points along the direction of greatest spread; subsequent components are orthogonal. Used for dimensionality reduction, visualisation and feature engineering.

Question 3

What is the central limit theorem?

Accepted Answer

CLT states that the sum (or average) of many independent random variables tends toward a normal distribution, regardless of the individual distributions. This is why the bell curve appears everywhere: heights, measurement errors, lab assays.

Question 4

What is bootstrap resampling?

Accepted Answer

Bootstrapping draws random samples (with replacement) from your data to estimate the sampling distribution of a statistic. Useful when analytic confidence intervals are intractable — works for medians, ratios, correlations. The basic procedure: resample → compute → repeat 10 000 times.

Question 5

When should I use a t-test vs a non-parametric test?

Accepted Answer

Use a t-test when data are approximately normal or n is large (CLT applies). Use Wilcoxon / Mann-Whitney when data are non-normal, ordinal, or have heavy outliers. The interactive simulations let you compare both on synthetic data.

Statistics

🧪 Simulations (14)

❓ Frequently asked questions