Preview — Pro guide

You are seeing a portion of this guide. Sign in and upgrade to unlock the full article, quizzes, and interview answers.

Sections

0/2

Related Guides

Hypothesis Testing for Data Scientists: p-values, Type I/II, Multiple Testing

Machine Learning

40m

A/B Testing & Experimentation at Scale

Machine Learning

70m

Statistics & Probability Foundations

Machine Learning

90m

Quiz

← Back to Library

Machine Learning·Intermediate

Non-parametric Tests: Mann–Whitney U, Kruskal–Wallis, Permutation Tests, and When Normality Fails

Revenue, dwell time, and latency are skewed — t-tests on raw values assume fragile things. This guide covers rank-based tests (Mann–Whitney, Kruskal–Wallis), exact permutation logic, median vs mean hypotheses, ties, and when robust alternatives (bootstrap, trimmed means) beat ranks for product A/B analysis.

34 min read 2 sections 1 interview questions

Mann-Whitney UWilcoxon Rank-SumKruskal-WallisPermutation TestNon-parametric StatisticsMedian TestHeavy-Tailed DataA/B TestingBootstrapscipy.statsRank-Based TestHodges-Lehmann

When Classical t-Test Assumptions Lie

The two-sample t-test assumes **approximately normal** sampling distribution of the mean (often satisfied by CLT for large ) and, for the equal-variance version, **homoskedasticity**. Skewed metrics (session duration, purchase amount) with **moderate n per arm** can produce **anti-conservative or misleading** p-values when outliers dominate the mean difference. **Non-parametric** rank tests replace values with **ranks**, down-weighting extreme tails automatically. They test **stochastic ordering** (Mann–Whitney: does treatment tend to produce larger values?) rather than equality of means — that distinction trips candidates.

IMPORTANT

Premium content locked

This guide is premium content. Upgrade to Pro to unlock the full guide, quizzes, and interview Q&A.

Upgrade to Pro Sign in to upgrade