SOLID STATE PRESS
← Back to catalog
The Chi-Square Distribution cover
Coming soon
Coming soon to Amazon
This title is in our publishing queue.
Browse available titles
Mathematics

The Chi-Square Distribution

Goodness-of-Fit, Tests of Independence, and Degrees of Freedom — A TLDR Primer

Chi-square shows up on AP Statistics exams, in introductory college stats courses, and in biology labs — and most students hit it cold, staring at a formula that looks more complicated than it is. This guide cuts straight to what you need: how the distribution works, how to build and interpret the test statistic, and how to run both major tests from hypotheses to conclusion.

**What's inside:** - The chi-square distribution itself — where it comes from, what its shape means, and how degrees of freedom control it - The observed-vs.-expected logic behind the test statistic, with clear worked examples including dice rolls and Mendel-style genetics problems - A full goodness-of-fit test, step by step, so you can handle any single-variable categorical question on an exam - The test of independence for two-way contingency tables — setting up expected counts, computing the statistic, and writing a proper conclusion - The expected-count-of-5 rule, when the test breaks down, and the mistakes students make most often - A forward look at where chi-square reappears: variance estimation, ANOVA, polling, and quality control

This primer is concise and to the point — no filler, no detours through material you won't be tested on. It's written for high school students in AP Statistics or introductory stats, early college students taking their first statistics course, and parents or tutors who need a quick, reliable refresher on chi-square tests for categorical data.

If your exam is coming up and you need to understand chi-square fast, pick this up and get to work.

What you'll learn
  • Explain what the chi-square distribution is and how degrees of freedom shape it
  • Compute expected counts and the chi-square statistic for categorical data
  • Run a goodness-of-fit test from hypotheses to conclusion
  • Run a test of independence on a two-way table and interpret the result
  • Recognize the assumptions, common errors, and limits of chi-square tests
What's inside
  1. 1. What the Chi-Square Distribution Is
    Introduce the chi-square distribution as the sum of squared standard normals, show its shape, and define degrees of freedom.
  2. 2. The Chi-Square Statistic: Observed vs. Expected
    Define the chi-square test statistic, explain why we square and divide, and walk through computing expected counts.
  3. 3. Goodness-of-Fit Test
    Run a full goodness-of-fit test, from hypotheses through p-value and conclusion, with worked dice and Mendel-style examples.
  4. 4. Test of Independence for Two-Way Tables
    Apply chi-square to two-way contingency tables to test whether two categorical variables are independent.
  5. 5. Assumptions, Pitfalls, and When Not to Use It
    Cover sample size rules, the expected-count-of-5 guideline, independence of observations, and common student errors.
  6. 6. Where Chi-Square Shows Up Next
    Connect chi-square to variance estimation, ANOVA, and real applications in genetics, polling, and quality control.
Published by Solid State Press
The Chi-Square Distribution cover
TLDR STUDY GUIDES

The Chi-Square Distribution

Goodness-of-Fit, Tests of Independence, and Degrees of Freedom — A TLDR Primer
Solid State Press

Contents

  1. 1 What the Chi-Square Distribution Is
  2. 2 The Chi-Square Statistic: Observed vs. Expected
  3. 3 Goodness-of-Fit Test
  4. 4 Test of Independence for Two-Way Tables
  5. 5 Assumptions, Pitfalls, and When Not to Use It
  6. 6 Where Chi-Square Shows Up Next
Chapter 1

What the Chi-Square Distribution Is

Start with what you already know. A standard normal distribution is the bell-shaped curve with mean 0 and standard deviation 1 — the $Z$-distribution from introductory statistics. Now ask a simple question: what happens if you take one of those $Z$-values, square it, and look at the distribution of that squared result? That object is the seed of the chi-square distribution.

Building the Distribution from Squared Normals

Take a random variable $Z$ drawn from a standard normal distribution. Form $Z^2$. Because squaring eliminates negative values, $Z^2$ is always greater than or equal to zero. Its distribution bunches near zero (small $Z$-values are common) and has a long right tail (large $Z^2$ values are rare but possible). That single-variable case is called a chi-square distribution with one degree of freedom, written $\chi^2(1)$.

Now take $k$ independent standard normal variables — call them $Z_1, Z_2, \ldots, Z_k$ — and add their squares:

$\chi^2 = Z_1^2 + Z_2^2 + \cdots + Z_k^2$

The result follows a chi-square distribution with $k$ degrees of freedom, written $\chi^2(k)$. That's the whole construction. No hidden machinery — it's literally a sum of squared standard normals.

Shape: Right-Skewed and Always Non-Negative

Because every term $Z_i^2$ is a square, the chi-square statistic can never be negative. That's its first defining feature: the distribution lives entirely on the interval $[0, \infty)$.

Its second feature is shape. With only a few degrees of freedom, the distribution is strongly right-skewed — it piles up near zero and trails off to the right. As the degrees of freedom increase, the distribution spreads out, its peak shifts rightward, and the skew gradually softens. By the time you reach roughly $k = 30$ or more, the curve starts to look approximately bell-shaped, though it never becomes perfectly symmetric.

A useful quick fact: for a $\chi^2(k)$ distribution, the mean equals $k$ and the variance equals $2k$. So as $k$ grows, both the center and the spread increase — the distribution migrates right and flattens out. You don't need to memorize these for most tests, but they explain the visual behavior.

Degrees of Freedom: What They Count

Degrees of freedom (often abbreviated df) measure how many independent pieces of information are free to vary after you've imposed constraints on your data. This phrase sounds abstract, so here is the concrete version: if you know five numbers must sum to 100, then once you've chosen four of them freely, the fifth is forced. You have four degrees of freedom, not five.

About This Book

If you are sitting in AP Statistics and the chi-square unit just started, this is the book you need. It is equally useful for any high school or intro college student working through categorical data analysis, whether that means preparing for an AP Statistics exam, catching up after missing class, or simply wanting a chi-square test explained for students in plain English rather than textbook prose.

This guide covers the chi-square distribution and degrees of freedom, the goodness-of-fit test for high school statistics, and the two-way table test of independence — the exact topics most AP Statistics study guides gloss over too quickly. Every term is defined, every formula is shown with worked numbers, and the chi-square practice problems and examples are built to match what appears on real exams. Statistics test prep for categorical data, done with ruthless cuts and no filler.

Read straight through once to build the mental framework. Then work every example yourself before reading the solution. Finally, attempt the problem set at the end to confirm you can execute independently.

Keep reading

You've read the first half of Chapter 1. The complete book covers 6 chapters in roughly fifteen pages — readable in one sitting.

Coming soon to Amazon