SOLID STATE PRESS
← Back to catalog
Polynomial Regression cover
Coming soon
Coming soon to Amazon
This title is in our publishing queue.
Browse available titles
Mathematics

Polynomial Regression

Curve Fitting, the Bias-Variance Tradeoff, and When Higher Degree Hurts — A TLDR Primer

Polynomial regression shows up in AP Statistics, intro data science courses, and the first week of any machine learning class — and most students hit it cold, with a textbook that buries the core idea under pages of theory before getting to anything useful.

This TLDR primer cuts straight to what matters. You'll learn how polynomial regression extends linear models to fit curved data, how the coefficients are actually computed using least squares and the normal equations, and how to tell whether a fit is genuinely good or just memorizing noise. The guide then tackles the concept that trips up almost every student encountering predictive modeling for the first time: the **bias-variance tradeoff**. Cranking up the polynomial degree always lowers error on your training data — and almost always hurts you on new data. This book shows you exactly why, and gives you concrete tools (train/test splits, k-fold cross-validation) to choose the right degree without guessing.

The final section covers the honest limits of polynomial models: why extrapolation fails, how multicollinearity between powers of *x* inflates uncertainty, and when a spline or a different model family is the smarter call.

Written for high school students in statistics or pre-calculus, early college students in data science or applied math, and anyone who wants to walk into a regression unit with real understanding instead of memorized formulas. Concise and to the point — no filler, no detours.

If polynomial regression is on your syllabus, start here.

What you'll learn
  • Set up a polynomial regression as a linear least-squares problem and solve it for small degrees by hand and conceptually for larger ones.
  • Interpret R-squared, residuals, and residual plots to judge whether a polynomial fit is appropriate.
  • Recognize overfitting and the bias-variance tradeoff, and use train/test splits or cross-validation to pick a degree.
  • Understand the limits of polynomial models — extrapolation failure, multicollinearity of powers, and when to prefer splines or other models.
What's inside
  1. 1. From Lines to Curves: What Polynomial Regression Is
    Introduces polynomial regression as an extension of linear regression that fits curved relationships while still being linear in its coefficients.
  2. 2. Fitting the Curve: Least Squares and the Normal Equations
    Walks through how the coefficients are actually computed by minimizing squared residuals, with a worked quadratic fit on a small dataset.
  3. 3. Reading the Fit: R-squared, Residual Plots, and What 'Good' Means
    Shows how to evaluate a polynomial fit using R-squared, adjusted R-squared, and visual diagnostics, and warns against common interpretation traps.
  4. 4. Overfitting and the Bias-Variance Tradeoff
    Explains why cranking up the degree always lowers training error but eventually wrecks predictions, and frames the tradeoff that drives model selection.
  5. 5. Choosing the Degree: Train/Test Splits and Cross-Validation
    Gives concrete procedures for picking the right polynomial degree using held-out data and k-fold cross-validation.
  6. 6. Limits and Alternatives: When Not to Use a Polynomial
    Covers extrapolation failure, multicollinearity between powers of x, and points toward splines, logistic regression, and other models for cases polynomials handle poorly.
Published by Solid State Press
Polynomial Regression cover
TLDR STUDY GUIDES

Polynomial Regression

Curve Fitting, the Bias-Variance Tradeoff, and When Higher Degree Hurts — A TLDR Primer
Solid State Press

Contents

  1. 1 From Lines to Curves: What Polynomial Regression Is
  2. 2 Fitting the Curve: Least Squares and the Normal Equations
  3. 3 Reading the Fit: R-squared, Residual Plots, and What 'Good' Means
  4. 4 Overfitting and the Bias-Variance Tradeoff
  5. 5 Choosing the Degree: Train/Test Splits and Cross-Validation
  6. 6 Limits and Alternatives: When Not to Use a Polynomial
Chapter 1

From Lines to Curves: What Polynomial Regression Is

Suppose you record how far a ball travels each second after you throw it. Plot those points and you will probably see a curve, not a line. Linear regression — fitting a straight line to data — cannot capture that shape. Polynomial regression can. It extends linear regression by letting the model bend, using powers of the input variable as additional predictors.

The setup you already know

Linear regression models the relationship between an input $x$ and an output $y$ as:

$y = \beta_0 + \beta_1 x + \varepsilon$

Here $\beta_0$ is the intercept, $\beta_1$ is the slope, and $\varepsilon$ (epsilon) is an error term that absorbs the gap between the model and the actual data point. The goal is to find the values of $\beta_0$ and $\beta_1$ that make the line fit the data as closely as possible. Section 2 covers exactly how that fitting works.

The problem is that real data is often curved. A straight line forces the relationship to be constant: every one-unit increase in $x$ changes $y$ by exactly $\beta_1$, no matter where you are on the $x$-axis. That assumption fails for things like projectile motion, population growth over short intervals, or how drug concentration in the bloodstream rises and then falls.

Adding powers of x

Polynomial regression fixes this by including higher powers of $x$ in the model. A polynomial of degree $n$ looks like:

$y = \beta_0 + \beta_1 x + \beta_2 x^2 + \beta_3 x^3 + \cdots + \beta_n x^n + \varepsilon$

Each term $\beta_k x^k$ adds one more bend the curve is allowed to make. A degree-1 polynomial is just linear regression. A degree-2 polynomial is a quadratic — it has one curve, like a parabola. A degree-3 polynomial is a cubic — it can have a rise, a fall, and a rise again (or the reverse). In general, a degree-$n$ polynomial can change direction at most $n-1$ times.

The numbers $\beta_0, \beta_1, \ldots, \beta_n$ are the coefficients. They are what you are solving for when you fit the model. The degree $n$ is something you choose before fitting; it is a hyperparameter — a structural decision about the model's shape, not something the fitting procedure determines automatically.

The key idea: linear in the coefficients, not in x

Here is the concept students most often find confusing: polynomial regression is called a linear model, even though the curve it fits is not a straight line. How?

About This Book

If you're staring down a statistics or data science course that assumes you already understand curve fitting and least squares, or you're an AP Statistics student who wants a sharper grip on regression analysis before the exam, this book was written for you. It also works for anyone exploring intro to machine learning math prerequisites — the student who knows algebra but hasn't yet seen why fitting a curve to data can go wrong.

This guide covers polynomial regression explained for students from the ground up: how least squares and the normal equations actually work, what R-squared tells you (and what it hides), and a bias-variance tradeoff simple explanation that makes overfitting and underfitting in machine learning genuinely click. It walks through cross-validation and model selection so degree-choosing stops feeling like guesswork. Concise and ruthless about cuts — no filler.

Read it straight through once, work every worked example on paper, then tackle the problem set at the end to confirm the ideas have landed.

Keep reading

You've read the first half of Chapter 1. The complete book covers 6 chapters in roughly fifteen pages — readable in one sitting.

Coming soon to Amazon