Bias variance preview

Last reviewed May 28, 2026 Content v20260528

Track mode: server_script
Means: Server runner
Reading: ~1 min
Level: intermediate

This lesson

This lesson teaches Bias variance preview: the data science mindset, methods, and communication habits behind evidence-based decisions.

Models can amplify historical bias—fairness and transparency are product requirements, not optional philosophy.

You will apply Bias variance preview in contexts like: Regulated domains, hiring models, credit scoring, and public-sector analytics.

Read the narrative, run Python in the playground (stdlib snippets now; install Jupyter, pandas, and scikit-learn locally for full notebooks), and complete MCQs to lock in vocabulary.

When you can explain the previous lesson's ideas in your own words.

The bias–variance tradeoff describes errors from oversimplified models (high bias) vs models that memorize noise (high variance). Good models balance both on unseen data.

High bias (underfitting)

Training error and test error both high
Model too simple for the pattern (linear line for curved data)

High variance (overfitting)

Training error low, test error much higher
Too many features, deep trees without regularization, small data

Mitigations

More relevant data, better features
Regularization (L1/L2), pruning, early stopping
Simpler model or ensemble with validation monitoring

Learning curves

Plot error vs training set size—if train and validation error stay far apart, variance is likely high; if both high, bias or noisy labels.

Important interview questions and answers

Q: Underfitting sign?
A: Poor performance on both train and validation.
Q: Overfitting sign?
A: Great train metrics, poor validation/test metrics.

Self-check

What is underfitting?
What is overfitting?
Name one way to reduce variance.

Tip: More features can increase variance—watch test performance.

Interview prep

Overfitting?: Low train error, high test error—high variance.

Interview tip Lesson completion confidence

Can you explain this lesson in 30 seconds without reading notes?

Self-reflection (saved on this device)

Not saved yet.

Playground

Runs on the configured server runner (dev: npm run runner with LEARNING_RUNNER_ENABLED=true). Output appears below the editor.

Code runner not available

Server runner is disabled. Set LEARNING_RUNNER_ENABLED=true and LEARNING_RUNNER_URL in .env (see .env.example).

Check yourself

Multiple choice — immediate feedback.

Discussion

Past discussion is visible to everyone. Only logged-in users can post comments and replies.

Starter discussion topics

Overfitting sign?
More features risk?

No discussion yet. Be the first to ask a question.