Pandas with NumPy

Last reviewed May 28, 2026 Content v20260528

Track mode: server_script
Means: Server runner
Reading: ~2 min
Level: intermediate

This lesson

This lesson teaches Pandas with NumPy: Pandas tabular manipulation—indexing, dtypes, reshaping, and analysis habits for real-world tables.

This track orients workflow; NumPy/Pandas tracks teach the tools you will use daily in notebooks.

You will apply Pandas with NumPy in contexts like: CSV/Parquet analysis, ETL notebooks, and ad hoc reporting.

Read the narrative, run `import pandas as pd` snippets with in-memory DataFrames (install pandas and numpy with pip if needed), inspect `.head()`, `.dtypes`, and complete MCQs.

Toward the end—consolidate before SciPy, sklearn-heavy projects, and interview prep.

Pandas and NumPy interoperate constantly: columns store ndarrays, arithmetic uses broadcasting, and to_numpy() exports matrices for custom kernels or sklearn.

NumPy under the hood

import pandas as pd
import numpy as np

df = pd.DataFrame({'a': [1, 2, 3]})
arr = df['a'].to_numpy()
print(type(arr), arr.dtype, arr.shape)

Crossing the boundary

df.to_numpy() — full 2D array (may copy if mixed dtypes)
df['col'].values — legacy alias; prefer to_numpy()
pd.DataFrame(arr, columns=[...]) — ndarray → labeled table
NumPy ufuncs on Series: np.sqrt(df['x'])

Alignment caveat

NumPy ops on raw arrays ignore index labels. Pandas Series ops align on index—can introduce NaN where labels mismatch. Use .values or to_numpy() when you want pure positional NumPy behavior.

Important interview questions and answers

Q: to_numpy vs values?
A: to_numpy is explicit modern API; values is legacy attribute on Series.
Q: Mixed dtype DataFrame?
A: to_numpy() may upcast to object—select numeric columns first for ML.

Self-check

Export a numeric column to ndarray.
Build a DataFrame from a 2D NumPy array.

Tip: Prefer to_numpy() over legacy .values for explicit exports.

Interview prep

to_numpy?: Explicit export to ndarray for sklearn and custom ufuncs.
Alignment caveat?: Pandas ops align indexes; raw NumPy ignores labels.

Interview tip Lesson completion confidence

Can you explain this lesson in 30 seconds without reading notes?

Self-reflection (saved on this device)

Not saved yet.

Playground

Runs on the configured server runner (dev: npm run runner with LEARNING_RUNNER_ENABLED=true). Output appears below the editor.

Code runner not available

Server runner is disabled. Set LEARNING_RUNNER_ENABLED=true and LEARNING_RUNNER_URL in .env (see .env.example).

Check yourself

Multiple choice — immediate feedback.

Discussion

Past discussion is visible to everyone. Only logged-in users can post comments and replies.

Starter discussion topics

values vs to_numpy?
Align shapes?

No discussion yet. Be the first to ask a question.