Privacy and AI

Last reviewed May 28, 2026 Content v20260528

Track mode

none

Means

Read / quiz

Reading

~2 min

Level

beginner

This lesson

This lesson teaches Privacy and AI: artificial intelligence concepts, limitations, and responsible use in modern software and data products.

Teams apply Privacy and AI in every serious AI project—skipping it leaves blind spots in analysis and reviews.

You will apply Privacy and AI in contexts like: Product planning, policy, engineering leadership, and responsible rollout discussions.

Study explanations, case studies, and MCQs—this topic is read/quiz focused without a code runner.

When you can explain the previous lesson's ideas in your own words.

AI often needs personal data—behavior, biometrics, health signals. Privacy principles: collect minimally, purpose-limit, secure, retain briefly, and honor user rights (access, deletion) where applicable.

Key concepts

PII / personal data — identifies or relates to a person
Purpose limitation — use data only for stated reasons
Data minimization — fewer fields, shorter retention
Anonymization vs pseudonymization — re-identification risk remains in many "anonymized" sets

ML-specific risks

Memorization—models regurgitate training snippets
Membership inference—guess if someone was in training set
Federated learning still needs threat modeling

Privacy-by-design habits

# Pseudonymize IDs in analytics tables
import hashlib

def pseudonym(user_id: str, salt: str) -> str:
    return hashlib.sha256(f"{salt}:{user_id}".encode()).hexdigest()[:16]

Practice: Reflect on ethics scenarios in writing—no code required. Optional snippets illustrate policy checks only.

Hashing alone is not sufficient if salt leaks or space is small—consult privacy engineers.

Important interview questions and answers

Q: GDPR lawful basis?
A: Consent, contract, legal obligation, etc.—must match processing activity.
Q: Delete user request?
A: Remove from stores and stop using in future training where required; model unlearning is hard—plan retention up front.

Self-check

Define data minimization.
Name one ML-specific privacy risk.

Tip: Plan deletion and retention before training—model unlearning is hard.

Interview prep

Data minimization?: Collect and retain only fields needed for the stated purpose.
Memorization risk?: Models may regurgitate sensitive training snippets.

Discussion

Past discussion is visible to everyone. Only logged-in users can post comments and replies.

Starter discussion topics

What part of this lesson needs a second read?
What would you try differently in a real project?

No discussion yet. Be the first to ask a question.