What is data poisoning, and why is loading a pickle model file dangerous?

Data poisoning is an attack where an adversary injects malicious or mislabeled examples into the training data to bias the model, create backdoors, or degrade it, and it is hard to detect because the model still trains successfully. Loading a pickle model is dangerous because Python's pickle executes arbitrary code on deserialization, so a malicious .pkl or .pt file from an untrusted source can run attacker code the moment you load it. Defenses include trusted data provenance and validation, and using safe formats like safetensors plus scanning model files.

What is MLSecOps, and what are the main threats across the ML lifecycle?

MLSecOps extends security practices across the whole ML lifecycle rather than just the deployed app, covering data, training, the model artifact, and serving. Key threats include data and model poisoning, adversarial evasion inputs, model theft or extraction, membership-inference and privacy leakage, and supply-chain risks like malicious model files and dependencies. Defenses span provenance and validation, robustness testing, access control and signing of artifacts, input monitoring, and scanning, integrated into the MLOps pipeline.

How would you defend an LLM application against prompt injection?

No single fix is complete, so defenses are layered: separate trusted instructions from untrusted data, constrain and least-privilege the tools and actions the model can take, validate and sanitize inputs and tool outputs, add output guardrails and injection classifiers, and keep a human in the loop for sensitive actions. Treat all external or retrieved content as untrusted.

Walk me through the full ML lifecycle from problem definition to model retirement.

The ML lifecycle spans eight phases: problem framing, data collection and validation, feature engineering, training and experimentation, offline evaluation, deployment, production monitoring, and retirement or retraining. Each phase has distinct owners, artefacts, and failure modes that an MLOps practice must systematise.

ML security (MLSecOps) — MLOps

The last lesson defended against harm the model causes by behaving exactly as trained — an unfair decision, an unexplained denial. But we ended by naming the threat a bias report can’t see: someone attacking the model on purpose. That adversary is this lesson. Where responsible-AI ops asks “is what the model honestly does acceptable?”, ML security asks the opposite question — “what happens when someone is actively trying to corrupt, steal, or weaponize it?”

Standard application security still applies to ML systems — but they add an attack surface ordinary software doesn’t have. The data can be poisoned, the model can be stolen or fooled, and the model file itself can carry malware. MLSecOps is folding these ML-specific threats into your security practice, and OWASP now maintains dedicated entries for them.

The ML attack surface

ML adds four attack vectors across the lifecycle: poisoning, theft, evasion, and supply chain.

Data & model poisoning — an attacker slips crafted samples into the training data to plant a backdoor: the model behaves normally until it sees a secret trigger, then misclassifies. OWASP tracks this as a top ML/LLM risk; the “PoisonGPT” demo made it concrete.
Model theft / extraction — by querying your prediction API enough, an attacker reconstructs a copy of your model (stealing the IP), or infers whether a specific record was in the training set (a privacy leak).
Evasion / adversarial inputs — small, deliberate perturbations to an input (imperceptible to humans) that flip the prediction — the classic “sticker on a stop sign” attack.
Supply chain — malicious dependencies, or a malicious model file. This is the sharpest practical risk for MLOps teams.

The pickle problem

Defenses that belong in the pipeline

Secure the supply chain — pin and scan dependencies, verify model provenance/signatures, and maintain an AI bill of materials (AIBOM) — the same SBOM/SLSA discipline as software, extended to data and models.
Validate training data — data validation tests and data contracts catch anomalous samples before they poison a model.
Lock down the model API — rate-limit and monitor prediction endpoints to make model extraction and membership-inference impractical.
Govern artifacts — only load models that passed through the registry with verified lineage; never pip install a random model server and point it at production.

In one breath

ML adds four attack vectors ordinary software doesn’t have: poisoning (crafted training samples that plant a backdoor — normal behavior until a secret trigger fires), theft/extraction (probing the prediction API to clone the model or infer who was in the training data), evasion (imperceptible adversarial perturbations that flip a prediction), and supply chain (malicious dependencies or a model file that runs code on load — pickle is as dangerous as a random shell script); the defenses are security discipline applied to ML artifacts — safe formats (Safetensors/ONNX), pinned-and-scanned dependencies with provenance/signing (an AIBOM), data-validation gates, locked-down APIs, and loading only registry-verified models — and the pragmatic order is to stop the practical attacks first, not every exotic one.

Practice

Before the quiz, sit with the sharpest practical risk the lesson names: the model file itself. Explain, to a teammate about to pip install a random model server and point it at production, why loading an untrusted checkpoint is “exactly as dangerous as running a random shell script,” and name the one format choice that defuses it. Then prioritize: of the four attack vectors, which two would you harden first for a typical production API, and why those over chasing adversarial-perturbation defenses?

Quick check

0/3

Q1What is a data-poisoning backdoor attack?

Q2Why is loading an untrusted model file dangerous?

Q3What is model extraction?

A question to carry forward

That closes MLOps. Look back at the whole arc: a model is 5% of the work, and the other 95% is the loop you built around it — a lifecycle, baselines, tracking, containers, CI/CD, serving, monitoring, retraining, a platform, and now its guardrails of cost, governance, and security. You can take a model from a notebook to a system that keeps working, safely and accountably, in production.

But every single lesson in this chapter quietly stood on something it never had to build: data, already cleaned, joined, and waiting. The feature store assumed a table of features; the training job assumed a dataset; the retraining window assumed rows by the million arriving on schedule. We have spent a whole section operating models and never once asked how the data that feeds them gets processed — and at the scale these systems run, that is not a small question. A real training set isn’t a CSV; it’s terabytes that won’t fit on one machine, let alone in pandas. So the question to carry forward, out of MLOps and into the next section, is the one underneath everything we just built: when the data is too big for a single computer, how do you process it at all? That is big data, and the section that answers it — Hadoop, the cluster, and Spark — begins with the most basic question of all: what does “big data” actually mean?

ML security (MLSecOps)

What you'll learn

Before you start

The ML attack surface

The pickle problem

Defenses that belong in the pipeline

In one breath

Practice

Quick check

A question to carry forward

Sign in to track your progress

Practice this in an interview

Related lessons

Explore further