What is the Model Context Protocol (MCP) and what problem does it solve?

MCP is an open protocol from Anthropic that standardizes how LLM applications discover and connect to external tools, data sources, and prompts through a common client-server interface. It replaces bespoke per-integration glue with a single protocol, so any MCP-compatible host can use any MCP server, and has been adopted across the broader ecosystem.

How do MCP tool poisoning, cross-server shadowing, and rug pulls differ?

Tool poisoning embeds malicious influence in a tool description or result. Cross-server shadowing lets one low-trust server steer use of another server's capability. A rug pull changes a previously reviewed schema, description, implementation, dependency, or destination later. Defend with isolated trust contexts, capability snapshots, change review, sandboxing and runtime authorization.

How do function/tool calling and LLM agents work at a high level?

Tool calling extends the LLM's output space to include structured function invocations. The model emits a JSON object naming a tool and its arguments; the runtime executes the tool and feeds the result back as a new message. An agent is a loop that repeats this cycle — observe, think, act — until the task is complete or a stopping condition is met.

What is tool use or function calling in LLMs, and how do you design good tools for an agent?

Function calling lets an LLM output a structured request to invoke an external function with arguments, which the runtime executes and feeds back, enabling agents to act in the world. Good tool design uses clear names and descriptions, minimal well-typed parameters, narrow single-purpose scope, least privilege, and informative error messages so the model can choose and call them reliably.

Code execution with MCP — Agentic AI

As agents get more MCP tools, a quiet scaling problem appears. The traditional approach loads every tool definition into the context up front, and routes every intermediate result back through the model. Connect a dozen MCP servers with a few hundred tools and fetch a large dataset, and you’re spending a fortune in tokens before the agent has done anything useful. Anthropic’s code execution with MCP flips this — and the savings are dramatic.

The problem: everything flows through the model

Two costs balloon as tool use grows:

Tool definitions — each tool’s schema and description sits in the context on every call, whether or not it’s used. Hundreds of tools = tens of thousands of tokens of pure overhead.
Intermediate data — fetch a 50,000-row result to filter it down to 3 rows, and all 50,000 rows pass through the context window on the way. The model pays to read data it’s only going to discard.

The fix: write code, run it out-of-context

In code mode, the MCP tools are presented as a code API the agent can program against. Instead of “call tool, read result, call tool, read result,” the agent writes a short script: call these tools, join and filter the results, return just the final answer. The fetching and crunching happen in a sandbox, outside the context window — only the small final result comes back to the model.

# Conceptually, the agent writes & runs code like this (in a sandbox):
orders = mcp.db.query("SELECT * FROM orders WHERE status='open'")  # 50k rows
overdue = [o for o in orders if days_since(o.due) > 30]            # filtered out-of-context
summary = f"{len(overdue)} overdue orders, total ${sum(o.amount for o in overdue):,}"
return summary    # ONLY this short string re-enters the model's context

The model never sees the 50,000 rows — just the one-line summary. Tool definitions are discovered and called as code, not pre-loaded as schemas. The result, on tool-heavy tasks, is up to a ~98.7% token reduction (Anthropic’s reported 150K → 2K), with matching cuts to cost and latency — and less context rot because the window stays small.

In one breath

With many MCP tools, two costs balloon: all tool definitions loaded up front, and every intermediate result routed back through the context window.
Code execution presents tools as a code API the agent programs against — it writes a short script that fetches, joins, and filters in a sandbox, returning only the final answer.
The big data never enters the window, so token use drops dramatically (Anthropic’s reported ~150K → 2K, ~98.7%), with matching cost/latency cuts and less context rot.
It’s context engineering in disguise: keep the window small, do the bulk work elsewhere.
The isolation is non-negotiable — model-written code must run in a sandbox (microVM/container) with no ambient credentials, egress controls, and resource limits.

Quick check

0/3

Q1What two things bloat the context when an agent has many MCP tools?

Q2How does code execution with MCP reduce tokens so dramatically?

Q3What's the essential safety requirement for this pattern?

Round out production readiness with observability & tracing and cost & latency control.

Code execution with MCP

What you'll learn

Before you start

The problem: everything flows through the model

The fix: write code, run it out-of-context

In one breath

Quick check

Quick check

Next

Sign in to track your progress

Practice this in an interview

Related lessons

Explore further