What is an AI agent, and how does it differ from a single LLM call?

An agent is an LLM placed in a loop where it reasons, chooses and calls tools or actions, observes the results, and repeats until a goal is met, rather than producing one response and stopping. The key differences are autonomy, tool use, memory and state, and multi-step control flow driven by the model's own decisions.

What is the Model Context Protocol (MCP) and what problem does it solve?

MCP is an open protocol from Anthropic that standardizes how LLM applications discover and connect to external tools, data sources, and prompts through a common client-server interface. It replaces bespoke per-integration glue with a single protocol, so any MCP-compatible host can use any MCP server, and has been adopted across the broader ecosystem.

When should you use a multi-agent system versus a single agent, and what is the supervisor versus swarm pattern?

Use multiple agents when a task decomposes into distinct specialties or parallel subtasks that exceed one agent's context or reliability; avoid it when a single agent suffices, since multi-agent systems add coordination overhead, latency, cost, and error propagation. A supervisor architecture has an orchestrator routing work to specialized sub-agents, while a swarm lets peer agents hand off control to one another without a central coordinator.

How do function/tool calling and LLM agents work at a high level?

Tool calling extends the LLM's output space to include structured function invocations. The model emits a JSON object naming a tool and its arguments; the runtime executes the tool and feeds the result back as a new message. An agent is a loop that repeats this cycle — observe, think, act — until the task is complete or a stopping condition is met.

A2A — the Agent2Agent Protocol — Agentic AI

In the MCP lesson you saw how one agent reaches down to its tools and data — a USB-C port for models. A2A (the Agent2Agent Protocol) solves the orthogonal problem: how one agent reaches across to another independent agent as a peer. The two are not rivals. The official spec calls them highly complementary: MCP is agent-to-tools; A2A is agent-to-agent. A real system uses both — A2A between agents, MCP inside each agent for its own tools.

The word that does all the work here is opaque. A2A’s central design principle is Opaque Execution: the agents “collaborate effectively without exposing their internal logic, memory, or proprietary tools.” That single constraint is what makes A2A a protocol and not just a function call — and it dictates every design choice that follows.

Two protocols, two directions

A2A runs horizontally between agents; each agent still uses MCP vertically for its own tools. Different layers, not competitors.

For the full landscape of MCP vs A2A vs ACP vs ANP, see the agent protocols overview. This lesson goes deep on A2A alone.

Step 1 — discovery via the Agent Card

You cannot delegate to an agent you cannot describe. So A2A starts with a discovery document: the Agent Card, a JSON “business card” that advertises an agent’s identity and what it can do. The canonical way to publish it is at a well-known URL — a plain HTTP GET away, following the RFC 8615 convention:

https://currency.example.com/.well-known/agent-card.json

A note on the path. The current spec uses /.well-known/agent-card.json. Earlier v0.x drafts used /.well-known/agent.json, and tooling often still accepts the old name for backward compatibility — but write new agents against agent-card.json.

A card carries the agent’s name, description, and provider; its A2A service url; a version; the capabilities it supports (notably streaming and pushNotifications); its security schemes (Bearer, OAuth2, API keys); default input/output modes; and a list of skills — each with an id, name, description, and example invocations. That skills array is the menu a client reads to decide whether this is the right agent for the job.

The well-known URL is only one of three discovery mechanisms the spec defines. For enterprises there are curated registries — a catalog you query by skill or tag — and for tightly-coupled systems there is direct configuration, a hardcoded URL or env var. The card format is identical; only how the client finds it changes.

Step 2–4 — the task, its lifecycle, and the artifact

Once the client has the card and picks a skill, it sends a task. This is the heart of A2A and the cleanest way to see how it differs from a tool call. A tool call is request → response: synchronous, stateless, done. A task is a stateful object with a lifecycle. It has an id, a contextId that groups related tasks, a status, an optional history of messages, and the artifacts it eventually produces.

Each message between the agents is built from typed parts — text, a file/url pointer, structured data (arbitrary JSON), or raw bytes. That is what makes A2A modality-independent: the same envelope carries a sentence, a spreadsheet, or an image. When the work finishes, the remote agent returns its results as artifacts (each an artifactId, a name, and its own parts) — the outputs of the task, as distinct from the conversational messages along the way.

The status walks a fixed set of states. These string values are fixed by the spec — you do not invent your own:

The happy path is submitted → working → completed. input-required and auth-required pause for the client. completed, failed, canceled and rejected are terminal.

The terminal-state rule is the gotcha worth memorizing: once a task is completed, failed, canceled, or rejected, it is immutable — you cannot reopen it. Follow-up work starts a new task in the same contextId, linked back via referenceTaskIds so the remote agent can infer continuity. This is deliberate: it keeps an auditable, append-only history across organizational boundaries.

Put the round-trip together and the whole handshake is just four HTTP exchanges:

Long-running tasks — stream or get called back

A currency conversion is instant. A “render this 90-second video” task is not. A2A gives a remote agent two ways to report progress on work that outlives a single request.

Streaming over SSE. The client calls the JSON-RPC method message/stream; the server replies 200 with Content-Type: text/event-stream and pushes Server-Sent Events. Each event’s data field carries a complete JSON-RPC response delivering an incremental status change or an artifact chunk. If the connection drops, tasks/resubscribe reconnects to the live stream.

Push notifications. Holding an SSE connection open for an hour is fragile. So for long-running or disconnected work, the client registers a webhook with tasks/pushNotificationConfig/set, and the remote agent makes server-initiated HTTP POST callbacks when the task updates — no polling, no held-open socket. The Agent Card’s capabilities advertise which of these (streaming, pushNotifications) an agent supports.

A2A is built on boring, sturdy web tech on purpose — HTTP, JSON-RPC 2.0, and SSE — with v1.0 adding gRPC and HTTP+JSON/REST bindings and version negotiation. Auth follows suit: schemes are declared in the Agent Card and negotiated out-of-band like any HTTP API, so credentials are never put inside the A2A message payload. v1.0 also adds cryptographically signed Agent Cards, so a client can verify an agent’s identity before trusting it across a trust boundary.

When do you actually need A2A?

Here is the honest engineering answer, because A2A is not free — it is a network hop, a contract, and an auth dance.

Reach for a plain in-process call (or MCP for tools) when the sub-agent is your own code: same process, same framework, sharing state. That is faster and simpler. A function call beats a protocol every time you are allowed to make one.
Reach for A2A when the other agent is a separate, independently deployed, opaque service — a different team, vendor, or framework, across a network or trust boundary — and you need standardized discovery (the Agent Card), long-running async task semantics with streaming or webhooks, and enterprise auth. In short: when you cannot, or should not, share internal state, and you need a vendor-neutral contract instead of a function signature.

That boundary — can I just call its function? — is the whole decision. A2A exists for every time the answer is no.

A note on governance

A2A is not a single vendor’s project. Google announced it on April 9, 2025, then donated it to the Linux Foundation on June 23, 2025, forming the Agent2Agent Protocol Project with founding members including AWS, Cisco, Google, Microsoft, Salesforce, SAP, and ServiceNow. The first stable release, A2A v1.0, shipped recently. Sources differ on the exact month — Google’s anniversary blog says March, some write-ups say January — so treat it as soft; the version jump from the v0.x drafts to a production 1.0 is solid. It is Apache-2.0 licensed and vendor-neutral.

In one breath

A2A connects independent, opaque agents as peers — orthogonal to MCP (agent-to-tools); a real system uses both.
Discovery is the Agent Card (/.well-known/agent-card.json): a JSON manifest of skills, URL, auth schemes, and capabilities the client reads to pick the right agent.
Work is a stateful task with a lifecycle (submitted → working → completed), built from typed parts, returning artifacts; terminal states are immutable — follow-ups start a new task in the same contextId.
Long-running work reports via SSE streaming or, more robustly, push-notification webhooks — no socket held open for an hour.
The whole decision is can I just call its function? — if yes, use an in-process call or MCP; reach for A2A only when the other agent is a separate, opaque, cross-boundary service.

Quick check

0/3

Q1What is the key difference between MCP and A2A?

Q2A client delegated a task that reached the 'completed' state. The user now asks a follow-up. What does A2A prescribe?

Q3Transfer: you're building a research agent. It needs to (a) read your internal Postgres and (b) hand a sub-question to a partner company's specialist agent for an hour-long analysis. Which protocols fit each, and how should progress on the long task be reported?

You now know how one agent delegates to another. For the wider map of competing and complementary standards — MCP, A2A, ACP, ANP and where each fits — read the agent protocols overview, and revisit MCP to see the tool-facing half of the same picture.

A2A — the Agent2Agent Protocol

What you'll learn

Before you start

Two protocols, two directions

Step 1 — discovery via the Agent Card

Step 2–4 — the task, its lifecycle, and the artifact

Long-running tasks — stream or get called back

When do you actually need A2A?

A note on governance

In one breath

Quick check

Quick check

Next

Sign in to track your progress

Practice this in an interview

Related lessons

Explore further