Explain the ReAct agent pattern and how it compares to Plan-and-Execute and Reflexion.

ReAct interleaves reasoning traces with actions step by step, deciding the next tool call based on the latest observation. Plan-and-Execute first drafts a full multi-step plan and then executes it, which is more efficient and predictable for complex tasks but less adaptive, while Reflexion adds a self-reflection step where the agent critiques past failures and retries with that feedback.

What is an AI agent, and how does it differ from a single LLM call?

An agent is an LLM placed in a loop where it reasons, chooses and calls tools or actions, observes the results, and repeats until a goal is met, rather than producing one response and stopping. The key differences are autonomy, tool use, memory and state, and multi-step control flow driven by the model's own decisions.

What prompt engineering techniques should every LLM practitioner know?

The core toolkit is: system prompts (role and constraints), few-shot examples (format and tone anchoring), chain-of-thought (step-by-step reasoning), and output constraints (JSON schema, stop sequences). Combining these predictably closes the gap between a capable base model and a production-ready feature.

How do function/tool calling and LLM agents work at a high level?

Tool calling extends the LLM's output space to include structured function invocations. The model emits a JSON object naming a tool and its arguments; the runtime executes the tool and feeds the result back as a new message. An agent is a loop that repeats this cycle — observe, think, act — until the task is complete or a stopping condition is met.

ReAct, Plan-Execute, Reflexion — Agentic AI

The design-patterns lesson covered the workflow shapes (prompt chaining, routing, parallelization). This lesson is about the reasoning loops an agent runs inside those shapes — how it actually decides what to do next. Three dominate the field, and knowing which to reach for is the difference between an agent that’s reliable and one that’s slow, expensive, or brittle.

TryPlanning patterns · the three loops

Same task, three ways an agent can reason

Three reasoning loops every agent engineer should know. Pick one and run it to watch its characteristic trace — then read the tradeoffs. They're not interchangeable; the right one depends on the task.

thoughtThought: I need the user's order status

actionAction: lookup_order(id)

obsObservation: status = shipped, 2 days ago

thoughtThought: they asked about a refund, check policy

actionAction: get_policy('refund')

obsObservation: refundable within 30 days

answerAnswer: yes, you're within the window…

LLM callshigh (one per step)

adaptivityhigh — reacts to each result

best fordynamic tool-use where you can't enumerate steps up front

ReAct — reason and act, interleaved

ReAct (Reason + Act) is the workhorse. The agent loops: Thought → Action → Observation → Thought → … — it reasons about what to do, takes one action (a tool call), observes the result, and reasons again with that new information. Because it reacts to each observation, it handles dynamic situations gracefully — it doesn’t need to know the steps in advance.

The cost: an LLM call per step, so a long task is many calls (latency and money), and it can wander or loop if the stop condition is loose. ReAct is the default shape behind most tool-using agents.

Plan-and-Execute — decide everything first

Plan-and-Execute splits reasoning from doing. A planner LLM lays out all the steps up front; an executor then runs them, often without calling the planner again. This is cheaper and more predictable — one expensive planning call, then mechanical execution — and easier to debug because the plan is explicit.

The weakness: it’s blind to surprises. If step 2 returns something the plan didn’t anticipate, a pure plan-execute agent plows ahead. In practice you add a re-plan step when execution deviates — a hybrid that recovers some of ReAct’s adaptivity.

Reflexion — try, critique, retry

Reflexion adds a self-correction loop: the agent makes an attempt, then a reflection step critiques its own output (“I cited the wrong policy section”), and it retries with that feedback. It trades extra passes for quality, and it shines when first attempts often fail and the result can be checked — code that must pass tests, structured extraction you can validate, math you can verify.

In one breath

These are the reasoning loops an agent runs inside a workflow shape — how it decides the next move.
ReAct interleaves Thought → Action → Observation, reasoning again after each result — adaptive, but one LLM call per step and can wander.
Plan-and-Execute plans every step up front then runs them mechanically — cheaper and predictable, but blind to surprises unless you add a re-plan step.
Reflexion attempts, self-critiques, and retries — best when first tries fail and the result is checkable (tests, validation, math).
Pick the simplest loop the task needs (dynamic → ReAct, known multi-step → Plan-Execute, checkable-hard → Reflexion), combine when useful, and always cap the budget.

Quick check

0/3

Q1What characterizes the ReAct loop?

Q2When is Plan-and-Execute the better choice over ReAct?

Q3What problem does Reflexion specifically address?

These loops run inside a single agent. When one agent isn’t enough, see multi-agent orchestration — and the equally important question of when not to go multi-agent.

ReAct, Plan-Execute, Reflexion

What you'll learn

Before you start

Same task, three ways an agent can reason

ReAct — reason and act, interleaved

Plan-and-Execute — decide everything first

Reflexion — try, critique, retry

In one breath

Quick check

Quick check

Next

Sign in to track your progress

Practice this in an interview

Related lessons

Explore further