Chapter 06 · Agent types

ReAct, OpenAI tools, structured chat — what they really are.

"Agent type" sounds intimidating; it's just a different way the LLM signals "call a tool." Three flavors: text parsing (ReAct), native tool calling (OpenAI tools), and JSON-only (structured chat).

ReAct — Reason + Act in plain text

ReAct is the OG pattern (Yao et al., 2022). It predates native tool calling. The trick: ask the model to write a rigid text format and parse its output yourself.

{`Answer the following question using available tools. You have access to: get_weather(city: str) - returns current weather search_web(query: str) - searches the web Use this exact format: Thought: Action: Action Input: Observation: ... (repeat as needed) Thought: I now know the final answer. Final Answer: Question: {input} {agent_scratchpad}`}

The runtime parses each Thought / Action / Action Input block, runs the tool, injects the Observation: back into the prompt, and the model continues. Watch one play out:

If the model emits malformed text (extra newline, wrong key) the parser breaks. Modern code uses native tool calling whenever the model supports it. Keep ReAct in your back pocket for local models without function calling. OpenAI tools agent — native tool calling

When the model API supports tool calling natively (GPT-4, Claude 3+, Gemini 1.5+), you don't need text parsing. The model returns a structured tool_calls field. LangChain's create_tool_calling_agent is the modern default:

{`from langchain.agents import AgentExecutor, create_tool_calling_agent from langchain_core.prompts import ChatPromptTemplate, MessagesPlaceholder prompt = ChatPromptTemplate.from_messages([ ("system", "You are a helpful assistant."), MessagesPlaceholder("chat_history", optional=True), ("human", "{input}"), MessagesPlaceholder("agent_scratchpad"), ]) agent = create_tool_calling_agent(model, tools, prompt) executor = AgentExecutor(agent=agent, tools=tools, verbose=True) executor.invoke({"input": "What's the weather in Tokyo?"})`}

This is what 90% of new code uses. It's the same idea as ReAct, but the loop is driven by structured fields instead of regex.

Structured chat — JSON-everything

For models that don't have native tool calling but do reliably emit JSON (smaller open-source models, often), the structured chat agent forces JSON output:

{`from langchain.agents import create_structured_chat_agent # Same idea, but the model must respond with: # {"action": "tool_name", "action_input": {...}} # or # {"action": "Final Answer", "action_input": "the answer"} agent = create_structured_chat_agent(model, tools, prompt)`} The 3-way comparison — exactly what differs

Same loop, three different ways to extract "what does the model want to do next?" from its output. Here's a scannable side-by-side:

	ReAct	Structured Chat	Tool Calling
Who defines the format?	LangChain prompt	LangChain prompt	Model provider (OpenAI/Anthropic)
How is output parsed?	Regex / string split	`json.loads()`	SDK deserializes API field
Model requirement	Can follow instructions	Can output valid JSON	Fine-tuned for tool use
Failure mode	Silent — wrong parse	Loud — JSON error thrown	Rare — schema enforced
Prompt-injection risk?	Yes	Yes	No — goes in API `tools=` field
AgentExecutor needed?	✅ Yes	✅ Yes	✅ Yes

Notice the bottom row — the executor is identical across all three. The only thing that changes is the parser wrapped around the LLM call. That's why the framework can offer them as drop-in alternatives.

Choosing one (decision tree)

Timeline of need

{`2022 ──► ReAct (any model, just follow text format) 2023 ──► Structured Chat (models that reliably emit JSON) 2023 ──► Tool Calling (GPT-4, Claude 3 — API-level support) 2024 ──► LangGraph (replaces AgentExecutor entirely)`}

Each row was a response to a real ceiling in the row above. ReAct works on anything but parses brittlely. Structured chat hardens the parser but still embeds the contract in the prompt. Tool calling pushes the contract into the API itself. LangGraph then says "the loop deserves to be a graph" and rewrites the runtime around state and persistence. Knowing the order makes the design choices feel obvious instead of arbitrary.

{`Does the model API return structured tool_calls? ├── YES → create_tool_calling_agent (GPT-4o, Claude, Gemini) └── NO → Does it reliably output JSON? ├── YES → create_structured_chat_agent (smaller open-source models) └── NO → ReAct (tiny/basic local models) Special case: Need state, branching, parallelism, human-in-loop? └── Skip all of these → Use LangGraph`} Don't be fooled by the names. All three "agent types" implement the same Reason → Act → Observe loop. The only difference is the wire format the LLM uses to say "call this tool." Pick whichever your model speaks best. {`# All three follow this exact same pattern: agent = create_tool_calling_agent(model, tools, prompt) # just the "brain" executor = AgentExecutor(agent=agent, tools=tools) # adds the loop executor.invoke({"input": "..."}) # now it actually runs`}

Swap create_tool_calling_agent for create_react_agent or create_structured_chat_agent — the rest of the code is identical. The agent factory returns a Runnable that produces "next action"; the executor wraps it in the loop. That separation is the whole API surface.