Alpha Notice: These docs cover the v1-alpha release. Content is incomplete and subject to change.For the latest stable version, see the current LangChain Python or LangChain JavaScript docs.

Overview

Agents combine language models with tools to create systems that can reason about tasks, decide which tools to use, and iteratively work towards solutions. create_react_agent() provides a production-ready ReAct (Reasoning + Acting) agent implementation based on the paper ReAct: Synergizing Reasoning and Acting in Language Models.

ReAct frames an agent’s behavior as an interleaving of thought -> action -> observation steps, where the model writes out its reasoning, picks a tool, sees the tool’s result, and then repeats. ReAct reduces hallucinations and makes the decision process auditable: the agent can form hypotheses (thought), test them with tools (action), and update its plan based on feedback (observation).A ReAct loop runs until a stop condition - i.e. when the model emits a final answer or a max-iterations limit is reached.

Under the hood, create_react_agent() builds a graph-based agent runtime. A graph consists of nodes (steps) and edges (connections) that define how your agent processes information. The agent moves through this graph, executing nodes like the model node (which calls the model), the tools node (which executes tools), or pre/post model hook nodes. Learn more about the graph API.

Core components

Model

The model is the reasoning engine of your agent. It can be specified in multiple ways, supporting both static and dynamic model selection.

Static model

Static models are configured once when creating the agent and remain unchanged throughout execution. This is the most common and straightforward approach. To initialize a static model from a model identifier string:
from langchain.agents import create_react_agent

agent = create_react_agent(
    "openai:gpt-5", 
    tools=tools
)
Model identifier strings use the format provider:model (e.g. "openai:gpt-5") and support automatic inference (e.g. "gpt-5" will be inferred as "openai:gpt-5"). The user may want more control over the model configuration, in which case they can initialize a model instance directly:
from langchain.agents import create_react_agent
from langchain_openai import ChatOpenAI

model = ChatOpenAI(
    model="gpt-5", 
    temperature=0.1,
    max_tokens=1000,
    timeout=30
)
agent = create_react_agent(model, tools=tools)
Model instances give you complete control over configuration. Use them when you need to set specific parameters like temperature, max tokens, timeouts, or configure API keys, base URLs, and other provider-specific settings. Refer to the API reference to see available params and methods on your model.

Dynamic model

runtime: The execution environment of your agent, containing immutable configuration and contextual data that persists throughout the agent’s execution (e.g., user IDs, session details, or application-specific configuration).
state: The data that flows through your agent’s execution, including messages, custom fields, and any information that needs to be tracked and potentially modified during processing (e.g. user preferences or tool usage stats).
Dynamic models are selected at runtime based on the current state and context. This enables sophisticated routing logic and cost optimization. To use a dynamic model, you need to provide a function that receives the graph state and runtime and returns an instance of BaseChatModel with the tools bound to it using .bind_tools(tools), where tools is a subset of the tools parameter.
from langchain_openai import ChatOpenAI
from langchain.agents import create_react_agent, AgentState
from langgraph.runtime import Runtime

def select_model(state: AgentState, runtime: Runtime) -> ChatOpenAI:
    """Choose model based on conversation complexity."""
    messages = state["messages"]
    message_count = len(messages)
    
    if message_count < 10:
        return ChatOpenAI(model="gpt-4.1-mini").bind_tools(tools)
    else:
        return ChatOpenAI(model="gpt-5").bind_tools(tools) # Better model for longer conversations

agent = create_react_agent(select_model, tools=tools)
For model configuration details, see Models.

Tools

Tools give agents the ability to take actions. Agents go beyond simple model-only tool binding by facilitating:
  • Multiple tool calls in sequence triggered by a single prompt
  • Parallel tool calls when appropriate
  • Dynamic tool selection based on results
  • Tool retry logic and error handling
  • State persistence across tool calls
Tools can be provided to the agent as either:
  1. A list of tools (LangChain @tool, callable, or dict that represents a builtin provider tool)
  2. A configured ToolNode

Passing a list of tools

Passing a list of tools to the agent will create a ToolNode under the hood. This is the simplest way to set up a tool-calling agent:
from langchain_core.tools import tool
from langchain.agents import create_react_agent

@tool
def search(query: str) -> str:
    """Search for information."""
    return f"Results for: {query}"

@tool
def calculate(expression: str) -> str:
    """Perform calculations."""
    return str(eval(expression))

agent = create_react_agent(model, tools=[search, calculate])
If an empty tool list is provided, the agent will consist of a single LLM node without tool calling.

Passing a configured ToolNode

Alternatively, you can create a ToolNode directly and pass it to the agent. This allows you to customize the tool node’s behavior, such as handling tool errors:
tool_node = ToolNode(
    tools=[search, calculate],
    handle_tool_errors="Please check your input and try again."
)
agent = create_react_agent(model, tools=tool_node)
result = agent.invoke({"messages": [...]})
If an error occurs within ToolNode, the agent will return a ToolMessage to the model with the custom error message:
# result["messages"]
[
    ...
    ToolMessage(content="Please check your input and try again.", tool_call_id="..."),
    ...
]

To learn more about error handling inToolNode, see ToolNode.

Tool use in the ReAct loop

Agents follow the ReAct (Reasoning + Acting) pattern, alternating between brief reasoning steps with targeted tool calls and feeding the resulting observations into subsequent decisions until they can deliver a final answer. Follow along as an agent applies ReAct to answer a user’s question:

Prompt: Identify the current most popular wireless headphones and verify availability.

================================ Human Message =================================

Find the most popular wireless headphones right now and check if they're in stock
Reasoning: “Popularity is time-sensitive, I need to use the provided search tool.”Acting: Call search_products("wireless headphones")
================================== Ai Message ==================================
Tool Calls:
  search_products (call_abc123)
 Call ID: call_abc123
  Args:
    query: wireless headphones
================================= Tool Message =================================

Found 5 products matching "wireless headphones". Top 5 results: WH-1000XM5, ...
Reasoning: “I need to confirm availability for the top-ranked item before answering.”Acting: Call check_inventory("WH-1000XM5")
================================== Ai Message ==================================
Tool Calls:
  check_inventory (call_def456)
 Call ID: call_def456
  Args:
    product_id: WH-1000XM5
================================= Tool Message =================================

Product WH-1000XM5: 10 units in stock
Reasoning: “I have the most popular model and its stock status. I can now answer the user’s question.”Acting: Produce final answer
================================== Ai Message ==================================

I found wireless headphones (model WH-1000XM5) with 10 units in stock...
To learn more about tools, see Tools.

Prompt

You can shape how your agent approaches tasks by providing a prompt. The prompt parameter can be provided in several forms.

String

agent = create_react_agent(
    model,
    tools,
    prompt="You are a helpful assistant. Be concise and accurate."
)

SystemMessage

agent = create_react_agent(
    model,
    tools,
    prompt=SystemMessage(content="You are a research assistant. Cite your sources.")
)

Callable

def dynamic_prompt(state):
    user_type = state.get("user_type", "standard")
    system_msg = SystemMessage(
        content="Provide detailed technical responses." 
        if user_type == "expert" 
        else "Provide simple, clear explanations."
    )
    return [system_msg] + state["messages"]
agent = create_react_agent(model, tools, prompt=dynamic_prompt)

Runnable

from langchain_core.runnables import RunnableLambda

prompt_runnable = RunnableLambda(
    lambda state: [
        SystemMessage(content=f"Current context: {state.get('context', 'general')}"),
        *state["messages"]
    ]
)
agent = create_react_agent(model, tools, prompt=prompt_runnable)
When no prompt is provided, the agent will infer its task from the messages directly.
For more details on message types and formatting, see Messages.

Advanced configuration

Structured Output

In some situations, you may want the agent to return an output in a specific format. LangChain provides a simple, universal way to do this with the response_format parameter.
from pydantic import BaseModel
from langchain.agents import create_react_agent

class ContactInfo(BaseModel):
    name: str
    email: str
    phone: str

agent = create_react_agent(
    model,
    tools=[search_tool],
    response_format=ContactInfo
)

result = agent.invoke({
    "messages": [{"role": "user", "content": "Extract contact info from: John Doe, john@example.com, (555) 123-4567"}]
})

result["structured_response"]
# ContactInfo(name='John Doe', email='john@example.com', phone='(555) 123-4567')
To learn about structured output, see Structured Output.

Memory

Agents maintain conversation history automatically through the message state. You can also configure the agent to use a custom state schema to remember additional information during the conversation. Information stored in the state can be thought of as the “short-term memory” of the agent:
from typing import TypedDict
from typing_extensions import Annotated
from langgraph.graph.message import add_messages
from langchain.agents import create_react_agent
from langchain.agents import AgentState

class CustomAgentState(AgentState):
    messages: Annotated[list, add_messages]
    user_preferences: dict

agent = create_react_agent(
    model,
    tools=tools,
    state_schema=CustomAgentState
)

# The agent can now track additional state beyond messages. This custom state can be accessed and updated throughout the conversation.
result = agent.invoke({
    "messages": [{"role": "user", "content": "I prefer technical explanations"}],
    "user_preferences": {"style": "technical", "verbosity": "detailed"},
})
To learn more about memory, including how to implement long-term memory that persists across sessions, see Memory.

Pre-model hook

Pre-model hook is an optional node that can process state before the model is called. Use cases include message trimming, summarization, and context injection. It must be a callable or a runnable that takes in current graph state and returns a state update in the form of:
{
    # Will UPDATE the `messages` in the state
    "messages": [RemoveMessage(id=REMOVE_ALL_MESSAGES), ...],
    # Any other state keys that need to be propagated
    ...
}
Example of a pre-model hook that trims messages to fit the context window:
from langchain_core.messages import RemoveMessage
from langgraph.graph.message import REMOVE_ALL_MESSAGES
from langchain.agents import create_react_agent

def trim_messages(state):
    """Keep only the last few messages to fit context window."""
    messages = state["messages"]
    
    if len(messages) <= 3:
        return {"messages": messages}
    
    first_msg = messages[0]
    recent_messages = messages[-3:] if len(messages) % 2 == 0 else messages[-4:]
    new_messages = [first_msg] + recent_messages
    
    return {
        "messages": [
            RemoveMessage(id=REMOVE_ALL_MESSAGES),
            *new_messages
        ]
    }

agent = create_react_agent(
    model,
    tools=tools,
    pre_model_hook=trim_messages
)
messages must be provided and will be used as an input to the agent node (i.e., the node that calls the LLM). The rest of the keys will be added to the graph state.
If you are returning messages in the pre-model hook, you should OVERWRITE the messages key by doing the following:
{
    "messages": [RemoveMessage(id=REMOVE_ALL_MESSAGES), *new_messages]
    ...
}

Post-model hook

Post-model hook is an optional node that can process the model’s response before tool execution. Use cases include validation, guardrails, or other post-processing. It must be a callable or a runnable that takes in current graph state and returns a state update. Example of a post-model hook that filters out confidential information:
from langchain_core.messages import AIMessage, RemoveMessage
from langgraph.graph.message import REMOVE_ALL_MESSAGES

def validate_response(state):
    """Check model response for policy violations."""
    messages = state["messages"]
    last_message = messages[-1]
    
    if "confidential" in last_message.content.lower():
        return {
            "messages": [
                RemoveMessage(id=REMOVE_ALL_MESSAGES),
                *messages[:-1],
                AIMessage(content="I cannot share confidential information.")
            ]
        }
    
    return {}

agent = create_react_agent(
    model,
    tools=tools,
    post_model_hook=validate_response
)

Streaming

We’ve seen how the agent can be called with .invoke to get a final response. If the agent executes multiple steps, this may take a while. To show intermediate progress, we can stream back messages as they occur.
for chunk in agent.stream({
    "messages": [{"role": "user", "content": "Search for AI news and summarize the findings"}]
}, stream_mode="values"):
    # Each chunk contains the full state at that point
    latest_message = chunk["messages"][-1]
    if latest_message.content:
        print(f"Agent: {latest_message.content}")
    elif latest_message.tool_calls:
        print(f"Calling tools: {[tc['name'] for tc in latest_message.tool_calls]}")
For more details on streaming, see Streaming.

End-to-end examples

Simple search agent

A simple agent that answers questions by searching for information using the Tavily search tool. To run this example, you’ll need to set your Tavily and OpenAI API keys in your environment variables.
from langchain_openai import ChatOpenAI
from langchain_tavily import TavilySearch
from langchain.agents import create_react_agent

search = TavilySearch(max_results=2)
tools = [search]

model = ChatOpenAI(model="gpt-5",)
agent = create_react_agent(
    model, 
    tools,
    prompt="You are a helpful assistant that can search the web for information. You can use the search tool to find information. Answer concisely."
)
agent.invoke({
    "messages": [
        {"role": "user", "content": "Tell me about LangChain."}
    ]
})
# LangChain is an open-source framework designed for building applications based on large language models (LLMs). It enables developers to create and customize prompt chains or templates, connect language models to external data sources, and adapt language models to specific business contexts. 

# LangChain also provides platforms for building reliable agents, used by top engineering teams and global brands like Replit, AT&T, Home Depot, and Klarna. The platform includes LangGraph, a framework for building controllable agents and long-running workflows, and LangSmith, which offers evaluation and monitoring tools to help developers debug and improve agents at scale. LangChain supports various integrations and makes it easy for teams to build, deploy, and manage enterprise-grade agents efficiently.

# For more detailed information, you can visit [LangChain's LinkedIn page](https://www.linkedin.com/company/langchain) or their [website](https://langchain.com/).

Multi-step researcher agent

A more sophisticated agent that researches topics by collecting and summarizing multiple articles. This example uses a Tavily search tool to browse the web, a store to persist state across sessions, a post_model_hook to track tool usage, a custom state_schema to store additional state.
from langchain_openai import ChatOpenAI
from langchain_tavily import TavilySearch
from langchain_core.tools import tool
from langgraph.checkpoint.memory import MemorySaver
from langchain.agents import create_react_agent
from langchain.agents import AgentState
from langgraph.store.memory import InMemoryStore
import json

search = TavilySearch(max_results=3)

store = InMemoryStore()

class ResearcherState(AgentState):
    tool_usage_stats: dict[str, int]

@tool
def store_article(name: str, url: str, summary: str) -> str:
    """Store an article's information in memory."""
    namespace = ("research", "articles")
    existing = store.get(namespace, "collected_articles")
    articles = existing.value if existing else []
    
    articles.append({
        "name": name,
        "url": url,
        "summary": summary
    })
    
    store.put(namespace, "collected_articles", articles)
    return f"Stored article: {name}"

@tool  
def get_stored_articles() -> str:
    """Retrieve all stored articles."""
    namespace = ("research", "articles")
    existing = store.get(namespace, "collected_articles")
    
    if not existing:
        return "No articles stored yet."
    
    articles = existing.value
    if not articles:
        return "No articles stored yet."
    
    return json.dumps(articles, indent=2)

def track_tool_usage(state):
    """Track which tools are being called and add metadata."""
    messages = state["messages"]
    last_message = messages[-1]
    
    if hasattr(last_message, 'tool_calls') and last_message.tool_calls:
        tool_names = [tc['name'] for tc in last_message.tool_calls]
                
        if not state.get("tool_usage_stats"):
            state["tool_usage_stats"] = {}
        for tool_name in tool_names:
            state["tool_usage_stats"][tool_name] = state["tool_usage_stats"].get(tool_name, 0) + 1
        
        return {"tool_usage_stats": state["tool_usage_stats"]}
    
    elif isinstance(last_message, AIMessage) and not hasattr(last_message, 'tool_calls'):
        if state.get("tool_usage_stats"):
            print(f"Tool usage summary: {json.dumps(state['tool_usage_stats'])}")

model = ChatOpenAI(model="gpt-5")
tools = [search, store_article, get_stored_articles]
agent = create_react_agent(
    model,
    tools,
    post_model_hook=track_tool_usage,
    prompt="""You are a research assistant. When asked to find articles:
1. Search for relevant content
2. Store each article with a descriptive name, URL, and summary
3. Only perform at most one tool call at a time
4. Keep track of what you've collected
5. Continue until you have the requested number of articles
6. Before you finish, double check that you have the requested number of articles by calling the get_stored_articles tool
6. If you have the requested number of articles, stop and present them in an organized format""",
    checkpointer=MemorySaver(),
    store=store,
    state_schema=ResearcherState
)
Invoke a research session:
config = {"configurable": {"thread_id": "langchain-agents"}, "recursion_limit": 100}
agent.invoke({
        "messages": [
            {"role": "user", "content": "Find 5 relevant articles about LangChain and AI agents."}
        ]
    }, 
    config=config,
)
# Here are 5 relevant articles about LangChain and AI agents:

# 1) LangGraph — LangChain Blog
# - URL: https://blog.langchain.dev/langgraph
# - Summary: Official introduction to LangGraph (built on LangChain) explaining why cyclical graphs are useful for agent runtimes. Covers core concepts and how LangGraph enables robust, stateful agents with loops and branching beyond traditional DAGs.

# 2) LangGraph: Multi-Agent Workflows — LangChain Blog
# - URL: https://blog.langchain.dev/langgraph-multi-agent-workflows
# - Summary: Shows how to build multi-agent systems with LangGraph, including coordination patterns and examples. Compares to other frameworks (e.g., AutoGen) and links to repo examples for practical implementation.

# 3) Plan-and-Execute Agents — LangChain Blog
# - URL: https://blog.langchain.dev/planning-agents
# - Summary: Discusses plan-and-execute agent architectures in LangChain. Explains separating a planner LLM from a tool-execution runtime, variations like ReWOO, and a "joiner” component for dynamic replanning to improve speed and reliability.

# 4) LangGraph Tutorial: Building LLM Agents with LangChain (Zep)
# - URL: https://www.getzep.com/ai-agents/langgraph-tutorial/
# - Summary: Hands-on tutorial for building agents using LangGraph, including ReAct-style agents, tool integration (ToolNode), memory (MemorySaver), and guidelines for production-ready agent architectures.

# 5) Building AI agent systems with LangGraph (Medium)
# - URL: https://medium.com/pythoneers/building-ai-agent-systems-with-langgraph-9d85537a6326
# - Summary: Overview of using LangGraph (on top of LangChain) to build AI agents with cyclic, stateful graphs. Compares with LangChain’s AgentExecutor loop and highlights benefits for error handling, tool use, and adaptable multi-agent workflows.
We can then use the same thread_id to invoke a follow-up session where the agent maintains access to work performed in the previous session:
config = {"configurable": {"thread_id": "research-session-2"}, "recursion_limit": 100}
agent.invoke({
        "messages": [
            {"role": "user", "content": "Find 2 more articles."}
        ]
    }, 
    config=config,
)
# Here are 2 more relevant articles about LangChain and AI agents:

# 6) LangGraph Studio: The first agent IDE — LangChain Blog
# - URL: https://blog.langchain.com/langgraph-studio-the-first-agent-ide/
# - Summary: Official announcement of LangGraph Studio, an IDE for developing, visualizing, and debugging agentic applications. Explains why specialized tooling matters for agents and how to get started using Studio.

# 7) Tool calling — Python LangChain (Docs)
# - URL: https://python.langchain.com/docs/concepts/tool_calling/
# - Summary: Concept guide to LangChain’s standardized tool-calling interface. Covers binding tools to models, controlling tool choice, and how this underpins agent behavior and AgentExecutor loops.