Deep Agents CLI

A terminal interface for building agents with persistent memory. Agents maintain context across sessions, learn project conventions, and execute code with approval controls. The Deep Agents CLI has the following built-in capabilities:

File operations - read, write, and edit files in your project with tools that enable agents to manage and modify code and documentation.
Shell command execution - execute shell commands to run tests, build projects, manage dependencies, and interact with version control systems.
Web search - search the web for up-to-date information and documentation (requires Tavily API key).
HTTP requests - make HTTP requests to APIs and external services for data fetching and integration tasks.
Task planning and tracking - break down complex tasks into discrete steps and track progress through the built-in todo system.
Memory storage and retrieval - store and retrieve information across sessions, enabling agents to remember project conventions and learned patterns.
Human-in-the-loop - require human approval for sensitive tool operations.

Watch the demo video to see how the Deep Agents CLI works.

Quick start

Set your API key

Export as an environment variable:

export ANTHROPIC_API_KEY="your-api-key"

Or create a .env file in your project root:

ANTHROPIC_API_KEY=your-api-key

Run the CLI

uvx deepagents-cli

Give the agent a task

> Create a Python script that prints "Hello, World!"

The agent proposes changes with diffs for your approval before modifying files.

Additional installation and configuration options

Install locally if needed:

pip install deepagents-cli

The CLI uses Anthropic Claude Sonnet 4 by default. To use OpenAI:

export OPENAI_API_KEY="your-key"

Enable web search (optional):

export TAVILY_API_KEY="your-key"

API keys can be set as environment variables or in a .env file.

Configuration

Command-line options

Option	Description
`--agent NAME`	Use named agent with separate memory
`--auto-approve`	Skip tool confirmation prompts (toggle with `Ctrl+T`)
`--sandbox TYPE`	Execute in remote sandbox: `modal`, `daytona`, or `runloop`
`--sandbox-id ID`	Reuse existing sandbox
`--sandbox-setup PATH`	Run setup script in sandbox

CLI commands

Command	Description
`deepagents list`	List all agents
`deepagents help`	Show help
`deepagents reset --agent NAME`	Clear agent memory and reset to default
`deepagents reset --agent NAME --target SOURCE`	Copy memory from another agent

Interactive mode

Slash commands

Use these commands within the CLI session:

/tokens - Display token usage
/clear - Clear conversation history
/exit - Exit the CLI

Bash commands

Execute shell commands directly by prefixing with !:

!git status
!npm test
!ls -la

Keyboard shortcuts

Shortcut	Action
`Enter`	Submit
`Alt+Enter`	Newline
`Ctrl+E`	External editor
`Ctrl+T`	Toggle auto-approve
`Ctrl+C`	Interrupt
`Ctrl+D`	Exit

Set project conventions with memories

Agents store information in ~/.deepagents/AGENT_NAME/memories/ as markdown files using a memory-first protocol:

Research: Searches memory for relevant context before starting tasks
Response: Checks memory when uncertain during execution
Learning: Automatically saves new information for future sessions

Organize memories by topic with descriptive filenames:

~/.deepagents/backend-dev/memories/
├── api-conventions.md
├── database-schema.md
└── deployment-process.md

Teach the agent conventions once:

uvx deepagents-cli --agent backend-dev
> Our API uses snake_case and includes created_at/updated_at timestamps

It remembers for future sessions:

> Create a /users endpoint
# Applies conventions without prompting

Use remote sandboxes

Execute code in isolated remote environments for safety and flexibility. Remote sandboxes provide the following benefits:

Safety: Protect your local machine from potentially harmful code execution
Clean environments: Use specific dependencies or OS configurations without local setup
Parallel execution: Run multiple agents simultaneously in isolated environments
Long-running tasks: Execute time-intensive operations without blocking your machine
Reproducibility: Ensure consistent execution environments across teams

To use a remote sandbox, follow these steps:

Configure your sandbox provider (Runloop, Daytona, or Modal):

# Runloop
export RUNLOOP_API_KEY="your-key"

# Daytona
export DAYTONA_API_KEY="your-key"

# Modal
modal setup

Run the CLI with a sandbox:
```
uvx deepagents-cli --sandbox runloop --sandbox-setup ./setup.sh
```
The agent runs locally but executes all code operations in the remote sandbox. Optional setup scripts can configure environment variables, clone repositories, and prepare dependencies.

(Optional) Create a setup.sh file to configure your sandbox environment:

#!/bin/bash
set -e

# Clone repository using GitHub token
git clone https://x-access-token:${GITHUB_TOKEN}@github.com/username/repo.git $HOME/workspace
cd $HOME/workspace

# Make environment variables persistent
cat >> ~/.bashrc <<'EOF'
export GITHUB_TOKEN="${GITHUB_TOKEN}"
export OPENAI_API_KEY="${OPENAI_API_KEY}"
cd $HOME/workspace
EOF

source ~/.bashrc

Store secrets in a local .env file for the setup script to access.

Sandboxes isolate code execution, but agents remain vulnerable to prompt injection with untrusted inputs. Use human-in-the-loop approval, short-lived secrets, and trusted setup scripts only. Note that sandbox APIs are evolving rapidly, and we expect more providers to support proxies that help mitigate prompt injection and secrets management concerns.

Edit this page on GitHub or file an issue.

Connect these docs to Claude, VSCode, and more via MCP for real-time answers.

Get started

Core capabilities

Command line interface

Quick start

Configuration

Interactive mode

Set project conventions with memories

Use remote sandboxes

Get started

Core capabilities

Command line interface

​Quick start

​Configuration

​Interactive mode

​Set project conventions with memories

​Use remote sandboxes

Quick start

Configuration

Interactive mode

Set project conventions with memories

Use remote sandboxes