If you’re ready to deploy your app to Cloud, follow the Cloud deployment quickstart or the full setup guide. This page explains the Cloud managed architecture for reference.
- Fully managed infrastructure: LangChain handles all infrastructure, updates, scaling, and maintenance.
- Deploy from GitHub: Connect your repositories and deploy with a few clicks.
- Automated CI/CD: Build process is handled automatically by the platform.
- LangSmith UI: Full access to observability, evaluation, deployment management, and Studio.
| Who manages it | Where it runs | |
|---|---|---|
| LangSmith platform (UI, APIs, datastores) | LangChain | LangChain’s cloud |
| Your Agent Servers | LangChain | LangChain’s cloud |
| CI/CD for your apps | LangChain | LangChain’s cloud |

Get started
To deploy your first application to Cloud, follow the Cloud deployment quickstart or refer to the comprehensive setup guide.Cloud architecture and scalability
This section is only relevant for the cloud-managed LangSmith services available at https://smith.langchain.com and https://eu.smith.langchain.com.For information on the self-hosted LangSmith solution, please refer to the self-hosted documentation.
Architecture
The US-based LangSmith service is deployed in theus-central1 (Iowa) region of GCP.
The EU-based LangSmith service is now available (as of mid-July 2024) and is deployed in the
europe-west4 (Netherlands) region of GCP. If you are interested in an enterprise plan in this region, contact our sales team.Regional storage
The resources and services in this table are stored in the location corresponding to the URL where sign-up occurred (either the US or EU). Cloud-managed LangSmith uses Supabase for authentication/authorization and ClickHouse Cloud for data warehouse.| US | EU | |
|---|---|---|
| URL | https://smith.langchain.com | https://eu.smith.langchain.com |
| API URL | https://api.smith.langchain.com | https://eu.api.smith.langchain.com |
| GCP | us-central1 (Iowa) | europe-west4 (Netherlands) |
| Supabase | AWS us-east-1 (N. Virginia) | AWS eu-central-1 (Germany) |
| ClickHouse Cloud | us-central1 (Iowa) | europe-west4 (Netherlands) |
| LangSmith deployment | us-central1 (Iowa) | europe-west4 (Netherlands) |
Region-independent storage
Data listed here is stored exclusively in the US:- Payment and billing information with Stripe and Metronome
GCP services
LangSmith is composed of the following services, all deployed on Google Kubernetes Engine (GKE):- LangSmith Frontend: serves the LangSmith UI.
- LangSmith Backend: serves the LangSmith API.
- LangSmith Platform Backend: handles authentication and other high-volume tasks. (Internal service)
- LangSmith Playground: handles forwarding requests to various LLM providers for the Playground feature.
- LangSmith Queue: handles processing of asynchronous tasks. (Internal service)
- Google Cloud Storage (GCS) for runs inputs and outputs.
- Google Cloud SQL PostgreSQL for transactional workloads.
- Google Cloud Memorystore for Redis for queuing and caching.
- Clickhouse Cloud on GCP for trace ingestion and analytics. Our services connect to Clickhouse Cloud, which is hosted in the same GCP region, via a private endpoint.
- Google Cloud Load Balancer for routing traffic to the LangSmith services.
- Google Cloud CDN for caching static assets.
- Google Cloud Armor for security and rate limits. For more information on rate limits we enforce, please refer to this guide.

Allowlisting IP addresses
Egress from LangChain SaaS
All traffic leaving LangSmith services will be routed through a NAT gateway. All traffic will appear to originate from the following IP addresses:| US | EU |
|---|---|
| 34.59.65.97 | 34.13.192.67 |
| 34.67.51.221 | 34.147.105.64 |
| 34.46.212.37 | 34.90.22.166 |
| 34.132.150.88 | 34.147.36.213 |
| 35.188.222.201 | 34.32.137.113 |
| 34.58.194.127 | 34.91.238.184 |
| 34.59.97.173 | 35.204.101.241 |
| 104.198.162.55 | 35.204.48.32 |
Ingress into LangChain SaaS
The langchain endpoints map to the following static IP addresses:| US | EU |
|---|---|
| 34.8.121.39 | 34.95.92.214 |
| 34.107.251.234 | 34.13.73.122 |
api.smith.langchain.com, smith.langchain.com, beacon.langchain.com, eu.api.smith.langchain.com, eu.smith.langchain.com, eu.beacon.langchain.com).
API rate limits
LangSmith enforces rate limits on API endpoints to ensure service stability and fair usage. The following table shows the rate limits for different endpoints in both US and EU regions. Note that:- Rate limits are expressed as
count / intervalwhere count is the number of requests allowed within the interval (in seconds). For example,2000 / 10means 2000 requests per 10 seconds. - When no HTTP method is specified in the endpoint column, the rate limit applies to all HTTP methods for that endpoint.
- When a specific method is listed (e.g.,
POST,GET), the rate limit applies only to that method.
| Match / Endpoint (method) | Identity key | US prod limit | EU prod limit | Category |
|---|---|---|---|---|
OPTIONS, /info, */v1/metadata/submit | IP | 2000 / 10 | 2000 / 10 | High throughput |
/auth | x-api-key | 2000 / 10 | 2000 / 10 | High throughput |
/auth | x-user-id + IP | 2000 / 10 | 2000 / 10 | High throughput |
/v1/beacon | IP | 2000 / 10 | 2000 / 10 | High throughput |
/repos | x-api-key | 100 / 60 | 100 / 60 | Repository |
/repos | x-user-id + IP | 100 / 60 | 100 / 60 | Repository |
POST /runs/batch | x-api-key | 2000 / 10 | 2000 / 10 | High throughput |
POST /otel/v1/traces | x-api-key | 2000 / 10 | 2000 / 10 | Run ingest |
POST containing /charts | x-api-key | 750 / 600 | 750 / 600 | Charts |
POST containing /charts | x-user-id + IP | 750 / 600 | 750 / 600 | Charts |
POST /runs/multipart | x-api-key | 6000 / 10 | 6000 / 10 | Multipart ingest |
POST /runs/query | x-api-key | 15 / 10 | 15 / 10 | Run query (API) |
POST /runs/query | x-user-id + IP | 300 / 10 | 300 / 10 | Run query (User) |
/generate | x-api-key | 30 / 3600 | 30 / 3600 | Generation |
/generate | x-user-id + IP | 30 / 3600 | 30 / 3600 | Generation |
/commits | x-api-key | 10000 / 60 | 2000 / 60 | Commits |
/commits | x-user-id + IP | 10000 / 60 | 2000 / 60 | Commits |
DELETE /sessions or */trigger | x-api-key | 10 / 60 | 10 / 60 | Deletion |
DELETE /sessions or */trigger | x-user-id + IP | 30 / 60 | 30 / 60 | Deletion |
POST /runs (single run ingest) | x-api-key | 2000 / 10 | 2000 / 10 | Run ingest |
PATCH containing /runs | x-api-key | 2000 / 10 | 2000 / 10 | Run ingest |
POST /feedback | x-api-key | 2000 / 10 | 2000 / 10 | High throughput |
GET /runs/{uuid} or /api/v1/runs/{uuid} | x-api-key | 30 / 60 | 30 / 60 | Run lookup |
GET containing /examples | x-api-key | 5000 / 60 | 5000 / 60 | Examples |
Any request with x-api-key | x-api-key | 1000 / 10 | 1000 / 10 | Default (API key) |
Any request with x-user-id | x-user-id + IP | 1000 / 10 | 1000 / 10 | Default (User) |
/public/download | IP | 5000 / 60 | 5000 / 60 | Public download |
/runs/stats | x-api-key | 1 / 10 | 20 / 10 | Stats |
| All other IPs (catch-all) | IP | 100 / 60 | 100 / 60 | Public (catch-all) |
Rate limit categories
- High throughput: General high-volume endpoints for core operations like authentication, metadata, and feedback.
- Repository: Repository and prompt management operations.
- Run ingest: Individual trace/run ingestion endpoints for observability.
- Charts: Chart generation and visualization endpoints.
- Multipart ingest: Bulk run ingestion via multipart upload for high-volume tracing.
- Run query (API): API key-based run query operations with stricter limits for complex queries.
- Run query (User): User-based run query operations with higher limits for interactive use.
- Generation: AI-powered code and content generation endpoints (limited to prevent abuse).
- Commits: Prompt versioning and commit operations.
- Deletion: Session deletion and workflow trigger operations.
- Run lookup: Retrieving specific runs by UUID.
- Examples: Fetching dataset examples for few-shot prompting.
- Default (API key): Fallback rate limit for authenticated API requests not matching specific patterns.
- Default (User): Fallback rate limit for authenticated user requests not matching specific patterns.
- Public download: High-volume public download endpoints for shared resources.
- Stats: Run statistics and analytics endpoints (region-specific limits apply).
- Public (catch-all): Default rate limit for unauthenticated public access.
