Cloud

If you’re ready to deploy your app to Cloud, follow the Cloud deployment quickstart or the full setup guide. This page explains the Cloud managed architecture for reference.

The Cloud option is a fully managed model where LangChain hosts and operates all LangSmith infrastructure and services:

Fully managed infrastructure: LangChain handles all infrastructure, updates, scaling, and maintenance.
Deploy from GitHub: Connect your repositories and deploy with a few clicks.
Automated CI/CD: Build process is handled automatically by the platform.
LangSmith UI: Full access to observability, evaluation, deployment management, and Studio.

	Who manages it	Where it runs
LangSmith platform (UI, APIs, datastores)	LangChain	LangChain’s cloud
Your Agent Servers	LangChain	LangChain’s cloud
CI/CD for your apps	LangChain	LangChain’s cloud

Cloud deployment: LangChain hosts and manages all components including the UI, APIs, and your Agent Servers.

Get started

To deploy your first application to Cloud, follow the Cloud deployment quickstart or refer to the comprehensive setup guide.

Cloud architecture and scalability

This section is only relevant for the cloud-managed LangSmith services available at https://smith.langchain.com and https://eu.smith.langchain.com.For information on the self-hosted LangSmith solution, please refer to the self-hosted documentation.

LangSmith is deployed on Google Cloud Platform (GCP) and is designed to be highly scalable. Many customers run production workloads on LangSmith for LLM application observability, evaluation, and agent deployment

Architecture

The US-based LangSmith service is deployed in the us-central1 (Iowa) region of GCP.

The EU-based LangSmith service is now available (as of mid-July 2024) and is deployed in the europe-west4 (Netherlands) region of GCP. If you are interested in an enterprise plan in this region, contact our sales team.

Regional storage

The resources and services in this table are stored in the location corresponding to the URL where sign-up occurred (either the US or EU). Cloud-managed LangSmith uses Supabase for authentication/authorization and ClickHouse Cloud for data warehouse.

	US	EU
URL	https://smith.langchain.com	https://eu.smith.langchain.com
API URL	https://api.smith.langchain.com	https://eu.api.smith.langchain.com
GCP	us-central1 (Iowa)	europe-west4 (Netherlands)
Supabase	AWS us-east-1 (N. Virginia)	AWS eu-central-1 (Germany)
ClickHouse Cloud	us-central1 (Iowa)	europe-west4 (Netherlands)
LangSmith deployment	us-central1 (Iowa)	europe-west4 (Netherlands)

See the Regions FAQ for more information.

Region-independent storage

Data listed here is stored exclusively in the US:

Payment and billing information with Stripe and Metronome

GCP services

LangSmith is composed of the following services, all deployed on Google Kubernetes Engine (GKE):

LangSmith Frontend: serves the LangSmith UI.
LangSmith Backend: serves the LangSmith API.
LangSmith Platform Backend: handles authentication and other high-volume tasks. (Internal service)
LangSmith Playground: handles forwarding requests to various LLM providers for the Playground feature.
LangSmith Queue: handles processing of asynchronous tasks. (Internal service)

LangSmith uses the following GCP storage services:

Google Cloud Storage (GCS) for runs inputs and outputs.
Google Cloud SQL PostgreSQL for transactional workloads.
Google Cloud Memorystore for Redis for queuing and caching.
Clickhouse Cloud on GCP for trace ingestion and analytics. Our services connect to Clickhouse Cloud, which is hosted in the same GCP region, via a private endpoint.

Some additional GCP services we use include:

Google Cloud Load Balancer for routing traffic to the LangSmith services.
Google Cloud CDN for caching static assets.
Google Cloud Armor for security and rate limits. For more information on rate limits we enforce, please refer to this guide.

Allowlisting IP addresses

Egress from LangChain SaaS

All traffic leaving LangSmith services will be routed through a NAT gateway. All traffic will appear to originate from the following IP addresses:

US	EU
34.59.65.97	34.13.192.67
34.67.51.221	34.147.105.64
34.46.212.37	34.90.22.166
34.132.150.88	34.147.36.213
35.188.222.201	34.32.137.113
34.58.194.127	34.91.238.184
34.59.97.173	35.204.101.241
104.198.162.55	35.204.48.32

It may be helpful to allowlist these IP addresses if connecting to your own AzureOpenAI service or other endpoints that may be required by the Playground or Online Evaluation.

Ingress into LangChain SaaS

The langchain endpoints map to the following static IP addresses:

US	EU
34.8.121.39	34.95.92.214
34.107.251.234	34.13.73.122

You may need to allowlist these to enable traffic from your private network to LangSmith SaaS endpoints (api.smith.langchain.com, smith.langchain.com, beacon.langchain.com, eu.api.smith.langchain.com, eu.smith.langchain.com, eu.beacon.langchain.com).

API rate limits

LangSmith enforces rate limits on API endpoints to ensure service stability and fair usage. The following table shows the rate limits for different endpoints in both US and EU regions. Note that:

Rate limits are expressed as count / interval where count is the number of requests allowed within the interval (in seconds). For example, 2000 / 10 means 2000 requests per 10 seconds.
When no HTTP method is specified in the endpoint column, the rate limit applies to all HTTP methods for that endpoint.
When a specific method is listed (e.g., POST, GET), the rate limit applies only to that method.

Match / Endpoint (method)	Identity key	US prod limit	EU prod limit	Category
OPTIONS, `/info`, `*/v1/metadata/submit`	IP	2000 / 10	2000 / 10	High throughput
`/auth`	`x-api-key`	2000 / 10	2000 / 10	High throughput
`/auth`	`x-user-id` + IP	2000 / 10	2000 / 10	High throughput
`/v1/beacon`	IP	2000 / 10	2000 / 10	High throughput
`/repos`	`x-api-key`	100 / 60	100 / 60	Repository
`/repos`	`x-user-id` + IP	100 / 60	100 / 60	Repository
`POST /runs/batch`	`x-api-key`	2000 / 10	2000 / 10	High throughput
`POST /otel/v1/traces`	`x-api-key`	2000 / 10	2000 / 10	Run ingest
`POST` containing `/charts`	`x-api-key`	750 / 600	750 / 600	Charts
`POST` containing `/charts`	`x-user-id` + IP	750 / 600	750 / 600	Charts
`POST /runs/multipart`	`x-api-key`	6000 / 10	6000 / 10	Multipart ingest
`POST /runs/query`	`x-api-key`	15 / 10	15 / 10	Run query (API)
`POST /runs/query`	`x-user-id` + IP	300 / 10	300 / 10	Run query (User)
`/generate`	`x-api-key`	30 / 3600	30 / 3600	Generation
`/generate`	`x-user-id` + IP	30 / 3600	30 / 3600	Generation
`/commits`	`x-api-key`	10000 / 60	2000 / 60	Commits
`/commits`	`x-user-id` + IP	10000 / 60	2000 / 60	Commits
`DELETE /sessions` or `*/trigger`	`x-api-key`	10 / 60	10 / 60	Deletion
`DELETE /sessions` or `*/trigger`	`x-user-id` + IP	30 / 60	30 / 60	Deletion
`POST /runs` (single run ingest)	`x-api-key`	2000 / 10	2000 / 10	Run ingest
`PATCH` containing `/runs`	`x-api-key`	2000 / 10	2000 / 10	Run ingest
`POST /feedback`	`x-api-key`	2000 / 10	2000 / 10	High throughput
`GET /runs/{uuid}` or `/api/v1/runs/{uuid}`	`x-api-key`	30 / 60	30 / 60	Run lookup
`GET` containing `/examples`	`x-api-key`	5000 / 60	5000 / 60	Examples
Any request with `x-api-key`	`x-api-key`	1000 / 10	1000 / 10	Default (API key)
Any request with `x-user-id`	`x-user-id` + IP	1000 / 10	1000 / 10	Default (User)
`/public/download`	IP	5000 / 60	5000 / 60	Public download
`/runs/stats`	`x-api-key`	1 / 10	20 / 10	Stats
All other IPs (catch-all)	IP	100 / 60	100 / 60	Public (catch-all)

Rate limit categories

High throughput: General high-volume endpoints for core operations like authentication, metadata, and feedback.
Repository: Repository and prompt management operations.
Run ingest: Individual trace/run ingestion endpoints for observability.
Charts: Chart generation and visualization endpoints.
Multipart ingest: Bulk run ingestion via multipart upload for high-volume tracing.
Run query (API): API key-based run query operations with stricter limits for complex queries.
Run query (User): User-based run query operations with higher limits for interactive use.
Generation: AI-powered code and content generation endpoints (limited to prevent abuse).
Commits: Prompt versioning and commit operations.
Deletion: Session deletion and workflow trigger operations.
Run lookup: Retrieving specific runs by UUID.
Examples: Fetching dataset examples for few-shot prompting.
Default (API key): Fallback rate limit for authenticated API requests not matching specific patterns.
Default (User): Fallback rate limit for authenticated user requests not matching specific patterns.
Public download: High-volume public download endpoints for shared resources.
Stats: Run statistics and analytics endpoints (region-specific limits apply).
Public (catch-all): Default rate limit for unauthenticated public access.

For more information on rate limits and other service limits, refer to the Administration overview.

Edit this page on GitHub or file an issue.

Connect these docs to Claude, VSCode, and more via MCP for real-time answers.

Overview

Hybrid

Self-hosted

Get started

Cloud architecture and scalability

Architecture

Regional storage

Region-independent storage

GCP services

Allowlisting IP addresses

Egress from LangChain SaaS

Ingress into LangChain SaaS

API rate limits

Rate limit categories

Overview

Hybrid

Self-hosted

​Get started

​Cloud architecture and scalability

​Architecture

​Regional storage

​Region-independent storage

​GCP services

​Allowlisting IP addresses

​Egress from LangChain SaaS

​Ingress into LangChain SaaS

​API rate limits

​Rate limit categories

Get started

Cloud architecture and scalability

Architecture

Regional storage

Region-independent storage

GCP services

Allowlisting IP addresses

Egress from LangChain SaaS

Ingress into LangChain SaaS

API rate limits

Rate limit categories