> ## Documentation Index
> Fetch the complete documentation index at: https://docs.langchain.com/llms.txt
> Use this file to discover all available pages before exploring further.

# MODEL_RATE_LIMIT

<Note>
  Currently only used in `langchainjs` (JavaScript/TypeScript).
</Note>

You have hit the maximum number of requests that a model provider allows over a given time period and are being temporarily blocked.

This error occurs when you exceed the maximum number of requests permitted by your model provider within a specific timeframe, resulting in temporary blocking. The restriction is generally temporary and lifts after the limit resets.

## Troubleshooting

To resolve this error, you can:

1. **Implement Rate Limiting**: Deploy a rate limiter to regulate the frequency of requests sent to the model.

2. **Implement Response Caching**: Use model response caching to reduce redundant requests when incoming queries are repetitive.

3. **Use Multiple Providers**: Distribute requests across multiple providers if your application architecture supports this approach

4. **Contact Your Provider**: Reach out to your model provider requesting an increase to your rate limits

***

<div className="source-links">
  <Callout icon="terminal-2">
    [Connect these docs](/use-these-docs) to Claude, VSCode, and more via MCP for real-time answers.
  </Callout>

  <Callout icon="edit">
    [Edit this page on GitHub](https://github.com/langchain-ai/docs/edit/main/src/oss/langchain/errors/MODEL_RATE_LIMIT.mdx) or [file an issue](https://github.com/langchain-ai/docs/issues/new/choose).
  </Callout>
</div>
