BasetenEmbeddings
features and configuration options, please refer to the API reference.
Overview
Baseten provides inference designed for production applications. Built on the Baseten Inference Stack, these APIs deliver enterprise-grade performance and reliability for leading open-source or custom models: https://www.baseten.co/library/.Integration details
Setup
To access Baseten embedding models you’ll need to create a Baseten account, get an API key, and install thelangchain-baseten
integration package.
Baseten embeddings are only available as dedicated models. You must deploy an embedding model from the Baseten model library before using this integration.
The embeddings functionality uses Baseten’s Performance Client for optimized performance, which is automatically included as a dependency.
Credentials
Head to https://app.baseten.co to sign up to Baseten and generate an API key. Once you’ve done this set the BASETEN_API_KEY environment variable:Installation
The LangChain Baseten integration lives in thelangchain-baseten
package:
Instantiation
Now we can instantiate our embeddings object using your deployed model’s URL:Indexing and Retrieval
Embedding models are often used in retrieval-augmented generation (RAG) flows, both as part of indexing data as well as later retrieving it. For more detailed instructions, please see our RAG tutorials. Below, see how to index and retrieve data using theembeddings
object we initialized above. In this example, we will index and retrieve a sample document in the InMemoryVectorStore
.
Direct Usage
Under the hood, the vectorstore and retriever implementations are callingembeddings.embed_documents(...)
and embeddings.embed_query(...)
to create embeddings for the text(s) used in from_texts
and retrieval invoke
operations, respectively.
You can directly call these methods to get embeddings for your own use cases.
Embed single texts
You can embed single texts or documents withembed_query
:
Embed multiple texts
You can embed multiple texts withembed_documents
:
API Reference
For detailed documentation onBasetenEmbeddings
features and configuration options, please refer to the API reference.
Connect these docs programmatically to Claude, VSCode, and more via MCP for real-time answers.