This page covers all LangChain integrations with Hugging Face Hub and libraries like transformers, sentence transformers, and datasets.Documentation Index
Fetch the complete documentation index at: https://docs.langchain.com/llms.txt
Use this file to discover all available pages before exploring further.
Chat models
ChatHuggingFace
We can use theHugging Face LLM classes or directly use the ChatHuggingFace class.
See a usage example.
LLMs
HuggingFaceEndpoint
We can use theHuggingFaceEndpoint class to run open source models via serverless Inference Providers or via dedicated Inference Endpoints.
See a usage example.
HuggingFacePipeline
We can use theHuggingFacePipeline class to run open source models locally.
See a usage example.
Embedding models
HuggingFaceEmbeddings
We can use theHuggingFaceEmbeddings class to run open source embedding models locally.
See a usage example.
HuggingFaceEndpointEmbeddings
We can use theHuggingFaceEndpointEmbeddings class to run open source embedding models via a dedicated Inference Endpoint.
See a usage example.
Text Embeddings Inference (TEI)
For self-hosted production serving of Sentence Transformers models, Hugging Face publishes Text Embeddings Inference, a dedicated inference server with batching and GPU support. Point LangChain at a TEI deployment viaHuggingFaceEndpointEmbeddings or see the dedicated TEI integration guide.
BGE embedding models
BGE models on Hugging Face are a strong open-source embedding family from the Beijing Academy of Artificial Intelligence (BAAI).BGE models are Sentence Transformers models, so use
HuggingFaceEmbeddings with encode_kwargs={"normalize_embeddings": True}. See a usage example.
Connect these docs to Claude, VSCode, and more via MCP for real-time answers.

