BGE on Hugging Face integration

BGE models on Hugging Face are a family of open-source embedding and reranking models published by the Beijing Academy of Artificial Intelligence (BAAI). BGE was one of the leading open-source embedding families in 2023 and 2024, and while newer models on the MTEB leaderboard have since surpassed them on raw retrieval scores, BGE (and BAAI/bge-m3 in particular) remains a widely used, well-balanced default for multilingual retrieval.

LangChain provides two ways to use BGE models:

HuggingFaceEmbeddings from langchain-huggingface: the generic Sentence Transformers class. Covers every BGE variant and is the recommended choice for new projects.

`BAAI/bge-m3` and newer

pip install -qU langchain-huggingface

from langchain_huggingface import HuggingFaceEmbeddings

embeddings = HuggingFaceEmbeddings(
    model_name="BAAI/bge-m3",
    encode_kwargs={"normalize_embeddings": True},
)

BAAI/bge-m3 is trained without a query prompt, so no extra configuration is needed. normalize_embeddings=True is recommended for cosine similarity, per the model authors.

`BAAI/bge-*-en-v1.5` (quick path)

from langchain_huggingface import HuggingFaceEmbeddings

embeddings = HuggingFaceEmbeddings(
    model_name="BAAI/bge-large-en-v1.5",
    encode_kwargs={"normalize_embeddings": True},
    query_encode_kwargs={
        "prompt": "Represent this sentence for searching relevant passages: ",
        "normalize_embeddings": True,
    },
)

Different BGE variants use different prompts; check each model card on Hugging Face for the exact string.

Picking a BGE model

Model	Size	Notes
`BAAI/bge-small-en-v1.5`	33M	Smallest English model, CPU-friendly
`BAAI/bge-large-en-v1.5`	335M	Stronger English, widely used baseline
`BAAI/bge-m3`	570M	Multilingual; dense, sparse, and multi-vector in one model

For reranking (not embedding), see BAAI/bge-reranker-v2-m3 via the Cross Encoder Reranker guide.

See the Sentence Transformers integration page for GPU configuration, batch sizes, query/document prompts, and deployment options.

Connect these docs to Claude, VSCode, and more via MCP for real-time answers.

Edit this page on GitHub or file an issue.

Documentation Index

​BAAI/bge-m3 and newer

​BAAI/bge-*-en-v1.5 (quick path)

​Picking a BGE model

​More

`BAAI/bge-m3` and newer

`BAAI/bge-*-en-v1.5` (quick path)

Picking a BGE model

More