As of today (Jan 25th, 2024) BaichuanTextEmbeddings ranks #1 in C-MTEB (Chinese Multi-Task Embedding Benchmark) leaderboard.Leaderboard (Under Overall -> Chinese section): https://huggingface.co/spaces/mteb/leaderboardOfficial Website: https://platform.baichuan-ai.com/docs/text-EmbeddingAn API key is required to use this embedding model. You can get one by registering at https://platform.baichuan-ai.com/docs/text-Embedding.BaichuanTextEmbeddings support 512 token window and preduces vectors with 1024 dimensions.Please NOTE that BaichuanTextEmbeddings only supports Chinese text embedding. Multi-language support is coming soon.
Copy
Ask AI
from langchain_community.embeddings import BaichuanTextEmbeddingsembeddings = BaichuanTextEmbeddings(baichuan_api_key="sk-*")