Sometimes it is useful for a custom evaluator or summary evaluator to return multiple metrics. For example, if you have multiple metrics being generated by an LLM judge, you can save time and money by making a single LLM call that generates multiple metrics instead of making multiple LLM calls. To return multiple scores using the Python SDK, simply return a list of dictionaries/objects of the following form:Documentation Index
Fetch the complete documentation index at: https://docs.langchain.com/llms.txt
Use this file to discover all available pages before exploring further.
- Python: Requires
langsmith>=0.2.0 - TypeScript: Support for multiple scores is available in
langsmith@0.1.32and higher

Related
Connect these docs to Claude, VSCode, and more via MCP for real-time answers.

