Log LLM calls - Docs by LangChain

When you call an LLM directly, outside of LangChain or a LangSmith supported integration, you need to provide specific metadata so that LangSmith can display token counts, calculate costs, and let you open the run in the Playground with the correct provider and model. There are four requirements for a fully functional LLM trace:

Requirement	What to do	Enables
1. Set `run_type="llm"`	Pass `run_type="llm"` to `@traceable`	LLM-specific rendering, token/cost display
2. Format inputs/outputs	Use OpenAI, Anthropic, or LangChain message format	Structured message rendering, Playground support
3. Set `ls_provider` and `ls_model_name`	Pass both in `metadata`	Cost tracking, Playground model selection
4. Provide token counts	Set `usage_metadata` on the run	Token counts and cost calculation

If you are using LangChain OSS, the OpenAI wrapper, or the Anthropic wrapper, these details are handled automatically.The examples on this page use the traceable decorator/wrapper (the recommended approach for Python and JS/TS). The same requirements apply if you use the RunTree or API directly.

Messages format

When tracing a custom model or a custom input/output format, it must either follow the LangChain format, OpenAI completions format or Anthropic messages format. For more details, refer to the OpenAI Chat Completions or Anthropic Messages documentation. The LangChain format is:

Show LangChain format

messages

array

required

A list of messages containing the content of the conversation.

role

string

required

Identifies the message type. One of: system | reasoning | user | assistant | tool

content

array

required

Content of the message. List of typed dictionaries.

Show Content options

type

string

required

Show text

type

literal('text')

required

text

string

required

Text content.

annotations

object[]

List of annotations for the text

extras

object

Additional provider-specific data.

Show reasoning

type

literal('reasoning')

required

text

string

required

Text content.

extras

object

Additional provider-specific data.

Show image

type

literal('image')

required

url

string

URL pointing to the image location.

base64

string

required

Base64-encoded image data.

string

Reference ID to an externally stored image (e.g., in a provider’s file system or in a bucket).

mime_type

string

Image MIME type (e.g., image/jpeg, image/png).

Show file (e.g., PDFs)

type

literal('file')

required

url

string

URL pointing to the file.

base64

string

required

Base64-encoded file data.

string

Reference ID to an externally stored file (e.g., in a provider’s file system or in a bucket).

mime_type

string

File MIME type (e.g., application/pdf).

Show audio

type

literal('audio')

required

url

string

URL pointing to the audio file.

base64

string

required

Base64-encoded audio data.

string

Reference ID to an externally stored audio file (e.g., in a provider’s file system or in a bucket).

mime_type

string

Audio MIME type (e.g., audio/mpeg, audio/wav).

Show video

type

literal('video')

required

url

string

URL pointing to the video file.

base64

string

required

Base64-encoded video data.

string

Reference ID to an externally stored video file (e.g., in a provider’s file system or in a bucket).

mime_type

string

Video MIME type (e.g., video/mp4, video/webm).

Show tool_call

type

literal('tool_call')

required

name

string

args

object

required

Arguments to pass to the tool.

string

Unique identifier for this tool call.

Show server_tool_call

type

literal('server_tool_call')

required

string

required

Unique identifier for this tool call.

name

string

required

The name of the tool to be called.

args

object

required

Arguments to pass to the tool.

Show server_tool_result

type

literal('server_tool_result')

required

tool_call_id

string

required

Identifier of the corresponding server tool call.

string

Unique identifier for this tool call.

status

string

required

Execution status of the server-side tool. One of: success | error.

output

Output of the executed tool.

tool_call_id

string

Must match the id of a prior assistant message’s tool_calls[i] entry. Only valid when role is tool.

usage_metadata

object

Use this field to send token counts and/or costs with your model’s output. See Provide token and cost information for more details.

 inputs = {
  "messages": [
    {
      "role": "user",
      "content": [
        {
          "type": "text",
          "text": "Hi, can you tell me the capital of France?"
        }
      ]
    }
  ]
}

outputs = {
  "messages": [
    {
      "role": "assistant",
      "content": [
        {
          "type": "text",
          "text": "The capital of France is Paris."
        },
        {
          "type": "reasoning",
          "text": "The user is asking about..."
        }
      ]
    }
  ]
}

Convert custom I/O formats into LangSmith compatible formats

If you’re using a custom input or output format, you can convert it to a LangSmith compatible format using process_inputs/processInputs and process_outputs/processOutputs functions on the @traceable decorator (Python) or traceable function (TS). process_inputs/processInputs and process_outputs/processOutputs accept functions that allow you to transform the inputs and outputs of a specific trace before they are logged to LangSmith. They have access to the trace’s inputs and outputs, and can return a new dictionary with the processed data. Here’s a boilerplate example of how to use process_inputs and process_outputs to convert a custom I/O format into a LangSmith compatible format:

class OriginalInputs(BaseModel):
    """Your app's custom request shape"""

class OriginalOutputs(BaseModel):
    """Your app's custom response shape."""

class LangSmithInputs(BaseModel):
    """The input format LangSmith expects."""

class LangSmithOutputs(BaseModel):
    """The output format LangSmith expects."""

def process_inputs(inputs: dict) -> dict:
    """Dict -> OriginalInputs -> LangSmithInputs -> dict"""

def process_outputs(output: Any) -> dict:
    """OriginalOutputs -> LangSmithOutputs -> dict"""


@traceable(run_type="llm", process_inputs=process_inputs, process_outputs=process_outputs)
def chat_model(inputs: dict) -> dict:
    """
    Your app's model call. Keeps your custom I/O shape.
    The decorators call process_* to log LangSmith-compatible format.
    """

Identify a custom model in traces

When using a custom model, it is recommended to also provide the following metadata fields to identify the model when viewing traces and when filtering.

ls_provider: The provider of the model, e.g., "openai", "anthropic".
ls_model_name: The name of the model, e.g., "gpt-5.4-mini", "claude-3-opus-20240229".

from langsmith import traceable

inputs = [
    {"role": "system", "content": "You are a helpful assistant."},
    {"role": "user", "content": "I'd like to book a table for two."},
]
output = {
    "choices": [
        {
            "message": {
                "role": "assistant",
                "content": "Sure, what time would you like to book the table for?"
            }
        }
    ]
}

@traceable(
    run_type="llm",
    metadata={"ls_provider": "my_provider", "ls_model_name": "my_model"}
)
def chat_model(messages: list):
    return output

chat_model(inputs)

If you implement a custom streaming chat_model, you can “reduce” the outputs into the same format as the non-streaming version. This is only supported in Python:

def _reduce_chunks(chunks: list):
    all_text = "".join([chunk["choices"][0]["message"]["content"] for chunk in chunks])
    return {"choices": [{"message": {"content": all_text, "role": "assistant"}}]}

@traceable(
    run_type="llm",
    reduce_fn=_reduce_chunks,
    metadata={"ls_provider": "my_provider", "ls_model_name": "my_model"}
)
def my_streaming_chat_model(messages: list):
    for chunk in ["Hello, " + messages[1]["content"]]:
        yield {
            "choices": [
                {
                    "message": {
                        "content": chunk,
                        "role": "assistant",
                    }
                }
            ]
        }

list(
    my_streaming_chat_model(
        [
            {"role": "system", "content": "You are a helpful assistant. Please greet the user."},
            {"role": "user", "content": "assistant"},
        ],
    )
)

Setting ls_model_name in your metadata is required for LangSmith to identify the model and calculate costs for custom LLM traces. Without it, token counts may still be recorded but costs won’t be estimated.

To learn more about how to use the metadata fields, refer to the Add metadata and tags guide. To customize how custom agent runs appear in the Messages view, see Customize the Messages view.

Provide token and cost information

Token counts enable cost calculation, which LangSmith displays in the Tracing Projects UI. There are two ways to provide them:

Set usage_metadata on the run tree: call get_current_run_tree() / getCurrentRunTree() inside your @traceable function and set the usage_metadata field. This does not change your function’s return value.
Return usage_metadata in the output: include usage_metadata as a top-level key in the dictionary your function returns.

Supported `usage_metadata` fields

Field	Type	Description
`input_tokens`	`int`	Total input/prompt tokens
`output_tokens`	`int`	Total output/completion tokens
`total_tokens`	`int`	Sum of input + output (optional, can be inferred)
`input_token_details`	`object`	Breakdown: `cache_read`, `cache_creation`, `cache_read_over_200k`, `ephemeral_5m_input_tokens`, `ephemeral_1h_input_tokens`, `audio`, `text`, `image`
`output_token_details`	`object`	Breakdown: `reasoning`, `audio`, `text`, `image`

To send costs directly (for non-linear pricing), you can also include input_cost, output_cost, and total_cost fields. For details on configuring model pricing and viewing costs in the UI, refer to the Cost tracking page.

Time-to-first-token

If you are using traceable or one of the SDK wrappers, LangSmith will automatically populate time-to-first-token for streaming LLM runs. However, if you are using the RunTree API directly, you will need to add a new_token event to the run tree in order to properly populate time-to-first-token. Here’s an example:

from langsmith.run_trees import RunTree
run_tree = RunTree(
    name="CustomChatModel",
    run_type="llm",
    inputs={ ... }
)
run_tree.post()
llm_stream = ...
first_token = None
for token in llm_stream:
    if first_token is None:
      first_token = token
      run_tree.add_event({
        "name": "new_token"
      })
run_tree.end(outputs={ ... })
run_tree.patch()

Custom instrumentation: core @traceable and RunTree patterns.
Access the current run (span) within a traced function: using get_current_run_tree() to set usage_metadata and other fields at runtime.
Trace OpenAI applications: automatic token and cost tracking when using the OpenAI wrapper.
Trace Anthropic applications: automatic token and cost tracking when using the Anthropic wrapper.
Integrations overview: full list of providers and frameworks with built-in LangSmith support.

Connect these docs to Claude, VSCode, and more via MCP for real-time answers.

Edit this page on GitHub or file an issue.

​Messages format

​Convert custom I/O formats into LangSmith compatible formats

​Identify a custom model in traces

​Provide token and cost information

​Supported usage_metadata fields

​Time-to-first-token

​Related

Messages format

Convert custom I/O formats into LangSmith compatible formats

Identify a custom model in traces

Provide token and cost information

Supported `usage_metadata` fields

Time-to-first-token

Related