Tracing

Tracing is the foundation of Marlo. It captures everything your agent does during a task, creating a complete record that powers rewards, learnings, and debugging.

Why Tracing Matters

Without tracing, agent failures are invisible. You might see a bad output, but you cannot tell which step caused it, what the model was thinking, or what data the tools returned. Tracing solves this by recording every action in order, so you can replay the entire run and pinpoint exactly where things went wrong.

Tracing also enables Marlo’s learning loop. Rewards are computed from traces. Learnings are generated from rewards. Without complete traces, the system cannot evaluate outcomes or generate useful guidance.

What Tracing Captures

For each task, Marlo records:

Agent Definition: The system prompt, available tools, and model configuration for the agent handling the task.
Task Input: The user request or message that started the task.
LLM Calls: Every call to a language model, including the messages sent, the model used, the response returned, and token usage.
Tool Calls: Every tool invocation, including the tool name, input arguments, output result, and any errors.
Reasoning: Internal chain-of-thought or reasoning steps recorded by the agent.
Task Output: The final response returned to the user.
Task Status: Whether the task completed successfully or failed with an error.

How Tracing Works

Tracing happens automatically when you use the SDK:

You wrap your agent logic in a marlo.task() context.
Inside the context, you call methods like task.input(), task.llm(), task.tool(), and task.output().
Each method records an event with a timestamp and the data you provide.
When the context exits, all events are sent to Marlo as a batch.

Events are linked together by a shared session ID (derived from your thread ID) and a unique task ID. This allows Marlo to group related events and reconstruct the full execution timeline.

Viewing Traces in the Dashboard

Once events are sent, you can view the trace in the Marlo dashboard:

Navigate to your project.
Select a thread from the list to see all tasks in that conversation.
Click on a task to see the full trace, including every LLM call, tool call, and the final output.
Expand individual events to see details like token usage, model parameters, and tool outputs.

The dashboard displays events in chronological order, making it easy to follow the agent’s decision-making process from start to finish.

Multi-Agent Tracing

For systems with multiple agents, Marlo traces each agent separately while preserving the parent-child relationship. When you use task.child() to delegate to a sub-agent, the child’s events are linked to the parent task. This creates a hierarchy in the dashboard, showing how the orchestrator delegated work and how each agent contributed to the final result.