Learnings

Learnings are reusable guidance generated from past task outcomes. They capture what works and what to avoid, so your agent improves automatically over time.

Why Learnings Matter

Agents make the same mistakes repeatedly because they have no memory of past failures. Each task starts fresh, with no knowledge of what went wrong before.

Learnings solve this by extracting patterns from rewards and turning them into actionable guidance. When your agent fails a task, Marlo generates a learning that describes what to do differently. The next time a similar task runs, that learning is injected into the agent’s context, helping it avoid the same mistake.

This creates a continuous improvement loop: failures become lessons, and lessons prevent future failures.

How Learnings Work

Learnings are generated automatically after a task receives a reward:

Reward Analysis: Marlo examines the reward rationale to understand what went wrong or what could be improved.
Learning Generation: A model extracts actionable guidance from the rationale, describing what the agent should do in similar situations.
Confidence Assignment: Each learning receives a confidence score between 0.0 and 1.0, indicating how reliable the guidance is based on supporting evidence.
Storage: The learning is stored and associated with the agent that produced it.
Application: When a new task starts, active learnings for that agent are fetched and can be injected into the agent’s context.

What a Learning Contains

Each learning includes:

Learning: The guidance itself, written as a clear instruction (e.g., “Always verify order ID format before calling lookup_order”).
Expected Outcome: What improvement this learning should produce (e.g., “Reduces tool call failures”).
Confidence: A score between 0.0 and 1.0 indicating how reliable this learning is.
Basis: The evidence or reasoning behind the learning.
Status: Whether the learning is active, retired, or pending review.

Fetching Learnings

Use task.get_learnings() to fetch active learnings for the current agent:


with marlo.task(thread_id="user-123", agent="support-agent") as task:
    task.input(user_message)
 
    # Fetch learnings
    learnings = task.get_learnings()
 
    # Build system prompt with learnings
    system_prompt = "You are a customer support agent."
    if learnings:
        active = learnings.get("active", [])
        if active:
            learnings_text = "\n".join(
                f"- {obj['learning']}" for obj in active if obj.get("learning")
            )
            if learnings_text:
                system_prompt += f"\n\nLearnings from past interactions:\n{learnings_text}"
 
    # Use system_prompt in your LLM call...

You decide where to place learnings in your agent’s context: in the system prompt, as a separate context block, or anywhere else that makes sense for your agent.

How Learnings Evolve

Learnings are not static. As more tasks run and more rewards are generated, Marlo updates existing learnings with new evidence:

Confidence Increases: When a learning is applied and the task succeeds, the confidence score increases.
Confidence Decreases: When a learning is applied and the task still fails, the confidence score decreases.
Refinement: If new evidence suggests a learning should be modified, Marlo updates the guidance while preserving the learning ID.
Retirement: Learnings with very low confidence are eventually retired and no longer surfaced.

Managing Learnings in the Dashboard

The dashboard provides tools to view, create, edit, and manage learnings for each agent.

Viewing Learnings

Navigate to your project in the dashboard.
Click Learnings in the sidebar.
Filter by agent to see learnings for a specific agent.
Click on a learning to see its full details, including the basis and linked tasks.

Creating a Learning Manually

Sometimes you know guidance that the system hasn’t discovered yet:

Click Create Learning in the Learnings view.
Select the target agent.
Enter the learning text (clear, actionable instruction).
Enter the expected outcome (what improvement this should produce).
Optionally set an initial confidence score.
Click Save.

Manual learnings are marked as “User Created” and follow the same confidence evolution as auto-generated learnings.

Editing a Learning

To refine or correct a learning:

Click on the learning in the Learnings view.
Click Edit.
Update the learning text, expected outcome, or confidence.
Add a note explaining the change.
Click Save.

Edits are tracked in the learning’s history.

Activating and Retiring Learnings

Control which learnings are surfaced to your agent:

Click on a learning to open its details.
Use the Status dropdown to change between:
- Active: Learning is included in get_learnings() responses.
- Retired: Learning is no longer surfaced but preserved for history.
- Pending: Learning is under review and not yet surfaced.
Click Save.

Bulk actions are available for managing multiple learnings at once.

Reviewing Learning Effectiveness

Track how well learnings are working:

Navigate to the Analytics section.
View success rates for tasks that applied each learning.
Identify learnings that consistently improve outcomes.
Flag learnings that don’t seem to help for manual review.

Exporting and Importing Learnings

For sharing learnings across projects or backing up:

In the Learnings view, click Export.
Select the learnings to export (or export all).
Download as JSON or CSV.

To import:

Click Import in the Learnings view.
Upload your file.
Review the learnings before confirming import.
Duplicates are detected and skipped.

Per-Agent Learnings

Learnings are scoped to individual agents. Each agent has its own set of learnings based on its own task history. In multi-agent systems, the orchestrator, researcher, and writer each develop their own learnings independently.

This means specialized agents get specialized guidance. A researcher agent learns how to search effectively, while a writer agent learns how to structure summaries clearly.

Best Practices

Writing Effective Manual Learnings

Be specific: “Verify order ID matches format ORD-XXXXX before calling lookup_order” is better than “Check inputs carefully”.
Focus on actions: Describe what the agent should do, not what went wrong.
Include context: Mention when the learning applies (e.g., “When handling refund requests…”).
Keep it concise: One clear instruction per learning.

Monitoring Learning Health

Review learnings monthly to retire outdated guidance.
Watch for conflicting learnings that might confuse the agent.
Track confidence scores—learnings stuck at low confidence may need editing.
Delete duplicate or redundant learnings to keep the context clean.

Iterating on Learnings

Start with auto-generated learnings from low-scoring tasks.
Review and refine the generated text for clarity.
Test the agent to see if the learning helps.
Adjust confidence based on observed results.
Retire learnings that no longer apply to your agent’s behavior.