Agentic Context Engine (ACE)

AI agents that get smarter with every task

⭐ Star this repo if you find it useful!

What is ACE?

ACE enables AI agents to learn from their execution feedback—what works, what doesn't—and continuously improve. No fine-tuning, no training data, just automatic in-context learning.

The framework maintains a Skillbook: a living document of strategies that evolves with each task. When your agent succeeds, ACE extracts patterns. When it fails, ACE learns what to avoid. All learning happens transparently in context.

Self-Improving: Agents autonomously get smarter with each task
20-35% Better Performance: Proven improvements on complex tasks
49% Token Reduction: Demonstrated in browser automation benchmarks
No Context Collapse: Preserves valuable knowledge over time

LLM Quickstart

Direct your favorite coding agent (Cursor, Claude Code, Codex, etc) to Quick Start Guide
Prompt away!

Quick Start

1. Install

pip install ace-framework

2. Set API Key

export OPENAI_API_KEY="your-api-key"

3. Run

from ace import ACELiteLLM

agent = ACELiteLLM(model="gpt-4o-mini")

answer = agent.ask("What does Kayba's ACE framework do?")
print(answer)  # "ACE allows AI agents to remember and learn from experience!"

Done! Your agent learns automatically from each interaction.

→ Quick Start Guide | → Setup Guide

Use Cases

Enhance Existing Agents

Wrap your existing agent (browser-use, LangChain, custom) with ACE learning. Your agent executes tasks normally while ACE analyzes results and builds a skillbook of effective strategies.

Automated System Prompting

The Skillbook acts as an evolving system prompt that automatically improves based on execution feedback—no manual prompt engineering required.

Claude Code with Learning

Run coding tasks with Claude Code while ACE learns patterns from each execution, building expertise over time for your specific codebase and workflows.

Build Self-Improving Agents

Create new agents with built-in learning for customer support, data extraction, code generation, research, content creation, and task automation.

Demos

The Seahorse Emoji Challenge

A challenge where LLMs often hallucinate that a seahorse emoji exists (it doesn't).

In this example:

The agent incorrectly outputs a horse emoji
ACE reflects on the mistake without external feedback
On the second attempt, the agent correctly realizes there is no seahorse emoji

→ Try it yourself

Browser Automation

Online Shopping Demo: ACE vs baseline agent shopping for 5 grocery items.

In this example:

ACE learns to navigate the website over 10 attempts
Performance stabilizes and step count decreases by 29.8%
Token costs reduce 49.0% for base agent and 42.6% including ACE overhead

→ Try it yourself & see all demos

Claude Code Loop

In this example, Claude Code is enhanced with ACE and self-reflects after each execution while translating the ACE library from Python to TypeScript.

Python → TypeScript Translation:

Metric	Result
Duration	~4 hours
Commits	119
Lines written	~14k
Outcome	Zero build errors, all tests passing
API cost	~$1.5 (Sonnet for learning)

→ Claude Code Loop

Integrations

ACE integrates with popular agent frameworks:

Integration	ACE Class	Use Case
LiteLLM	`ACELiteLLM`	Simple self-improving agent
LangChain	`ACELangChain`	Wrap LangChain chains/agents
browser-use	`ACEAgent`	Browser automation
Claude Code	`ACEClaudeCode`	Claude Code CLI
Opik	`OpikIntegration`	Production monitoring and cost tracking

→ Integration Guide | → Examples

How Does ACE Work?

Based on the ACE research framework from Stanford & SambaNova.

ACE uses three specialized roles that work together:

Agent - Creates a plan using learned skills and executes the task
Reflector - Analyzes what worked and what didn't after execution
SkillManager - Updates the skillbook with new strategies based on reflection

Important: The three ACE roles are different specialized prompts using the same language model, not separate models.

ACE teaches your agent and internalizes:

Successes → Extract patterns that work
Failures → Learn what to avoid
Tool usage → Discover which tools work best for which tasks
Edge cases → Remember rare scenarios and how to handle them

The magic happens in the Skillbook—a living document of skills that evolves with experience. Key innovation: All learning happens in context through incremental updates—no fine-tuning, no training data, and complete transparency into what your agent learned.

---
config:
  look: neo
  theme: neutral
---
flowchart LR
    Skillbook[("`**Skillbook**<br>(Evolving Context)<br><br>•Strategy Skills<br> Helpful skills <br> Harmful patterns <br> Neutral observations`")]
    Start(["**Query** <br>User prompt or question"]) --> Agent["**Agent** <br>Executes task using skillbook"]
    Agent --> Reflector
    Skillbook -. Provides Context .-> Agent
    Environment["**Task Environment**<br>Evaluates answer<br>Provides feedback"] -- Feedback+ <br>Optional Ground Truth --> Reflector
    Reflector["**Reflector**<br>Analyzes and provides feedback what was helpful/harmful"]
    Reflector --> SkillManager["**SkillManager**<br>Produces improvement updates"]
    SkillManager --> UpdateOps["**Merger** <br>Updates the skillbook with updates"]
    UpdateOps -- Incremental<br>Updates --> Skillbook
    Agent <--> Environment

Documentation

Quick Start Guide - Get running in 5 minutes
Setup Guide - Installation, configuration, providers
Integration Guide - Add ACE to existing agents
API Reference - Complete API documentation
Complete Guide to ACE - Deep dive into concepts
Prompt Engineering - Advanced prompt techniques
Agentic System Prompting - Automatically generate prompt improvements from past traces
Examples - Ready-to-run code examples
Benchmarks - Evaluate ACE performance
Changelog - Recent changes

Contributing

We love contributions! Check out our Contributing Guide to get started.

Acknowledgment

Based on the ACE paper and inspired by Dynamic Cheatsheet.

If you use ACE in your research, please cite:

@article{zhang2024ace,title={Agentic Context Engineering},author={Zhang et al.},journal={arXiv:2510.04618},year={2024}}

⭐ Star this repo if you find it useful!

Built with ❤️ by Kayba and the open-source community.

Name		Name	Last commit message	Last commit date
Latest commit History 408 Commits
.github/workflows		.github/workflows
ace		ace
benchmarks		benchmarks
docs		docs
examples		examples
scripts		scripts
tests		tests
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Agentic Context Engine (ACE)

What is ACE?

LLM Quickstart

Quick Start

1. Install

2. Set API Key

3. Run

Use Cases

Enhance Existing Agents

Automated System Prompting

Claude Code with Learning

Build Self-Improving Agents

Demos

The Seahorse Emoji Challenge

Browser Automation

Claude Code Loop

Integrations

How Does ACE Work?

Documentation

Contributing

Acknowledgment

About

Uh oh!

Releases 10

Packages

Contributors 9

Uh oh!

Languages

License

kayba-ai/agentic-context-engine

Folders and files

Latest commit

History

Repository files navigation

Agentic Context Engine (ACE)

What is ACE?

LLM Quickstart

Quick Start

1. Install

2. Set API Key

3. Run

Use Cases

Enhance Existing Agents

Automated System Prompting

Claude Code with Learning

Build Self-Improving Agents

Demos

The Seahorse Emoji Challenge

Browser Automation

Claude Code Loop

Integrations

How Does ACE Work?

Documentation

Contributing

Acknowledgment

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 10

Packages 0

Contributors 9

Uh oh!

Languages

Packages