ContextEngine

An open-source context management platform for LLM-powered agents

The Problem

LLM agents experience context degradation well before hitting token limits. Quality drops significantly after ~50 tool calls, typically around 60-70% context capacity—a phenomenon we call "pre-rot".

Quality │
   100% │████████████████████████
        │                        ████████
    80% │                                ████████
        │                                        ████
    60% │                                            ████
        │────────────────────────────────────────────────────
            0%    25%    50%    65%    80%    100%
                      Token Usage

        │◄──── Safe ────►│◄ Pre-Rot ►│◄── Degraded ──►│

The Solution

ContextEngine treats context as a first-class data structure (graph-based, not flat strings) and applies intelligent compression through a tiered strategy:

Lossless (100% recoverable): Externalize payloads, deduplicate, collapse tool chains
Compaction (80-95% recoverable): Schema compression, entity-centric filtering
Summarization (last resort): Hierarchical, task-aware, and incremental summarization

Features

Graph-based Context: Typed nodes (messages, tool calls, artifacts) with relationships
Smart Compression: 10-20x compression with minimal information loss
Pre-rot Detection: Act before quality degrades, not after
Entity Tracking: NER-powered entity extraction and importance scoring
Semantic Search: Embedding-based similarity and duplicate detection
Recovery Manifests: Track all compression operations for potential rollback
Tiered Storage: Hot/warm/cold storage with automatic migration policies
Tool Caching: Semantic and exact-match caching with pattern detection
Predictive Prefetch: Learn tool patterns and prefetch likely next calls
Observable: OpenTelemetry tracing and Prometheus metrics built-in

Installation

# Using uv (recommended)
uv add context-core context-compression context-memory context-tools

# Or with pip
pip install context-core context-compression context-memory context-tools

Quick Start

from context_core import ContextGraph, TokenBudget, EntityTracker, SemanticIndex
from context_compression import CompressionPipeline, CompressionTier

# Create a context graph
graph = ContextGraph(session_id="my-session")

# Add messages and tool calls
graph.add_message(role="user", content="Find all Python files in the src directory")
call = graph.add_tool_call("glob", {"pattern": "src/**/*.py"})
graph.add_tool_result(call.id, ["src/main.py", "src/utils.py", "src/config.py"])

# Track token budget with pre-rot detection
budget = TokenBudget(total_tokens=100_000)
budget.allocate("context", graph.total_tokens)

if budget.status.needs_compression:
    # Apply intelligent compression
    pipeline = CompressionPipeline()
    results = pipeline.compress(graph, max_tier=CompressionTier.COMPACTION)

    print(f"Saved {sum(r.tokens_saved for r in results)} tokens")

Architecture

┌─────────────────────────────────────────────────────────────────┐
│                        ContextEngine                             │
├─────────────────────────────────────────────────────────────────┤
│                                                                  │
│  ┌─────────────┐  ┌─────────────┐  ┌─────────────┐              │
│  │context-core │  │  context-   │  │  context-   │              │
│  │             │  │ compression │  │   observe   │              │
│  │ • Graph     │  │             │  │             │              │
│  │ • Entities  │  │ • Pipeline  │  │ • Tracing   │              │
│  │ • Semantic  │  │ • Lossless  │  │ • Metrics   │              │
│  │ • Budget    │  │ • Compaction│  │ • Events    │              │
│  │ • Tokenizer │  │ • Summary   │  │             │              │
│  └─────────────┘  └─────────────┘  └─────────────┘              │
│                                                                  │
│  ┌─────────────┐  ┌─────────────┐  ┌─────────────┐              │
│  │  context-   │  │  context-   │  │  context-   │              │
│  │   memory    │  │    tools    │  │ multiagent  │  (Phase 4)   │
│  │             │  │             │  │             │              │
│  │ • Tiered    │  │ • Caching   │  │ • Broker    │              │
│  │ • Working   │  │ • Patterns  │  │ • Handoff   │              │
│  │ • Retrieval │  │ • Prefetch  │  │ • Sync      │              │
│  └─────────────┘  └─────────────┘  └─────────────┘              │
│                                                                  │
└─────────────────────────────────────────────────────────────────┘

Packages

Package	Description	Status	Tests
`context-core`	Graph, entities, semantic index, token budget	✅ Complete	358
`context-compression`	Compression pipeline with 9 strategies	✅ Complete	311
`context-observe`	OpenTelemetry tracing, Prometheus metrics	✅ Complete	-
`context-memory`	Storage backends, tiered storage, retrieval	✅ Complete	307
`context-tools`	Tool caching, patterns, prefetching	✅ Complete	283
`context-multiagent`	Broker, handoff, shared memory	📅 Phase 4	-

Compression Strategies

Lossless (100% Recoverable)

Strategy	Description	Compression
`ExternalizePayloads`	Move large outputs to external storage	2-5x
`DeduplicateSemantically`	Remove near-duplicate content	1.5-2x
`CollapseToolChains`	Merge sequential related tool calls	2-3x

Compaction (80-95% Recoverable)

Strategy	Description	Compression
`SchemaCompression`	Extract and reference repeated JSON schemas	2-4x
`EntityCentricCompression`	Keep only entity-relevant sentences	2-3x
`TaskRelevanceCompression`	Filter by current task relevance	2-4x

Summarization (Irreversible, Last Resort)

Strategy	Description	Compression
`HierarchicalSummarization`	Bottom-up multi-level summaries	5-10x
`TaskAwareSummarization`	Task-focused with relevance scoring	5-10x
`IncrementalSummarization`	Streaming updates to running summary	5-10x

Development

# Clone the repository
git clone https://github.com/Sean-Koval/context-engineering.git
cd context-engineering

# Install dependencies
uv sync

# Install all packages in development mode
uv pip install -e packages/context-core -e packages/context-compression -e packages/context-memory -e packages/context-tools -e packages/context-observe

# Run tests
uv run pytest

# Format and lint
uv run ruff format .
uv run ruff check --fix .

# Type check
uv run ty check .

Project Structure

context-engineering/
├── packages/
│   ├── context-core/           # Foundation package (358 tests)
│   │   ├── src/context_core/
│   │   │   ├── graph/          # ContextGraph, nodes, edges
│   │   │   ├── entities/       # EntityTracker, NER backends
│   │   │   ├── semantic/       # SemanticIndex, vector stores
│   │   │   ├── budget/         # TokenBudget, pre-rot detection
│   │   │   └── tokenizer/      # Tokenizer protocol, implementations
│   │   └── tests/
│   ├── context-compression/    # Compression pipeline (311 tests)
│   │   ├── src/context_compression/
│   │   │   ├── strategies/
│   │   │   │   ├── lossless/   # Externalize, deduplicate, collapse
│   │   │   │   ├── compaction/ # Schema, entity-centric, task
│   │   │   │   └── summarization/ # Hierarchical, task-aware, incremental
│   │   │   ├── recovery/       # Manifest, operations
│   │   │   └── pipeline.py     # CompressionPipeline orchestrator
│   │   └── tests/
│   ├── context-memory/         # Persistent storage (307 tests)
│   │   ├── src/context_memory/
│   │   │   ├── backends/       # FileSystem, SQLite, Postgres, Redis
│   │   │   ├── retrieval/      # Semantic, Entity, Temporal, Ensemble
│   │   │   ├── artifacts/      # Versioned artifact management
│   │   │   ├── tiered.py       # Hot/warm/cold tiered storage
│   │   │   ├── working.py      # Working memory with LRU cache
│   │   │   └── eviction.py     # Multi-tier eviction strategies
│   │   └── tests/
│   ├── context-tools/          # Tool optimization (283 tests)
│   │   ├── src/context_tools/
│   │   │   ├── cache/          # ToolCallCache, semantic matching
│   │   │   ├── patterns/       # ToolUsagePatterns, antipattern detection
│   │   │   ├── compression/    # ToolResultCompressor, schema extraction
│   │   │   └── prefetch/       # ToolPrefetcher, argument prediction
│   │   └── tests/
│   └── context-observe/        # Observability
│       ├── src/context_observe/
│       │   ├── tracer.py       # OpenTelemetry integration
│       │   ├── metrics.py      # Prometheus metrics
│       │   └── events.py       # Structured logging
│       └── tests/
├── specs/                      # Technical specifications
├── docs/                       # Research and analysis
├── INDEX.md                    # Implementation progress tracking
├── TASK_BOARD.md              # Granular task breakdown
└── MASTER_ROADMAP.md          # Vision and architecture

Roadmap

Phase	Focus	Status
Phase 1	Foundation (Graph, Entities, Semantic, Budget)	✅ Complete
Phase 2	Compression (Pipeline, 9 Strategies, Recovery)	✅ Complete
Phase 3	Memory & Tools (Storage, Caching, Patterns)	✅ Complete
Phase 4	Multi-Agent (Broker, Handoff, Sync)	📅 Planned

Key Metrics

Metric	Target	Current
Context utilization before degradation	90%+	✅
Reversible compression ratio	3-5x	✅
Total compression ratio	10-20x	✅
Test coverage	90%+	1,259 tests
Memory retrieval p99 latency	< 100ms	✅
Tool cache hit rate	> 60%	✅

Contributing

We welcome contributions! See CONTRIBUTING.md for guidelines.

For Coding Agents

Each spec file in specs/ contains:

Complete Python code with type hints
Pydantic models for all data structures
Implementation checklists
Test specifications

Use TASK_BOARD.md for granular task breakdown with dependencies.

License

MIT License - see LICENSE for details.

Acknowledgments

Built with uv for blazing fast package management
Uses NetworkX for graph operations
Embeddings powered by sentence-transformers
NER via spaCy

ContextEngine - Because context is too valuable to waste.

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
.claude		.claude
.github/workflows		.github/workflows
docs		docs
packages		packages
scripts/worktree		scripts/worktree
specs		specs
.gitignore		.gitignore
.mcp.json		.mcp.json
API_CONTRACTS.md		API_CONTRACTS.md
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
INDEX.md		INDEX.md
LICENSE		LICENSE
MASTER_ROADMAP.md		MASTER_ROADMAP.md
README.md		README.md
TASK_BOARD.md		TASK_BOARD.md
files.zip		files.zip
files.zip:Zone.Identifier		files.zip:Zone.Identifier
handoff.md		handoff.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ContextEngine

The Problem

The Solution

Features

Installation

Quick Start

Architecture

Packages

Compression Strategies

Lossless (100% Recoverable)

Compaction (80-95% Recoverable)

Summarization (Irreversible, Last Resort)

Development

Project Structure

Roadmap

Key Metrics

Contributing

For Coding Agents

License

Acknowledgments

About

Uh oh!

Releases

Packages

Contributors 2

Languages

License

Sean-Koval/context-engineering

Folders and files

Latest commit

History

Repository files navigation

ContextEngine

The Problem

The Solution

Features

Installation

Quick Start

Architecture

Packages

Compression Strategies

Lossless (100% Recoverable)

Compaction (80-95% Recoverable)

Summarization (Irreversible, Last Resort)

Development

Project Structure

Roadmap

Key Metrics

Contributing

For Coding Agents

License

Acknowledgments

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages