Context Garbage Collector (CGC) for LangGraph agents — virtual memory for LLM context windows

These details have not been verified by PyPI

Project links

Project description

L1-Pager — Virtual Memory for LLM Context Windows

Your agent never forgets. It just pages.

L1-Pager is a drop-in context garbage collector (CGC) for LangGraph agents. It watches your conversation, evicts large old messages to a heap, and restores them on demand — exactly like OS virtual memory, but for LLM context windows.

Without L1-Pager                   With L1-Pager
─────────────────                  ──────────────────────────────
┌──────────────────┐               ┌──────────────────────────────┐
│ System prompt    │               │ System prompt                │
│ Tool result 3KB  │ ← fills fast  │ [ptr: tool result evicted]   │ ← 12 chars
│ Tool result 2KB  │               │ [ptr: tool result evicted]   │
│ Tool result 4KB  │               │ Tool result 4KB              │ ← recent, kept
│ ...12 more turns │               │ ...12 more turns             │
│ 128K limit hit   │ ✗             │ Plenty of headroom           │ ✓
└──────────────────┘               └──────────────────────────────┘

When the model needs an evicted value, it calls fetch_evicted_page(page_id=...) and gets the raw content back — zero data loss, transparent to the rest of your graph.

How it works

Turn N                     Turn N+3                  Turn N+6
──────────────             ──────────────             ──────────────
Tool result    →  Mark  →  [ptr: abc123…]  →  Model sees ptr
(3 000 tokens)    phase    stored in heap     calls fetch_evicted_page
                                               ↓
                                              Raw content re-injected
                                              Model answers with exact value

Mark — O(n) scan each turn. Candidates: > 500 tokens AND > 3 turns old.
Sweep — concurrent heap writes. Pointer replaces original message in-place (same id → LangGraph add_messages deduplication works correctly).
Page fault — model calls fetch_evicted_page, interceptor handles it transparently before returning to your graph.

Overhead: p99 < 1ms on 400-message arrays (benchmarked, not estimated).

Install

Python

pip install l1-pager

# Optional: Redis heap backend
pip install "l1-pager[redis]"

TypeScript / JavaScript

npm install @l1-pager/core

Quickstart

Python — drop-in `create_react_agent` replacement

from langchain_anthropic import ChatAnthropic
from l1_pager import EvictionConfig, build_heap
from l1_pager import create_l1_pager_react_agent

model  = ChatAnthropic(model="claude-sonnet-4-6")
config = EvictionConfig(min_tokens=500, min_turns_old=3)
heap   = build_heap(config)

# Exact same API as LangGraph's create_react_agent
agent = create_l1_pager_react_agent(model, tools=[...], heap=heap, config=config)

result = await agent.ainvoke({"messages": [HumanMessage(content="...")]})

Python — wrap your existing model

from l1_pager import L1Pager, EvictionConfig, build_heap

config = EvictionConfig()
heap   = build_heap(config)
pager  = L1Pager(model, heap, config)

# Use pager.ainvoke() anywhere you'd call model.invoke()
response = await pager.ainvoke(messages, current_turn=turn_count)

TypeScript — LangGraph node

import { createL1PagerReactAgent } from "@l1-pager/core";

const agent = createL1PagerReactAgent(model, tools);

const result = await agent.invoke({
  messages: [new HumanMessage("...")],
});

TypeScript — lower-level

import { L1Pager, buildHeap, DEFAULT_EVICTION_CONFIG } from "@l1-pager/core";

const heap  = buildHeap(DEFAULT_EVICTION_CONFIG);
const pager = new L1Pager(model, heap, DEFAULT_EVICTION_CONFIG);

const { aiMessage, pageFaults, tokensFreed } = await pager.ainvoke(messages, turn);

Configuration

from l1_pager import EvictionConfig, EvictionPolicy

config = EvictionConfig(
    min_tokens      = 500,        # don't evict messages smaller than this
    min_turns_old   = 3,          # don't evict messages newer than this
    max_heap_entries = 128,       # LRU cap on in-memory heap
    entry_ttl_seconds = 3_600,   # evicted pages expire after 1 hour
    max_page_fault_depth = 3,    # max re-fetch loops per model call
    policy = EvictionPolicy.LRU,  # LRU | IMPORTANCE | HYBRID
)

import { EvictionConfig } from "@l1-pager/core";

const config: EvictionConfig = {
  minTokens: 500,
  minTurnsOld: 3,
  maxHeapEntries: 128,
  entryTtlSeconds: 3600,
  maxPageFaultDepth: 3,
  policy: "lru",
};

Heap backends

In-memory (default)

heap = build_heap(config, backend="memory")

Zero dependencies, LRU eviction, lives in process memory.

Redis (persistent, shared across workers)

heap = await build_heap_async(config, backend="redis", redis_url="redis://localhost:6379")

import { buildHeapAsync } from "@l1-pager/core";

const heap = await buildHeapAsync(config, "redis", { redisUrl: "redis://localhost:6379" });

Each evicted page is stored as a Redis hash with a TTL. Access resets the TTL (LRU behaviour without a sorted set).

State schema

L1-Pager tracks its own counters as first-class graph state. These accumulate across turns and are accessible for observability:

Field	Type	Description
`l1_turn_count`	`int`	Turns processed
`l1_tokens_freed_total`	`int`	Estimated tokens freed across all sweeps
`l1_page_fault_total`	`int`	Total page faults resolved
`l1_eviction_log`	`list[EvictionRecord]`	Full audit trail of every eviction

System prompt guidance

For best results, add this block to your system prompt so the model understands the compression format:

This conversation uses L1-Pager context compression.
Compressed messages appear as:  <PAGE_FAULT_ID: {hex} | SUMMARY: {text}>

If a question asks for SPECIFIC DATA inside a compressed reference,
call fetch_evicted_page(page_id="{hex}") to retrieve it first.
Do NOT guess — always fetch before answering precise values.

Benchmarks

Run against synthetic conversations on Apple M-series (no Redis):

Messages	p50	p95	p99
52	0.04ms	0.07ms	0.09ms
102	0.08ms	0.13ms	0.17ms
202	0.18ms	0.26ms	0.32ms
402	0.38ms	0.49ms	0.55ms

SLA target: 15ms. Actual worst-case p99: 0.55ms (27× headroom).

Run benchmarks yourself:

cd python && python benchmarks/bench_sweep.py

Project structure

l1-pager/
├── python/                    # PyPI package: l1-pager
│   ├── l1_pager/
│   │   ├── schema.py          # types, pointer helpers
│   │   ├── mark.py            # O(n) mark phase
│   │   ├── heap.py            # InMemoryHeap + RedisHeap
│   │   ├── summarizer.py      # extractive one-line summaries
│   │   ├── sweep.py           # concurrent eviction
│   │   ├── tools.py           # fetch_evicted_page tool schema
│   │   ├── interceptor.py     # L1Pager — the main class
│   │   ├── langgraph_integration.py
│   │   └── wrap.py            # @l1_pager_hook decorator
│   ├── tests/
│   └── benchmarks/
└── typescript/                # npm package: @l1-pager/core
    └── src/
        ├── schema.ts
        ├── mark.ts
        ├── heap.ts
        ├── redis_heap.ts
        ├── summarizer.ts
        ├── sweep.ts
        ├── tools.ts
        ├── interceptor.ts
        ├── langgraph_node.ts
        └── index.ts

Requirements

Python

Python ≥ 3.11
langchain-core >= 0.3
langgraph >= 0.2
redis >= 4.2 (optional, for Redis heap)

TypeScript

Node.js ≥ 18
@langchain/core >= 0.3 (peer dependency)
@langchain/langgraph >= 0.2 (peer dependency)
redis >= 4.0 (optional, for Redis heap)

License

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.1.1

May 5, 2026

This version

0.1.0

May 5, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

l1_pager-0.1.0.tar.gz (38.7 kB view details)

Uploaded May 5, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

l1_pager-0.1.0-py3-none-any.whl (28.1 kB view details)

Uploaded May 5, 2026 Python 3

File details

Details for the file l1_pager-0.1.0.tar.gz.

File metadata

Download URL: l1_pager-0.1.0.tar.gz
Upload date: May 5, 2026
Size: 38.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.4

File hashes

Hashes for l1_pager-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`a6d54969ba53ba01b21ee75c5ee84389f2e353e49bba484174c722ba98536f0d`
MD5	`00e126c4467ffd7328054202a939f316`
BLAKE2b-256	`291be0afb2dec58223ce92b6392177a59a91b9da4fd1850e0288f2c89c2fb094`

See more details on using hashes here.

File details

Details for the file l1_pager-0.1.0-py3-none-any.whl.

File metadata

Download URL: l1_pager-0.1.0-py3-none-any.whl
Upload date: May 5, 2026
Size: 28.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.4

File hashes

Hashes for l1_pager-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7467183ada82154c542ee2e2b29544d31a73e5568c2f71c2ff878ba927bb62be`
MD5	`4c8639f85453ad67efbf971477fcc865`
BLAKE2b-256	`f13c56ba2ddc86749a9906bb6d26c39609b44e36c478f2154a4b36cc02f70f30`

See more details on using hashes here.

l1-pager 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

L1-Pager — Virtual Memory for LLM Context Windows

How it works

Install

Quickstart

Python — drop-in create_react_agent replacement

Python — wrap your existing model

TypeScript — LangGraph node

TypeScript — lower-level

Configuration

Heap backends

In-memory (default)

Redis (persistent, shared across workers)

State schema

System prompt guidance

Benchmarks

Project structure

Requirements

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

Python — drop-in `create_react_agent` replacement