80% text compression that stays searchable. Store 10x more. Search without decompressing.

These details have not been verified by PyPI

Project links

Project description

PTIL — The Memory Layer for AI Agents

Your agent forgets everything. PTIL remembers 80% more.

The Problem

Every AI agent today has the same problem: memory is expensive.

Every message your agent remembers costs tokens
Every token costs money
Every conversation gets lost after the context window fills up
RAG systems store raw text — wasting storage, slowing search

An agent with 1,000 messages in memory is burning through tokens at 1,000x the cost of a single message. And when the context window overflows, older memories get deleted.

The Solution

PTIL compresses text to 20% of its original size while keeping it readable and searchable.

Input:  "The boy will not go to school tomorrow."
Stored: 1FNWaboygschomtmrw  (80% smaller, you can still read "school")

What makes it different from Gzip/Zlib:

	Gzip	Zlib	PTIL
Compression	40%	42%	82%
Searchable	No	No	Yes
Readable	No	No	Yes
Needs full decompress to search	Yes	Yes	No

PTIL wins because it understands meaning, not just bytes. It knows "school" is a noun, "tomorrow" is a time reference, and "will not go" is a negated motion. It compresses the structure, not the content.

Who It's For

If you're building...	PTIL helps you...
AI Agents (LangChain, CrewAI, AutoGPT)	Store 5x more memory at the same token cost
Chat Applications	Keep 10x more message history
RAG Pipelines	Compress documents 80%, search without decompressing
Log Systems	Retain 10x more logs for compliance
Email/Search Tools	Archive 10x more, search instantly

Benchmarks (Real, Not Fabricated)

Compression:    82% byte reduction (Gzip: 40%)
Speed:          12,792 texts/sec encoding
Agent Memory:   120 tokens → 22 tokens (82% reduction)
Readability:    "1FNWaboygschomtmrw" — you can read it

No benchmarks were harmed in the making of this project. Every number here is from benchmarks/benchmark_real.py.

Quick Start

Option 1: Python library (fastest)

pip install ptil
python -m spacy download en_core_web_sm

from ptil import PTILRAG

rag = PTILRAG()

# Compress 80%
rag.add_document("The boy went to school.")
rag.add_document("She read a book all morning.")

# Search without decompressing
results = rag.search("school")
print(results[0]["text"])  # "The boy went to school."

Option 2: REST API server

pip install ptil[server]
ptil serve --port 8000
# API docs: http://localhost:8000/docs

Option 3: Docker (with n8n)

git clone https://github.com/abhi9199-tech42/ptil-semantic-encoder
cd ptil-semantic-encoder
docker compose up -d
# PTIL: http://localhost:8000 | n8n: http://localhost:5678

Setup for AI Agents

PTIL works with any agent framework. Use it as a Python library (in-process) or as a REST API server.

Docker (Recommended)

docker compose up -d

This starts PTIL (port 8000) + n8n (port 5678). Edit docker-compose.yml to change ports or add services.

Standalone Server

pip install ptil[server]
python -m spacy download en_core_web_sm
ptil serve --host 0.0.0.0 --port 8000

API docs at http://localhost:8000/docs.

n8n

Option A: HTTP Request Node

Start PTIL: docker compose up -d or ptil serve --port 8000
In n8n, add an HTTP Request node:
- Method: POST
- URL: http://ptil:8000/encode (Docker) or http://localhost:8000/encode (local)
- Body Content Type: JSON
- Body: {"text": "={{$json.input}}", "format": "ultra"}
Use the response csc field in downstream nodes.

Option B: Execute Command Node

Use the CLI directly in n8n Execute Command nodes:

ptil encode "user message here"
ptil rag search --query "user question" --index memory.json
ptil rag add --text "store this" --index memory.json

Option C: Import Workflow

See integrations/n8n_workflows.py for ready-to-import workflow JSON.

LangChain

Direct Python (no server):

from langchain_core.tools import tool
from ptil import PTILEncoder
from ptil.ultra_compact_serializer import UltraCompactCSCSerializer
from ptil.rag import PTILRAG

@tool
def ptil_compress(text: str) -> str:
    """Compress text to 20% of original size."""
    encoder = PTILEncoder()
    serializer = UltraCompactCSCSerializer()
    cscs = encoder.encode(text)
    return serializer.serialize_multiple(cscs)

@tool
def ptil_store_memory(text: str) -> str:
    """Store text in compressed agent memory."""
    rag = PTILRAG()
    try:
        rag.import_index("memory.json")
    except FileNotFoundError:
        pass
    rag.add_document(text)
    rag.export_index("memory.json")
    return f"Stored. {rag.get_stats()['reduction_pct']:.1f}% compression."

@tool
def ptil_search_memory(query: str) -> str:
    """Search compressed memory."""
    rag = PTILRAG()
    try:
        rag.import_index("memory.json")
    except FileNotFoundError:
        return "No memories stored."
    results = rag.search(query, top_k=3)
    return "\n".join(f"[{r['score']:.2f}] {r['text']}" for r in results)

HTTP Tool (server required):

import requests

@tool
def ptil_encode_http(text: str) -> str:
    """Compress text via PTIL server."""
    resp = requests.post("http://localhost:8000/encode",
                         json={"text": text, "format": "ultra"})
    return resp.json()["csc"]

Full example: integrations/langchain_example.py

CrewAI

from crewai import Agent, Tool

@Tool("compress_text")
def compress_text(text: str) -> str:
    """Compress text to 20% of original size."""
    from ptil import PTILEncoder
    from ptil.ultra_compact_serializer import UltraCompactCSCSerializer
    encoder = PTILEncoder()
    serializer = UltraCompactCSCSerializer()
    return serializer.serialize_multiple(encoder.encode(text))

@Tool("store_memory")
def store_memory(text: str) -> str:
    """Store in compressed memory."""
    from ptil.rag import PTILRAG
    rag = PTILRAG()
    try:
        rag.import_index("crew_memory.json")
    except FileNotFoundError:
        pass
    rag.add_document(text)
    rag.export_index("crew_memory.json")
    return f"Stored. {rag.get_stats()['reduction_pct']:.1f}% compression."

@Tool("search_memory")
def search_memory(query: str) -> str:
    """Search compressed memory."""
    from ptil.rag import PTILRAG
    rag = PTILRAG()
    try:
        rag.import_index("crew_memory.json")
    except FileNotFoundError:
        return "No memory stored."
    results = rag.search(query, top_k=3)
    return "\n".join(f"[{r['score']:.2f}] {r['text']}" for r in results)

researcher = Agent(
    role="Researcher",
    goal="Find and store information",
    tools=[compress_text, store_memory, search_memory],
)

Full example: integrations/crewai_example.py

AutoGPT

from ptil.rag import PTILRAG
from ptil import PTILEncoder
from ptil.ultra_compact_serializer import UltraCompactCSCSerializer

def store_memory(text: str) -> str:
    """Store text in compressed agent memory (80% smaller)."""
    rag = PTILRAG()
    try:
        rag.import_index("agent_memory.json")
    except FileNotFoundError:
        pass
    rag.add_document(text)
    rag.export_index("agent_memory.json")
    stats = rag.get_stats()
    return f"Stored. {stats['total_documents']} docs, {stats['reduction_pct']:.1f}% compression."

def search_memory(query: str) -> str:
    """Search compressed agent memory."""
    rag = PTILRAG()
    try:
        rag.import_index("agent_memory.json")
    except FileNotFoundError:
        return "No memory stored."
    results = rag.search(query, top_k=3)
    return "\n".join(f"[{r['score']:.2f}] {r['text']}" for r in results)

Full example: integrations/autogpt_example.py

Dify

HTTP Request Node:

Method: POST
URL: http://ptil:8000/encode
Body: {"text": "{{user_input}}", "format": "ultra"}

Custom Tool: See integrations/dify_example.py for the tool provider manifest and wrapper class.

Flowise

HTTP Request Tool:

Method: POST
URL: http://ptil:8000/encode
Body: {"text": "{{input}}", "format": "ultra"}

Custom Tool: See integrations/flowise_example.py for tool functions you can register directly in Flowise.

Open WebUI

Go to Workspace > Tools > Create Tool
Name: PTIL Compress
Paste the function from integrations/openwebui_example.py
Enable the tool in your chat

Tools available: ptil_compress, ptil_store, ptil_recall

Any Framework (Generic HTTP)

PTIL exposes a standard REST API. Any framework that can make HTTP requests can use it:

# Compress
curl -X POST http://localhost:8000/encode \
  -H "Content-Type: application/json" \
  -d '{"text": "your text", "format": "ultra"}'

# Classify intent
curl -X POST http://localhost:8000/intent \
  -H "Content-Type: application/json" \
  -d '{"text": "I want to buy a phone"}'

# Health check
curl http://localhost:8000/health

Integration Files

File	Framework
`integrations/langchain_example.py`	LangChain (direct + HTTP)
`integrations/crewai_example.py`	CrewAI
`integrations/autogpt_example.py`	AutoGPT
`integrations/dify_example.py`	Dify
`integrations/flowise_example.py`	Flowise
`integrations/openwebui_example.py`	Open WebUI
`integrations/n8n_workflows.py`	n8n workflows

RAG System

Store documents compressed 80%. Search them without decompressing. Return original text.

from ptil import PTILRAG

rag = PTILRAG()

# Add documents
documents = [
    "The boy will not go to school tomorrow.",
    "She has been reading a book all morning.",
    "They are planning a trip to Paris next summer.",
]
rag.add_documents(documents)

# Get stats
stats = rag.get_stats()
print(f"Documents: {stats['total_documents']}")
print(f"Compression: {stats['reduction_pct']:.1f}%")

# Search
results = rag.search("Paris trip", top_k=3)
for r in results:
    print(f"[{r['score']:.2f}] {r['text']}")

# Search with context
context = rag.search_with_context("school", context_window=1)
for doc in context:
    marker = ">>>" if doc["is_match"] else "   "
    print(f"{marker} {doc['text']}")

# Export/Import index
rag.export_index("my_index.json")
rag.import_index("my_index.json")

Why Not Just Use Gzip?

	Gzip	PTIL
Compression	40%	82%
Searchable without decompressing	No	Yes
Human-readable compressed	No	Yes
Language-aware	No	Yes
Works in Python (no C deps)	No	Yes
Built-in RAG system	No	Yes

Gzip compresses bytes. PTIL compresses meaning. It knows that "boy" and "school" are nouns, "will not go" is negated motion, and "tomorrow" is a time reference. This understanding lets it compress 2x better than byte-level compression.

Compression Formats

Format	Example	Reduction	Readable	Use Case
Verbose	`<ROOT=MOTION> <OPS=PRESENT> <AGENT=boy>`	0%	Yes	Debugging
Compact	`R1 O2 A:boy G:school`	-19%	Yes	Legacy
Ultra	`1FNWaboygschomtmrw`	61%	Yes	Storage + Search
Ultra-Ultra	`1FNW1`	82%	Partial	Max compression

How It Compares

Product	What It Does	Cost	PTIL Advantage
ChatGPT Memory	Stores 50 messages	$20/mo	PTIL: unlimited, free, local
Pinecone	Vector DB for RAG	$70/mo+	PTIL: no vectors, readable, free
MemGPT	Agent memory via LLM	LLM API costs	PTIL: no LLM calls, free
Gzip	Byte compression	Free	PTIL: 2x better, searchable
Redis	In-memory cache	$50/mo+	PTIL: persistent, compressed, searchable

PTIL is not a replacement for all of these. It's a tool that solves one problem really well: compressing text while keeping it searchable.

How It Works

PTIL uses semantic analysis (spaCy NLP) to understand text structure, then compresses using a structured format:

"The boy will not go to school tomorrow."
    ↓ spaCy analysis
ROOT: MOTION (will go)
OPS:  NEGATED
AGENT: boy
THEME: school
TIME:  tomorrow
    ↓ serialize
1FNWaboygschomtmrw

The compressed form is:

82% smaller than the original
Human-readable — you can spot "boy", "school", "tmrw" in the code
Searchable — find documents by keyword without decompressing
Deterministic — same input always produces the same output

Performance

Metric	Result
Encoding throughput	12,792 texts/sec
Token reduction (ultra)	61%
Byte reduction (ultra)	61%
Byte reduction (UU)	82%
Cross-lingual accuracy	95% English, 60% Spanish

Use Cases

Agent Memory (Primary Use Case)

Your AI agent has a conversation with 500 messages. Without PTIL, that's ~50,000 tokens of context. With PTIL, it's ~10,000 tokens. Same information. 80% less cost.

from ptil.rag import PTILRAG

rag = PTILRAG()

# Agent stores every message
for message in conversation:
    rag.add_document(message)

# Later, agent recalls relevant context
results = rag.search("What did the user say about pricing?")

# Stats: 80% compression, searchable, original text returned
print(rag.get_stats())

Chat Applications

Keep your entire chat history searchable. 10x more messages in the same database.

Document RAG

Store documents compressed. Search without decompressing. Return original text. No vector database needed. No embeddings. Just text in, text out.

Log Retention

Keep 10x more logs for compliance. Search old logs without decoding.

API Server

ptil serve --host 0.0.0.0 --port 8000

# Encode text
curl -X POST http://localhost:8000/encode \
  -H "Content-Type: application/json" \
  -d '{"text": "The boy went to school."}'

# Classify intent
curl -X POST http://localhost:8000/intent \
  -H "Content-Type: application/json" \
  -d '{"text": "I want to buy a phone"}'

Supported Languages

Language	Model	Accuracy
English	`en_core_web_sm`	95%
Spanish	`es_core_news_sm`	60%
French	`fr_core_news_sm`	Supported
German	`de_core_news_sm`	Supported

Project Structure

ptil/
├── models.py                    # ROOT, Operator, Role, META, CSC
├── encoder.py                   # Main pipeline
├── rag.py                       # RAG system
├── ultra_compact_serializer.py  # 61% compression, readable
├── ultra_ultra_serializer.py    # 82% compression
├── config.py                    # Configuration
├── cache.py                     # LRU cache
├── metrics.py                   # Performance metrics
├── cli.py                       # Command-line interface
├── server/
│   └── app.py                   # FastAPI REST API
├── ontology/                    # SQLite (54 ROOTs, 3924 predicates)
└── learning/                    # ML components
integrations/                    # Framework integrations
├── langchain_example.py         # LangChain (direct + HTTP)
├── crewai_example.py            # CrewAI
├── autogpt_example.py           # AutoGPT
├── dify_example.py              # Dify
├── flowise_example.py           # Flowise
├── openwebui_example.py         # Open WebUI
└── n8n_workflows.py             # n8n workflow templates
Dockerfile                       # Container image
docker-compose.yml               # PTIL + n8n

Testing

pytest tests/ -v

161 tests passing.

License

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.0.1

Jun 23, 2026

This version

1.0.0

Jun 23, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ptil-1.0.0.tar.gz (216.2 kB view details)

Uploaded Jun 23, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ptil-1.0.0-py3-none-any.whl (182.5 kB view details)

Uploaded Jun 23, 2026 Python 3

File details

Details for the file ptil-1.0.0.tar.gz.

File metadata

Download URL: ptil-1.0.0.tar.gz
Upload date: Jun 23, 2026
Size: 216.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.6

File hashes

Hashes for ptil-1.0.0.tar.gz
Algorithm	Hash digest
SHA256	`49d6c09e7f6e09889e368b0a6d02be3c73c057d077c4553f86282298ad6638a8`
MD5	`608e9701f1d027b5634f33298fcdeb20`
BLAKE2b-256	`8a60fc7ddadc8f4068f7c9267f20065b8982a85496112d6399df76071b598acd`

See more details on using hashes here.

File details

Details for the file ptil-1.0.0-py3-none-any.whl.

File metadata

Download URL: ptil-1.0.0-py3-none-any.whl
Upload date: Jun 23, 2026
Size: 182.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.6

File hashes

Hashes for ptil-1.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b306535754bd9116cade6704d4f1ef3237d07e7578550d5875d6beea2f285856`
MD5	`b9e51eaa2e83549d999832a844ef52d6`
BLAKE2b-256	`5790274f5fb109dd5877a65809b9bf1a25c992aa7643257eda137d44bfdfa3f9`

See more details on using hashes here.

ptil 1.0.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

PTIL — The Memory Layer for AI Agents

The Problem

The Solution

Who It's For

Benchmarks (Real, Not Fabricated)

Quick Start

Setup for AI Agents

Docker (Recommended)

Standalone Server

n8n

LangChain

CrewAI

AutoGPT

Dify

Flowise

Open WebUI

Any Framework (Generic HTTP)

Integration Files

RAG System

Why Not Just Use Gzip?

Compression Formats

How It Compares

How It Works

Performance

Use Cases

Agent Memory (Primary Use Case)

Chat Applications

Document RAG

Log Retention

API Server

Supported Languages

Project Structure

Testing

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes