A historical market simulation environment for testing AI trading agent reasoning against real market data.
Project description
tags: [finlet, readme, overview, trading, AI] keywords: Finlet, evaluation harness, AI trading agents, backtesting alternative, FastAPI, MCP priority: normal
Finlet
An evaluation harness for AI trading agents. Daily-timeframe. US equities. Long-only.
Test your AI agent's reasoning — not just its algorithms.
Finlet is an agent evaluation harness that tests whether AI trading agents make sound decisions with the information available to them. It provides a simulation clock that controls what data your agent can see, a Date Ceiling Enforcer that strips future data from every response, and a tamper-evident reasoning trace that logs every query and decision with SHA-256 checksums.
This is not a backtester. Backtesters test strategies against historical data. Finlet tests whether your agent reasons well given the same information a human analyst would have had on that date.
Why Finlet?
Most backtesting frameworks test algorithms. Finlet tests reasoning.
As a backtester, Finlet would lose to QuantConnect on every metric — data coverage, asset classes, speed. As an evaluation harness, it has no direct product competitor.
- Date Ceiling Enforcer — Defense-in-depth runtime enforcement strips any data past the simulation clock. Property-based tests prove no future item passes. No competitor has an equivalent.
- SHA-256 Reasoning Trace — Every query, decision, and order is logged with tamper-evident checksums. No backtester tracks agent reasoning.
- Cross-Scenario Leaderboard — Standardized evaluation with composite scoring across returns, risk, drawdown, reasoning quality, and information efficiency
- Real data sources — S3 Parquet data lake (prices, news, fundamentals, sentiment) is the primary hot path; SEC EDGAR filings, FRED economic indicators, and Finnhub (fallback for news + fundamentals when S3 is unavailable) round out the surface — not synthetic datasets
- MCP native — Connect Claude Code (or any MCP client) directly as a trading agent via 16 purpose-built tools
v1 Scope
Finlet v1 is intentionally scoped to daily-timeframe, US-equities, long-only evaluation. These are design choices, not gaps:
| Scope Decision | Rationale |
|---|---|
| EOD data only | Agent evaluation tests reasoning quality, not execution speed |
| US equities only | Deepest data coverage (price, filings, economic, news) |
| Long-only | Isolates reasoning from margin/borrow mechanics |
| No partial fills | Honest about what EOD data supports (no order book depth) |
| No live/paper trading | Evaluation harness, not trading platform |
See docs/V1_SCOPE.md for the complete scope document with rationale.
Quick Start
Finlet's launch user is an LLM agent invoking via MCP, not a human typing CLI commands. The
finletbinary bootstraps credentials and, by default, runs a stdio bridge to hosted/mcp; self-host operators usefinlet mcp serve --self-hostfor the local FastMCP server. The agent host (Claude Desktop, Codex CLI, Cursor, Windsurf, custom MCP client) invokes the tools. The web dashboard at finlet.dev is a read-only monitoring surface. Setup guide for agents: finlet.dev/setup. Machine-readable index: finlet.dev/llms.txt.
1. Install
pip install finlet
2. Authenticate
finlet auth login
Opens a browser to run the OAuth 2.1 + PKCE flow and persists a Bearer JWT at ~/.finlet/credentials (mode 0600). The default MCP stdio bridge refreshes that credential when needed before forwarding calls to hosted /mcp. Headless / CI runs may use finlet register --email you@example.com to mint an api_key and pass it via FINLET_API_KEY instead.
3. Wire finlet mcp serve into your agent host
Claude Desktop — add to ~/Library/Application Support/Claude/claude_desktop_config.json (macOS) under mcpServers:
{
"mcpServers": {
"finlet": {
"command": "finlet",
"args": ["mcp", "serve"]
}
}
}
Codex CLI — run from your terminal:
codex mcp add finlet -- finlet mcp serve
Generic MCP Client (Cursor, Windsurf, custom) — point your client at the stdio bridge with the same JSON schema:
{
"command": "finlet",
"args": ["mcp", "serve"]
}
4. Dialog with your agent
Open the agent host. Ask:
Create a session named 'my-first-sim' starting 2024-01-02 with $100,000
capital and tickers AAPL, MSFT, GOOGL. Then place a market BUY order
for 10 shares of AAPL and show me the portfolio.
The agent invokes the MCP tools (create_session, submit_order, get_portfolio) and reports back. The simulation clock starts FROZEN at the supplied start date; no future data leaks through the Date Ceiling Enforcer. See the MCP Tools section below or dashboard/agent-guide.html (/agent-guide) for the full tool inventory.
5. Watch your agent work in the monitoring dashboard
Visit https://finlet.dev (or http://localhost:8000 when running locally) to see the read-only monitoring surface for your agent's sessions: equity curve, positions, trade log, reasoning trace, plugin health, leaderboard. The dashboard observes — it does not control. All write paths are MCP-only post-2026-05-22; legacy write-path UI is preserved at legacy/dashboard-ui/. See docs/decisions/ui-removal-launch-cut.md and docs/decisions/v2-pure-mcp-migration.md for the rationale.
Direct REST API (alternative to CLI/MCP)
Any HTTP client works:
import httpx
async with httpx.AsyncClient(base_url="http://localhost:8000") as client:
clock = await client.get(f"/sessions/{session_id}/clock")
prices = await client.get(
f"/sessions/{session_id}/market/price",
params={"ticker": "AAPL", "period": "3mo"},
)
await client.post(f"/sessions/{session_id}/trade/order", json={
"side": "BUY",
"ticker": "AAPL",
"quantity": 50,
"order_type": "MARKET",
"reasoning": "Strong Q4 earnings beat, raising guidance, reasonable P/E",
})
Features
Simulation Clock
Three modes to control how time flows:
| Mode | Behavior |
|---|---|
| FROZEN | Clock is stopped. Agent can make unlimited queries at the current time. |
| STEPPING | Clock advances by a fixed interval when explicitly told to step. |
| CONTINUOUS | Clock advances in real-time at a configurable speed multiplier. |
The clock only moves forward when explicitly advanced. No implicit time progression.
Date Ceiling Enforcer
Defense-in-depth protection against future data leakage:
- All timestamps in response data must be <= current sim clock time
- Items with timestamps after the ceiling are stripped
- Items without timestamps are excluded by default
- Strip counts are logged internally but never exposed to the agent (that would leak information about future data existence)
Portfolio Engine
Full portfolio tracking with computed metrics:
- Cash tracking, position management, P&L (realized + unrealized)
- Sharpe ratio, max drawdown, win rate
- Equity curve time series for charting
Order System
Market, limit, and stop orders with full lifecycle tracking:
PENDING -> FILLED | CANCELLED | REJECTED
Each order supports an optional reasoning field for trace logging.
Plugin System
Extensible data source architecture. Built-in plugins connect to real financial APIs:
| Plugin | Data Type | API Key | Notes |
|---|---|---|---|
| PricePlugin | Price (OHLCV) | None | S3 Parquet-backed historical price data. Hot path — no Finnhub fallback. |
| FundamentalsPlugin | Fundamentals | None | S3 Parquet quarterly financials. Default for fundamentals queries. |
| NewsPlugin | News | None | S3 Parquet historical news headlines. Default for news queries. |
| SentimentPlugin | Analyst Ratings | None | S3 Parquet analyst ratings, price targets, consensus. |
| EDGAR | SEC Filings | None* | 10-K, 10-Q, 8-K, 13-F, Form 4. *Requires User-Agent email. |
| FRED | Economic | Free key | GDP, unemployment, CPI, rates. ALFRED vintage dates. |
| Finnhub | News + Fundamentals (fallback) | User key | Fallback only — used when S3 news or fundamentals are unavailable. 60 calls/min free tier. |
# Configure plugin API keys
finlet plugins add finnhub --api-key=YOUR_KEY
finlet plugins add fred --api-key=YOUR_KEY
Plugin configuration is stored at ~/.finlet/plugins.json (never committed to git).
Reasoning Trace
Every agent interaction is logged:
- Action type: What kind of query or action (price, news, filing, order, etc.)
- Sim time + real time: When it happened in simulation and wall clock
- Request params: What was requested
- Response summary: What was returned
- Reasoning: Agent's own explanation for the action
- Latency: How long the operation took
Leaderboard
Standardized evaluation across 5 scenarios with composite scoring. Agents are scored on:
- Portfolio returns vs. benchmark
- Risk-adjusted returns (Sharpe ratio)
- Maximum drawdown
- Reasoning quality
- Information efficiency (returns per API call)
Opt in to data sharing to appear on the public leaderboard and compare your agent against others.
Architecture
+-------------------------------------------------------------+
| AI Trading Agent |
| (Claude Code, custom bot, etc.) |
+----------+------------------------------+--------------------+
| MCP (stdio) | REST API
v v
+-------------------------------------------------------------+
| Finlet Server |
| +-----------+ +-----------+ +----------------------+ |
| | MCP | | FastAPI | | Static Dashboard | |
| | Server | | Routes | | (HTML/JS/CSS) | |
| +-----+-----+ +-----+-----+ +----------------------+ |
| +-------+-------+ |
| v |
| +------------------------------------------------------+ |
| | Session Engine | |
| | +---------+ +-----------+ +------------------+ | |
| | | Clock | | Portfolio | | Order Executor | | |
| | | (frozen) | | Engine | | | | |
| | +---------+ +-----------+ +------------------+ | |
| +------------------------+-------------------------------+ |
| v |
| +------------------------------------------------------+ |
| | Date Ceiling Enforcer | |
| | (strips data after sim clock time) | |
| +------------------------+-------------------------------+ |
| v |
| +------------------------------------------------------+ |
| | Plugin Registry | |
| | +----------+ +---------+ +-------+ +------+ +---------+ | |
| | | Price | | S3 News | | EDGAR | | FRED | | Finnhub | | |
| | |(S3 Parqt)| | + Fund. | |(filing| |(econ)| |(fallback| | |
| | | | | + Sent. | | | | | | news+fd)| | |
| | +----------+ +---------+ +-------+ +------+ +---------+ | |
| +------------------------------------------------------+ |
| | |
| +------------------------v-------------------------------+ |
| | SQLite (per-session DB) | |
| | ~/.finlet/sessions/{id}/session.db | |
| +------------------------------------------------------+ |
+-------------------------------------------------------------+
API Reference
Base URL: http://localhost:8000 (local) or https://finlet.dev (cloud)
Sessions
| Method | Endpoint | Description |
|---|---|---|
POST |
/sessions |
Create a new session |
GET |
/sessions |
List all sessions |
GET |
/sessions/{id} |
Get session state |
DELETE |
/sessions/{id} |
End a session |
POST |
/sessions/{id}/configure |
Update session config |
Clock
| Method | Endpoint | Description |
|---|---|---|
GET |
/sessions/{id}/clock |
Current clock state |
POST |
/sessions/{id}/clock/freeze |
Freeze the clock |
POST |
/sessions/{id}/clock/step |
Step by interval |
POST |
/sessions/{id}/clock/step-to |
Step to a specific time |
POST |
/sessions/{id}/clock/play |
Start continuous mode |
POST |
/sessions/{id}/clock/stop |
Stop continuous mode |
Market Data
| Method | Endpoint | Description |
|---|---|---|
GET |
/sessions/{id}/market/price |
Price data (OHLCV) |
POST |
/sessions/{id}/market/search-news |
Search news articles |
GET |
/sessions/{id}/market/fundamentals |
Company fundamentals |
GET |
/sessions/{id}/market/filings |
SEC filings |
GET |
/sessions/{id}/market/economic |
Economic indicators |
Trading
| Method | Endpoint | Description |
|---|---|---|
POST |
/sessions/{id}/trade/order |
Submit an order |
GET |
/sessions/{id}/trade/orders |
List orders |
GET |
/sessions/{id}/trade/orders/{oid} |
Order detail |
DELETE |
/sessions/{id}/trade/orders/{oid} |
Cancel an order |
Portfolio
| Method | Endpoint | Description |
|---|---|---|
GET |
/sessions/{id}/portfolio |
Portfolio state + metrics |
GET |
/sessions/{id}/portfolio/history |
Equity curve time series |
GET |
/sessions/{id}/trace |
Reasoning trace log |
Full interactive docs at /docs when the server is running.
MCP Tools
When connected via MCP, these tools are available to your AI agent:
| Tool | Description |
|---|---|
get_price_data |
Fetch OHLCV price data for a ticker |
search_news |
Search news articles by keyword/ticker |
get_fundamentals |
Get company financial fundamentals |
get_filings |
Retrieve SEC filings |
get_economic_data |
Get economic indicators (GDP, CPI, etc.) |
submit_order |
Place a buy/sell order |
get_portfolio |
View current portfolio state |
get_sim_time |
Check current simulation time |
advance_time |
Step the simulation clock forward |
freeze_time |
Freeze the simulation clock |
CLI Reference
The finlet binary is the minimal bootstrap shell. All agent operations (session create / list / delete, advance, order, portfolio, status, benchmark *) go through the MCP tool surface invoked by your agent host — not the CLI. See docs/decisions/v2-pure-mcp-migration.md for the v2.0.0 rationale.
finlet serve # Start API server + dashboard (operator surface)
finlet mcp serve # Stdio bridge to hosted /mcp — wire into your agent host
finlet mcp serve --self-host # Local FastMCP server for self-host operators
finlet auth login # OAuth 2.1 + PKCE browser bootstrap
finlet auth logout # Clear ~/.finlet/credentials
finlet auth status # Report local credential state
finlet register --email EMAIL # Mint a bootstrap api_key (CI / headless)
finlet plugins list # Show plugin status
finlet plugins add NAME # Configure a plugin
--api-key TEXT # API key for the plugin
finlet plugins remove NAME # Clear a plugin's credentials
finlet manual [TOPIC] # Local markdown topic viewer
finlet --version # Diagnostics
Tech Stack
- Python 3.12+ — Async throughout, type hints everywhere
- FastAPI — REST API with automatic OpenAPI docs
- MCP SDK — Model Context Protocol server for LLM integration
- SQLite — Per-session database via aiosqlite + SQLModel
- TradingView Lightweight Charts — Dashboard charting (MIT, via CDN)
Contributing
- Clone the repo
- Install in dev mode:
pip install -e ".[dev]" - Run tests:
pytest - Follow the code conventions in
CLAUDE.md
Key rules:
- All datetimes in UTC internally
- Async functions everywhere — no sync I/O in the hot path
- Error messages must be specific and actionable
- Every plugin response goes through the date ceiling enforcer
- Mock external APIs in tests — no real network calls
Disclaimers
Finlet is provided for educational and research purposes only. It is not financial advice and should not be used as the basis for any investment decisions.
Past performance of simulated trading strategies does not guarantee future results. All market data is historical and provided by third-party sources subject to their respective terms of service.
Finlet is not affiliated with any stock exchange, broker, or financial institution.
License
Proprietary
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file finlet-2.0.0.tar.gz.
File metadata
- Download URL: finlet-2.0.0.tar.gz
- Upload date:
- Size: 8.5 MB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ba23acac923335e5e947c53bf1bfbed6ce22b23c3967f6a39c9703a43ad440ac
|
|
| MD5 |
7c4506fa52a9eba20db90fae62a0b4e8
|
|
| BLAKE2b-256 |
a339cea035e8acdc23fedb555c0ce4182b2a873a4398fd980a9e245db82dfdd5
|
Provenance
The following attestation bundles were made for finlet-2.0.0.tar.gz:
Publisher:
pypi-publish.yml on justnau1020/finlet
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
finlet-2.0.0.tar.gz -
Subject digest:
ba23acac923335e5e947c53bf1bfbed6ce22b23c3967f6a39c9703a43ad440ac - Sigstore transparency entry: 1615327958
- Sigstore integration time:
-
Permalink:
justnau1020/finlet@9bdb96184f7c6e321cb8d078b01007b45803b32d -
Branch / Tag:
refs/tags/v2.0.0 - Owner: https://github.com/justnau1020
-
Access:
private
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
pypi-publish.yml@9bdb96184f7c6e321cb8d078b01007b45803b32d -
Trigger Event:
push
-
Statement type:
File details
Details for the file finlet-2.0.0-py3-none-any.whl.
File metadata
- Download URL: finlet-2.0.0-py3-none-any.whl
- Upload date:
- Size: 993.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
4ffa46df33d64a17bb7627ab199bc4afd0f80841f754de37d9aab689b795ffc7
|
|
| MD5 |
8e6f361ac71357796c7c89e26e9d0f92
|
|
| BLAKE2b-256 |
59a9594d4b112a32d2b41d362a08e37fe050ca7847c7a573040d5baff9eae86c
|
Provenance
The following attestation bundles were made for finlet-2.0.0-py3-none-any.whl:
Publisher:
pypi-publish.yml on justnau1020/finlet
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
finlet-2.0.0-py3-none-any.whl -
Subject digest:
4ffa46df33d64a17bb7627ab199bc4afd0f80841f754de37d9aab689b795ffc7 - Sigstore transparency entry: 1615327994
- Sigstore integration time:
-
Permalink:
justnau1020/finlet@9bdb96184f7c6e321cb8d078b01007b45803b32d -
Branch / Tag:
refs/tags/v2.0.0 - Owner: https://github.com/justnau1020
-
Access:
private
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
pypi-publish.yml@9bdb96184f7c6e321cb8d078b01007b45803b32d -
Trigger Event:
push
-
Statement type: