Skip to main content

Tailor: a local-first MCP framework that lets any MCP-speaking AI work with your own data — your Wardrobe stays on your machine, with server-side analytics, consent gates, cost gates, and a durable audit log.

Project description

Tailor — your AI works with your data, on your machine

Tailor is a personal AI server with research-grade trust. A local-first framework that lets any MCP-speaking AI (Claude Desktop, Cline, Cursor, or a local model via Ollama) work with your own data — without that data leaving your machine. Every action your AI takes gets recorded in a durable audit log; every result is stamped for reproducibility.

It also turns a $200/month AI bill into a $2/month one — and makes the AI materially better at your question. Most analytical questions resolve to a server-computed summary instead of a raw-stream dump (return the answer, not the data). Daily workflows that would burn hundreds of dollars a month against a hosted LLM run for single digits through Tailor, because the AI's context window goes to reasoning over your question and your prior work instead of shuffling streams it would have to re-aggregate itself.

Today the worked-out recipe is health research — the first recipe shipped end-to-end, not the platform's identity. Future recipes (knowledge work, quantified self, household, creative archives) compose on the same engine.

Your Wardrobe is what Tailor governs on your behalf: the structured collection of your data and prior analytical work that lives entirely on your machine. Not clothes — your stuff. Your Wardrobe accumulates themes (questions you keep returning to), moments (observations worth remembering), evidence (data that grounds your themes), audit history (every action your AI took on your behalf), and the source data itself. Tailor curates your Wardrobe — adds to it, retrieves from it, governs how the AI reaches into it — and never sends any of it to a service you didn't choose.

Install

uv tool install tailor-mcp

Bootstrap your first project with tailor pilot (multi-subject CSV setup wizard) or tailor tour (a guided walkthrough on bundled synthetic fixtures from the HIP Lab realistic demo). No data leaves your machine at any point. The bundled HIP Lab CSV fixtures (S001S016) shipped inside the wheel are synthetic by construction — random-walk traces sized to mimic real cohort shapes, not real participant data.

Architecture

LLM client <--> RouterMCP (validate -> circuit break -> consent -> cost
                           -> execute -> PHI scrub -> audit + provenance)
                   |                  \           \
              ChildMCP                  VaultLayer  LocalLLMLayer
   (one per data source                 (reorientation  (local-LLM
    e.g. CSV directory,                  tier;           guardian; opt-in
    Strava API, FHIR bundle)             Obsidian        via user_config)
                                         vault + index)

Children ship in the framework today:

  • csv_dir — wrap a local directory of per-subject CSV files; 7 tools (file detail, summary report, cohort summary, force decline, downsampled stream, raw stream, file list)
  • running — Strava API wrapper as a worked example; 12 tools across pace, heart rate, GPS, run reports, trend reports
  • template — runnable starting point for new data sources; copy + rename to wrap your own data

Three-tier access model

Tailor enforces data minimization server-side, not in the AI's prompt:

Tier What the AI sees Gate
1 — Free Server-computed reports (splits, cohort summaries, decline metrics) None
2 — Consent Downsampled streams (5–30s intervals) Per-domain biometric consent
3 — Cost Per-timestamp streams Consent + cost approval

Most analytical questions resolve at Tier 1 — zero raw biometric data leaves the machine, and the AI's context goes to reasoning rather than to data shuffling.

Security pipeline

Every tool call passes through six layers, cheapest first:

  1. Parameter validation — type/range/pattern, reject before any work
  2. Circuit breaker — block domain after 3 consecutive failures
  3. Consent gate — per-domain biometric consent, revocable
  4. Cost gate — pre-estimate tokens before execution
  5. PHI scrubber — institutional PHI-stripping seam (no-op default; subclass per child)
  6. Audit log + token ledger — every call logged to SQLite

Every successful result also carries a _meta block stamped with package version, tool name, UTC timestamp, domain, tier, scrubber identifier, and token counts — minimum-viable provenance for results that may end up in a paper.

Problems Tailor is built against

  1. Data governance. Hosted LLMs are the wrong home for sensitive participant data. The tier model and local-first processing are the structural response.
  2. Reproducibility. LLM-assisted analyses in chat windows leave no durable trace. The audit log and _meta provenance stamps make every result traceable.
  3. Longitudinal analytical memory. Observations made in one session disappear when the chat ends. The Wardrobe (themes, moments, evidence, append-only) is the response.
  4. AI economics. Tier-1 server-side computation — return the answer, not the data — is simultaneously a cost lever (token-per-question collapses by 1–2 orders of magnitude on most analytical questions) and a cognition lever (freed context goes to reasoning over the analyst's prior work, not to data shuffling). The same architectural choice that satisfies the data-governance problem also makes the AI materially better at the question and reduces cost-per-question by 10–100×.

Where to read more

The project landing page at https://saahasmuthineni.github.io/tailor-mcp-landing/ describes the project's stage and audience. The source repository is currently in invited evaluation; full design documentation (34 numbered ADRs, design notes, roadmap) is private until Tailor completes its first beachhead deployment with a research lab.


Built by Saahas Muthineni. If you received this URL personally and have questions, reply through whatever channel he sent it through.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tailor_mcp-7.0.13.tar.gz (1.4 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

tailor_mcp-7.0.13-py3-none-any.whl (1.3 MB view details)

Uploaded Python 3

File details

Details for the file tailor_mcp-7.0.13.tar.gz.

File metadata

  • Download URL: tailor_mcp-7.0.13.tar.gz
  • Upload date:
  • Size: 1.4 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for tailor_mcp-7.0.13.tar.gz
Algorithm Hash digest
SHA256 dbaadf6bee55f3c6d2ffa7cb9c2d5f90f473d8c2c8791f8d5846cd9536d91966
MD5 cd9ad7472a339881e010e46c050d2fea
BLAKE2b-256 a9b264bdfe5df6a7cba60a54c87260816d29ee285224db2a04aa224a1c868ffb

See more details on using hashes here.

File details

Details for the file tailor_mcp-7.0.13-py3-none-any.whl.

File metadata

  • Download URL: tailor_mcp-7.0.13-py3-none-any.whl
  • Upload date:
  • Size: 1.3 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for tailor_mcp-7.0.13-py3-none-any.whl
Algorithm Hash digest
SHA256 68494d1112deedf6fe71861d69526774059ae8a85b06d90b6a64268d7d0be6ab
MD5 6e3569567b8105ef8654ede3f8da38c8
BLAKE2b-256 b92430841ae506a805638b39c9848a8e510b46afc373b893be53d4130eb8a541

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page