Continuation-free membership inference on closed language models via sample self-concentration

These details have not been verified by PyPI

Project links

Project description

leakit

Continuation-free membership inference for closed language models.

leakit tells you whether a document was likely in a model's training set using nothing but its sampling API. No logits, no log-probabilities, and -- unlike prior sampling attacks such as SaMIA -- no need to know the document's true continuation. You give it the opening of a document; it samples several continuations and measures how much they agree with each other. Training documents pull the model's continuation distribution toward the memorised text, so the samples concentrate; novel documents leave the distribution diffuse.

This is the reference implementation of the self-concentration attack from the paper "Leak It: Continuation-Free Membership Inference on Closed Language Models via Sample Self-Concentration."

Install

curl -fsSL https://raw.githubusercontent.com/victormaricato/leakit/main/install.sh | bash

or, directly, with any of:

uv tool install leakit       # recommended
pipx install leakit
pip install leakit

Use

leakit talks to any OpenAI-compatible endpoint. Set the API key for the service you are probing -- the key maps to whatever provider --base-url points at -- then run it.

export LEAKIT_API_KEY="sk-..."        # or OPENAI_API_KEY

# OpenAI
leakit --model gpt-4o-mini suspect.txt

# Anything OpenAI-compatible (OpenRouter, Anthropic compat route, vLLM, Together, local server)
leakit --model anthropic/claude-3.5-sonnet \
       --base-url https://openrouter.ai/api/v1 \
       --api-key-env OPENROUTER_API_KEY \
       -n 32 suspect.txt

# Compare a candidate against known non-member documents (relative percentile)
leakit --model gpt-4o-mini --calibrate clean/*.txt suspect.txt

# Pipe text in, get JSON out
cat article.txt | leakit --model gpt-4o-mini --json

Output:

document     score  samples
-----------------------------
suspect.txt  0.4213   32/32

A higher score means the sampled continuations agree more, which correlates with membership. The absolute scale is model-dependent, so interpret scores relatively: score several documents together, or use --calibrate with a set of documents you know were not in training to get a percentile.

Key options

Flag	Meaning	Default
`--model`	model id passed to the API	required
`--base-url`	OpenAI-compatible endpoint	OpenAI
`--api-key-env`	env var holding the key	`LEAKIT_API_KEY`, then `OPENAI_API_KEY`
`-n, --samples`	continuations per document	16
`--max-tokens`	tokens per continuation	64
`--temperature`	sampling temperature	1.0
`--prefix-chars`	chars of each doc used as the prefix (0 = whole doc)	256
`--statistic`	`word-jaccard` (parameter-free) or `kgram`	`word-jaccard`
`--mode`	`chat` (closed APIs) or `completion` (base models)	`chat`
`--calibrate`	non-member baseline file(s) for a percentile	off
`--json`	machine-readable output	off

For base/text-completion models (e.g. self-hosted Pythia/Llama base), use --mode completion to sample the raw continuation distribution. For chat/instruct models, the default chat mode asks the model to continue the passage verbatim.

Python API

from leakit import LeakIt

scorer = LeakIt(model="gpt-4o-mini", n_samples=32)   # reads LEAKIT_API_KEY/OPENAI_API_KEY
result = scorer.score(open("suspect.txt").read())
print(result.score, result.n_returned)

The raw statistics are exposed too:

from leakit import self_concentration_word_jaccard
self_concentration_word_jaccard(["a b c", "a b c", "x y z"])

Responsible use

leakit is a privacy-auditing and red-teaming tool: use it to test models you own or are authorised to assess. The self-concentration signal is a statistical indicator, not proof of membership; calibrate before drawing conclusions.

License

MIT.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.1.0

Jul 2, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

leakit-0.1.0.tar.gz (14.8 kB view details)

Uploaded Jul 2, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

leakit-0.1.0-py3-none-any.whl (12.4 kB view details)

Uploaded Jul 2, 2026 Python 3

File details

Details for the file leakit-0.1.0.tar.gz.

File metadata

Download URL: leakit-0.1.0.tar.gz
Upload date: Jul 2, 2026
Size: 14.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.0.1 CPython/3.10.13

File hashes

Hashes for leakit-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`4c17187610d93831287a1055673c6cd394aa12a4cbee95342e8275de2ff5f3a5`
MD5	`b8687344e768bf84ed3747dd3b7d0f0a`
BLAKE2b-256	`420348713e2910b743f3cccbcbb9ea0880a61b5d5f32dc05590707f34a14393b`

See more details on using hashes here.

File details

Details for the file leakit-0.1.0-py3-none-any.whl.

File metadata

Download URL: leakit-0.1.0-py3-none-any.whl
Upload date: Jul 2, 2026
Size: 12.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.0.1 CPython/3.10.13

File hashes

Hashes for leakit-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`839a833f03467958a89f8900e89096991d90d3d1902ee24303b2b04093c2988e`
MD5	`8c3ef6f447092285611642f71d3a0d9a`
BLAKE2b-256	`01e9528eb5a324b86e059dec11c93a5c68a65a9b5d93a09e5d9740364d2a4a12`

See more details on using hashes here.

leakit 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

leakit

Install

Use

Key options

Python API

Responsible use

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes