Statistical monitoring for legal document revision — detect semantic drift in contracts and policies

These details have not been verified by PyPI

Project links

Project description

LegalDrift

Statistical monitoring for legal document revision.

LegalDrift detects when the substantive meaning of a legal document shifts between versions. It is designed for lawyers, contract managers, and compliance officers who need to verify that a redlined contract, an updated policy, or a renegotiated agreement has not introduced unintended semantic changes.

This is not a replacement for legal review. It is a screening tool to flag passages that merit closer human attention.

When to Use LegalDrift
When Not to Use It
Installation
Quick Start
How It Works
Command-Line Usage
Python API
Localized Drift Detection
Drift History & Audit Logs
Concept Extraction
Statistical Methods
Limitations & Caveats
Contributing
License

When to Use LegalDrift

You might find this useful if you:

Review multiple redline rounds of the same contract and want a sanity check that nothing was silently broadened or narrowed.
Maintain a template library and need to verify that a "minor update" has not altered the substantive scope of a standard clause.
Track regulatory compliance documents (e.g., privacy policies, AI governance frameworks) across jurisdictions where a wording tweak can change legal effect.
Compare an executed agreement against the final draft to confirm no last-minute substitutions occurred.

When Not to Use It

LegalDrift is intentionally narrow in scope. It will not help you with:

Determining whether a change is legally valid or enforceable. It flags that text has shifted; it does not opine on whether the shift is permissible.
Detecting purely formatting or numbering changes. It operates on semantic embeddings, not on character-level diffs. A renumbered Section 3.2 that says the exact same thing will not trigger a drift alert.
Short, low-entropy text. A one-sentence NDA addendum will not yield meaningful statistical results.
Cross-lingual comparison. The current embedding model is English-centric.
Finding typos or grammatical errors. These are invisible to the semantic layer.

If you need a traditional redline tool, use a document comparison suite (e.g., Workshare, Litera Compare, or your document editor's built-in comparison). LegalDrift complements those tools; it does not replace them.

Installation

pip install legaldrift

For development:

git clone https://github.com/OsamaMoftah/LegalDrift.git
cd legaldrift
pip install -e ".[dev]"
pytest

Requirements

Python 3.8 or later
numpy, scipy, scikit-learn, pandas, matplotlib
sentence-transformers (optional; required for the Legal-BERT embedding backend)

If sentence-transformers is unavailable, LegalDrift falls back to a deterministic hash-based embedding. The fallback is faster and offline, but less sensitive to nuanced semantic shifts. For production use, we recommend installing the optional dependency.

Quick Start

1. Detect drift between two contracts

legaldrift detect contract_v1.txt contract_v2.txt

Example output (using the bundled sample contracts):

Drift Detection Results
========================================
Drift Detected: True
P-value: 0.0012
Confidence: 99.88%
Severity: 0.2417
Effect Size: 0.8421

Individual Tests:
  ks_test: p=0.0034
  mannwhitney: p=0.0011
  mmd: p=0.0008
  energy: p=0.0029

A low p-value (typically < 0.05) indicates that the two documents occupy measurably different regions of the embedding space. In plain English: the second version says something meaningfully different from the first.

2. Analyze a single document for legal concepts

legaldrift analyze contract.txt

Output:

Document Analysis
========================================
Document ID: contract
Jurisdiction: US
Word Count: 1,247
Character Count: 8,932

Legal Concepts Detected:
  - data_protection
  - obligation
  - permission
  - transparency

3. Compare section by section

legaldrift chunks contract_v1.txt contract_v2.txt

Output:

Chunked Drift Detection Results
============================================================
🔴 [DRIFT] 3. PAYMENT
    p=0.0034, severity=0.2417

🟢 [OK] 1. DEFINITIONS
    p=0.8912, severity=0.0124

🟡 [NEW] 10. AI GOVERNANCE
    added in current document

This tells you where to look, not just whether something changed.

How It Works

LegalDrift converts each document into a high-dimensional vector using a language model trained on legal text (Legal-BERT). It then treats the collection of vectors from each document as a statistical distribution and asks: are these two distributions the same, or have they diverged?

Four non-parametric tests are run in parallel:

Kolmogorov-Smirnov — compares the overall shape of the distributions.
Mann-Whitney U — tests for location shifts (mean or median drift).
Maximum Mean Discrepancy (MMD) — a kernel-based measure of distributional distance.
Energy Distance — a geometric measure of separation between sample clouds.

The four p-values are combined via Fisher's method. The result is a single p-value and a severity score. There is no machine-learning classifier; there are no training labels. This is pure statistical hypothesis testing, which means the results are interpretable and the false-positive rate is controlled by your chosen threshold (default: 0.05).

For a deeper technical explanation, see docs/architecture.md.

Command-Line Usage

LegalDrift exposes a single CLI entry point with subcommands.

Global flags

Flag	Description
`-j, --jurisdiction`	Default jurisdiction tag (US, EU, DE, UK, etc.)
`--no-legal-bert`	Force the hash-based fallback embedder
`-v, --verbose`	Debug logging

Subcommands

Subcommand	Purpose
`detect`	Full-document drift test between two files
`analyze`	Concept extraction and metadata for one file
`chunks`	Section-by-section drift test
`compare`	Run LegalDrift alongside baseline methods (ADWIN, DDM, HDP)
`history`	Query saved drift records

Examples

# Detect with EU jurisdiction tag
legaldrift -j EU detect privacy_2024.txt privacy_2025.txt

# Save the result to an audit log
legaldrift detect v1.txt v2.txt \
  --history drift.db \
  --notes "Post-AI Act update" \
  --tags gdpr ai-act eu

# Query prior detections
legaldrift history --path drift.db --drift-only --limit 20

Python API

Full-document comparison

from legaldrift import LegalDocument, EmbeddingEngine, DriftDetector

doc1 = LegalDocument(
    text=open("contract_v1.txt").read(),
    document_id="2024-001",
    jurisdiction="DE"
)

doc2 = LegalDocument(
    text=open("contract_v2.txt").read(),
    document_id="2024-001-r1",
    jurisdiction="DE"
)

engine = EmbeddingEngine()
detector = DriftDetector(threshold=0.05)

emb1 = engine.encode([doc1.text])
emb2 = engine.encode([doc2.text])

result = detector.detect(emb1, emb2)

print(f"Drift: {'YES' if result.drift_detected else 'NO'}")
print(f"p-value: {result.p_value:.4f}")
print(f"Severity: {result.severity:.4f}")

Section-level comparison

from legaldrift import chunk_by_sections, align_chunks

chunks1 = chunk_by_sections(doc1)
chunks2 = chunk_by_sections(doc2)

for c1, c2 in align_chunks(chunks1, chunks2):
    if c1 is None:
        print(f"[ADDED] {c2.metadata.get('header', 'Section')}")
        continue
    if c2 is None:
        print(f"[REMOVED] {c1.metadata.get('header', 'Section')}")
        continue

    e1 = engine.encode([c1.text])
    e2 = engine.encode([c2.text])
    r = detector.detect(e1, e2)

    if r.drift_detected:
        print(f"[DRIFT] Section {c1.chunk_index}: p={r.p_value:.4f}")
    else:
        print(f"[OK]    Section {c1.chunk_index}")

Drift history and audit trails

from legaldrift import DriftHistory

history = DriftHistory(path="audit.db", backend="sqlite")

history.save(
    baseline_id="2024-001",
    current_id="2024-001-r1",
    result=result,
    notes="Reviewed by J. Smith, 15 Jan 2025",
    tags=["ai-act", "high-risk"]
)

# Retrieve all drift-positive records
records = history.query(drift_detected=True, limit=50)
for r in records:
    print(r.timestamp, r.baseline_id, r.result["p_value"])

Localized Drift Detection

Full-document comparison has a well-known weakness: a large, mostly unchanged contract can dilute a significant change in one clause. LegalDrift addresses this by splitting documents into semantically coherent chunks (paragraphs, sections, or sentences) and comparing each pair.

Three chunking strategies are available:

chunk_by_paragraphs() — respects paragraph boundaries; merges very short paragraphs.
chunk_by_sections() — splits on numbered or titled headers (e.g., "1. DEFINITIONS", "Article 3"); falls back to paragraph chunking if no headers are found.
chunk_by_sentences() — finest granularity; useful for short agreements or isolated clauses.

Chunk alignment is index-based by default. If you have a custom similarity matrix (e.g., from a clause-matching preprocessor), you can pass it to align_chunks() for greedy nearest-neighbor alignment.

Drift History & Audit Logs

Legal and compliance workflows require reproducibility. LegalDrift can persist every detection run to a JSON file or an SQLite database, along with:

Timestamp (UTC)
Baseline and current document IDs
Full test results (p-values, severity, effect size)
Human-readable notes
Tags for categorization

This turns the tool from an ad-hoc script into a lightweight audit system. The SQLite backend supports indexed queries by document ID, drift status, date range, and tags.

Concept Extraction

LegalDrift includes a lightweight regex-based extractor that flags common legal concept classes without requiring a heavy NLP pipeline:

Concept	Example trigger phrases
`obligation`	"shall", "must", "is required"
`permission`	"may", "is permitted", "has the right"
`prohibition`	"shall not", "must not", "is prohibited"
`data_protection`	"GDPR", "personal data", "privacy"
`high_risk`	"high risk", "conformity assessment"
`automated_decision`	"automated decision", "algorithmic processing"
`transparency`	"transparency", "explainable AI"
`human_oversight`	"human oversight", "meaningful human control"

The extractor is deliberately simple. It is intended as a first-pass triage tool, not as a substitute for a trained legal analyst or a full clause-ontology system.

Statistical Methods

LegalDrift combines four complementary tests. Each test is sensitive to a different kind of distributional shift:

Test	What it detects	Notes
Kolmogorov-Smirnov	Shape differences in cumulative distributions	Non-parametric; works across all PCA-reduced dimensions
Mann-Whitney U	Location shifts (median/mean drift)	Robust to outliers; also run across all dimensions
MMD	General distributional divergence in kernel space	Computationally heavier; permutation-based p-value
Energy Distance	Geometric separation of point clouds	Related to the Wasserstein metric; intuitive geometric interpretation

Fisher's method combines the four p-values into a single chi-squared statistic. This meta-test gains power when multiple individual tests show weak but consistent signals, and it remains valid even if one test is misspecified.

The p-value is not a probability that "the contract changed." It is the probability of observing this much separation between the two embedding distributions if the underlying semantic content had remained identical. A low p-value is therefore evidence against the null hypothesis of "no semantic drift."

Limitations & Caveats

We believe tools that touch legal documents owe their users an honest account of what they cannot do.

Statistical, not legal, significance. A p-value of 0.01 means the embeddings are different. It does not mean the difference is legally material. A shifted comma in a damages cap can be legally catastrophic yet statistically invisible; a stylistic rewrite of a boilerplate clause can be legally trivial yet statistically detectable.
Embedding bias. Legal-BERT was trained on a corpus of US and EU legal text. Its semantic space may not accurately represent non-Western legal traditions, highly technical scientific agreements, or domain-specific jargon (e.g., maritime salvage law, biotech licensing).
Single-sample documents. When comparing one document to one document, the detector has limited statistical power. Chunking improves this, but the tool is most reliable when you have multiple samples per version (e.g., a corpus of standard-form contracts from 2024 versus 2025).
No temporal modeling. The baselines named ADWIN, DDM, and HDP are included for comparative benchmarking. They are not implemented as true streaming change detectors; they are offline two-sample approximations.
No format parsing. LegalDrift expects plain text. You must extract text from PDF, Word, or HTML before ingestion.

If your use case involves high-stakes transactional work or regulatory submissions, treat LegalDrift as a screening layer, not a sign-off layer.

Contributing

We welcome contributions from legal informaticists, data scientists, and practitioners. Please see CONTRIBUTING.md for setup instructions, testing requirements, and our code of conduct.

If you are a legal professional with no programming background but a concrete use case, please open a GitHub Issue. User stories are as valuable as pull requests.

License

MIT License. See LICENSE for the full text.

LegalDrift is provided as-is, without warranty of any kind. The authors and contributors assume no liability for decisions made on the basis of its output. Always consult a qualified legal professional before acting on contract analysis.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.1.0

Apr 27, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

legaldrift-0.1.0.tar.gz (37.4 kB view details)

Uploaded Apr 27, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

legaldrift-0.1.0-py3-none-any.whl (27.8 kB view details)

Uploaded Apr 27, 2026 Python 3

File details

Details for the file legaldrift-0.1.0.tar.gz.

File metadata

Download URL: legaldrift-0.1.0.tar.gz
Upload date: Apr 27, 2026
Size: 37.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.9.6

File hashes

Hashes for legaldrift-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`4f4d2e8f2d83b15fb46224c3d5732594342f23eb2133ee608e5255738b2ca51c`
MD5	`c79efc9511384939e084633efb574321`
BLAKE2b-256	`d8dcac1d53a6cfa9f38c6317e2b991739a5d63cbf5d4700b9f98ab006146dc06`

See more details on using hashes here.

File details

Details for the file legaldrift-0.1.0-py3-none-any.whl.

File metadata

Download URL: legaldrift-0.1.0-py3-none-any.whl
Upload date: Apr 27, 2026
Size: 27.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.9.6

File hashes

Hashes for legaldrift-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`616ebb6c646359f4a5d2f088e5b44c3c02895b3d0f9e80ad8bb8a67191c0d888`
MD5	`b8f7308c07607bfc9650e605534fca7a`
BLAKE2b-256	`9ca73f971e62cbd4cc00abe3ab32a52f2216ddccb1f70a5c0b95b911b639de02`

See more details on using hashes here.

legaldrift 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

LegalDrift

Table of Contents

When to Use LegalDrift

When Not to Use It

Installation

Requirements

Quick Start

1. Detect drift between two contracts

2. Analyze a single document for legal concepts

3. Compare section by section

How It Works

Command-Line Usage

Global flags

Subcommands

Examples

Python API

Full-document comparison

Section-level comparison

Drift history and audit trails

Localized Drift Detection

Drift History & Audit Logs

Concept Extraction

Statistical Methods

Limitations & Caveats

Contributing

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes