Skip to main content

Data Race Detector for Free-Threading Python

Project description

Threadcheck

Python License Platform CI Ruff

Python data race detector for the free-threading (no-GIL) era. Detects concurrent access to shared mutable state in multi-threaded Python programs through static analysis and runtime instrumentation.


Problem

Python 3.14 (2026) introduces free-threading, removing the Global Interpreter Lock (GIL). This enables true parallel execution of multi-threaded code, but the ecosystem lacks debugging tools for concurrency bugs. Go has -race, C++ has ThreadSanitizer, Java has SpotBugs. Python has nothing comparable without recompiling the interpreter with Clang and TSan.

threadcheck is a pure-Python race detector that installs with pip and works out of the box across Linux, Windows, and macOS.


Features

  • Static analysis -- scans AST for shared mutable state (global, nonlocal, class attributes, module-level lists/dicts) and missing lock protection
  • Runtime detection -- instruments code via AST transformation at import time; tracks memory accesses with vector clocks and detects happens-before violations (read-write and write-write races)
  • Lock-aware suppression -- understands threading.Lock/RLock and with-based synchronization; supports nested locks (with lock1, lock2:) with automatic per-lock vector clock tracking
  • Cross-module analysis -- two-pass scan collects Thread(target=...) and executor.submit/map targets across all files in a directory
  • Confidence scoring -- each warning tagged HIGH / MEDIUM / LOW based on thread context and lock coverage
  • Configuration -- .threadcheckignore file (gitignore-style patterns + file:line suppression) and [tool.threadcheck] section in pyproject.toml
  • Multiple output formats -- terminal (grouped by file, colorized), JSON, SARIF v2.1.0, and self-contained HTML report
  • pytest plugin -- automatic race detection during test execution via --threadcheck flag
  • Free-threading compatibility checker -- threadcheck compat scans installed packages for C extensions and checks FT ABI tags

Installation

pip install threadcheck

Requires Python 3.12+. Python 3.14+ is recommended for free-threading features.


Quick Start

Static Analysis

Scan a file or directory for potential race conditions without running any code:

threadcheck scan my_project/

Output (grouped by file, with severity icons and per-file summary):

  [1/2] my_project/counter.py
  --------------------------------
    [!] HIGH [unsafe_global] line 8:8
          Global variable `counter` modified without lock
          suggestion: Use `threading.Lock()` to protect `counter`
    [i] LOW [thread_usage] line 10:11
          Thread creation detected (target=increment)

  [2/2] my_project/worker.py
  --------------------------------
    [!] HIGH [shared_mutable] line 15:8
          Module-level mutable object `results.append()` called from multiple threads

Total: 2 issue(s) in 2 file(s) (0 error(s), 2 warning(s), 0 info(s))

Runtime Detection

Execute a script with instrumentation to detect actual data races:

threadcheck run my_script.py

Output for a racing script:

Data races detected:

  [!] `counter`
    |-- Thread-28928 (write) at my_script.py:8
    |-- Thread-9888 (write) at my_script.py:8
    \-- No happens-before relationship between accesses
       (10000 overlapping accesses)

A script protected with locks reports:

No data races detected

Free-threading Compatibility Check

Check whether your project's dependencies support free-threading:

threadcheck compat

Output:

threadcheck compat - Free-threading compatibility check
Python 3.13.10

  [OK] numpy                 C extension has free-threading tag (cp313t-)
  [??] torch                 C extension without free-threading tag
  [OK] pytest                pure Python, no C extensions

Total: 3 package(s) - 2 compatible, 1 need verification, 0 not installed

HTML Report

threadcheck scan my_project/ -o report.html

Generates a self-contained HTML report with dark/light theme, sortable table, and summary cards.

Quiet / Verbose Modes

threadcheck scan my_project/ -q     # one-line summary only
threadcheck scan my_project/ -v     # include source code snippets
threadcheck scan my_project/        # default grouped output

Configuration

.threadcheckignore

Create a .threadcheckignore file in your project root (gitignore-style patterns):

# Ignore generated files
generated/*.py
build/*.py

# Ignore specific lines in a specific file
src/legacy.py:42          # suppress line 42
src/legacy.py:50-60       # suppress lines 50-60

# Negation (do not ignore)
*.py
!important.py

pyproject.toml

[tool.threadcheck]
ignore = [
    "build/*",
    "generated/*.py",
]

Both sources are merged automatically.


Output Formats

Flag Format Use Case
(default) Terminal (colorized, grouped) Interactive use
-q One-line summary Quick status
-v Terminal + source snippets Debugging
--json JSON CI pipelines
--sarif SARIF v2.1.0 GitHub CodeQL integration
-o report.html Self-contained HTML Team reports

Commands

Command Description
scan <path> Static race analysis of file or directory
run <script> Execute script with runtime race detection
compat [path] Check free-threading compatibility of dependencies

Library Usage

from threadcheck import analyze_file, analyze_path

warnings = analyze_file("my_module.py")
warnings = analyze_path("src/")

for w in warnings:
    print(f"{w.file}:{w.line} [{w.confidence.value}] {w.message}")
from threadcheck.dynamic.tracker import ThreadCheckTracker
from threadcheck.dynamic.transform import transform_and_compile

code = transform_and_compile(source, "script.py")
ThreadCheckTracker.start()
exec(code, {"_threadcheck_tracker": ThreadCheckTracker})
ThreadCheckTracker.stop()

print(ThreadCheckTracker.format_races())
ThreadCheckTracker.reset()

pytest Integration

python -m pytest tests/ --threadcheck

The plugin hooks into pytest_runtest_call and reports race warnings as test failures.


Architecture

Static Analysis Pipeline

  1. Parse source into AST
  2. Identify shared mutable state: globals, nonlocals, class attributes (self.x), module-level mutable objects (lists, dicts, sets)
  3. Detect thread creation sites (threading.Thread, executor.submit/map)
  4. Cross-reference with lock usage (with lock:, nested with lock1, lock2:)
  5. Assign confidence: HIGH (thread target, no lock), MEDIUM (thread present, no lock), LOW (suspicious pattern, no thread context)
  6. Report findings with repair suggestions

Runtime Detection Pipeline

  1. Parse source into AST
  2. Identify shared variables per function scope (global, nonlocal declarations)
  3. Transform AST: inject read_before() for reads and write_before() for writes of shared variables; inject lock_acquire()/lock_release() around with blocks
  4. Compile and execute transformed code under a tracker that maintains per-thread vector clocks
  5. On lock acquire, merge the lock's clock into the thread's clock (happens-before)
  6. On lock release, save the thread's clock to the lock
  7. After execution, scan access log for conflicting operations (concurrent writes OR write-read pairs with no happens-before relationship)
  8. Report detected races with thread IDs, source locations, and overlap counts

Free-threading Compatibility Check

  1. Parse project dependencies from pyproject.toml or requirements.txt
  2. For each installed package, scan for C extension files (.pyd/.so)
  3. If no C extensions found -> COMPATIBLE
  4. If C extension filename contains free-threading ABI tag (cp313t-/cpython-313t-) -> COMPATIBLE
  5. Otherwise -> NEEDS_VERIFICATION

Project Structure

threadcheck/
|-- pyproject.toml
|-- src/threadcheck/
|   |-- __init__.py
|   |-- __main__.py
|   |-- _version.py           # single version source
|   |-- _tid.py               # platform thread ID (swapped per-platform in CI)
|   |-- cli.py                # argument parsing + dispatch
|   |-- config.py             # .threadcheckignore + pyproject.toml loader
|   |-- pytest_plugin.py      # --threadcheck flag for pytest
|   |-- static/
|   |   |-- analyzer.py       # static analysis entry point
|   |   |-- visitors.py       # 5 AST visitors
|   |   |-- lock_tracker.py   # lock usage analysis
|   |   \-- models.py         # RaceWarning, Severity, Confidence
|   |-- dynamic/
|   |   |-- __main__.py       # run_script entry point
|   |   |-- transform.py      # AST transformation engine
|   |   |-- tracker.py        # runtime tracker with vector clocks
|   |   |-- clock.py          # vector clock implementation
|   |   \-- hook.py           # sys.meta_path import hook
|   |-- compat/
|   |   |-- checker.py        # C extension FT tag scanner
|   |   \-- models.py         # FTCompatResult, CompatStatus
|   \-- reporting/
|       |-- formatter.py      # terminal / JSON output
|       |-- sarif.py          # SARIF v2.1.0 output
|       \-- html.py           # HTML report output
|-- tests/
|   |-- fixtures/             # 11 fixture files with known races
|   |-- test_static_analyzer.py
|   |-- test_dynamic_detector.py
|   |-- test_formatter.py
|   |-- test_sarif.py
|   |-- test_compat.py
|   |-- test_config.py
|   \-- test_pytest_plugin.py
|-- demo/
|   |-- race_example.py       # sample with intentional races
|   \-- run_demo.py           # demo runner for all output formats
\-- README.md

Platform Support

Platform Python CI
Linux (x86_64) 3.12, 3.13, 3.14 ubuntu-latest
Windows (amd64) 3.12, 3.13, 3.14 windows-latest
macOS (ARM64) 3.12, 3.13, 3.14 macos-latest

Thread IDs: uses native_id (gettid) on Linux/macOS, threading.get_ident() on Windows.


Roadmap

  • v0.0.1.2a1 (current): Round A (core gap fill) + Round B (DX) -- static and dynamic analysis, lock tracking, cross-module analysis, pytest plugin, FT compat checker, HTML reports, configuration, enhanced output
  • v0.2.0 (next): Round C -- Thread.join() happens-before, threading.Atomic support, function call chain tracking, deadlock detection
  • v1.0.0 (future): Round D -- GitHub Action, pre-commit hook, VS Code integration, stable API

Limitations

  • Static analysis may produce false positives (reports a race that cannot occur at runtime) and false negatives (misses races involving indirect sharing through aliases or containers)
  • Runtime detection modifies the AST before execution; code that introspects its own source or frame objects may behave differently
  • Runtime instrumentation incurs overhead (approximately 2-5x slowdown for typical code)
  • Lock tracking supports threading.Lock, threading.RLock, and standard with-based patterns; other synchronization primitives (threading.Event, threading.Condition, third-party libraries) are not tracked
  • Cross-module analysis handles Thread(target=...) and executor.submit/map but does not perform full inter-procedural data-flow analysis

License

MIT


Contributing

Contributions are welcome. Please open an issue or submit a pull request on GitHub.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

threadcheck-0.0.1.2a6.tar.gz (44.9 kB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

threadcheck-0.0.1.2a6-py3-none-win_amd64.whl (33.5 kB view details)

Uploaded Python 3Windows x86-64

threadcheck-0.0.1.2a6-py3-none-manylinux_2_28_x86_64.whl (33.3 kB view details)

Uploaded Python 3manylinux: glibc 2.28+ x86-64

threadcheck-0.0.1.2a6-py3-none-macosx_11_0_arm64.whl (33.3 kB view details)

Uploaded Python 3macOS 11.0+ ARM64

File details

Details for the file threadcheck-0.0.1.2a6.tar.gz.

File metadata

  • Download URL: threadcheck-0.0.1.2a6.tar.gz
  • Upload date:
  • Size: 44.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for threadcheck-0.0.1.2a6.tar.gz
Algorithm Hash digest
SHA256 d249c8b17c8634103e5500e63336b4087a8c930ba9b727598fdfc886f0831f28
MD5 0b61f1a15d2e03854bd4d30a2d9c6b4d
BLAKE2b-256 bc50f0b76ac711d44f5eddc62c41a4c59c6b06a9bd01d2ee7ae12ada47e2f920

See more details on using hashes here.

Provenance

The following attestation bundles were made for threadcheck-0.0.1.2a6.tar.gz:

Publisher: release.yml on ChidcGithub/Threadcheck

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file threadcheck-0.0.1.2a6-py3-none-win_amd64.whl.

File metadata

File hashes

Hashes for threadcheck-0.0.1.2a6-py3-none-win_amd64.whl
Algorithm Hash digest
SHA256 3c4be1ba68dfe3254dd6ff241ab4fab5c6005938d750416f2760cf6c90928962
MD5 23cb6010557c8ef9ddc5cf6861fce824
BLAKE2b-256 f068bdfd0fd1be3c6e4ed94bcef1c26310a7f5b55891ab3d0fa7629955e553b9

See more details on using hashes here.

Provenance

The following attestation bundles were made for threadcheck-0.0.1.2a6-py3-none-win_amd64.whl:

Publisher: release.yml on ChidcGithub/Threadcheck

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file threadcheck-0.0.1.2a6-py3-none-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for threadcheck-0.0.1.2a6-py3-none-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 167b24e60c8f64b7926a8c1c3e197c8659484e7b703157dee10a2b4b6376013e
MD5 f0eedcce3b03b7bc07f3b55a19362e51
BLAKE2b-256 b0bc5438248f960eac367d6936c3d0d1fb2035a2d729e578672571faa1409798

See more details on using hashes here.

Provenance

The following attestation bundles were made for threadcheck-0.0.1.2a6-py3-none-manylinux_2_28_x86_64.whl:

Publisher: release.yml on ChidcGithub/Threadcheck

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file threadcheck-0.0.1.2a6-py3-none-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for threadcheck-0.0.1.2a6-py3-none-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 420f8a6d4e75e5d111a3de3f38e4990b1706611c5e3dc4844a117e3d8d84eb63
MD5 f0d1ba7eaa4b4c4367a5c38144fc8ed5
BLAKE2b-256 44b0b9b4cefef9551e74729b8e714ee9f258c87ef366df397665285facd2524c

See more details on using hashes here.

Provenance

The following attestation bundles were made for threadcheck-0.0.1.2a6-py3-none-macosx_11_0_arm64.whl:

Publisher: release.yml on ChidcGithub/Threadcheck

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page