Skip to main content

Data Race Detector for Free-Threading Python

Project description

Threadcheck

Python License Platform CI Ruff

Python data race detector for the free-threading (no-GIL) era. Detects concurrent access to shared mutable state in multi-threaded Python programs through static analysis and runtime instrumentation.


Problem

Python 3.14 (2026) introduces free-threading, removing the Global Interpreter Lock (GIL). This enables true parallel execution of multi-threaded code, but the ecosystem lacks debugging tools for concurrency bugs. Go has -race, C++ has ThreadSanitizer, Java has SpotBugs. Python has nothing comparable without recompiling the interpreter with Clang and TSan.

threadcheck is a pure-Python race detector that installs with pip and works out of the box across Linux, Windows, and macOS.


Features

  • Static analysis -- scans AST for shared mutable state (global, nonlocal, class attributes, module-level lists/dicts) and missing lock protection
  • Runtime detection -- instruments code via AST transformation at import time; tracks memory accesses with vector clocks and detects happens-before violations (read-write and write-write races)
  • Lock-aware suppression -- understands threading.Lock/RLock and with-based synchronization; supports nested locks (with lock1, lock2:) with automatic per-lock vector clock tracking
  • Cross-module analysis -- two-pass scan collects Thread(target=...) and executor.submit/map targets across all files in a directory
  • Confidence scoring -- each warning tagged HIGH / MEDIUM / LOW based on thread context and lock coverage
  • Configuration -- .threadcheckignore file (gitignore-style patterns + file:line suppression) and [tool.threadcheck] section in pyproject.toml
  • Multiple output formats -- terminal (grouped by file, colorized), JSON, SARIF v2.1.0, and self-contained HTML report
  • pytest plugin -- automatic race detection during test execution via --threadcheck flag
  • Free-threading compatibility checker -- threadcheck compat scans installed packages for C extensions and checks FT ABI tags

Installation

pip install threadcheck

Requires Python 3.12+. Python 3.14+ is recommended for free-threading features.


Quick Start

Static Analysis

Scan a file or directory for potential race conditions without running any code:

threadcheck scan my_project/

Output (grouped by file, with severity icons and per-file summary):

  [1/2] my_project/counter.py
  --------------------------------
    [!] HIGH [unsafe_global] line 8:8
          Global variable `counter` modified without lock
          suggestion: Use `threading.Lock()` to protect `counter`
    [i] LOW [thread_usage] line 10:11
          Thread creation detected (target=increment)

  [2/2] my_project/worker.py
  --------------------------------
    [!] HIGH [shared_mutable] line 15:8
          Module-level mutable object `results.append()` called from multiple threads

Total: 2 issue(s) in 2 file(s) (0 error(s), 2 warning(s), 0 info(s))

Runtime Detection

Execute a script with instrumentation to detect actual data races:

threadcheck run my_script.py

Output for a racing script:

Data races detected:

  [!] `counter`
    |-- Thread-28928 (write) at my_script.py:8
    |-- Thread-9888 (write) at my_script.py:8
    \-- No happens-before relationship between accesses
       (10000 overlapping accesses)

A script protected with locks reports:

No data races detected

Free-threading Compatibility Check

Check whether your project's dependencies support free-threading:

threadcheck compat

Output:

threadcheck compat - Free-threading compatibility check
Python 3.13.10

  [OK] numpy                 C extension has free-threading tag (cp313t-)
  [??] torch                 C extension without free-threading tag
  [OK] pytest                pure Python, no C extensions

Total: 3 package(s) - 2 compatible, 1 need verification, 0 not installed

HTML Report

threadcheck scan my_project/ -o report.html

Generates a self-contained HTML report with dark/light theme, sortable table, and summary cards.

Quiet / Verbose Modes

threadcheck scan my_project/ -q     # one-line summary only
threadcheck scan my_project/ -v     # include source code snippets
threadcheck scan my_project/        # default grouped output

Configuration

.threadcheckignore

Create a .threadcheckignore file in your project root (gitignore-style patterns):

# Ignore generated files
generated/*.py
build/*.py

# Ignore specific lines in a specific file
src/legacy.py:42          # suppress line 42
src/legacy.py:50-60       # suppress lines 50-60

# Negation (do not ignore)
*.py
!important.py

pyproject.toml

[tool.threadcheck]
ignore = [
    "build/*",
    "generated/*.py",
]

Both sources are merged automatically.


Output Formats

Flag Format Use Case
(default) Terminal (colorized, grouped) Interactive use
-q One-line summary Quick status
-v Terminal + source snippets Debugging
--json JSON CI pipelines
--sarif SARIF v2.1.0 GitHub CodeQL integration
-o report.html Self-contained HTML Team reports

Commands

Command Description
scan <path> Static race analysis of file or directory
run <script> Execute script with runtime race detection
compat [path] Check free-threading compatibility of dependencies

Library Usage

from threadcheck import analyze_file, analyze_path

warnings = analyze_file("my_module.py")
warnings = analyze_path("src/")

for w in warnings:
    print(f"{w.file}:{w.line} [{w.confidence.value}] {w.message}")
from threadcheck.dynamic.tracker import ThreadCheckTracker
from threadcheck.dynamic.transform import transform_and_compile

code = transform_and_compile(source, "script.py")
ThreadCheckTracker.start()
exec(code, {"_threadcheck_tracker": ThreadCheckTracker})
ThreadCheckTracker.stop()

print(ThreadCheckTracker.format_races())
ThreadCheckTracker.reset()

pytest Integration

python -m pytest tests/ --threadcheck

The plugin hooks into pytest_runtest_call and reports race warnings as test failures.


Architecture

Static Analysis Pipeline

  1. Parse source into AST
  2. Identify shared mutable state: globals, nonlocals, class attributes (self.x), module-level mutable objects (lists, dicts, sets)
  3. Detect thread creation sites (threading.Thread, executor.submit/map)
  4. Cross-reference with lock usage (with lock:, nested with lock1, lock2:)
  5. Assign confidence: HIGH (thread target, no lock), MEDIUM (thread present, no lock), LOW (suspicious pattern, no thread context)
  6. Report findings with repair suggestions

Runtime Detection Pipeline

  1. Parse source into AST
  2. Identify shared variables per function scope (global, nonlocal declarations)
  3. Transform AST: inject read_before() for reads and write_before() for writes of shared variables; inject lock_acquire()/lock_release() around with blocks
  4. Compile and execute transformed code under a tracker that maintains per-thread vector clocks
  5. On lock acquire, merge the lock's clock into the thread's clock (happens-before)
  6. On lock release, save the thread's clock to the lock
  7. After execution, scan access log for conflicting operations (concurrent writes OR write-read pairs with no happens-before relationship)
  8. Report detected races with thread IDs, source locations, and overlap counts

Free-threading Compatibility Check

  1. Parse project dependencies from pyproject.toml or requirements.txt
  2. For each installed package, scan for C extension files (.pyd/.so)
  3. If no C extensions found -> COMPATIBLE
  4. If C extension filename contains free-threading ABI tag (cp313t-/cpython-313t-) -> COMPATIBLE
  5. Otherwise -> NEEDS_VERIFICATION

Project Structure

threadcheck/
|-- pyproject.toml
|-- src/threadcheck/
|   |-- __init__.py
|   |-- __main__.py
|   |-- _version.py           # single version source
|   |-- _tid.py               # platform thread ID (swapped per-platform in CI)
|   |-- cli.py                # argument parsing + dispatch
|   |-- config.py             # .threadcheckignore + pyproject.toml loader
|   |-- pytest_plugin.py      # --threadcheck flag for pytest
|   |-- static/
|   |   |-- analyzer.py       # static analysis entry point
|   |   |-- visitors.py       # 5 AST visitors
|   |   |-- lock_tracker.py   # lock usage analysis
|   |   \-- models.py         # RaceWarning, Severity, Confidence
|   |-- dynamic/
|   |   |-- __main__.py       # run_script entry point
|   |   |-- transform.py      # AST transformation engine
|   |   |-- tracker.py        # runtime tracker with vector clocks
|   |   |-- clock.py          # vector clock implementation
|   |   \-- hook.py           # sys.meta_path import hook
|   |-- compat/
|   |   |-- checker.py        # C extension FT tag scanner
|   |   \-- models.py         # FTCompatResult, CompatStatus
|   \-- reporting/
|       |-- formatter.py      # terminal / JSON output
|       |-- sarif.py          # SARIF v2.1.0 output
|       \-- html.py           # HTML report output
|-- tests/
|   |-- fixtures/             # 11 fixture files with known races
|   |-- test_static_analyzer.py
|   |-- test_dynamic_detector.py
|   |-- test_formatter.py
|   |-- test_sarif.py
|   |-- test_compat.py
|   |-- test_config.py
|   \-- test_pytest_plugin.py
|-- demo/
|   |-- race_example.py       # sample with intentional races
|   \-- run_demo.py           # demo runner for all output formats
\-- README.md

Platform Support

Platform Python CI
Linux (x86_64) 3.12, 3.13, 3.14 ubuntu-latest
Windows (amd64) 3.12, 3.13, 3.14 windows-latest
macOS (ARM64) 3.12, 3.13, 3.14 macos-latest

Thread IDs: uses native_id (gettid) on Linux/macOS, threading.get_ident() on Windows.


Roadmap

  • v0.0.1.2 (current): Round A (core gap fill) + Round B (DX) -- static and dynamic analysis, lock tracking, cross-module analysis, pytest plugin, FT compat checker, HTML reports, configuration, enhanced output
  • v0.2.0 (next): Round C -- Thread.join() happens-before, threading.Atomic support, function call chain tracking, deadlock detection
  • v1.0.0 (future): Round D -- GitHub Action, pre-commit hook, VS Code integration, stable API

Limitations

  • Static analysis may produce false positives (reports a race that cannot occur at runtime) and false negatives (misses races involving indirect sharing through aliases or containers)
  • Runtime detection modifies the AST before execution; code that introspects its own source or frame objects may behave differently
  • Runtime instrumentation incurs overhead (approximately 2-5x slowdown for typical code)
  • Lock tracking supports threading.Lock, threading.RLock, and standard with-based patterns; other synchronization primitives (threading.Event, threading.Condition, third-party libraries) are not tracked
  • Cross-module analysis handles Thread(target=...) and executor.submit/map but does not perform full inter-procedural data-flow analysis

License

MIT


Contributing

Contributions are welcome. Please open an issue or submit a pull request on GitHub.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

threadcheck-0.0.1.2.tar.gz (47.4 kB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

threadcheck-0.0.1.2-py3-none-win_amd64.whl (34.3 kB view details)

Uploaded Python 3Windows x86-64

threadcheck-0.0.1.2-py3-none-manylinux_2_28_x86_64.whl (34.1 kB view details)

Uploaded Python 3manylinux: glibc 2.28+ x86-64

threadcheck-0.0.1.2-py3-none-macosx_11_0_arm64.whl (34.0 kB view details)

Uploaded Python 3macOS 11.0+ ARM64

File details

Details for the file threadcheck-0.0.1.2.tar.gz.

File metadata

  • Download URL: threadcheck-0.0.1.2.tar.gz
  • Upload date:
  • Size: 47.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for threadcheck-0.0.1.2.tar.gz
Algorithm Hash digest
SHA256 9becf82ffe5cdd50cc5acf650073ba440540d7bae77088b519e63c9c1ace5959
MD5 52295c86161b572417c3ee5ebef13fa3
BLAKE2b-256 e3507ba31cf7d230c137207645100b17887f20f21af746bed25d4ebb81723599

See more details on using hashes here.

Provenance

The following attestation bundles were made for threadcheck-0.0.1.2.tar.gz:

Publisher: release.yml on ChidcGithub/Threadcheck

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file threadcheck-0.0.1.2-py3-none-win_amd64.whl.

File metadata

  • Download URL: threadcheck-0.0.1.2-py3-none-win_amd64.whl
  • Upload date:
  • Size: 34.3 kB
  • Tags: Python 3, Windows x86-64
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for threadcheck-0.0.1.2-py3-none-win_amd64.whl
Algorithm Hash digest
SHA256 72cae754a4c423ee1da527f61ff9edc8199b7200f6c4adecff4befbb0f772c7c
MD5 0576c215f6aee711396554524c36235e
BLAKE2b-256 080cc37d02e2981111845d91b2d8481444e6b363001b5ae07c5b58dad10c5a02

See more details on using hashes here.

Provenance

The following attestation bundles were made for threadcheck-0.0.1.2-py3-none-win_amd64.whl:

Publisher: release.yml on ChidcGithub/Threadcheck

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file threadcheck-0.0.1.2-py3-none-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for threadcheck-0.0.1.2-py3-none-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 c66eb15420471066bd991bc201529ad0a23f2c111582c222f99b0bb14aa868c0
MD5 5f271c9eba44047c0a87dd5408acdcb7
BLAKE2b-256 317f14a5f32e94cf46835c108423e3f35fc8ed1eb0d2603424acfd924916e929

See more details on using hashes here.

Provenance

The following attestation bundles were made for threadcheck-0.0.1.2-py3-none-manylinux_2_28_x86_64.whl:

Publisher: release.yml on ChidcGithub/Threadcheck

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file threadcheck-0.0.1.2-py3-none-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for threadcheck-0.0.1.2-py3-none-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 62cf2ca3d16da3a769ab1289db3e70c32cc09a41c5a7e7c341d3f65cc6ae148b
MD5 732eda642cf251b41a0751880ff07006
BLAKE2b-256 51cf6cada2eb79dafa89a5047e73a5c45e721501c56c524e90d19a45cbf5eb68

See more details on using hashes here.

Provenance

The following attestation bundles were made for threadcheck-0.0.1.2-py3-none-macosx_11_0_arm64.whl:

Publisher: release.yml on ChidcGithub/Threadcheck

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page