Skip to main content

Verify citations and spot hallucinations.

Project description

CiteSleuth

Verify citations, spot hallucinations.

CiteSleuth ingests BibTeX, DOCX, PDF, or plain text references and verifies them against public scholarly data providers. It produces machine-readable JSON and human-readable Markdown reports with suggested fixes.

Install

pip install citesleuth
# or from source
python -m pip install -e .

Usage

citesleuth verify refs.bib
citesleuth verify paper.pdf --format md
citesleuth verify paper.docx --out report.json
citesleuth verify refs.bib --write-bib refs.verified.bib --apply-fixes

Common flags:

  • --offline: use cache only
  • --no-cache: disable cache writes
  • --max-rps: global request cap
  • --provider-delay semanticscholar=1.1,dblp=2.0,openlibrary=0.5

Tests

python -m pip install -e .[test]
pytest
cargo test -p citesleuth-core

Rust + Node

Core matching logic lives in crates/citesleuth-core. Node bindings are in crates/citesleuth-napi.

cargo test -p citesleuth-core
cd crates/citesleuth-napi
npm install
npm run build

Quick Node usage:

const citesleuth = require('citesleuth')
const result = citesleuth.normalizeTitle('Attention Is All You Need')
console.log(result)

Notes

  • Text-based PDFs only (no OCR in v1).
  • Semantic Scholar uses an API key for best reliability. Set SEMANTIC_SCHOLAR_API_KEY or pass --api-key-semanticscholar.
  • Open Library requests a descriptive User-Agent string; use --user-agent to override.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

citesleuth-0.1.1.tar.gz (26.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

citesleuth-0.1.1-py3-none-any.whl (23.6 kB view details)

Uploaded Python 3

File details

Details for the file citesleuth-0.1.1.tar.gz.

File metadata

  • Download URL: citesleuth-0.1.1.tar.gz
  • Upload date:
  • Size: 26.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for citesleuth-0.1.1.tar.gz
Algorithm Hash digest
SHA256 d63e9a5068fcf6e9b9e232d16623b7296da9d52149f3c8f967438e68bb8ad074
MD5 2dcd174bad504a8f72947d11b03cae54
BLAKE2b-256 353ac6b98938d49c14f5c1145befe2269b8c16979d46e9ae45ff5a4600ebaadf

See more details on using hashes here.

Provenance

The following attestation bundles were made for citesleuth-0.1.1.tar.gz:

Publisher: release.yml on Haayhur/CiteSleuth

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file citesleuth-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: citesleuth-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 23.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for citesleuth-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 4a596dde5b888195081a5513714d59286ae2f039b3d48d8257562cdf0640b86d
MD5 9ef9e7df36b524a3ef72d1a12e7d5a83
BLAKE2b-256 cedd9f13004ad67d2cfb920415a563091e0cc8d957e819de72b2d6ba2ad21dd5

See more details on using hashes here.

Provenance

The following attestation bundles were made for citesleuth-0.1.1-py3-none-any.whl:

Publisher: release.yml on Haayhur/CiteSleuth

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page