Self-hosted bibliometric data preparation tool (Scopus + Web of Science)
Project description
Self-hosted, reproducible bibliometric data preparation for Web of Science & Scopus.
BibexPy v2.0.0 "Helium" merges, filters, harmonizes, enriches and exports WoS + Scopus records through a local web interface — with full provenance — and keeps your licensed exports on your own machine. It prepares analysis-ready datasets for VOSviewer, Biblioshiny, BibTeX, RIS, Excel and more.
Install
pip install bibexpy
bibexpy # launches the local web UI (browser opens automatically)
Requires Python 3.10+ only — no Node.js/npm needed (the interface ships precompiled inside the package). Works on Windows, macOS and Linux.
bibexpy --port 8080 # custom port
bibexpy --no-browser # server only
bibexpy --storage ./data # custom storage folder
bibexpy --version
Defaults: UI at http://127.0.0.1:6060, data under ~/.bibexpy/storage, settings/API keys
under ~/.bibexpy/.env (managed from the in-app Settings page). Press Ctrl+C to stop.
Highlights (v2)
- One-click Smart Merge — probabilistic record linkage (DOI + Jaro–Winkler), confidence scoring, optional borderline review, and a copy-ready methodology paragraph.
- ORCID-first author disambiguation + address harmonization (organization roll-up, country standardization).
- Multi-source enrichment (CrossRef, OpenAlex, Scopus, DataCite, Unpaywall, Europe PMC, Semantic Scholar) with reverse-DOI recovery — verifiable sources only.
- Reproducible, preset-based filtering and a bibliometrically weighted quality dashboard.
- Full provenance: append-only audit log, snapshots, isolated analyses, auto-generated methodology narrative.
- Structured export: WoS, VOSviewer TSV, BibTeX, RIS, CSV, TSV, XLSX.
Links
Website · Docs · GitHub · Paper (SoftwareX)
Citation
Kara, B. C., Şahin, A., & Dirsehan, T. (2025). BibexPy: Harmonizing the bibliometric symphony of Scopus and Web of Science. SoftwareX, 30, 102098. https://doi.org/10.1016/j.softx.2025.102098
License
GPL-3.0-or-later
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file bibexpy-2.0.0-py3-none-any.whl.
File metadata
- Download URL: bibexpy-2.0.0-py3-none-any.whl
- Upload date:
- Size: 12.2 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
aa68496cd6a236f206359397e22b4f51cbcf4fcdb39a3976aa05cdb900d9249e
|
|
| MD5 |
a03b4ea49d68ddb30a1d9e29aca41be5
|
|
| BLAKE2b-256 |
94d4f606fc3eaba92d9fd61fe5a801f61be8d5f26d740897c7eeb4dd2c227ba1
|
Provenance
The following attestation bundles were made for bibexpy-2.0.0-py3-none-any.whl:
Publisher:
release.yml on bcankara/BibexPy
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
bibexpy-2.0.0-py3-none-any.whl -
Subject digest:
aa68496cd6a236f206359397e22b4f51cbcf4fcdb39a3976aa05cdb900d9249e - Sigstore transparency entry: 1770722228
- Sigstore integration time:
-
Permalink:
bcankara/BibexPy@b66f16a38370bdc1e0e784ecb7df8f5e02981a17 -
Branch / Tag:
refs/tags/v2.0.0 - Owner: https://github.com/bcankara
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@b66f16a38370bdc1e0e784ecb7df8f5e02981a17 -
Trigger Event:
push
-
Statement type: