Skip to main content

Toolkit for civic paper production using Python and LaTeX

Project description

civic-paperkit

PyPI Python versions License: MIT CI Status Docs

Automatically download research data linked to your bibliography references.

civic-paperkit fetches datasets, CSVs, and supplementary materials referenced in your academic papers. It reads your .bib file and a companion metadata file to archive all your sources.


Installation

pip install civic-paperkit

Quick Start

  1. Create a refs_meta.yaml file mapping your BibTeX keys to data sources:
# refs_meta.yaml
cdc_pmdr:
  notes: "CDC Maternal Mortality Rates 2018-2022"
  assets:
    - url: "https://data.cdc.gov/api/views/e2d5-ggg7/rows.csv"
      filename: "maternal_mortality.csv"

smith2024:
  assets:
    - page_url: "https://example.org/supplementary"
      allow_ext: [".csv", ".xlsx", ".zip"]
  1. Run the tool:
ci-paperkit --bib paper/refs.bib --meta paper/refs_meta.yaml
  1. Find your data in data/raw/<bibkey>/

Features

  • Download direct file URLs (CSV, Excel, PDF, etc.)
  • Scrape pages for data files
  • Organize downloads by citation key
  • Checksum verification (optional)
  • Configurable output directories

Requirements

  • Python 3.12+
  • A BibTeX file with your references
  • A YAML metadata file mapping references to data sources

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

civic_paperkit-0.0.2.tar.gz (43.6 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

civic_paperkit-0.0.2-py3-none-any.whl (15.9 kB view details)

Uploaded Python 3

File details

Details for the file civic_paperkit-0.0.2.tar.gz.

File metadata

  • Download URL: civic_paperkit-0.0.2.tar.gz
  • Upload date:
  • Size: 43.6 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for civic_paperkit-0.0.2.tar.gz
Algorithm Hash digest
SHA256 9b678c8f2cd56109d9126e02b3245e29fbfd1001bf41c6cc8517bfb031714b68
MD5 c23e886ddb1604aa94024c20f408d423
BLAKE2b-256 9c3a681e5184d6bdc5b0b84f1e2dc1cad115a4af5b73a2505a8e0ab7e59b0b83

See more details on using hashes here.

Provenance

The following attestation bundles were made for civic_paperkit-0.0.2.tar.gz:

Publisher: release.yml on civic-interconnect/civic-paperkit

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file civic_paperkit-0.0.2-py3-none-any.whl.

File metadata

  • Download URL: civic_paperkit-0.0.2-py3-none-any.whl
  • Upload date:
  • Size: 15.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for civic_paperkit-0.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 2bb91335c26cfd06082246044b33e409e0212993b2aced7e88ba6064de5af7df
MD5 50c9e5d9f8887da6039fe78cdc73aa5d
BLAKE2b-256 cbcee69dc61fa5d86605bba3e92a7424139a24172c5c382ab1ee3636858d3e98

See more details on using hashes here.

Provenance

The following attestation bundles were made for civic_paperkit-0.0.2-py3-none-any.whl:

Publisher: release.yml on civic-interconnect/civic-paperkit

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page