Skip to main content

Reproducibility simplified.

Project description

Calkit

Calkit simplifies reproducibility, acting as a layer on top of Git, DVC, Zenodo, and more, such that all all aspects of the research process can be fully described in a single repository.

Why does reproducibility matter?

If your work is reproducible, that means that someone else can "run" it and get the same results or outputs. This is a major step towards addressing the replication crisis and has some major benefits for both you as an individual and the research community:

  1. You will avoid mistakes caused by, e.g., running an old version of a script and including a figure that wasn't created after fixing a bug in the data processing pipeline.
  2. Since your project is "runnable," it's more likely that someone else will be able to reuse part of your work to run it in a different context, thereby producing a bigger impact and accelerating the pace of discovery. If someone can take what you've done and use it to calculate a prediction, you have just produced truly useful knowledge.

Why another tool/platform?

Git, GitHub, DVC, Zenodo et al. are amazing tools/platforms, but their use involves multiple fairly difficult learning curves. Our goal is to provide a single tool and platform to unify all of these so that there is a single, gentle learning curve. However, it is not our goal to hide or replace these underlying components. Advanced users can use them directly, but new users aren't forced to, which helps them get up and running with less effort and training. Calkit should help users understand what is going on under the hood without forcing them to work at that lower level of abstraction.

Installation

Simply run

pip install calkit-python

Cloud integration

The Calkit cloud platform (https://calkit.io) serves as a project management interface and a DVC remote for easily storing all versions of your data/code/figures/publications, interacting with your collaborators, reusing others' research artifacts, etc.

After signing up, visit the settings page and create a token. Then run

calkit config set token ${YOUR_TOKEN_HERE}

Then, inside a project repo you'd like to connect to the cloud, run

calkit config setup-remote

This will setup the Calkit DVC remote, such that commands like dvc push will allow you to push versions of your data or pipeline outputs to the cloud for safe storage and sharing with your collaborators.

How it works

Calkit creates a simple human-readable "database" inside the calkit.yaml file, which serves as a way to store important information about the project, e.g., what question(s) it seeks to answer, what files should be considered datasets, figures, publications, etc. The Calkit cloud reads this database and registers the various entities as part of the entire ecosystem such that if a project is made public, other researchers can find and reuse your work to accelerate their own.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

calkit_python-0.0.5.tar.gz (13.5 kB view details)

Uploaded Source

Built Distribution

calkit_python-0.0.5-py3-none-any.whl (16.0 kB view details)

Uploaded Python 3

File details

Details for the file calkit_python-0.0.5.tar.gz.

File metadata

  • Download URL: calkit_python-0.0.5.tar.gz
  • Upload date:
  • Size: 13.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.1 CPython/3.12.6

File hashes

Hashes for calkit_python-0.0.5.tar.gz
Algorithm Hash digest
SHA256 5923daa69e190eba25187631647fed81f98a501a421f9f381b27751a945e90ba
MD5 cc6c2e131f24c91e2611b7b3e2455107
BLAKE2b-256 da5278aac7370350e913de69618f8babdf1f46197b7aaaa64c31f1762ffe6c3b

See more details on using hashes here.

File details

Details for the file calkit_python-0.0.5-py3-none-any.whl.

File metadata

File hashes

Hashes for calkit_python-0.0.5-py3-none-any.whl
Algorithm Hash digest
SHA256 c5b40504f67fd0000de3a042a3d657f32c2d2ce329c0e3c5b41a99c3d5ce83c3
MD5 ae62314d2cab109e334ef8606391f5a7
BLAKE2b-256 4779c1d064e30fb0d6a4cb021c225c44792d3db3f9c79bf904a9fd0ee5217eca

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page