Skip to main content

Download dataset via DOI or landing page url

Project description

🗃️ DaGet

Simple utility to download datasets from data respositories.

The goal of this project is to explore machine readable metadata and learn more about writing python packages.

⚠️ script is in early development and needs testing ⚠️

Installation

PyPI

to install daget using pip:

pip install daget

Usage

Download dataset via DOI or landing page url:

daget https://doi.org/10.5878/331q-3p13 ./destination

or short form doi:

daget 10.5878/331q-3p13 ./destination

Example

$ daget 10.7910/DVN/LFH4H8 ./test
destination:   /home/user/test
landing page:  https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/LFH4H8
 355.4KiB  Annex.pdf
 115.6KiB  List of items for measuring conceptions of democracy.csv
 81.1KiB  List of items for measuring conceptions of democracy.xlsx
 11.5KiB  Script - Part 1 Conceptions of democracy.R
 3.0KiB  Script - Part 2 Analyze items used to measure conceptions of democracy.R
 73.3KiB  Spreadsheet - Conceptions of democracy 10-2021.csv
 64.9KiB  Spreadsheet - Conceptions of democracy 10-2021.xlsx
 704.8KiB  downloaded 

Supported data respositories with file metadata

Alternatives

Improve the script

Adding suport for additional repositories requires test cases and investigation arround how to get file metadata from the landing page.

Please help by testing and reporting issues

Development

  • Check out this repository and open a terminal in this directory
  • Install dependecies: pip install pyproject.toml
  • Run the module without installing it: python3 -m daget

TODO

  • Add error handling
  • Check empty destination directory
  • Improve documentation
  • Package script for pip

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

daget-0.7.tar.gz (5.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

daget-0.7-py3-none-any.whl (6.5 kB view details)

Uploaded Python 3

File details

Details for the file daget-0.7.tar.gz.

File metadata

  • Download URL: daget-0.7.tar.gz
  • Upload date:
  • Size: 5.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for daget-0.7.tar.gz
Algorithm Hash digest
SHA256 4dde87bcb3cfda362b05a2d19b106773a26e1915af41d35c585cccac684c5a0e
MD5 fce7d378603fee0d4e4c00208ca0bd99
BLAKE2b-256 0b9b5cb35b76d0f6b5473d966f31446bd2f07d94e383692e3044148f25ca3803

See more details on using hashes here.

Provenance

The following attestation bundles were made for daget-0.7.tar.gz:

Publisher: python-publish.yml on borsna/daget

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file daget-0.7-py3-none-any.whl.

File metadata

  • Download URL: daget-0.7-py3-none-any.whl
  • Upload date:
  • Size: 6.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for daget-0.7-py3-none-any.whl
Algorithm Hash digest
SHA256 7c13ff144d7b65e8477ae1977414e1f7d7a5d1746ac7560435254d33d0eedd8e
MD5 e438aaaf8deccf4400e69c83ebaba680
BLAKE2b-256 9085a25d8a746495d25c0011f55bac3dc46ab338af2ebc0377d95696438f02fb

See more details on using hashes here.

Provenance

The following attestation bundles were made for daget-0.7-py3-none-any.whl:

Publisher: python-publish.yml on borsna/daget

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page