Skip to main content

Downloads data and metadata from GEO and SRA and creates standard PEPs.

Project description

geofetch logo

PEP compatible Run pytests docs-badge pypi-badge Code style: black install with bioconda

geofetch is a command-line tool that downloads sequencing data and metadata from GEO and SRA and create metadata tables in standard PEP format. geofetch is hosted at pypi. You can convert the result of geofetch into unmapped bam or fastq files with the included sraconvert command.

Key geofetch features:

  • Works with GEO and SRA metadata
  • Combines samples from different projects
  • Standardizes output metadata
  • Filters type and size of processed files (from GEO) before downloading them
  • Easy to use
  • Fast execution time
  • Can search GEO to find relevant data
  • Can be used either as a command-line tool or from within Python using an API

Docs


Documentation: https://pep.databio.org/geofetch/

Source Code: https://github.com/pepkit/geofetch/


Installation

To install geofetch use this command:

pip install geofetch

or install the latest version from the GitHub repository:

pip install git+https://github.com/pepkit/geofetch.git

How to cite:

https://doi.org/10.1093/bioinformatics/btad069

@article{10.1093/bioinformatics/btad069,
    author = {Khoroshevskyi, Oleksandr and LeRoy, Nathan and Reuter, Vincent P and Sheffield, Nathan C},
    title = "{GEOfetch: a command-line tool for downloading data and standardized metadata from GEO and SRA}",
    journal = {Bioinformatics},
    volume = {39},
    number = {3},
    pages = {btad069},
    year = {2023},
    month = {03},
    abstract = "{The Gene Expression Omnibus has become an important source of biological data for secondary analysis. However, there is no simple, programmatic way to download data and metadata from Gene Expression Omnibus (GEO) in a standardized annotation format.To address this, we present GEOfetch—a command-line tool that downloads and organizes data and metadata from GEO and SRA. GEOfetch formats the downloaded metadata as a Portable Encapsulated Project, providing universal format for the reanalysis of public data.GEOfetch is available on Bioconda and the Python Package Index (PyPI).}",
    issn = {1367-4811},
    doi = {10.1093/bioinformatics/btad069},
    url = {https://doi.org/10.1093/bioinformatics/btad069},
    eprint = {https://academic.oup.com/bioinformatics/article-pdf/39/3/btad069/49407404/btad069.pdf},
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

geofetch-0.12.7.tar.gz (47.8 kB view details)

Uploaded Source

Built Distribution

geofetch-0.12.7-py3-none-any.whl (41.5 kB view details)

Uploaded Python 3

File details

Details for the file geofetch-0.12.7.tar.gz.

File metadata

  • Download URL: geofetch-0.12.7.tar.gz
  • Upload date:
  • Size: 47.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.1 CPython/3.12.6

File hashes

Hashes for geofetch-0.12.7.tar.gz
Algorithm Hash digest
SHA256 c7820852dcb5a619a510616b0ca80f402a6a2a60f95d081ea7e174b387ec3e36
MD5 9379f4fd50920c658ca7069e71fc999b
BLAKE2b-256 54bcf17b9b521ba279f1691a183f1a8e94fc43c5ee302fc52881e0a21ada05d8

See more details on using hashes here.

File details

Details for the file geofetch-0.12.7-py3-none-any.whl.

File metadata

  • Download URL: geofetch-0.12.7-py3-none-any.whl
  • Upload date:
  • Size: 41.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.1 CPython/3.12.6

File hashes

Hashes for geofetch-0.12.7-py3-none-any.whl
Algorithm Hash digest
SHA256 039249b38e4553a460780b59cd8bdc9a4b5ff9337b6d6d1bc912bf93c35ab947
MD5 63f5fa661da48701ba4ab75d37c4e45e
BLAKE2b-256 7c5a200693e6fe0804385bedae2fce4bd775ec1a652f006274e754dac6ca98a5

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page