Skip to main content

Python library to access Gene Expression Omnibus Database (GEO)

Project description

https://img.shields.io/pypi/v/GEOparse.svg https://img.shields.io/travis/guma44/GEOparse.svg

Python library to access Gene Expression Omnibus Database (GEO).

GEOparse is python package that can be used to query and retrieve data from Gene Expression Omnibus database (GEO). The inspiration and the base for it is great R library GEOquery.

Features

  • Download GEO series, datasets etc. as SOFT files

  • Download supplementary files for GEO series to use them locally

  • Load GEO SOFT as easy to use and manipulate objects

  • Prepare your data for GEO upload

Installation

At the command line:

$ pip install GEOparse

TODO

There is still work to do so any contribution is welcome. Any bug/error that you report will improve the library.

The main issues are:

  • add checking for compatibility with SOFT files

  • expand GEOTypes objects with useful functions for differential expression analysis

  • share your idea

  • add more tests - that’s always good idea :)

History

1.0.2 (2017-11-01)

  • Bugfix: type name was depended on the order of entries

1.0.1 (2017-08-10)

  • Hotfix: wrong path split in Windows

1.0.0 (2017-07-21)

  • Many small bug fixes: * unknown subset types added to columns * silent=True is really silent * correct treatment of duplicated columns * illegal file names and no filtering of user input from GEO to create the file names * platform was not imported but used * fixed issues of python 2 and 3 compatibility

  • Logging replaced stdout and stderr + ability to set verbosity and log file

  • Return downloaded paths from download functions

  • Updated documentation according to Google docstring style guide

  • Tests update

  • Code refactored to be more PEP-8 friendly

0.1.10 (2017-03-27)

  • Important fix for SRA download

  • Fix duplicated columns issue

  • Python 2 and 3 open compatibility

0.1.9 (2017-03-10)

  • Added property phenotype_data to access phenotype data of GSE

  • Fixed windows issue with file names

  • replaced default download function with wgetter

  • Update documentation

  • Various bugfixes

0.1.8 (2016-11-02)

Thanks to Tycho Bismeijer:

  • Python 3 Compatibility

  • Bio.Entrez dependency optional

0.1.7 (2016-05-30)

Thanks to Simon van Heeringen:

  • bugfix in datasets with multiple associated relations

  • –split-files to fastq-dump to support paired-end experiments by default

  • parse a GPL that also contains series and sample information

  • gsm2fastq command to make download easier

  • initial Aspera download support

0.1.6 (2016-04-12)

  • Bugfixes

  • SRA function of GSE can now filter GSMs

0.1.5 (2016-02-03)

  • Added functions to download supplementary files including raw files from SRA

0.1.4 (2015-09-27)

  • Updated documentation including example

  • Updated tests: they now cover 80% of library with all important functions

  • Added pivot_and_annotate method to GSE object

  • Bugfixes

0.1.3 (2015-08-30)

  • Updated documentation

  • Added pivot_samples to GSE object

  • Code of GEOTypes was refactored

  • All objects now have to_soft function

  • Various bugfixes

0.1.2 (2015-08-23)

  • Added GDS support

  • Added to_soft methods to GSE, GSM and GPL

  • Added DATABASE entry support to GSE and GDS

0.1.1 (2015-08-16)

  • Brown-Bag release

0.1.0 (2015-08-16)

  • First release on PyPI.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

GEOparse-1.0.2.tar.gz (42.8 MB view details)

Uploaded Source

Built Distribution

GEOparse-1.0.2-py2.py3-none-any.whl (23.1 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file GEOparse-1.0.2.tar.gz.

File metadata

  • Download URL: GEOparse-1.0.2.tar.gz
  • Upload date:
  • Size: 42.8 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for GEOparse-1.0.2.tar.gz
Algorithm Hash digest
SHA256 c75eaac6171e1cbc5eece05a280a2ccd62e2cd397760bb162bec3255d8d5d271
MD5 2ebc997e8d3182f70a8569a0234c4a72
BLAKE2b-256 eed131670c423104645081f0b6f2d96eecb4554e98c6425ac165bc55261db357

See more details on using hashes here.

File details

Details for the file GEOparse-1.0.2-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for GEOparse-1.0.2-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 cac90dc8e88845610c2522fea8caafefb6e3934cbf2711d25044c6bdbb422df3
MD5 e75b994a81a08b504a9cd1914fb1dff9
BLAKE2b-256 fd96816e6a1e098fedf0ab72a7100868e00b4cb7dd30fd579008e4a79eac46a3

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page