Skip to main content

A toolbox for working with the Allen Brain Atlas genetic data

Project description

This package provides a Python interface for fetching and working with the Allen Human Brain Atlas (AHBA) microarray expression data.

https://circleci.com/gh/rmarkello/abagen.svg?style=shield https://codecov.io/gh/rmarkello/abagen/branch/main/graph/badge.svg https://readthedocs.org/projects/abagen/badge/?version=latest https://img.shields.io/badge/License-BSD%203--Clause-blue.svg https://zenodo.org/badge/DOI/10.5281/zenodo.3451463.svg

Overview

In 2013, the Allen Institute for Brain Science released the Allen Human Brain Atlas, a dataset containing microarray expression data collected from six human brains (Hawrylycz et al., 2012) . This dataset has offered an unprecedented opportunity to examine the genetic underpinnings of the human brain, and has already yielded novel insight into e.g., adolescent brain development and functional brain organization.

However, in order to be effectively used in most analyses, the AHBA microarray expression data often needs to be (1) collapsed into regions of interest (e.g., parcels or networks), and (2) combined across donors. While this may potentially seem trivial, there are a number of analytic choices in these steps that can dramatically influence the resulting data and any downstream analyses. Arnatkevičiūte et al., 2019 provided a thorough treatment of this in a recent manuscript, demonstrating how the techniques and code used to prepare the raw AHBA data have varied widely across published reports.

The current Python package, abagen, aims to provide reproducible workflows for processing and preparing the AHBA microarray expression data for analysis.

Installation requirements

Currently, abagen works with Python 3.6+ and requires a few dependencies:

  • nibabel

  • numpy (>=1.14.0)

  • pandas (>=0.25.0), and

  • scipy

There are some additional (optional) dependencies you can install to speed up some functions:

  • fastparquet, and

  • python-snappy

These latter packages are primarily used to facilitate loading the (rather large!) microarray expression dataframes provided by the Allen Institute,

For detailed information on how to install abagen, including these dependencies, refer to our installation instructions.

Quickstart

At it’s core, using abagen is as simple as:

>>> import abagen
>>> expression = abagen.get_expression_data('myatlas.nii.gz')

where 'myatlas.nii.gz' points to a brain parcellation file.

This function can also be called from the command line with:

$ abagen --output-file expression.csv myatlas.nii.gz

For more detailed instructions on how to use abagen please refer to our user guide!

Development and getting involved

If you’ve found a bug, are experiencing a problem, or have a question about using the package, please head on over to our GitHub issues and make a new issue with some information about it! Someone will try and get back to you as quickly as possible, though please note that the primary developer for abagen (@rmarkello) is a graduate student so responses make take some time!

If you’re interested in getting involved in the project: welcome ✨! We’re thrilled to welcome new contributors. You should start by reading our code of conduct; all activity on abagen should adhere to the CoC. After that, take a look at our contributing guidelines so you’re familiar with the processes we (generally) try to follow when making changes to the repository! Once you’re ready to jump in head on over to our issues to see if there’s anything you might like to work on.

Citing abagen

For up-to-date instructions on how to cite abagen please refer to our documentation.

License Information

This codebase is licensed under the 3-clause BSD license. The full license can be found in the LICENSE file in the abagen distribution.

Reannotated gene information located at abagen/data/reannotated.csv.gz and individualized donor parcellations for the Desikan-Killiany atlas located at abagen/data/native_dk are taken from Arnatkevičiūte et al., 2018 and are separately licensed under the CC BY 4.0; these data can also be found on figshare.

Corrected MNI coordinates used to match AHBA tissues samples to MNI space located at abagen/data/corrected_mni_coordinates.csv are taken from the alleninf package, provided under the 3-clause BSD license.

All microarray expression data is copyrighted under non-commercial reuse policies by the Allen Institute for Brain Science (© 2010 Allen Institute for Brain Science. Allen Human Brain Atlas. Available from: Allen Human Brain Atlas).

All trademarks referenced herein are property of their respective holders.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

abagen-0.1.3.tar.gz (3.7 MB view details)

Uploaded Source

Built Distribution

abagen-0.1.3-py3-none-any.whl (3.7 MB view details)

Uploaded Python 3

File details

Details for the file abagen-0.1.3.tar.gz.

File metadata

  • Download URL: abagen-0.1.3.tar.gz
  • Upload date:
  • Size: 3.7 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/3.7.3 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.59.0 CPython/3.8.8

File hashes

Hashes for abagen-0.1.3.tar.gz
Algorithm Hash digest
SHA256 22225ce661d082cdbeeee131da2187b4c942d7b610c5a2b312140ab127aaaee6
MD5 0025febb61b87354ba15f66f84ce39d9
BLAKE2b-256 f8235db2a5bf3a116722a7081721382941ae0ea27b985be06efc48aae565f16a

See more details on using hashes here.

File details

Details for the file abagen-0.1.3-py3-none-any.whl.

File metadata

  • Download URL: abagen-0.1.3-py3-none-any.whl
  • Upload date:
  • Size: 3.7 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/3.7.3 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.59.0 CPython/3.8.8

File hashes

Hashes for abagen-0.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 637a1b04c8adb6d6807b00116f686f7964b881b64c6b2a54a599b4bca52d8139
MD5 eae981d5ca2bd76db11dd51a23baafef
BLAKE2b-256 a26cd94d0131bc7cc55db3403b6063e4e2bc9448087b34529ccc113437687d30

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page