Package for exrtacting, processing and analyzing Intan and OpenEphys data
Project description
See the full documentation here.
Table of contents generated with markdown-toc
blechpy
This is a package to extract, process and analyze electrophysiology data recorded with Intan or OpenEphys recording systems. This package is customized to store experiment and analysis metadata for the BLECh Lab (Katz lab) @ Brandeis University, but can readily be used and customized for other labs.
Installation
I recommend installing miniconda to handle your virtual environments Create a miniconda environment with:
conda create -n blechpy python==3.7.13
conda activate blechpy
Now you can install this package simply with pip:
pip install blechpy
If you want to update blechpy to the latest version:
pip install blechpy -U
Now you can deal with all of your data from within an ipython terminal:
ipython
import blechpy
Ubuntu 20.04 LTS+
With Ubuntu 20 or higher, you will get a segmentation fault when importing blechpy because numba version 0.48 installed through pip is corrupted. You will need to reinstall it via conda
conda install numba=0.48.0
Usage
blechpy handles experimental metadata using data_objects which are tied to a directory encompassing some level of data. Existing types of data_objects include:
- dataset
- object for a single recording session
- experiment
- object encompasing an ordered set of recordings from a single animal
- individual recordings must first be processed as datasets
- project
- object that can encompass multiple experiments & data groups and allow analysis or group differences
Datasets
Right now this pipeline is only compatible with recordings done with Intan's 'one file per channel' or 'one file per signal type' recordings settings.
Starting with a raw dataset
Create dataset
With a brand new shiny recording you can initilize a dataset with:
dat = blechpy.dataset('path/to/recording/directory')
# or
dat = blechpy.dataset() # for user interface to select directory
This will create a new dataset object and setup basic file paths.
If you're working via SSH or just want a command-line interface instead of a GUI you can use the keyword argument shell=True
You should only do this when starting data processing for the first time. If you use it on a processed dataset, it will get overwritten.
Use blechpy.load_dataset() instead to load an existing dataset (see below)
Initialize Parameters
dat.initParams()
Initalizes all analysis parameters with a series of prompts. See prompts for optional keyword params. Primarily setups parameters for:
- Flattening Port & Channel in Electrode designations
- Common average referencing
- Labelling areas of electrodes
- Labelling digital inputs & outputs
- Labelling dead electrodes
- Clustering parameters
- Spike array creation
- PSTH creation
- Palatability/Identity Responsiveness calculations
Initial parameters are pulled from default json files in the dio subpackage. Parameters for a dataset are written to json files in a parameters folder in the recording directory
Useful dat.initParams() arguments:
- data_quality='hp' -increases strictness of clustering, total # of clusters, and spike-sorting window to -0.75 to 1s.
- car_keyword = 'bilateral64' -auto assigns channel mapping to match the Omnetics-connector open ephys 64 channel EIB with 2-site implantation
- car_keyword = '2site_OE64' -auto assigns channel mapping to match Hirose-connector Open Ephys 64 channel EIB with 2-site implantation
- shell = True -bypasses GUI interface in favor of shell interface, useful if working over SSH or GUI is broken
Basic Processing
The most basic data extraction workflow would be:
dat = blechpy.dataset('/path/to/data/dir/')
dat.initParams() # See fucntion docstring, lots of optional parameters to eliminate need for user interaction
dat.extract_data() # Extracts raw data into HDF5 store
dat.create_trial_list() # Creates table of digital input triggers
dat.mark_dead_channels() # View traces and label electrodes as dead, or just pass list of dead channels
dat.mark_dead_channels([dead channel indices]) #alternatively, if you already know which chanels are dead, you can pass them as an argument
dat.common_average_reference() # Use common average referencing on data. Repalces raw with referenced data in HDF5 store
dat.detect_spikes()
dat.blech_clust_run() # Cluster data using GMM
dat.blech_clust_run(data_quality='noisy') # alternative: re-run clustering with less strict parameters
dat.sort_spikes(electrode_number) # Split, merge and label clusters as units
check blechpy/datastructures/dataset.py to see what functions are available
Preferred Workflow:
This workflow uses some parameters with defualts which makes the workflow more convenient.
dat = blechpy.dataset('/path/to/data/dir/')
dat.initParams(data_quality = 'hp', car_keyword = '2site_OE64') # 'hp' parameter for stricter clustering criteria, '2site_OE64' automatically maps channels to hirose-connector 64ch OEPS EIB in 2-site implantation
dat.extract_data()
dat.create_trial_list()
dat.mark_dead_channels([channel numbers]) # pass a list of dead channels (i.e. [1,2,3]) to bypass GUI marking of dead channels. Requires that you note them during drive building &/ recording
dat.common_average_reference()
dat.detect_spikes()
dat.blech_clust_run(umap=True) # Cluster with UMAP instead of GMM, supposedly better clustering
dat.sort_spikes(electrode_number) # Split, merge and label clusters as units
Checking processing progress:
dat.processing_status
Can provide an overview of basic data extraction and processing steps that need to be taken.
Viewing a Dataset
Experiments can be easily viewed wih: print(dat)
A summary can also be exported to a text with: dat.export_to_text()
Loading an existing dataset
dat = blechpy.load_dataset() # load an existing dataset from .p file
# or
dat = blechpy.load_dataset('path/to/recording/directory')
# or
dat = blechpy.load_dataset('path/to/dataset/save/file.p')
Import processed dataset into dataset framework
dat = blechpy.port_in_dataset()
# or
dat = blechpy.port_in_dataset('/path/to/recording/directory')
Experiments
Creating an experiment
exp = blechpy.experiment('/path/to/dir/encasing/recordings')
# or
exp = blechpy.experiment()
This will initalize an experiment with all recording folders within the chosen directory.
Editing recordings
exp.add_recording('/path/to/new/recording/dir/') # Add recording
exp.remove_recording('rec_label') # remove a recording dir
Recordings are assigned labels when added to the experiment that can be used to easily reference exerpiments.
Held unit detection
exp.detect_held_units()
Uses raw waveforms from sorted units to determine if units can be confidently classified as "held". Results are stored in exp.held_units as a pandas DataFrame. This also creates plots and exports data to a created directory: /path/to/experiment/experiment-name_analysis
Analysis
The blechpy.analysis
module has a lot of useful tools for analyzing your data.
Most notable is the blechpy.analysis.poissonHMM
module which will allow fitting of the HMM models to your data. See tutorials.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file blechpy-2.1.38.tar.gz
.
File metadata
- Download URL: blechpy-2.1.38.tar.gz
- Upload date:
- Size: 146.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.4.2 requests/2.28.1 setuptools/65.6.3 requests-toolbelt/0.8.0 tqdm/4.30.0 CPython/3.8.10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 10343ca6e35caa1440fbfe8e2e43a713c9d7e719e7a87abd62018de9c7595119 |
|
MD5 | 1471a2d4d4c117d828de33155c46e2e1 |
|
BLAKE2b-256 | d22a9091e52b28e93ad6580faefe0cb13b573300faa93923358537f7f7cff9cb |
File details
Details for the file blechpy-2.1.38-py3-none-any.whl
.
File metadata
- Download URL: blechpy-2.1.38-py3-none-any.whl
- Upload date:
- Size: 164.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.4.2 requests/2.28.1 setuptools/65.6.3 requests-toolbelt/0.8.0 tqdm/4.30.0 CPython/3.8.10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2a0061effd611627068e1cfeb256220e3f545b10301b6d7179d4da67e3b279cf |
|
MD5 | 3ab3bf44748b03110ed562cb89ac2e75 |
|
BLAKE2b-256 | 110dc339e716fe456ebcb0142f7d9183086827688ff5e0fa7ead71a58880dca5 |