Skip to main content

Pre-processing data tool for NHP Lab @ CMU

Project description

nhp-prep

This is a CLI Tool that has been created to pre-process historical data that has been collected in multiple instances. This includes data collected at Seneca Zoo and Mellon Institute.

Requirements

This package requires Python 3.

Installing

To install this CLI tool you can run the below command

pip3 install nhp-prep

Updating

If you already have this tool installed, you can update it to the latest stable release by using the following command:

pip3 install -U nhp-prep

Alternatively, you clone this repo and then run this command from within the repository folder

python3 setup.py install

Another way to install this solution is by running the following command from within the repository folder:

pip install -e .

Both the above commands would install the package globally and nhp-prep will be available on your system.

How to use

There are multiple instances in which you can use this tool.

nhp-prep COMMAND [OPTIONS]

There are four use-cases (commands) in which you can use this tool:

  1. Mapping columns from prior to current format (reorder-columns)
nhp-prep reorder-columns -i <directory_with_files_to_reorder_columns_OR_unique_CSV_file> -o <output_directory> -r <file_with_reference_columns>
  1. Rename the files to follow current standard (rename)
nhp-prep rename -i <directory_files_to_rename_OR_uniques_CSV_file> -o <output_directory>

The current format for the file is: YYYY-MM-DD_HHmmh_<experiment_name>_<Subject_name>_<Researcher_name_or_initials>_data.csv

  1. Timestamp estimation trials from historical data files based on column (timestamp-estimate)
nhp-prep --timestamp-estimate -i <input_file>

Alternatively, you can pass the directory of the files to estimate the timestamp of each trial:

nhp-prep timestamp-estimate -i <directory_with_files_OR_unique_CSV_file>
Since v0.3.0

Since the previous 3 steps are common across the different datasets collected, the dev team decided to merge them into one single command (preparation-steps):

nhp-prep preparation-steps -i <input_directory> -o <output_directory>

The previous command will run sequentially the steps 1 to 3. The only command left outside of the bundle is the #4 since that is only applicable for the Baboons' data and requires the additional reference file.

  1. Renaming of Subject according to logs file (needs the file) (sub-rename)
nhp-prep sub-rename -r <file_with_columns_and_reference_subject_names> -i <directory_with_files_OR_unique_CSV_file> -o <output_directory>

You could also run nhp-prep --help to see the available commands and their corresponding usage.

If you want to know all the options available for an specific command, run the following:

nhp-prep COMMAND --help

Example:

nhp-prep sub-rename --help

Feedback

Please feel free to leave feedback in issues/PRs.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nhp-prep-0.3.2.tar.gz (14.2 kB view hashes)

Uploaded Source

Built Distribution

nhp_prep-0.3.2-py3-none-any.whl (15.8 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page