Skip to main content

Scripts and Jupyter notebooks for 4DN wrangling

Project description

DCIC-wrangling

This is a collection of scripts and Jupyter notebooks that can be helpful when performing many data wrangling tasks. Most of the tools are specific to 4DN Nucleome wrangling needs, however may be modified to be more generally useful for certain tasks.

Install

Packaged with poetry can be installed using make, poetry or pip.

From the dcicwrangling directory - make build

If you already have poetry installed - poetry install

Or to pip install from PyPi - pip install dcicwrangling

All dependencies are installed by default - if for some reason you don't want to install pytest packages or invoke (used to launch notebooks) you can do poetry install --no-dev - not recommended.

Usage

Jupyter notebooks

There are a collection of commonly used jupyter notebooks in the notebooks/useful_notebooks directory. You can start a jupyter notebook server locally using invoke notebook from the top level directory. This should launch the server and open a browser page where the notebooks can be accessed.

IMPORTANT! - You should create your own folder in the notebooks directory named Yourname_scripts. This folder is where you should create, access and run your notebooks. If you want to start with one of the notebooks in the useful_notebooks directory please create a copy and move it to your own folder. This keeps the repository clean and organized. Please DO NOT run notebooks in the useful_notebooks directory and commit the results to the repository.

Scripts

The scripts directory contains some useful command line scripts. They can be run from the top level directory using python scripts/script_name --options. Using --help shows available options. In general, modified versions and bespoke scripts should not be committed to the repository - or alternatively committed to a separate non-master branch.

As scripts are developed and refined tool.poetry.scripts directives can be added to facilitate script usage - see pyproject.toml file example.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dcicwrangling-3.2.0.tar.gz (57.7 kB view details)

Uploaded Source

Built Distribution

dcicwrangling-3.2.0-py3-none-any.whl (69.0 kB view details)

Uploaded Python 3

File details

Details for the file dcicwrangling-3.2.0.tar.gz.

File metadata

  • Download URL: dcicwrangling-3.2.0.tar.gz
  • Upload date:
  • Size: 57.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.4.2 CPython/3.11.9 Linux/6.5.0-1025-azure

File hashes

Hashes for dcicwrangling-3.2.0.tar.gz
Algorithm Hash digest
SHA256 0365b00e13fd7d4be524c2e2d74e105d64250bbcfba7328f97d3b3dc985a3e7b
MD5 b9c9ecbc70aa316b615cdddfb8a9e28e
BLAKE2b-256 82144e84ffc1c8df231ad73009950a70b00acea7549c8d80fd8ff1c59f94a134

See more details on using hashes here.

File details

Details for the file dcicwrangling-3.2.0-py3-none-any.whl.

File metadata

  • Download URL: dcicwrangling-3.2.0-py3-none-any.whl
  • Upload date:
  • Size: 69.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.4.2 CPython/3.11.9 Linux/6.5.0-1025-azure

File hashes

Hashes for dcicwrangling-3.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 99f177e1d719a530f663d76efac7659eb22e05d225b6817bedf6b6c6e2e755fe
MD5 1cd4149f7013bc9deb53db43a8ec04a9
BLAKE2b-256 683efe286734dc36a8cbdd560910741cbd13ee24100f263a7db56f6f8ddeeeb7

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page