A lightweight module for research experiment reproducibility and analysis

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

Skeletor

Skeletor attempts to provide a lightweight wrapper for research code with two goals: (1) make it easy to track experiment results and data for later analysis and (2) orchestrate many experiments in parallel without worrying too much. The first goal is satisfied using track for logging experiment metrics. You can get the experiment results in a nice Pandas DataFrame with it, it logs in a nice format, and it can back up to S3. The second goal is satisfied using ray to parallelize multi-gpu grid-searched experiment configurations.

99% of the work is being done by track and ray.

I added boilerplate model, architecture, and optimizer construction functions for some basic PyTorch setups. I will try to add more as time goes on, but I don't plan on adding TensorFlow things anytime soon.

Setup

Necessary packages are listed in setup.py. Just run pip install skeletor-ml to get started.

Basic Usage

All you really have to do is supply a supply_args(parser) function and an experiment_fn(parsed_args) function. The first one takes in an ArgumentParser object so you can supply your own arguments to the project. The second one will take in the parsed arguments and run your experiment.

To launch a single experiment, you can do something like

CUDA_VISIBLE_DEVICES=0 python train.py <my args> <experimentname>

To launch experiments in parallel, you can do something like

CUDA_VISIBLE_DEVICES=0,1,2,3 python train.py <config.yaml> --self_host=4 <experimentname>

Logs (track records) will be stored in <args.logroot>/<args.experimentname>.

Examples

You can find an example of running a grid search for training a residual network on CIFAR-10 in PyTorch in examples/train.py.

Getting experiment results

I added a utility in skeletor.proc for converting all track trial records for an experiment into a single Pandas DataFrame. It can also pickle it.

Help me out

I tried to erase boilerplate by adding basic experiment utilities as well as various models and dataloaders. I haven't added much yet. Feel free to port over other architectures and datasets into the repo via PRs.

Things to do

Add capability to register custom models, dataset loaders, and optimizers with the build_model, build_dataset, and build_optimizer functions.

Sometimes track doesn't install correctly from the setup.py. If this happens, just run pip install --upgrade git+https://github.com/richardliaw/track.git@master#egg=track first. `

...

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.1.4

Nov 22, 2018

0.1.3

Nov 5, 2018

0.1.2

Oct 23, 2018

0.1.1

Oct 21, 2018

This version

0.1

Oct 21, 2018

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

skeletor-ml-0.1.tar.gz (6.9 kB view hashes)

Uploaded Oct 21, 2018 Source

Built Distribution

skeletor_ml-0.1-py3-none-any.whl (9.7 kB view hashes)

Uploaded Oct 21, 2018 Python 3

Hashes for skeletor-ml-0.1.tar.gz

Hashes for skeletor-ml-0.1.tar.gz
Algorithm	Hash digest
SHA256	`7b381c13d278f277314c9ccf7870351aa9a58115f6fba07775cc497c74bd4bc9`
MD5	`9e35fc3d95ac747587766f7f3787fe4f`
BLAKE2b-256	`ca6e184ef84ba9fc559befbb44d219f9eefdef094aae33dac4a3d036870fd3fd`

Hashes for skeletor_ml-0.1-py3-none-any.whl

Hashes for skeletor_ml-0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`dac95a612ef5617354d52a5a902429c458139fdaf763c100468042c367c59326`
MD5	`a498f8c4e6105bb21ae0a0a3993cef70`
BLAKE2b-256	`378bbf1431cd3b892f2317110a4effda0f8090c3bd32d662fb778e719565a94f`