PyScaffold extension for Data Science projects
- advocates a proper Python package structure that can be shipped and distributed,
- uses a conda environment instead of something virtualenv-based and is thus more suitable for data science projects,
- more default configurations for Sphinx, py.test, pre-commit, etc. to foster clean coding and best practices.
Also consider using dvc to version control and share your data within your team.
The final directory structure looks like:
├── AUTHORS.rst <- List of developers and maintainers. ├── CHANGELOG.rst <- Changelog to keep track of new features and fixes. ├── LICENSE.txt <- License as chosen on the command-line. ├── README.md <- The top-level README for developers. ├── configs <- Directory for configurations of model & application. ├── data │ ├── external <- Data from third party sources. │ ├── interim <- Intermediate data that has been transformed. │ ├── processed <- The final, canonical data sets for modeling. │ └── raw <- The original, immutable data dump. ├── docs <- Directory for Sphinx documentation in rst or md. ├── environment.yaml <- The conda environment file for reproducibility. ├── models <- Trained and serialized models, model predictions, │ or model summaries. ├── notebooks <- Jupyter notebooks. Naming convention is a number (for │ ordering), the creator's initials and a description, │ e.g. `1.0-fw-initial-data-exploration`. ├── references <- Data dictionaries, manuals, and all other materials. ├── reports <- Generated analysis as HTML, PDF, LaTeX, etc. │ └── figures <- Generated plots and figures for reports. ├── scripts <- Analysis and production scripts which import the │ actual PYTHON_PKG, e.g. train_model. ├── setup.cfg <- Declarative configuration of your project. ├── setup.py <- Use `python setup.py develop` to install for development or | or create a distribution with `python setup.py bdist_wheel`. ├── src │ └── PYTHON_PKG <- Actual Python package where the main functionality goes. ├── tests <- Unit tests which can be run with `py.test`. ├── .coveragerc <- Configuration for coverage reports of unit tests. ├── .isort.cfg <- Configuration for git hook that sorts imports. └── .pre-commit-config.yaml <- Configuration of pre-commit git hooks.
Just install this package with
pip install pyscaffoldext-dsproject
and note that
putup -h shows a new option
Creating a data science project is then as easy as:
putup --dsproject my_ds_project
This project has been set up using PyScaffold 3.2. For details and usage information on PyScaffold see https://pyscaffold.org/.
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
|Filename, size||File type||Python version||Upload date||Hashes|
|Filename, size pyscaffoldext_dsproject-0.4-py2.py3-none-any.whl (12.7 kB)||File type Wheel||Python version py2.py3||Upload date||Hashes View|
|Filename, size pyscaffoldext-dsproject-0.4.tar.gz (20.9 kB)||File type Source||Python version None||Upload date||Hashes View|
Hashes for pyscaffoldext_dsproject-0.4-py2.py3-none-any.whl
Hashes for pyscaffoldext-dsproject-0.4.tar.gz