mlcomp·PyPI

Machine learning pipelines. Especially, for competitions, like Kaggle

These details have not been verified by PyPI

Project links

Homepage

Project description

MLComp logo

Distributed directed acyclic graph framework for machine learning with UI

The goal of MLComp is to provide tools for training, inferencing, creating complex pipelines (especially for computer vision) in a rapid, well manageable way.
MLComp is compatible with: Python 3.6+, Unix operation system.

Part of Catalyst Ecosystem. Project manifest.

Features

Amazing UI
Catalyst support
Distributed training
Supervisor that controls computational resources
Synchronization of both code and data
Resource monitoring
Full functionality of the pause and continue on UI
Auto control of the requirements
Code dumping (with syntax highlight on UI)
Kaggle integration
Hierarchical logging
Grid search
Experiments comparison
Customizing layout system

Contents

Screenshots
Installation
UI
Usage
Docs and examples
Environment variables

Screenshots

Dags

dags

Computers

computers

Reports

reports

Code

code

Graph

graph

More screenshots

Installation

Install MLComp package

sudo apt-get install -y \
libavformat-dev libavcodec-dev libavdevice-dev \
libavutil-dev libswscale-dev libavresample-dev libavfilter-dev

pip install mlcomp
mlcomp init
mlcomp migrate

Setup your environment. Please consider Environment variables section
Run db, redis, mlcomp-server, mlcomp-workers:

Variant 1: minimal (if you have 1 computer)

Run all necessary (mlcomp-server, mlcomp-workers, redis-server), it uses SQLITE:
```
mlcomp-server start --daemon=True
```
Variant 2: full

a. Change your Environment variables to use PostgreSql

b. Install rsync on each work computer
```
sudo apt-get install rsync
```
Ensure that every computer is available by SSH protocol with IP/PORT you specified in the Environment variables file.

rsync will perform the following commands:

to upload
```
rsync -vhru -e "ssh -p {target.port} -o StrictHostKeyChecking=no" \
{folder}/ {target.user}@{target.ip}:{folder}/ --perms  --chmod=777
```
to download
```
rsync -vhru -e "ssh -p {source.port} -o StrictHostKeyChecking=no" \
{source.user}@{source.ip}:{folder}/ {folder}/ --perms  --chmod=777
```
c. Install apex for distributed learning

d. To Run postgresql, redis-server, mlcomp-server, execute on your server-computer:
```
cd ~/mlcomp/configs/
docker-compose -f server-compose.yml up -d
```
e. Run on each worker-computer:
```
mlcomp-worker start
```

UI

Web site is available at http://{WEB_HOST}:{WEB_PORT}

By default, it is http://localhost:4201

The front is built with AngularJS.

In case you desire to change it, please consider front's Readme page

Usage

Run

mlcomp dag PATH_TO_CONFIG.yml

This command copies files of the directory to the database.

Then, the server schedules the DAG considering free resources.

For more information, please consider Docs

Docs and examples

API documentation and an overview of the library can be found here

You can find advanced tutorials and MLComp best practices in the examples folder of the repository.

FileSync tutorial describes data synchronization mechanism

Environment variables

The single file to setup your computer environment is located at ~/mlcomp/configs/.env

ROOT_FOLDER - folder to save MLComp files: configs, db, tasks, etc.
TOKEN - site security token. Please change it to any string
DB_TYPE. Either SQLITE or POSTGRESQL
POSTGRES_DB. PostgreSql db name
POSTGRES_USER. PostgreSql user
POSTGRES_PASSWORD. PostgreSql password
POSTGRES_HOST. PostgreSql host
PGDATA. PostgreSql db files location
REDIS_HOST. Redis host
REDIS_PORT. Redis port
REDIS_PASSWORD. Redis password
WEB_HOST. MLComp site host. 0.0.0.0 means it is available from everywhere
WEB_PORT. MLComp site port
CONSOLE_LOG_LEVEL. log level for output to the console
DB_LOG_LEVEL. log level for output to the database
IP. Ip of a work computer. The work computer must be accessible from other work computers by these IP/PORT
PORT. Port of a work computer. The work computer must be accessible from other work computers by these IP/PORT (SSH protocol)
MASTER_PORT_RANGE. distributed port range for a work computer. 29500-29510 means that if this work computer is a master in a distributed learning, it will use the first free port from this range. Ranges of different work computers must not overlap.
NCCL_SOCKET_IFNAME. NCCL network interface.
FILE_SYNC_INTERVAL. File sync interval in seconds. 0 means file sync is off
WORKER_USAGE_INTERVAL. Interval in seconds of writing worker usage to DB
INSTALL_DEPENDENCIES. True/False. Either install dependent libraries or not
SYNC_WITH_THIS_COMPUTER. True/False. If False, all computers except that will not sync with that one
CAN_PROCESS_TASKS. True/False. If false, this computer does not process tasks

You can see your network interfaces with ifconfig command. Please consider nvidia doc

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

20.3.1

Mar 16, 2020

20.3

Mar 9, 2020

20.2.4.1

Mar 7, 2020

20.2.4

Mar 5, 2020

20.2.4rc0 pre-release

Mar 7, 2020

20.2.4b0 pre-release

Mar 6, 2020

20.2.4a0 pre-release

Mar 6, 2020

20.1.3

Jan 29, 2020

20.1

Jan 20, 2020

19.12

Dec 12, 2019

19.10.1

Oct 10, 2019

19.10

Oct 6, 2019

19.10rc0 pre-release

Oct 10, 2019

19.10b0 pre-release

Oct 9, 2019

19.10a0 pre-release

Oct 6, 2019

19.9.5

Sep 29, 2019

19.9.4

Sep 17, 2019

19.9

Sep 2, 2019

19.8.6

Aug 26, 2019

19.8.5

Aug 21, 2019

0.5.9

Jul 10, 2019

0.5.8

Jul 10, 2019

0.5.7

Jun 24, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mlcomp-20.3.1.tar.gz (7.0 MB view details)

Uploaded Mar 16, 2020 Source

Built Distribution

mlcomp-20.3.1-py2.py3-none-any.whl (14.3 MB view details)

Uploaded Mar 16, 2020 Python 2Python 3

File details

Details for the file mlcomp-20.3.1.tar.gz.

File metadata

Download URL: mlcomp-20.3.1.tar.gz
Upload date: Mar 16, 2020
Size: 7.0 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/45.1.0 requests-toolbelt/0.9.1 tqdm/4.39.0 CPython/3.6.9

File hashes

Hashes for mlcomp-20.3.1.tar.gz
Algorithm	Hash digest
SHA256	`47a7b964e59f10d245cbaa4125dd27eaf3c7a71b29c5ce7d4aed17ca8c6a4602`
MD5	`692e3a54a6123da8f231f1f64aa5b5dd`
BLAKE2b-256	`fb04dfa6db754555fd4ad9ec93bd5465c115d07c075acb48498844193bd97acd`

See more details on using hashes here.

File details

Details for the file mlcomp-20.3.1-py2.py3-none-any.whl.

File metadata

Download URL: mlcomp-20.3.1-py2.py3-none-any.whl
Upload date: Mar 16, 2020
Size: 14.3 MB
Tags: Python 2, Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/45.1.0 requests-toolbelt/0.9.1 tqdm/4.39.0 CPython/3.6.9

File hashes

Hashes for mlcomp-20.3.1-py2.py3-none-any.whl
Algorithm	Hash digest
SHA256	`980b749e1ab3a18d528d07732f2f28a192861ce5d7e221e0de34339060eaf599`
MD5	`dfae193f9aed18ea8cad0796ecddd098`
BLAKE2b-256	`52cd6f60b9b24761e0e283890e9949d0f3196a387cce51f28029798477786273`

See more details on using hashes here.

mlcomp 20.3.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Screenshots

Installation

UI

Usage

Docs and examples

Environment variables

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes