a plugin that installs image processing tools for ngAutonML
Project description
ngAutonML
Image-specific docs coming soon to a repo near you!
ngAutonML
The ngAutonML project is an Automated Machine Learning solution intended to make it much easier to find good solutions to common machine learning problems, or to aid in prototyping a more complex solution. It aims to be simple to use for the bulk of machine learning problems, but maintains a high level of customizability for those situations that require more specific setups by more experiences machine learning professionals.
This project is the result of research performed as part of the D3M project run by DARPA. It is a reimplementation of many concepts used in that project, and is currently under heavy development.
Installation
At this time, the only installation available is via cloning the repository on GitLab:
git clone git@gitlab.com:autonlab/ngautonml.git
As the project leaves Alpha stage, a Pypi package will be posted for easy installation.
It is recommended to create a virtual environment to run ngautonml. To do so with conda, run:
conda create -n env-name python=3.9
conda activate env-name
ngAutonML is designed to run on Python 3.9 and above.
A requirements.txt
file is provided to install necessary libraries. Use:
pip install -r requirements.txt
In addition, if you are a developer, install requirements-dev.txt
, which contains linters that code must be fully compliant with:
pip install -r requirements-dev.txt
Usage
To use the current ngAutonML, you need a Problem Definition(explained later) to activate with:
python picard.py wrangle -d <path to problem definition>
Problem Definitions
The problem definition file is a JSON file that describes the dataset being used, and how it should be handled, in a simplified level that requires only minimal knowledge of machine learning. Examples can be found in the examples/problem_definition
directory under the project, but in brief the important aspects are the following fields:
- metrics: This field defines the scoring metric(s) that the Auto ML will use to determine fitness. At this time there is little documentation on what all the metrics require for parameters.
- dataset: This field is required, with the two major subfields being the
train_path
pointing to the training dataset(in csv format) and thetarget
field defining which column will be predicted. For those datasets that have a test to run, thetest_path
is also provided here - problem_type: This uses the subfields of
data_type
to tell the type of dataset, and thetask
to determine how the dataset will be handled, such as a classification or forecasting problem. The full documentation will have a set of possible tasks and how to identify which problem your data falls into.
Support
Currently all issues should be generated via the GitLab Issue Tracker.
Roadmap
In Development:
- Support for external models such as Docker Containers or LLM services
- Code generation for insertion into your own projects or for low-level customization
- API support
Contributing
If forking and wanting to contribute, please ensure PEP8 Compliance. The current project uses flake8, mypy, and pylint for code compliance, with the exception of setting the maximum line length to 100 characters.
Authors and acknowledgment
The CMU AutonML Development Team:
Piggy Yarroll (programmer/architect)
Andrew Williams (programmer)
Merritt Kowaleski (programmer)
Mujing Wang (programmer)
Carter Weaver (programmer)
Jeishi Chen (data scientist)
License
This project is currently licensed under the Apache 2.0 license.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file ngautonml_image-0.4.1b2.tar.gz
.
File metadata
- Download URL: ngautonml_image-0.4.1b2.tar.gz
- Upload date:
- Size: 12.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.9.19
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5e21db92bc7d1649bc0a0fe30389fe7cf517a4168e9a03a078c69bcf98b8f5e8 |
|
MD5 | 9204ebf1e9e17fbc87ca70628450e9fb |
|
BLAKE2b-256 | d75ff7b121d2e0276e317d14e8da6bf1737df05d9198f346678097605a9e19b1 |
File details
Details for the file ngautonml_image-0.4.1b2-py3-none-any.whl
.
File metadata
- Download URL: ngautonml_image-0.4.1b2-py3-none-any.whl
- Upload date:
- Size: 15.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.9.19
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 323dcc2562e42176649e82bf77743e45735a1053a9df211b75b510f90741262d |
|
MD5 | 9d8569e9f9fe8539ef203a5c340b33f1 |
|
BLAKE2b-256 | 03779b2adf6accadea5afab607201f5b46120dfc6dc459226942c9fd82934e16 |