Skip to main content

a plugin that installs forecasting tools for ngAutonML

Project description

ngAutonML

Forecasting-specific docs coming soon to a repo near you!

ngAutonML

The ngAutonML project is an Automated Machine Learning solution intended to make it much easier to find good solutions to common machine learning problems, or to aid in prototyping a more complex solution. It aims to be simple to use for the bulk of machine learning problems, but maintains a high level of customizability for those situations that require more specific setups by more experiences machine learning professionals.

This project is the result of research performed as part of the D3M project run by DARPA. It is a reimplementation of many concepts used in that project, and is currently under heavy development.

Installation

The preferred method for installing the plugin is pip:

pip install ngautonml_forecasting

Installation via git clone (for developers)

You can clone the repository on GitLab:

git clone git@gitlab.com:autonlab/ngautonml.git

It is recommended to create a virtual environment to run ngautonml. To do so with conda, run:

conda create -n env-name python=3.10
conda activate env-name

ngAutonML is designed to run on Python 3.9 and above.

A requirements.txt file is provided to install necessary libraries. Use:

pip install -r plugins/ngautonml_forecasting/ngautonml_forecasting/requirements.txt
pip install -r plugins/ngautonml_forecasting/ngautonml_forecasting/requirements-dev.txt

Usage

To use the current ngAutonML, you need a Problem Definition(explained later) to activate with:

python picard.py wrangle -d <path to problem definition>

Problem Definitions

The problem definition file is a JSON file that describes the dataset being used, and how it should be handled, in a simplified level that requires only minimal knowledge of machine learning. Examples can be found in the examples/problem_definition directory under the project, but in brief the important aspects are the following fields:

  • metrics: This field defines the scoring metric(s) that the Auto ML will use to determine fitness. At this time there is little documentation on what all the metrics require for parameters.
  • dataset: This field is required, with the two major subfields being the train_path pointing to the training dataset(in csv format) and the target field defining which column will be predicted. For those datasets that have a test to run, the test_path is also provided here
  • problem_type: This uses the subfields of data_type to tell the type of dataset, and the task to determine how the dataset will be handled, such as a classification or forecasting problem. The full documentation will have a set of possible tasks and how to identify which problem your data falls into.

Support

Currently all issues should be generated via the GitLab Issue Tracker.

Roadmap

In Development:

  • Support for external models such as Docker Containers or LLM services
  • Code generation for insertion into your own projects or for low-level customization
  • API support

Contributing

If forking and wanting to contribute, please ensure PEP8 Compliance. The current project uses flake8, mypy, and pylint for code compliance, with the exception of setting the maximum line length to 100 characters.

Authors and acknowledgment

The CMU AutonML Development Team:

Piggy Yarroll (programmer/architect)
Andrew Williams (programmer)
Merritt Kowaleski (programmer)
Mujing Wang (programmer)
Carter Weaver (programmer)
Jeishi Chen (data scientist)
Nick Gisolfi (researcher)
Dan Howarth (programmer)

License

This project is currently licensed under the Apache 2.0 license.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ngautonml_forecasting-0.5.0b8.tar.gz (22.0 kB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

ngautonml_forecasting-0.5.0b8-py3-none-any.whl (37.4 kB view details)

Uploaded Python 3

ngautonml_forecasting-0.5.0b8-py2.py3-none-any.whl (37.6 kB view details)

Uploaded Python 2Python 3

File details

Details for the file ngautonml_forecasting-0.5.0b8.tar.gz.

File metadata

  • Download URL: ngautonml_forecasting-0.5.0b8.tar.gz
  • Upload date:
  • Size: 22.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.17

File hashes

Hashes for ngautonml_forecasting-0.5.0b8.tar.gz
Algorithm Hash digest
SHA256 60a9762b53d79294df24b01f3dcaecb523e30d0400e9627489cab077a534fbbd
MD5 83b838252e57b3e3a0e0bef36aaceecf
BLAKE2b-256 ee16cd25ce9166196d0e9987b9142c7eb84ad7d86ba147a711bbbc782e4de919

See more details on using hashes here.

File details

Details for the file ngautonml_forecasting-0.5.0b8-py3-none-any.whl.

File metadata

File hashes

Hashes for ngautonml_forecasting-0.5.0b8-py3-none-any.whl
Algorithm Hash digest
SHA256 200dbd5e60492d5a6f56a74827f19f097f49ca326aa6f3c6d65e0ec85417552d
MD5 02ea1beb22eabb5e774aae4b7d08a0c4
BLAKE2b-256 8adfd5edb88ae9c656459d5ff744dd755df37e4f35c33ddda5d3f8c9727f1cdf

See more details on using hashes here.

File details

Details for the file ngautonml_forecasting-0.5.0b8-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for ngautonml_forecasting-0.5.0b8-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 b54f6ce87b275816ac7ccbda29a11130b9a067e47ec12b6ae4bec6673776eb85
MD5 20126f3200c9c9616ee28109f1ed4e80
BLAKE2b-256 a0422d3332eca5ddc5b1dc00690b7b163f2e9a2eb0f5652f615f61d7f438ebdf

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page