Skip to main content

An Open Source tool for Feature Engineering in Machine Learning

Project description

anovos-dark-horizontal

Anovos

Release Latest Docs Latest License twitter Slack

Anovos is an open source library for feature engineering at scale. Built by data scientists & ML Engineers for the data science community, it provides all the capabilities required for data ingestion, data analysis, data drift & data stability analysis, feature recommendation and feature composition. In addition, it automatically produces easily interpretable professional data reports that help users understand the nature of data at first sight and further enable data scientists to identify and engineer features.

Leveraging the power of Apache Spark behind the scenes, Anovos improves data scientists' productivity and helps them build more resilient and better performing models.

Quick Start

The easiest way to try out Anovos and explore its capabilities is through the provided examples that you can run via Docker without the need to install anything on your local machine.

# Launch an anovos-examples Docker container
sudo docker run -p 8888:8888 anovos/anovos-examples-3.2.2:latest

To reach the Jupyter environment, open the link to http://127.0.0.1:8888/?token... generated by the Jupyter NotebookApp.

If you're not familiar with Anovos or feature engineering, the Getting Started with Anovos guide is a good place to begin your journey. You can find it in the /guides folder within the Jupyter environment.

For more detailed instructions on how to install Docker and how to troubleshoot potential issues, see the examples README.

Using Anovos

Requirements

To use Anovos, you need compatible versions of Apache Spark, Java and Python.

Currently, we officially support the following combinations:

  • Apache Spark 2.4.x on Java 8 with Python 3.7.x
  • Apache Spark 3.1.x on Java 11 with Python 3.9.x
  • Apache Spark 3.2.x on Java 11 with Python 3.10.x

To see what we're currently testing, see this configuration.

Installation

You can install the latest release of Anovos directly through PyPI:

pip install anovos

Documentation

We provide a comprehensive documentation at docs.anovos.ai that includes user guides as well as a detailed API documentation.

For usage examples, see the provided interactive guides and Jupyter notebooks as well as the Spark demo.

Overview

Anovos Architecture Diagram

Roadmap

Anovos has designed for to support any feature engineering tasks in a scalable form. To see what's planned for the upcoming releases, see our roadmap.

Development Version

To try out the latest additions to Anovos, you can install it directly from GitHub:

pip install git+https://github.com/anovos/anovos.git

Please note that this version is frequently updated and might not be fully compatible with the documentation available at docs.anovos.ai.

Contribute

We're always happy to discuss and accept improvements to Anovos. To get started, please refer to our Contributing to Anovos page in the documentation.

To start coding, clone this repository, install both the regular and development requirements, and set up the pre-commit hooks:

git clone https://github.com/anovos/anovos.git
cd anovos/
pip install -r requirements.txt
pip install -r dev_requirements.txt
pre-commit install

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

anovos-1.1.0.tar.gz (208.1 kB view details)

Uploaded Source

Built Distribution

anovos-1.1.0-py3-none-any.whl (226.0 kB view details)

Uploaded Python 3

File details

Details for the file anovos-1.1.0.tar.gz.

File metadata

  • Download URL: anovos-1.1.0.tar.gz
  • Upload date:
  • Size: 208.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.15

File hashes

Hashes for anovos-1.1.0.tar.gz
Algorithm Hash digest
SHA256 4dc8c2d13f05a9982a707f434f484368ee7053d0b1646b23ddc2e5b7bd852f91
MD5 090941b251c25996578b1b288e054868
BLAKE2b-256 d7be77c4a5577c9dd7f4978bac6323f1c811d5f57eb3061869b8278f337e48ef

See more details on using hashes here.

File details

Details for the file anovos-1.1.0-py3-none-any.whl.

File metadata

  • Download URL: anovos-1.1.0-py3-none-any.whl
  • Upload date:
  • Size: 226.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.15

File hashes

Hashes for anovos-1.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 d47e3fe59ffac539576170ea0754c4e4317c91fd2336131bd25b311cb36d79fd
MD5 81d809d6a3f04b3b7c279e1811e5b35e
BLAKE2b-256 a53d426beecf32ab7051231935f554c47008e2b6354fe159b8bfa709ebb82649

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page