Skip to main content

Hierarchical Classification Library.

Project description

HiClass

HiClass is an open-source python library for hierarchical classification compatible with scikit-learn

Deploy PyPI Documentation Status codecov Downloads Conda Downloads pypi License

✨ Here are a couple of demos that show HiClass in action on hierarchical datasets:

  • Classify a consumer complaints dataset from the consumer financial protection bureau: consumer-complaints
  • Classify a 16S rRNA dataset from the TAXXI benchmark: 16s-rrna

Quick Links

Install

Option 1: Conda

HiClass and its dependencies can be easily installed with conda:

conda install -c conda-forge hiclass

Option 2: Pip

Alternatively, HiClass and its dependencies can also be installed with pip:

pip install hiclass

Quick start

Here's a quick example showcasing how you can train and predict using a local classifier per node.

from hiclass import LocalClassifierPerNode
from sklearn.ensemble import RandomForestClassifier

# define data
X_train, X_test = get_some_train_data()  # (n, num_features)
Y_train = get_some_labels()  # (n, num_largest_hierarchy)
# Use random forest classifiers for every node and run a classification
rf = RandomForestClassifier()
lcpn = LocalClassifierPerNode(local_classifier=rf)
lcpn.fit(X_train, Y_train)
predictions = lcpn.predict(X_test)

Step-by-step walk-through

A step-by-step walk-through is available on our interactive notebook hosted on Google Colab.

This will guide you through the process of installing hiclass with conda, training and predicting a small dataset.

API Documentation

Here's our official API documentation, available on Read the Docs.

If you notice any issues with the documentation or walk-through, please let us know by opening an issue here: https://github.com/mirand863/hiclass/issues.

Contributing

We are a small team on a mission to democratize hierarchical classification, and we'll take all the help we can get! If you'd like to get involved, here's information on where we could use your help: Contributing.md

Getting Latest Updates

If you'd like to get updates when we release new versions, please click on the "Watch" button on the top and select "Releases only". Github will then send you notifications along with a changelog with each new release.

Citation

If you use HiClass, please cite:

Miranda, Fábio M., Niklas Köehnecke, and Bernhard Y. Renard. "HiClass: a Python library for local hierarchical classification compatible with scikit-learn." arXiv preprint arXiv:2112.06560 (2021).

@article{miranda2021hiclass,
  title={HiClass: a Python library for local hierarchical classification compatible with scikit-learn},
  author={Miranda, F{\'a}bio M and K{\"o}ehnecke, Niklas and Renard, Bernhard Y},
  journal={arXiv preprint arXiv:2112.06560},
  year={2021}
}

In addition, we would like to list publications that use our software on our repository. Please email the reference, the name of your lab, department and institution to fabio.malchermiranda@hpi.de

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hiclass-3.1.2.tar.gz (41.3 kB view hashes)

Uploaded Source

Built Distribution

hiclass-3.1.2-py3-none-any.whl (30.3 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page