Skip to main content

Generalized Isolation Forest

Project description

Generalized Isolation Forest Read Manual PyPI PyPI - Format

This repository provides an Python implementation of the "Generalized Isolation Forest" (GIF) algorithm for unsupervised detection of outliers in data. GIF has originally been proposed in:

Buschjäger, S., Honysz, PJ. & Morik, K. Randomized outlier detection with trees. International Journal of Data Science and Analytics (2020). https://doi.org/10.1007/s41060-020-00238-w

More information on this package, including a quick start guide, examples and how to use this within C++, is given here.

Install from the Python Package Index (PyPI, recommended)

We provide Linux wheel packages for various Python versions, which can be installed like this:

pip install genif

Windows or macOS builds are currently not provided. Please resort to installation from source, if you are either using Windows or macOS.

Install from source

Requirements:

  • GCC >= 5.4.0 (older versions or other compilers such as Clang or ICC may work, but have not been tested yet.)
  • CMake >= 3.5.1
  • OpenMP

Build steps

  • Recursively clone this repository by issueing git clone --recurse-submodules git@github.com:philippjh/genif.git
  • Change your working directory to the root of the repository. Run pip3 install .
  • The Python package manager will now build and install the package.

Acknowledgments

Part of the work on this paper has been supported by Deutsche Forschungsgemeinschaft (DFG) within the Collaborative Research Center SFB 876 "Providing Information by Resource-Constrained Analysis", project A1, http://sfb876.tu-dortmund.de and by the German Competence Center for Machine Learning Rhine Ruhr (ML2R, https://www.ml2r.de, 01IS18038A), funded by the German Federal Ministry for Education and Research.

Third Party Licenses

This open source project would not have been possible without other open source projects, which we want to acknowledge here.

Eigen

This library uses the MPL2-licensed features of Eigen, a C++ template library for linear algebra. A copy of the MPL2 license is available at https://www.mozilla.org/en-US/MPL/2.0/. The source code of the Eigen library can be obtained at http://eigen.tuxfamily.org/ or https://gitlab.com/libeigen/eigen.

nanoflann

Software License Agreement (BSD License)

Copyright 2008-2009 Marius Muja (mariusm@cs.ubc.ca). All rights reserved. Copyright 2008-2009 David G. Lowe (lowe@cs.ubc.ca). All rights reserved. Copyright 2011 Jose L. Blanco ( joseluisblancoc@gmail.com). All rights reserved.

THE BSD LICENSE

Redistribution and use in source and binary forms, with or without modification, are permitted provided that the following conditions are met:

  1. Redistributions of source code must retain the above copyright notice, this list of conditions and the following disclaimer.
  2. Redistributions in binary form must reproduce the above copyright notice, this list of conditions and the following disclaimer in the documentation and/or other materials provided with the distribution.

THIS SOFTWARE IS PROVIDED BY THE AUTHOR ``AS IS'' AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE AUTHOR BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.

pybind11

Copyright (c) 2016 Wenzel Jakob wenzel.jakob@epfl.ch, All rights reserved.

Redistribution and use in source and binary forms, with or without modification, are permitted provided that the following conditions are met:

  1. Redistributions of source code must retain the above copyright notice, this list of conditions and the following disclaimer.

  2. Redistributions in binary form must reproduce the above copyright notice, this list of conditions and the following disclaimer in the documentation and/or other materials provided with the distribution.

  3. Neither the name of the copyright holder nor the names of its contributors may be used to endorse or promote products derived from this software without specific prior written permission.

THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT HOLDER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.

Please also refer to the file .github/CONTRIBUTING.md, which clarifies licensing of external contributions to this project including patches, pull requests, etc.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

genif-1.0.1-cp39-cp39-manylinux2014_x86_64.whl (184.6 kB view details)

Uploaded CPython 3.9

genif-1.0.1-cp38-cp38-manylinux2014_x86_64.whl (184.6 kB view details)

Uploaded CPython 3.8

genif-1.0.1-cp37-cp37m-manylinux2014_x86_64.whl (185.8 kB view details)

Uploaded CPython 3.7m

genif-1.0.1-cp36-cp36m-manylinux2014_x86_64.whl (185.8 kB view details)

Uploaded CPython 3.6m

genif-1.0.1-cp35-cp35m-manylinux2014_x86_64.whl (185.8 kB view details)

Uploaded CPython 3.5m

File details

Details for the file genif-1.0.1-cp39-cp39-manylinux2014_x86_64.whl.

File metadata

  • Download URL: genif-1.0.1-cp39-cp39-manylinux2014_x86_64.whl
  • Upload date:
  • Size: 184.6 kB
  • Tags: CPython 3.9
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.3.0 pkginfo/1.6.1 requests/2.25.1 setuptools/51.1.1 requests-toolbelt/0.9.1 tqdm/4.55.1 CPython/3.9.1

File hashes

Hashes for genif-1.0.1-cp39-cp39-manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 0ea54e70e3cfd9e74a92a43eca3ab37d025bb68200c2063d61faf838c2e6710e
MD5 c0ff534768dc7561a7c5efb452998121
BLAKE2b-256 b8877df59fd3ba9df7490fcb5c24dfc877fc10cf50bc7ff4f815e54d6d3ee82b

See more details on using hashes here.

File details

Details for the file genif-1.0.1-cp38-cp38-manylinux2014_x86_64.whl.

File metadata

  • Download URL: genif-1.0.1-cp38-cp38-manylinux2014_x86_64.whl
  • Upload date:
  • Size: 184.6 kB
  • Tags: CPython 3.8
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.3.0 pkginfo/1.6.1 requests/2.25.1 setuptools/51.1.1 requests-toolbelt/0.9.1 tqdm/4.55.1 CPython/3.9.1

File hashes

Hashes for genif-1.0.1-cp38-cp38-manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 583ad7cbb8214aa24ea6422c943a5a6c44ac34e31e3edf2f24dbff25217f76f6
MD5 02d1d3a46d22d662ae8cece896e9c83a
BLAKE2b-256 ccf7c290253ecfde0f2a97a4659200de59a6e9b873b62915f7dc774efbe5b073

See more details on using hashes here.

File details

Details for the file genif-1.0.1-cp37-cp37m-manylinux2014_x86_64.whl.

File metadata

  • Download URL: genif-1.0.1-cp37-cp37m-manylinux2014_x86_64.whl
  • Upload date:
  • Size: 185.8 kB
  • Tags: CPython 3.7m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.3.0 pkginfo/1.6.1 requests/2.25.1 setuptools/51.1.1 requests-toolbelt/0.9.1 tqdm/4.55.1 CPython/3.9.1

File hashes

Hashes for genif-1.0.1-cp37-cp37m-manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 48e6c37b104c9b0d7845db6f34d3092b2af2ba3a2e429b8c10dd63ff6c55a4d3
MD5 af5410ebd43ce67f7ee232848e9159c4
BLAKE2b-256 428ea53ef9fe78ce8b34c80d74f51759ae5159b0cb9781b6d9afc4a4d6d826e3

See more details on using hashes here.

File details

Details for the file genif-1.0.1-cp36-cp36m-manylinux2014_x86_64.whl.

File metadata

  • Download URL: genif-1.0.1-cp36-cp36m-manylinux2014_x86_64.whl
  • Upload date:
  • Size: 185.8 kB
  • Tags: CPython 3.6m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.3.0 pkginfo/1.6.1 requests/2.25.1 setuptools/51.1.1 requests-toolbelt/0.9.1 tqdm/4.55.1 CPython/3.9.1

File hashes

Hashes for genif-1.0.1-cp36-cp36m-manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 5aa8c8bea1b39aa77615fb2e62cd87409eddcc49305f8226710a425d7ac5c49d
MD5 a096bfb301fc01f069e49e2f8f49d915
BLAKE2b-256 d4fa0b0141f9cbf2272d2f01a06cfae75883830aa51d7fb5c8da6dd876354bf9

See more details on using hashes here.

File details

Details for the file genif-1.0.1-cp35-cp35m-manylinux2014_x86_64.whl.

File metadata

  • Download URL: genif-1.0.1-cp35-cp35m-manylinux2014_x86_64.whl
  • Upload date:
  • Size: 185.8 kB
  • Tags: CPython 3.5m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.3.0 pkginfo/1.6.1 requests/2.25.1 setuptools/51.1.1 requests-toolbelt/0.9.1 tqdm/4.55.1 CPython/3.9.1

File hashes

Hashes for genif-1.0.1-cp35-cp35m-manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 ad7a54354e8370f5c57ad5be1f88993b52feeeb323d6fda0d1512bf50311ea6c
MD5 3fb5742156e6fabbcd985a58aa22d98e
BLAKE2b-256 0cbf1c5f2d0122f5ca427dec13057f51971d521cc99f6339738150f6fe8c43a4

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page