Generalized Isolation Forest
Project description
Generalized Isolation Forest

This repository provides an Python implementation of the "Generalized Isolation Forest" (GIF) algorithm for unsupervised detection of outliers in data. GIF has originally been proposed in:
Buschjäger, S., Honysz, PJ. & Morik, K. Randomized outlier detection with trees. International Journal of Data Science and Analytics (2020). https://doi.org/10.1007/s41060-020-00238-w
More information on this package, including a quick start guide, examples and how to use this within C++, is given here.
Install from the Python Package Index (PyPI, recommended)
We provide Linux wheel packages for various Python versions, which can be installed like this:
pip install genif
Windows or macOS builds are currently not provided. Please resort to installation from source, if you are either using Windows or macOS.
Install from source
Requirements:
- GCC >= 5.4.0 (older versions or other compilers such as Clang or ICC may work, but have not been tested yet.)
- CMake >= 3.5.1
- OpenMP
Build steps
- Recursively clone this repository by issueing
git clone --recurse-submodules git@github.com:philippjh/genif.git - Change your working directory to the root of the repository. Run
pip3 install . - The Python package manager will now build and install the package.
Acknowledgments
Part of the work on this paper has been supported by Deutsche Forschungsgemeinschaft (DFG) within the Collaborative Research Center SFB 876 "Providing Information by Resource-Constrained Analysis", project A1, http://sfb876.tu-dortmund.de and by the German Competence Center for Machine Learning Rhine Ruhr (ML2R, https://www.ml2r.de, 01IS18038A), funded by the German Federal Ministry for Education and Research.
Third Party Licenses
This open source project would not have been possible without other open source projects, which we want to acknowledge here.
Eigen
This library uses the MPL2-licensed features of Eigen, a C++ template library for linear algebra. A copy of the MPL2 license is available at https://www.mozilla.org/en-US/MPL/2.0/. The source code of the Eigen library can be obtained at http://eigen.tuxfamily.org/ or https://gitlab.com/libeigen/eigen.
nanoflann
Software License Agreement (BSD License)
Copyright 2008-2009 Marius Muja (mariusm@cs.ubc.ca). All rights reserved. Copyright 2008-2009 David G. Lowe (lowe@cs.ubc.ca). All rights reserved. Copyright 2011 Jose L. Blanco ( joseluisblancoc@gmail.com). All rights reserved.
THE BSD LICENSE
Redistribution and use in source and binary forms, with or without modification, are permitted provided that the following conditions are met:
- Redistributions of source code must retain the above copyright notice, this list of conditions and the following disclaimer.
- Redistributions in binary form must reproduce the above copyright notice, this list of conditions and the following disclaimer in the documentation and/or other materials provided with the distribution.
THIS SOFTWARE IS PROVIDED BY THE AUTHOR ``AS IS'' AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE AUTHOR BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
pybind11
Copyright (c) 2016 Wenzel Jakob wenzel.jakob@epfl.ch, All rights reserved.
Redistribution and use in source and binary forms, with or without modification, are permitted provided that the following conditions are met:
-
Redistributions of source code must retain the above copyright notice, this list of conditions and the following disclaimer.
-
Redistributions in binary form must reproduce the above copyright notice, this list of conditions and the following disclaimer in the documentation and/or other materials provided with the distribution.
-
Neither the name of the copyright holder nor the names of its contributors may be used to endorse or promote products derived from this software without specific prior written permission.
THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT HOLDER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
Please also refer to the file .github/CONTRIBUTING.md, which clarifies licensing of external contributions to this project including patches, pull requests, etc.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distributions
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file genif-1.0.1-cp39-cp39-manylinux2014_x86_64.whl.
File metadata
- Download URL: genif-1.0.1-cp39-cp39-manylinux2014_x86_64.whl
- Upload date:
- Size: 184.6 kB
- Tags: CPython 3.9
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.3.0 pkginfo/1.6.1 requests/2.25.1 setuptools/51.1.1 requests-toolbelt/0.9.1 tqdm/4.55.1 CPython/3.9.1
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0ea54e70e3cfd9e74a92a43eca3ab37d025bb68200c2063d61faf838c2e6710e
|
|
| MD5 |
c0ff534768dc7561a7c5efb452998121
|
|
| BLAKE2b-256 |
b8877df59fd3ba9df7490fcb5c24dfc877fc10cf50bc7ff4f815e54d6d3ee82b
|
File details
Details for the file genif-1.0.1-cp38-cp38-manylinux2014_x86_64.whl.
File metadata
- Download URL: genif-1.0.1-cp38-cp38-manylinux2014_x86_64.whl
- Upload date:
- Size: 184.6 kB
- Tags: CPython 3.8
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.3.0 pkginfo/1.6.1 requests/2.25.1 setuptools/51.1.1 requests-toolbelt/0.9.1 tqdm/4.55.1 CPython/3.9.1
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
583ad7cbb8214aa24ea6422c943a5a6c44ac34e31e3edf2f24dbff25217f76f6
|
|
| MD5 |
02d1d3a46d22d662ae8cece896e9c83a
|
|
| BLAKE2b-256 |
ccf7c290253ecfde0f2a97a4659200de59a6e9b873b62915f7dc774efbe5b073
|
File details
Details for the file genif-1.0.1-cp37-cp37m-manylinux2014_x86_64.whl.
File metadata
- Download URL: genif-1.0.1-cp37-cp37m-manylinux2014_x86_64.whl
- Upload date:
- Size: 185.8 kB
- Tags: CPython 3.7m
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.3.0 pkginfo/1.6.1 requests/2.25.1 setuptools/51.1.1 requests-toolbelt/0.9.1 tqdm/4.55.1 CPython/3.9.1
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
48e6c37b104c9b0d7845db6f34d3092b2af2ba3a2e429b8c10dd63ff6c55a4d3
|
|
| MD5 |
af5410ebd43ce67f7ee232848e9159c4
|
|
| BLAKE2b-256 |
428ea53ef9fe78ce8b34c80d74f51759ae5159b0cb9781b6d9afc4a4d6d826e3
|
File details
Details for the file genif-1.0.1-cp36-cp36m-manylinux2014_x86_64.whl.
File metadata
- Download URL: genif-1.0.1-cp36-cp36m-manylinux2014_x86_64.whl
- Upload date:
- Size: 185.8 kB
- Tags: CPython 3.6m
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.3.0 pkginfo/1.6.1 requests/2.25.1 setuptools/51.1.1 requests-toolbelt/0.9.1 tqdm/4.55.1 CPython/3.9.1
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5aa8c8bea1b39aa77615fb2e62cd87409eddcc49305f8226710a425d7ac5c49d
|
|
| MD5 |
a096bfb301fc01f069e49e2f8f49d915
|
|
| BLAKE2b-256 |
d4fa0b0141f9cbf2272d2f01a06cfae75883830aa51d7fb5c8da6dd876354bf9
|
File details
Details for the file genif-1.0.1-cp35-cp35m-manylinux2014_x86_64.whl.
File metadata
- Download URL: genif-1.0.1-cp35-cp35m-manylinux2014_x86_64.whl
- Upload date:
- Size: 185.8 kB
- Tags: CPython 3.5m
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.3.0 pkginfo/1.6.1 requests/2.25.1 setuptools/51.1.1 requests-toolbelt/0.9.1 tqdm/4.55.1 CPython/3.9.1
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ad7a54354e8370f5c57ad5be1f88993b52feeeb323d6fda0d1512bf50311ea6c
|
|
| MD5 |
3fb5742156e6fabbcd985a58aa22d98e
|
|
| BLAKE2b-256 |
0cbf1c5f2d0122f5ca427dec13057f51971d521cc99f6339738150f6fe8c43a4
|