Skip to main content

dbscan1d is a package for DBSCAN on 1D arrays

Project description

DBSCAN1D

dbscan1d is a 1D implementation of the DBSCAN algorithm. It was created to efficiently preform clustering on large 1D arrays.

Sci-kit Learn's DBSCAN implementation does not have a special case for 1D, where calculating the full distance matrix is wasteful. It is much better to simply sort the input array and performing efficient bisects for finding closest points. Here are the results of running the simple profile script included with the package. In every case DBSCAN1D is much faster than scikit learn's implementation.

image

Installation

Simply use pip to install dbscan1d:

pip install dbscan1d

It only requires numpy.

Quickstart

dbscan1d is designed to be interchangable with sklearn's implementation in alnmost all cases. The exception is that the weights parameter is not yet supported.

from sklearn.datasets import make_blobs

from dbscan1d.core import DBSCAN1D

# make blobs to test clustering
X = make_blobs(1_000_000, centers=2, n_features=1)[0]

# init dbscan object
dbs = DBSCAN1D(eps=.5, min_samples=4)

# get labels for each point
labels = dbs.fit_predict(X)

# show core point indices
dbs.core_sample_indices_

# get values of core points
dbs.components_

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dbscan1d-0.1.4.tar.gz (4.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dbscan1d-0.1.4-py3-none-any.whl (7.0 kB view details)

Uploaded Python 3

File details

Details for the file dbscan1d-0.1.4.tar.gz.

File metadata

  • Download URL: dbscan1d-0.1.4.tar.gz
  • Upload date:
  • Size: 4.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.6.8

File hashes

Hashes for dbscan1d-0.1.4.tar.gz
Algorithm Hash digest
SHA256 143719e043b94f0da9c917d628ccd269b1bccd40cf6c8cfacb42c1dce544b438
MD5 29a9f3d51da8f55a27cbb5fafad60aa8
BLAKE2b-256 498371ab6e03d78030503b53428bab26612cce6f982a626ac7e03a1725c6c1c5

See more details on using hashes here.

File details

Details for the file dbscan1d-0.1.4-py3-none-any.whl.

File metadata

  • Download URL: dbscan1d-0.1.4-py3-none-any.whl
  • Upload date:
  • Size: 7.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.6.8

File hashes

Hashes for dbscan1d-0.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 905e489182bb53c710602a0f23093800d902167fd8cd6d8020d8c2b084a76dcc
MD5 14db28df8940731753ba9cfec6a238aa
BLAKE2b-256 824e9a474f15f7070a8a03953bbfab5e28f47e9aa25f84acbea57a968f9c7d68

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page