Python library for simplifying data science

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

Atlantis

Atlantis is a Python library for simplifying programming with Python for data science.

Installation

You can just use pip to install Atlantis:

pip install atlantis

Modules

collections helps with working with collections.
colour simplifies using colours.
ds (datascience) provides tools for:
- data wrangling,
- validation,
- tuning,
- sampling,
- evaluation,
- clustering, and
- parallel processing of machine learning models.
functions manages higher order functions.
hash simplifies and standardizes hashing.
text makes working with texts and strings easy.
time
- provides methods for interacting with time and date as well as
- progress bars

collections

This module of the package atlantis helps with working with collections.

`flatten`

from atlantis.collections import flatten
flatten([1, 2, [3, 4, [5, 6], 7], 8])

returns: [1, 2, 3, 4, 5, 6, 7, 8]

`List`

This class inherits from Python's list class but implements a few additional functionalities.

from atlantis.collections import List
l = List(1, 2, 3, 4, 2, [1, 2], [1, 2])

Flattening:

l.flatten()
>>> List: [1, 2, 3, 4, 2, 1, 2, 1, 2]

Finding duplicates:

l.get_duplicates()
>>> List: [2, List: [1, 2]]

Note: the list elements of a List automatically get converted to Lists, recursively.

ds (Data Science)

This module provides data science tools for:

data wrangling,
validation,
tuning,
sampling,
evaluation,
clustering, and
parallel processing of machine learning models.

KMeans Clustering

I have used the KMeans class from both sklearn and that of pyspark and was frustrated by two problems: (a) even though the two classes do exactly the same thing their interfaces are vastly different and (b) some of the simplest operations are very hard to do with both classes. I solved this problem by creating my own KMeans class that is a wrapper aroung both of those classes and uses the appropriate one automatically without complicating it for the data scientist programmer.

Usage

from atlantis.ds.clustering import KMeans

kmeans = KMeans(n_clusters=3, n_jobs=10)
kmeans.fit(X=X)

predictions = kmeans.predict(X=X)
transformed_x = kmeans.transform(X=X)

Clustering Optimization

Usage

from atlantis.ds.clustering import ClusteringOptimizer

clustering_optimizer = ClusteringOptimizer(min_k=2, max_k=16, n_jobs=10)
clustering_optimizer.fit(X=X)
print(f'best number of clusters: {clustering_optimizer.optimal_number_of_clusters}')

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

This version

2023.6.21

Jun 21, 2023

2022.12.19

Dec 19, 2022

2022.9.7.1

Sep 7, 2022

2022.9.7

Sep 7, 2022

2022.3.22

Mar 22, 2022

2022.3.17

Mar 17, 2022

2022.3.13

Mar 14, 2022

2022.3.8

Mar 9, 2022

2022.3.7

Mar 7, 2022

2022.2.26

Feb 27, 2022

2021.11.19

Nov 19, 2021

2021.9.22.1

Sep 22, 2021

2021.9.22

Sep 22, 2021

2021.9.13.1

Sep 13, 2021

2021.9.13

Sep 13, 2021

2021.9.10.1

Sep 10, 2021

2021.9.10

Sep 10, 2021

2021.9.9

Sep 9, 2021

2021.9.8.3

Sep 8, 2021

2021.9.8.2

Sep 8, 2021

2021.9.8.1

Sep 8, 2021

2021.9.8

Sep 8, 2021

2021.9.7.1

Sep 8, 2021

2021.9.7

Sep 8, 2021

2021.9.2

Sep 2, 2021

2021.8.16

Aug 16, 2021

2021.8.6.2

Aug 6, 2021

2021.8.6.1

Aug 6, 2021

2021.8.6

Aug 6, 2021

2021.7.30.2

Jul 30, 2021

2021.7.30.1

Jul 30, 2021

2021.7.30

Jul 30, 2021

2021.7.29.2

Jul 29, 2021

2021.7.29.1

Jul 29, 2021

2021.7.29

Jul 29, 2021

2021.7.25

Jul 26, 2021

2021.7.21

Jul 21, 2021

2021.7.20.3

Jul 20, 2021

2021.7.20.2

Jul 20, 2021

2021.7.20.1

Jul 20, 2021

2021.7.20

Jul 20, 2021

2021.7.19.1

Jul 20, 2021

2021.7.19

Jul 19, 2021

2021.7.17

Jul 17, 2021

2021.7.16.1

Jul 17, 2021

2021.7.16

Jul 17, 2021

2021.7.9.3

Jul 9, 2021

2021.7.9.2

Jul 9, 2021

2021.7.9.1

Jul 9, 2021

2021.7.9

Jul 9, 2021

2021.7.5.13

Jul 5, 2021

2021.7.5.12

Jul 5, 2021

2021.7.5.11

Jul 5, 2021

2021.7.5.10

Jul 5, 2021

2021.7.5.9

Jul 5, 2021

2021.7.5.8

Jul 5, 2021

2021.7.5.7

Jul 5, 2021

2021.7.5.6

Jul 5, 2021

2021.7.5.5

Jul 5, 2021

2021.7.5.4

Jul 5, 2021

2021.7.5.3

Jul 5, 2021

2021.7.5.2

Jul 5, 2021

2021.7.5.1

Jul 5, 2021

2021.7.5

Jul 5, 2021

0.4

Feb 12, 2019

0.3

Feb 12, 2019

0.2

Feb 12, 2019

0.1

Feb 12, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

atlantis-2023.6.21.tar.gz (131.8 kB view hashes)

Uploaded Jun 21, 2023 Source

Built Distribution

atlantis-2023.6.21-py3-none-any.whl (199.8 kB view hashes)

Uploaded Jun 21, 2023 Python 3

Hashes for atlantis-2023.6.21.tar.gz

Hashes for atlantis-2023.6.21.tar.gz
Algorithm	Hash digest
SHA256	`359d5cfb205a6af69f5b03e5437f5ab89e1af1bf08c8afee03cce8f4886cf316`
MD5	`f5144afcfa6cd9952845366b26ab9745`
BLAKE2b-256	`0c2635e2b5338f6a185075821572ab1d7c450cbf43e65fbcf7f316ffcd6c8013`

Hashes for atlantis-2023.6.21-py3-none-any.whl

Hashes for atlantis-2023.6.21-py3-none-any.whl
Algorithm	Hash digest
SHA256	`9fba22cf704e9a9915f1e982029db01e2c37e844374b4ddaf0b0f91cf7470387`
MD5	`20ca09150bb234826a49401fea12af43`
BLAKE2b-256	`202f20b28369cacfef33bdf7f812511c846c31cf842b7787110fea125f3ebe68`