discovery-transition-ds

Advanced data cleaning, data wrangling and feature extraction tools for ML engineers

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

1 Filling the Gap - Project Hadron

Project Hadron has been built to bridge the gap between data scientists and data engineers. More specifically between machine learning business outcomes and the final product.

Project Hadron is a core set of abstractions that are the foundation of the three key elements that represent data science, those being: (1) feature engineering, (2) the construction of synthetic data with simulators, and generators (3) and statistics and machine learning algorithms for discovery and creating models. Project Hadron uniquely sees data as ‘all the same’ (lazyprogrammer (2020) https://lazyprogrammer.me/all-data-is-the-same/) , by which we mean its origin, shape and size stay independent throughout the disciplines so its content, form and structure can be removed as a factor in the design and implementation of the components built.

Project Hadron has been designed to place data scientists in the familiar environment of machine learning and statistical tools, extracting their ideas and translating them automagicially into production ready solutions familiar to data engineers and Subject Matter Experts (SME’s).

Project Hadron provides a clear Separation of Concerns, whilst maintaining the original intentions of the data scientist, that can be passed to a production team. It offers trust between the data scientists teams and product teams. It brings with it transparency and traceability, dealing with bias, fairness, and knowledge. The resulting outcome provides the product engineers with adaptability, robustness, and reuse; fitting seamlessly into a microservices solution that can be language agnostic.

At the heart of Project Hardon is a multi-tenant, NoSQL, singleton, in memory data store that has minimal code and functionality and has been custom built specifically for Hadron tasks in mind. Abstracted from this is the component store which allows us to build a reusable set of methods that define each tenanted component that sits separately from the store itself. In addition, a dynamic key value class provides labeling so that each tenant is not tied to a fixed set of reference values unless by specificity. Each of the classes, the data store, the component property manager, and the key value pairs that make up the component are all independent, giving complete flexibility and minimum code footprint to the build process of new components.

This is what gives us the Domain Contract for each tennant which sits at the heart of what makes the contracts reusable, translatable, transferable and brings the data scientist closer to the production engineer along with building a production ready component solution.

2 Main features

Data Preparation
Feature Selection
Feature Engineering
Feature Cataloguing
Augmented Knowledge
Synthetic Feature Build

3 Background

Born out of the frustration of time constraints and the inability to show business value within a business expectation, this project aims to provide a set of tools to quickly build production ready data science disciplines within a component based solution demonstrating coupling and cohesion between each disipline, providing a separation of concerns between components.

It also aims to improve the communication outputs needed by ML delivery to talk to Pre-Sales, Stakholders, Business SME’s, Data SME’s product coders and tooling engineers while still remaining within familiar code paradigms.

4 Getting Started

The discovery-transition-ds package is a set of python components that are focussed on Data Science. They are a concrete implementation of the Project Hadron abstract core. It is build to be very light weight in terms of package dependencies requiring nothing beyond what would be found in an basic Data Science environment. Its designed to be used easily within multiple python based interfaces such as Jupyter, IDE or command-line python.

5 Installation

package install

The best way to install AI-STAC component packages is directly from the Python Package Index repository using pip. All AI-STAC components are based on a pure python foundation package aistac-foundation

$ pip install aistac-foundation

The AI-STAC component package for the Transition is discovery-transition-ds and pip installed with:

$ pip install discovery-transition-ds

if you want to upgrade your current version then using pip install upgrade with:

$ pip install --upgrade discovery-transition-ds

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

4.18.14

Jul 3, 2023

4.18.13

Jun 23, 2023

4.18.12

Jun 23, 2023

4.18.11

Jun 23, 2023

4.18.10

Jun 15, 2023

4.18.9

Jun 14, 2023

4.18.8

Jun 14, 2023

4.18.7

Jun 14, 2023

4.18.6

Jun 14, 2023

4.18.5

Jun 14, 2023

4.18.4

Jun 14, 2023

4.18.2

Jun 14, 2023

4.18.1

Jun 12, 2023

4.18.0

Jun 11, 2023

4.17.6

Jun 8, 2023

4.17.5

Jun 8, 2023

4.17.4

Jun 8, 2023

4.17.3

Jun 1, 2023

4.17.2

Jun 1, 2023

4.17.1

May 30, 2023

4.17.0

May 30, 2023

4.16.14

May 30, 2023

4.16.13

May 30, 2023

4.16.12

May 26, 2023

4.16.11

May 21, 2023

4.16.10

May 20, 2023

4.16.9

May 20, 2023

4.16.8

May 20, 2023

4.16.7

May 13, 2023

4.16.6

May 13, 2023

4.16.5

May 13, 2023

4.16.4

May 12, 2023

4.16.3

May 12, 2023

4.16.2

May 12, 2023

4.16.1

May 12, 2023

4.16.0

May 12, 2023

4.15.9

May 11, 2023

4.15.8

May 11, 2023

4.15.7

May 10, 2023

4.15.6

May 9, 2023

4.15.5

May 9, 2023

4.15.4

May 8, 2023

4.15.3

May 8, 2023

4.15.2

May 8, 2023

4.15.1

May 7, 2023

4.15.0

May 7, 2023

4.14.3

May 3, 2023

4.14.2

May 3, 2023

4.14.1

Apr 30, 2023

4.14.0

Apr 29, 2023

4.13.13

Apr 29, 2023

4.13.12

Apr 29, 2023

4.13.11

Apr 26, 2023

4.13.10

Apr 26, 2023

4.13.9

Apr 25, 2023

4.13.8

Apr 25, 2023

4.13.7

Apr 24, 2023

4.13.6

Apr 24, 2023

4.13.5

Apr 24, 2023

4.13.4

Apr 24, 2023

4.13.2

Apr 20, 2023

4.13.1

Apr 20, 2023

4.13.0

Apr 19, 2023

4.12.12

Apr 19, 2023

4.12.11

Apr 17, 2023

4.12.10

Apr 13, 2023

4.12.9

Apr 12, 2023

4.12.8

Apr 11, 2023

4.12.7

Apr 10, 2023

4.12.6

Apr 10, 2023

4.12.5

Apr 10, 2023

4.12.4

Apr 10, 2023

4.12.3

Apr 10, 2023

4.12.2

Apr 9, 2023

4.12.1

Apr 9, 2023

4.12.0

Apr 9, 2023

4.11.14

Apr 2, 2023

4.11.13

Apr 2, 2023

4.11.12

Mar 30, 2023

4.11.11

Mar 29, 2023

4.11.10

Mar 29, 2023

4.11.8

Mar 28, 2023

4.11.7

Mar 27, 2023

4.11.6

Mar 27, 2023

4.11.5

Mar 27, 2023

4.11.4

Mar 27, 2023

4.11.3

Mar 27, 2023

4.11.2

Mar 27, 2023

4.11.1

Mar 26, 2023

4.11.0

Mar 25, 2023

4.10.11

Mar 25, 2023

4.10.10

Mar 25, 2023

4.10.9

Mar 23, 2023

4.10.8

Mar 23, 2023

4.10.7

Mar 23, 2023

4.10.4

Mar 23, 2023

4.10.2

Mar 23, 2023

4.10.1

Mar 22, 2023

4.10.0

Mar 18, 2023

4.9.9

Mar 18, 2023

4.9.8

Mar 16, 2023

4.9.7

Mar 16, 2023

4.9.6

Mar 16, 2023

4.9.5

Mar 15, 2023

4.9.4

Mar 15, 2023

4.9.3

Mar 12, 2023

4.9.2

Mar 12, 2023

4.9.1

Mar 11, 2023

4.8.4

Mar 10, 2023

4.8.2

Mar 8, 2023

4.8.1

Mar 8, 2023

4.8.0

Mar 8, 2023

4.7.2

Mar 8, 2023

4.7.1

Mar 8, 2023

4.6.1

Feb 13, 2023

4.6.0

Feb 9, 2023

4.5.5

Feb 9, 2023

4.5.4

Feb 6, 2023

4.5.3

Jan 26, 2023

4.5.2

Jan 25, 2023

4.5.1

Jan 25, 2023

4.5.0

Jan 25, 2023

4.4.1

Jan 23, 2023

4.4.0

Jan 22, 2023

4.3.0

Jan 19, 2023

4.2.1

Jan 13, 2023

4.2.0

Jan 5, 2023

4.1.9

Jan 5, 2023

4.1.8

Jan 5, 2023

4.1.6

Jan 5, 2023

4.1.5

Jan 5, 2023

4.1.4

Jan 5, 2023

4.1.3

Jan 5, 2023

4.1.2

Jan 5, 2023

4.1.1

Jan 5, 2023

4.0.1

Jan 4, 2023

3.5.27

Jan 2, 2023

3.5.26

Dec 31, 2022

3.5.25

Dec 31, 2022

3.5.24

Dec 31, 2022

3.5.23

Dec 31, 2022

3.5.22

Dec 28, 2022

3.5.21

Dec 21, 2022

3.5.20

Dec 21, 2022

3.5.19

Dec 21, 2022

3.5.18

Dec 17, 2022

3.5.17

Dec 15, 2022

3.5.16

Dec 15, 2022

3.5.15

Dec 15, 2022

3.5.14

Dec 15, 2022

3.5.13

Dec 14, 2022

3.5.12

Dec 14, 2022

3.5.11

Dec 12, 2022

3.5.10

Dec 12, 2022

3.5.9

Dec 12, 2022

3.5.8

Dec 12, 2022

3.5.7

Dec 12, 2022

3.5.6

Dec 11, 2022

3.5.5

Dec 10, 2022

3.5.4

Dec 10, 2022

3.5.3

Dec 10, 2022

3.5.2

Dec 10, 2022

3.5.1

Dec 8, 2022

3.5.0

Dec 8, 2022

3.4.62

Dec 8, 2022

3.4.61

Dec 6, 2022

3.4.60

Dec 6, 2022

3.4.59

Dec 6, 2022

3.4.58

Dec 6, 2022

3.4.57

Dec 6, 2022

3.4.56

Dec 5, 2022

3.4.54

Dec 4, 2022

3.4.53

Dec 4, 2022

3.4.52

Dec 2, 2022

3.4.51

Nov 29, 2022

3.4.50

Nov 29, 2022

3.4.49

Nov 28, 2022

3.4.48

Nov 28, 2022

3.4.47

Nov 28, 2022

3.4.46

Nov 28, 2022

3.4.44

Nov 27, 2022

3.4.43

Nov 27, 2022

3.4.42

Nov 27, 2022

3.4.41

Nov 26, 2022

3.4.40

Nov 26, 2022

3.4.39

Nov 26, 2022

3.4.38

Nov 24, 2022

3.4.37

Nov 23, 2022

3.4.36

Nov 22, 2022

3.4.35

Nov 20, 2022

3.4.34

Nov 20, 2022

3.4.33

Nov 18, 2022

3.4.32

Nov 18, 2022

3.4.31

Nov 18, 2022

This version

3.4.30

Nov 18, 2022

3.4.28

Nov 18, 2022

3.4.27

Nov 18, 2022

3.4.20

Nov 3, 2022

3.4.19

Nov 3, 2022

3.4.18

Nov 2, 2022

3.4.17

Nov 2, 2022

3.4.16

Nov 1, 2022

3.4.15

Oct 29, 2022

3.4.14

Oct 28, 2022

3.4.13

Oct 19, 2022

3.4.12

Oct 4, 2022

3.4.11

Sep 29, 2022

3.4.9

Sep 19, 2022

3.4.8

Sep 14, 2022

3.4.7

Sep 13, 2022

3.4.6

Sep 11, 2022

3.4.5

Sep 7, 2022

3.4.4

Sep 7, 2022

3.4.3

Sep 6, 2022

3.4.2

Aug 29, 2022

3.4.1

Aug 27, 2022

3.4.0

Aug 24, 2022

3.3.48

Aug 24, 2022

3.3.47

Aug 23, 2022

3.3.46

Aug 22, 2022

3.3.45

Aug 22, 2022

3.3.44

Aug 22, 2022

3.3.43

Aug 18, 2022

3.3.41

Aug 17, 2022

3.3.40

Aug 16, 2022

3.3.38

Aug 10, 2022

3.3.37

Aug 7, 2022

3.3.36

Jul 21, 2022

3.3.35

Jul 21, 2022

3.3.34

Jul 20, 2022

3.3.33

Jul 20, 2022

3.3.32

Jul 19, 2022

3.3.31

Jul 18, 2022

3.3.30

Jul 16, 2022

3.3.29

Jul 15, 2022

3.3.25

Jul 8, 2022

3.3.24

Jul 7, 2022

3.3.23

Jul 7, 2022

3.3.22

Jul 7, 2022

3.3.21

Jul 6, 2022

3.3.20

Jul 6, 2022

3.3.19

Jul 2, 2022

3.3.18

Jul 1, 2022

3.3.17

Jun 22, 2022

3.3.16

Jun 21, 2022

3.3.15

May 30, 2022

3.3.13

May 23, 2022

3.3.12

May 6, 2022

3.2.85

Jun 29, 2021

3.2.84

Jun 29, 2021

3.2.83

Jun 29, 2021

3.2.82

Jun 24, 2021

3.2.81

Jun 24, 2021

3.2.80

Jun 24, 2021

3.2.79

Jun 24, 2021

3.2.77

Jun 24, 2021

3.2.76

Jun 23, 2021

3.2.75

Jun 15, 2021

3.2.74

Jun 3, 2021

3.2.73

May 26, 2021

3.2.71

May 21, 2021

3.2.70

May 21, 2021

3.2.69

May 19, 2021

3.2.68

May 16, 2021

3.2.67

May 15, 2021

3.2.66

May 15, 2021

3.2.65

May 12, 2021

3.2.64

May 11, 2021

3.2.63

May 11, 2021

3.2.62

May 5, 2021

3.2.61

Apr 29, 2021

3.2.60

Apr 28, 2021

3.2.59

Apr 28, 2021

3.2.58

Apr 28, 2021

3.2.57

Apr 28, 2021

3.2.56

Apr 28, 2021

3.2.55

Apr 28, 2021

3.2.54

Apr 28, 2021

3.2.53

Apr 28, 2021

3.2.52

Apr 28, 2021

3.2.51

Apr 27, 2021

3.2.50

Apr 27, 2021

3.2.49

Apr 22, 2021

3.2.48

Apr 22, 2021

3.2.47

Apr 22, 2021

3.2.43

Mar 28, 2021

3.2.42

Mar 24, 2021

3.2.41

Mar 23, 2021

3.2.40

Mar 22, 2021

3.2.39

Mar 22, 2021

3.2.38

Mar 17, 2021

3.2.37

Mar 17, 2021

3.2.36

Mar 16, 2021

3.2.35

Mar 15, 2021

3.2.34

Mar 12, 2021

3.2.33

Mar 12, 2021

3.2.30

Mar 9, 2021

3.2.29

Mar 8, 2021

3.2.28

Mar 8, 2021

3.2.27

Mar 8, 2021

3.2.26

Mar 5, 2021

3.2.25

Mar 5, 2021

3.2.24

Mar 5, 2021

3.2.23

Mar 5, 2021

3.2.22

Mar 5, 2021

3.2.21

Mar 3, 2021

3.2.20

Mar 3, 2021

3.2.19

Mar 3, 2021

3.2.18

Mar 2, 2021

3.2.17

Mar 1, 2021

3.2.15

Feb 27, 2021

3.2.14

Feb 26, 2021

3.2.12

Feb 26, 2021

3.2.11

Feb 26, 2021

3.2.10

Feb 26, 2021

3.2.9

Feb 25, 2021

3.2.8

Feb 24, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

discovery-transition-ds-3.4.30.tar.gz (6.7 MB view hashes)

Uploaded Nov 18, 2022 Source

Built Distribution

discovery_transition_ds-3.4.30-py38-none-any.whl (6.9 MB view hashes)

Uploaded Nov 18, 2022 Python 3.8

Hashes for discovery-transition-ds-3.4.30.tar.gz

Hashes for discovery-transition-ds-3.4.30.tar.gz
Algorithm	Hash digest
SHA256	`3e4fc5994e94157e9b685f9b3fff0b4c441f1c3fb2eb313506eb714ab18b61ed`
MD5	`27b8c52bbd07439c46668c12bd97e05e`
BLAKE2b-256	`dde8a206c49368bba2dfd3e43570d93b41c6e135aac764e25599ad1796b1603b`

Hashes for discovery_transition_ds-3.4.30-py38-none-any.whl

Hashes for discovery_transition_ds-3.4.30-py38-none-any.whl
Algorithm	Hash digest
SHA256	`97a0f1b44a91a3567225130f635afa33e6985bc421277a46bcf76d76df56d6a1`
MD5	`16778c5dcf48d9fe63e07f855d27c608`
BLAKE2b-256	`b4e9865b11b3f02c7e2918d3595c8224c1c4a9b2ce5bfa00215f85399b92807a`