Skip to main content

Personal side project to streamline the most common tasks of data science solutions in an efficient manner. This project is based on my experience as a lead data scientist in the industry and financial services sectors, where I have gained expertise in delivering effective data-driven insights and solutions

Project description

smart_data_science

Personal side project to streamline the most common tasks of data science solutions in an efficient manner. This project is based on my experience as a lead data scientist in the industry and financial services sectors, where I have gained expertise in delivering effective data-driven insights and solutions

Code style: black

Installation in Dev / Editor mode

Note: A Debian/Ubuntu Machine, VM or container is highly recommended

Step 0: One-time Machine setup only valid for all Data Science Projects

Create or use a Machine with Conda, Git and Poetry as closely as defined in .devcontainer/Dockerfile:

  • This Dockerfile contains a non-root user so the same configuration can be applied to a WSL Ubuntu Machine and any Debian/Ubuntu CLoud Machine (Vertex AI workbench, Azure VM ...)
  • In case of having an Ubuntu/Debian machine with non-root user (e.g.: Ubuntu in WSL, Vertex AI VM ...), just install the tools from "non-root user" (no sudo) section of the Dockerfile (sudo apt-get install <software> may be required)
  • A pre-configured Cloud VM usually has Git and Conda pre-installed, those steps can be skipped
  • The development container defined in .devcontainer/Dockerfile can be directly used for a fast setup (Docker required). With Visual Studio Code, just open the root folder of this repo, press F1 and select the option Dev Containers: Open Workspace in Container. The container will open the same workspace after the Docker Image is built.

Step 1. Enter to the root path of the repo and use or create a new conda environment for development:

$ conda create -n dev python=3.10 -y && conda activate dev

Step 2. Install all the Dependencies and the package in editor mode:

$ make setup

Installation for Production/Usage (Not published in PyPi yet)

$ conda create -n smart python=3.10 -y && conda activate smart
$ pip install dist/smart-data-science-0.1.1-py3-none-any.whl

Installation for Production/Usage (after the package is published in PyPi)

$ pip install smart_data_science

Usage

  • Still under development. Please refer to the notebooks and examples folders for usage examples

Contributing

Check out the contributing guidelines

License

smart_data_science was created by Angel Martinez-Tenor. It is licensed under the terms of the MIT license.

Credits

smart_data_science was created from a Data Science Template developed by Angel Martinez-Tenor. The template was built upon py-pkgs-cookiecutter [template] (https://github.com/py-pkgs/py-pkgs-cookiecutter)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

smart_data_science-0.1.2.tar.gz (9.7 MB view details)

Uploaded Source

Built Distribution

smart_data_science-0.1.2-py3-none-any.whl (9.7 MB view details)

Uploaded Python 3

File details

Details for the file smart_data_science-0.1.2.tar.gz.

File metadata

  • Download URL: smart_data_science-0.1.2.tar.gz
  • Upload date:
  • Size: 9.7 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.2.2 CPython/3.10.6 Linux/5.15.90.1-microsoft-standard-WSL2

File hashes

Hashes for smart_data_science-0.1.2.tar.gz
Algorithm Hash digest
SHA256 00f1bf08641219b6748144fe9a40adfb2f5051bc0e70f600e16db2e8bbe91ca6
MD5 eeecaa6012a9a9da303d396ee1db836b
BLAKE2b-256 4afe75ae3eb44767c57ccdccf0090048b520fb94b9a88080a8db20f230699af8

See more details on using hashes here.

File details

Details for the file smart_data_science-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: smart_data_science-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 9.7 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.2.2 CPython/3.10.6 Linux/5.15.90.1-microsoft-standard-WSL2

File hashes

Hashes for smart_data_science-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 13e4159143ac9b202019092042e1a508c28ba0db527198fb216efbbe30f4c11a
MD5 da5e5fea94f229461a3fee0ea396930e
BLAKE2b-256 27a43a02e80d84df38d0dd6ece6852690fe3e06e3880dd8f25b3d0d6e818ddfb

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page