Personal side project to streamline the most common tasks of data science solutions in an efficient manner. This project is based on my experience as a lead data scientist in the industry and financial services sectors, where I have gained expertise in delivering effective data-driven insights and solutions
Project description
smart_data_science
Personal side project to streamline the most common tasks of data science solutions in an efficient manner. This project is based on my experience as a lead data scientist in the industry and financial services sectors, where I have gained expertise in delivering effective data-driven insights and solutions
Installation in Dev / Editor mode
Note: A Debian/Ubuntu Machine, VM or container is highly recommended
Step 0: One-time Machine setup only valid for all Data Science Projects
Create or use a Machine with Conda, Git and Poetry as closely as defined in .devcontainer/Dockerfile
:
- This Dockerfile contains a non-root user so the same configuration can be applied to a WSL Ubuntu Machine and any Debian/Ubuntu CLoud Machine (Vertex AI workbench, Azure VM ...)
- In case of having an Ubuntu/Debian machine with non-root user (e.g.: Ubuntu in WSL, Vertex AI VM ...), just install the tools from "non-root user" (no sudo) section of the Dockerfile (sudo apt-get install <software> may be required)
- A pre-configured Cloud VM usually has Git and Conda pre-installed, those steps can be skipped
- The development container defined in
.devcontainer/Dockerfile
can be directly used for a fast setup (Docker required). With Visual Studio Code, just open the root folder of this repo, pressF1
and select the option Dev Containers: Open Workspace in Container. The container will open the same workspace after the Docker Image is built.
Step 1. Enter to the root path of the repo and use or create a new conda environment for development:
$ conda create -n dev python=3.10 -y && conda activate dev
Step 2. Install all the Dependencies and the package in editor mode:
$ make setup
Installation for Production/Usage (Not published in PyPi yet)
$ conda create -n smart python=3.10 -y && conda activate smart
$ pip install dist/smart-data-science-0.1.1-py3-none-any.whl
Installation for Production/Usage (after the package is published in PyPi)
$ pip install smart_data_science
Usage
- Still under development. Please refer to the notebooks and examples folders for usage examples
Contributing
Check out the contributing guidelines
License
smart_data_science
was created by Angel Martinez-Tenor. It is licensed under the terms of the MIT license.
Credits
smart_data_science
was created from a Data Science Template developed by Angel Martinez-Tenor. The template was built upon py-pkgs-cookiecutter
[template] (https://github.com/py-pkgs/py-pkgs-cookiecutter)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file smart_data_science-0.1.2.tar.gz
.
File metadata
- Download URL: smart_data_science-0.1.2.tar.gz
- Upload date:
- Size: 9.7 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.2.2 CPython/3.10.6 Linux/5.15.90.1-microsoft-standard-WSL2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 00f1bf08641219b6748144fe9a40adfb2f5051bc0e70f600e16db2e8bbe91ca6 |
|
MD5 | eeecaa6012a9a9da303d396ee1db836b |
|
BLAKE2b-256 | 4afe75ae3eb44767c57ccdccf0090048b520fb94b9a88080a8db20f230699af8 |
File details
Details for the file smart_data_science-0.1.2-py3-none-any.whl
.
File metadata
- Download URL: smart_data_science-0.1.2-py3-none-any.whl
- Upload date:
- Size: 9.7 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.2.2 CPython/3.10.6 Linux/5.15.90.1-microsoft-standard-WSL2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 13e4159143ac9b202019092042e1a508c28ba0db527198fb216efbbe30f4c11a |
|
MD5 | da5e5fea94f229461a3fee0ea396930e |
|
BLAKE2b-256 | 27a43a02e80d84df38d0dd6ece6852690fe3e06e3880dd8f25b3d0d6e818ddfb |