Skip to main content

Interactive Data Science tutor helps to guide sklearn projects.

Project description

Data Science Tutor

I've been learning data science for a while now. I've noticed that in different projects I might delve deep into a particular step of the data science workflow, but the learnings are not carried over to the next project.

There's a misalignment between "2nd brain" tools like Obsidian and Notion and my needs. Having the notes and mathematics in one place and executable code in another place is annoying.

This project is a kind of 2nd brain where I can gather robust data science practices, data visualisation aids and mathematical reminders in one place.

Goal

Create a python package that I can import into other projects that brings with it all the learning and creations I've gathered and made. At its core is a base tutor class that acts as a kind of practical note on the subject. For example, a MissingDataTutor class contains the code to create missing data visualisations, perhaps a reminder on which estimators don't work with missing data and other bits of information that a tutor teaching about missing data would mention.

The tutors classes are then orchestrated by higher level experiment and EDA classes.

The package should help me to complete data science projects whilst reminding me of core mathematical concepts and ensuring I'm not making wrong assumptions or falling for any gotchas.

It needs to operationalise my knowledge, enforce rigorous standards and act a flashcard for me. Not hiding the implementation details.

Code

Outcomes

Project Context Class

  • Creates train test split EDA Class
  • Runs Mutual Information
  • Flags missing values

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ds_tutor-0.1.0.tar.gz (4.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ds_tutor-0.1.0-py3-none-any.whl (6.0 kB view details)

Uploaded Python 3

File details

Details for the file ds_tutor-0.1.0.tar.gz.

File metadata

  • Download URL: ds_tutor-0.1.0.tar.gz
  • Upload date:
  • Size: 4.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.9.2

File hashes

Hashes for ds_tutor-0.1.0.tar.gz
Algorithm Hash digest
SHA256 d03aed2df9bced7a3d55cd11d0373f5292fe6f6fe82919e95ff429890ff4eb4e
MD5 8403d75699fdfc1e00745b00919d933b
BLAKE2b-256 522d379d608666eb5ecfe1a30512287037f9769aa4a9400f8cd2679dd2d00f4f

See more details on using hashes here.

File details

Details for the file ds_tutor-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: ds_tutor-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 6.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.9.2

File hashes

Hashes for ds_tutor-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 da82197ab65da34d2a6220c5b3f61d21f4a6d7e814b2455b9274ca252136f654
MD5 d6c804a39d3d356f5dc4f266d258727c
BLAKE2b-256 02a19a25e6df33aa0bdbd2a56e2efa506ea981f1a237bbc1af9c8cc8945f218a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page