Skip to main content

sscu-budapest utilities for scientific data engineering

Project description

datazimmer

Documentation Status codeclimate codecov pypi

Some utility function to help with

  • setting up data environments
  • simplified dvc pipeline registry

these are used in the project-template

Make sure that python points to python>=3.8 and you have pip and git

Functions

Tinker

check out a table or few, with a notebook and some basic analysis to help

Engineer Research

Lookahead

  • overlapping names convention
  • resolve naming confusion with colassigner, colaccessor and table feature / composite type / index base classes
  • abstract composite type + subclass of entity class
    • import ACT, inherit from it and specify
    • importing composite type is impossible now if it contains foreign key :(
  • automatic filter for env creation based on foreign key metadata
  • add option to infer data type of assigned feature
    • can be problematic b/c pandas int/float/nan issue
  • sharing functions among projects
    • functions specific to processing certain composite / named types
    • e.g. function dealing with fitting into a limit in dogshow project 1
  • create similar sets of features in a dry way
  • detecting reliance of composite type given by assigner
    • can wait, as initial import is just the assigner transformed to accessor
  • overlapping in entities
    • detect / signal the same type of entity
  • properly assert importing

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

datazimmer-0.2.5.tar.gz (67.0 kB view hashes)

Uploaded Source

Built Distribution

datazimmer-0.2.5-py3-none-any.whl (53.5 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page