Skip to main content

sscu-budapest utilities for scientific data engineering

Project description

sscutils

Documentation Status codeclimate codecov pypi

Some utility function to help with

  • setting up data subsets with invoke
  • simplified dvc pipeline registry

these are used in dataset-template and research-project-template

Make sure that python points to python>=3.8

Lookahead

  • overlapping names convention
  • resolve naming confusion with colassigner, colaccessor and table feature / composite type / index base classes
  • abstract composite type + subclass of entity class
    • import ACT, inherit from it and specify
    • importing composite type is impossible now if it contains foreign key :(
  • automatic filter for env creation based on foreign key metadata
  • add option to infer data type of assigned feature
    • can be problematic b/c pandas int/float/nan issue
  • metadata created dry, dynamically, but imported static, wet
  • sharing functions among projects
    • functions specific to processing certain composite / named types
    • e.g. function dealing with fitting into a limit in dogshow project 1
  • detecting reliance of composite type given by assigner
    • can wait, as initial import is just the assigner transformed to accessor
  • overlapping in entities
    • detect / signal the same type of entity
  • properly assert importing

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sscutils-0.3.3.tar.gz (67.8 kB view details)

Uploaded Source

File details

Details for the file sscutils-0.3.3.tar.gz.

File metadata

  • Download URL: sscutils-0.3.3.tar.gz
  • Upload date:
  • Size: 67.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.7.1 importlib_metadata/4.10.1 pkginfo/1.8.2 requests/2.27.1 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.10.2

File hashes

Hashes for sscutils-0.3.3.tar.gz
Algorithm Hash digest
SHA256 c14901726492c6597dc475e2828eecd101d527d8a1b8bcb51c69a57fc3398137
MD5 2409dd1787ad31cee06384103266e4dd
BLAKE2b-256 6c5af6b54652d54ef7e42ed81e15dabaa4257f761b3835ad24eafbdf104c9c28

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page