Skip to main content

A lightweight and extensible Python package for managing data, tailored for researchers working with structured data.

Project description

📦 DWrappr

A lightweight and extensible Python package for managing data, tailored for researchers working with structured data. In addition to general data management features, the package introduces a data structure specifically optimized for ML research. This common format enables researchers to efficiently test new algorithms and methods, streamlining collaboration and ensuring consistency in data management across projects.

For more details, please study the Documentation.

🧩 Features

  • 🗃️ Consistent dataset object structure for handling structured data in ML use cases
  • 🔄 Support for building a file-based internal dataset collaboration platform for researchers
  • 🧰 General utilities for managing data like saving and loading

🏗️ Structure & Functionality

DataSet Structure

to be added

🚀 Quickstart Example

show available datasets in directory

# examples/loading_datasets.py
from dwrappr import DataSet

# Adjust the path according to the location of the datasets.
PATH_TO_DATA_DIR = "./datasets/"
available_datasets = DataSet.get_available_datasets_in_folder(path=PATH_TO_DATA_DIR)
print(available_datasets)

Example output:

                 name  ...                      local_filepath
0   example_dataset_1  ...  ./datasets/example_dataset_1.joblib
1   example_dataset_2  ...  ./datasets/example_dataset_2.joblib

load dataset from available datasets

# examples/loading_datasets.py
row_of_dataset_to_load = 0
ds = DataSet.load(available_datasets.iloc[row_of_dataset_to_load]['local_filepath'])
print(ds.as_df.head(2))

Example output:

[12 rows x 12 columns]
   UDI Product ID Type  Air temperature [K]  ...    HDF    PWF    OSF    RNF
0    1     M14860    M                298.1  ...  False  False  False  False
1    2     L47181    L                298.2  ...  False  False  False  False

load dataset directly

# examples/loading_datasets.py
PATH_TO_DATASET = "local path to your dataset"
ds = DataSet.load(PATH_TO_DATASET)

🛠️ Package Installation

pip install dwrappr

©️ License

This project is licensed under the MIT License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dwrappr-0.0.3.tar.gz (17.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dwrappr-0.0.3-py3-none-any.whl (17.6 kB view details)

Uploaded Python 3

File details

Details for the file dwrappr-0.0.3.tar.gz.

File metadata

  • Download URL: dwrappr-0.0.3.tar.gz
  • Upload date:
  • Size: 17.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.12

File hashes

Hashes for dwrappr-0.0.3.tar.gz
Algorithm Hash digest
SHA256 054709bcc26cf3c2325f67eb7eb7e58b0687d1ece2e2c68a0e0fd0866b925dba
MD5 9a55d0c968c953e2dff5a9abe2921b21
BLAKE2b-256 71a5b372192e6f681ded084e59bfb06a54d8687ad5dfa1f59083fad1c95c9e3a

See more details on using hashes here.

File details

Details for the file dwrappr-0.0.3-py3-none-any.whl.

File metadata

  • Download URL: dwrappr-0.0.3-py3-none-any.whl
  • Upload date:
  • Size: 17.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.12

File hashes

Hashes for dwrappr-0.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 e2b1f71b7377077716b911ebeab3f012c56dc62504ac59f0488dfa3884675a43
MD5 2909ba53173ee2db22643753fd268bd0
BLAKE2b-256 a552563c0c3fc7d94e3dc0f23df64a419432a3a3013ab7bcfb038cbe8c05f8b4

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page