Skip to main content

Memory frugal torch dataset from a csv collection

Project description

csvsdataset

csvsdataset is a Python library designed to simplify the process of working with multiple CSV files as a single dataset. The primary functionality is provided by the CsvsDataset class in the csvsdataset.py module.

Installation

To install the csvsdataset library, simply run:

pip install csvsdataset

Usage

    from csvsdataset.csvsdataset import CsvsDataset
    
    # Initialize the CsvsDataset instance
    dataset = CsvsDataset(folder_path="path/to/your/csv/folder",
                          file_pattern="*.csv",
                          x_columns=["column1", "column2"],
                          y_column="target_column")
    
    # Iterate over the dataset
    for x_data, y_data in dataset:
        # Your processing code here
        pass
    
    # Access a specific item in the dataset
    x_data, y_data = dataset[42]

Memory frugality

Only data from a small number of csv files are maintained in memory. The rest is discarded on a LRU basis. This class is intended for use when a very large number of data files exist which cannot be loaded into memory conveniently.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

csvsdataset-0.0.6.tar.gz (35.0 MB view details)

Uploaded Source

Built Distribution

csvsdataset-0.0.6-py3-none-any.whl (35.3 MB view details)

Uploaded Python 3

File details

Details for the file csvsdataset-0.0.6.tar.gz.

File metadata

  • Download URL: csvsdataset-0.0.6.tar.gz
  • Upload date:
  • Size: 35.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.2

File hashes

Hashes for csvsdataset-0.0.6.tar.gz
Algorithm Hash digest
SHA256 1f02f335df1bb2bdf17eaa8151dd6ae0fffb04ddde77ecb7d8eb8b5f1a68dd81
MD5 d80dad54fe6151d4ed01a68194f32330
BLAKE2b-256 2f685783c252f382958b0e6024998d49d32b256b07d3f6baa6d426b131f2c42a

See more details on using hashes here.

File details

Details for the file csvsdataset-0.0.6-py3-none-any.whl.

File metadata

  • Download URL: csvsdataset-0.0.6-py3-none-any.whl
  • Upload date:
  • Size: 35.3 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.2

File hashes

Hashes for csvsdataset-0.0.6-py3-none-any.whl
Algorithm Hash digest
SHA256 be4eae39c38376dfe6c2c9f82982a5fcb6e3f00ae472e237ff10c07a59ba2e52
MD5 9df1b9ec6c04a397df1a74c79fb5045c
BLAKE2b-256 261d38cb0be9745ab3dd6d27c644dd255efa61eebdbd3a7fba7c8ab79bd8afa8

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page