Generate csv files with descriptions of the data in a header

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

c3d - create & comment `csv` data files

A small library for post-processing of data (currently only csv) files: generating metadata and documentation of included data sets

Usage example

A minimal working example looks like that:

import os
import cccd as cdf

ds = cdf.DataSet("Data", "The description of the data set.")
    ds.append("Value_1", "The description of the value nr. 1.", 42)
    ds.append("Value_2", "The description of the second value.", 69)

# we decide to save all of our results here
results_f = "example_results"
if not os.path.exists(results_f):
    os.mkdir(results_f)

cdf.write_ds(ds, f"{results_f}/ex1.csv")

Here, we have a dataset called Data with a description. We also have two values that belong to the dataset. We also provide a description of the values. Finally, we save it all in ex1.csv file that looks as follows:

# Data:
#    The description of the data set.
#
# Value_1:
#    The description of the value nr. 1.
#
# Value_2:
#    The description of the second value.
#
Value_1,Value_2
42,69

A more involved example (including metadata and some customisation) below:

import os
import cccd as cdf
import polars as pl

if __name__ == "__main__":

    # ingest the raw data
    test_data = pl.read_csv("examples/test.csv")

    data_set = cdf.DataSet(
        "This is the dataset nr. 1",
        "The description of the data set and each field can be long. It will"
        " be wrapped anyway. I could even put Lorem Ipsum here.",
    )

    # lets describe the raw data that we have
    data_set.append(
        "f1",  # name
        "This data describes f1 and is very meaningful!",  # description
        test_data[
            "first"
        ],  # value, currently it should be a Series or something like that
    )

    # We can create the DataDescription here and pass it to our dataset. This
    # is not very convienent as we have to also make sure that the second arg
    # is a dataframe.
    data_set.append_data_raw(
        cdf.DataDescription("f2", "Second data set. Not very meanigful."),
        pl.DataFrame({"f2": test_data["second"]}),
    )

    # we decide to save all of our results here
    results_f = "example_results"
    if not os.path.exists(results_f):
        os.mkdir(results_f)

    # We will write the described dataset, including metadata.
    mdata = cdf.metadata(
        author="James Bond", time_stamp=True, count_lines=True
    )
    # We will also overwrite the comment sign to `--` like in `lua`.
    cdf.write_ds(
        data_set,
        f"{results_f}/ex2.csv",
        meta_data=mdata,
        comment_text="--",
    )

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

This version

0.1.1

Jan 29, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cccd-0.1.1.tar.gz (7.5 kB view hashes)

Uploaded Jan 29, 2024 Source

Built Distribution

cccd-0.1.1-py3-none-any.whl (7.3 kB view hashes)

Uploaded Jan 29, 2024 Python 3

Hashes for cccd-0.1.1.tar.gz

Hashes for cccd-0.1.1.tar.gz
Algorithm	Hash digest
SHA256	`62972864aa99f7a8a9d35d156874ebd3b49f2d018d7339d76a5350f6aec31356`
MD5	`5f885d0a57dc996ad6b4e542b78bd7d2`
BLAKE2b-256	`6b1dd5193a90149622b5c04bb5c4507d5afbc9058f422f8563d5d789f7e6c8a5`

Hashes for cccd-0.1.1-py3-none-any.whl

Hashes for cccd-0.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c67c68e80aaf485a5e364d90a35618fb3d9f24a8a471bfc4350284e5e9abd2e8`
MD5	`bbbabbe2bc113d6222a848c725c41ab8`
BLAKE2b-256	`6c8e1f8f0250f165a419acbab88647fe20197dcdaed267edb243387aae336461`

cccd 0.1.1

Navigation

Verified details

Maintainers

Unverified details

Project links

GitHub Statistics

Meta

Classifiers

Project description

c3d - create & comment `csv` data files

Usage example

Project details

Verified details

Maintainers

Unverified details

Project links

GitHub Statistics

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

cccd 0.1.1

Navigation

Verified details

Maintainers

Unverified details

Project links

GitHub Statistics

Meta

Classifiers

Project description

c3d - create & comment csv data files

Usage example

Project details

Verified details

Maintainers

Unverified details

Project links

GitHub Statistics

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

c3d - create & comment `csv` data files