Python implementation of the Header-Included Comma-Separated Values (hicsv) file format.

These details have not been verified by PyPI

Project links

Homepage

Project description

hicsv-python

Python implementation of the I/O interface for the Header-Included Comma-Separated Values (hicsv) file format.

Documentation

The API documentation is available here.

What is hicsv?

Header-included comma-separated values (hicsv or hi-csv in short) is a text file format consisting of a header (metadata) and a tabulated values, both in a human-readable and editable form.

CSV is a great universal format that can store a table of values in a human-readable way. Many instruments and softwares can export data in CSV. We can then avoid dealing with the proprietary binary formats whose specifications are often not disclosed to consumers.

However, there is no commonly accepted method to store metadata along with a table of values. Metadata such as experimental conditions, author, etc. are often essential in science. People can easily lose track of these metadata when the CSV file only contains the table part.

There are some formats or conventions that (try to) treat a table and metadata nicely (e.g. Tabular Data Package https://specs.frictionlessdata.io/tabular-data-package/#introduction). But, to the best of the author's knowledge, there is no text-based generic format that combines a table and metadata in a single file.

It is very important to combine the metadata and the table because it would be difficult to force everyone to always keep two separate files together.

Binary formats such as HDF5 can store metadata in addition to image or array data. But again, there is no text-based, human-readable format with embedded metadata.

hicsv format has been developed as a solution to the above problem. It is basically just a CSV table with a JSON header part attached at the top. The rough, tentative specification of hicsv is as follows.

It should be a Unicode text file.
It starts with consequtive lines starting with #.
The last one of the #-capped lines represents the "keys" of each column in the table.
The rest of the #-capped lines are an indented JSON which represents the dictionary of metadata.
The lines not starting with # are the CSV table. The CSV part follows the generally accepted conventions (mostly RFC 4180 but may not be strict) except for following additional restrictions.
The columns in the CSV table should always be of the same length.
Each column must consist of only one type.

Example of hicsv format

#{
#    "some metadata entry": "some metadata",
#    "it can be anything": {
#        "that": "can be made into a JSON file",
#        "for example": "a dictionary"
#    },
#    "もちろん": "ユニコード文字もOK",
#    "hicsv-python version": "0.0.0",
#    "hicsv version": "20220812"
#}
#"first column"     ,"2nd column","column 3","四番目のカラム"
0.391006442374058   ,0           ,"string"  ,"いち"     
0.6943031206081407  ,1           ,"is"      ,"に"      
0.03433735164688634 ,2           ,"also"    ,"さん"     
0.015028457129899642,3           ,"OK"      ,"よん"     
0.5900201005940872  ,4           ,"!"       ,"ご"

Scope of this package

The hicsv package is a minimal implementation of the I/O interface to the hicsv-formatted files. It, therefore, does not have (or aim for) any advanced functionalities. Please use numpy and/or other useful packages for anything beyond I/O.

Example

To read a hicsv-formatted text file,

import hicsv
d = hicsv.hicsv("foo.txt")

Or you can just create an empty object to which you can add columns and header info.

e = hicsv.hicsv()

You can get one or more columns.

c1, c2 = d.ga("column 1", "column 2")

The header info is stored as a dictionary. You can set and get information at any time::

d.h["new header entry"] = "some value"
print(d.h["new header entry"])

If you need to create a hicsv-formatted file from scratch::

out = hicsv()

To add a column::

out.append_column("column 1", arr)

Here arr should be a 1-d numpy array. Note that all columns must be of the same length, so adding an array with different length will cause error. Then you can write into a hicsv-formatted text file like::

out.save("new hicsv file.txt")

Note

Some users may find the following behavior of this package unexpected.

The parser deletes the spaces at the beginning and at the end of each cell. This is true even if the cell content is double-quoted. For example, the two cells
```
 "spam", "  spam  "
```
results in identical strings.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

1.1.0

Mar 9, 2025

1.0.1

Dec 25, 2022

1.0.0

Dec 25, 2022

0.0.0

Aug 18, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hicsv_python-1.1.0.tar.gz (253.3 kB view details)

Uploaded Mar 9, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

hicsv_python-1.1.0-py3-none-any.whl (11.6 kB view details)

Uploaded Mar 9, 2025 Python 3

File details

Details for the file hicsv_python-1.1.0.tar.gz.

File metadata

Download URL: hicsv_python-1.1.0.tar.gz
Upload date: Mar 9, 2025
Size: 253.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.6.3

File hashes

Hashes for hicsv_python-1.1.0.tar.gz
Algorithm	Hash digest
SHA256	`647abc9f93886ca5a61fafb8a843f3f39f27d2aed61969a89cc01368cd54784e`
MD5	`4f174e0da185f411e6a374088fab74b5`
BLAKE2b-256	`814f8693c8a2f7d984541ecdab20cb60f8b5babd12d9d1a553705edbd59feebe`

See more details on using hashes here.

File details

Details for the file hicsv_python-1.1.0-py3-none-any.whl.

File metadata

Download URL: hicsv_python-1.1.0-py3-none-any.whl
Upload date: Mar 9, 2025
Size: 11.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.6.3

File hashes

Hashes for hicsv_python-1.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`66b9b319bc69b2de8d788bd5a6f5862ae796d9bcc26caad97615e7a1664346c6`
MD5	`da00879aba78360036a8ab26f1a22cf5`
BLAKE2b-256	`9fbc18a21532cc45a640cb92256bf44403859e9a9e4e9210f273f23791f08eb2`

See more details on using hashes here.

hicsv-python 1.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

hicsv-python

Documentation

What is hicsv?

Example of hicsv format

Scope of this package

Example

Note

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes