Simple dataset convertor in Python
Project description
ConvertMe – Ayoyoyo Wololo
Simple dataset convertor in Python. Currently memory hungry in case of large datasets.
Installation
Install package via pip:
$ pip install convertme
Use the provided CLI:
$ convertme --help
Usage: convertme [OPTIONS]
Options:
-if, --input-format [csv|fimi|cxt|mat|cex]
[required]
-of, --output-format [csv|fimi|cxt|mat|cex]
[required]
-i, --input TEXT Input file, skip it for stdin.
-o, --output TEXT Output file, skip it for stdout.
--input-delimiter TEXT (CSV) Delimiter of input. [default: ,]
--output-delimiter TEXT (CSV) Delimiter of output. [default: ,]
--objects-col INTEGER (CSV) Index of column with object labels,
typically 0, ignored on default.
--attributes-row INTEGER (CSV) Index of row with attribute labels,
typically 0, ignored on default.
--true-values TEXT (CSV) Values which will be count as True,
comma separated.
--help Show this message and exit.
Basic usage:
Convert simple csv file to fimi format:
$ convertme -i dataset.csv -if=csv -o dataset.fimi -of=fimi
Content of dataset.csv:
1,0,1,0
0,1,0,1
Content of dataset.fimi:
0 2
1 3
Supported formats
csv
burmeister (.cxt)
fimi
matlab (version<=7.3)
conexp (.cex)
Development
Clone this repository to the folder, then:
# create virtualenv (optional)
$ mkvirtualenv convertme -p python3
#if is not actived (optional)
$ workon convertme
$ pip install -e .
$ python setup.py test
History
0.1.0 (2020-03-24)
Supported formats (csv, burmeister, fimi, matlab (version<=7))
Basic CLI implemented
0.1.1 (2020-04-7)
Added Conexp format support
Added support of Python 3.5, 3.6
0.1.2 (2020-04-17)
Rewritten to bitarray under the hood (less memory needed)
0.1.3 (2022-08-10)
Fimi: attributes are not required to start from zero
Added support for Python 3.9, 3.10
Drop support for Python 3.5
0.1.4 (2022-08-10)
Fix instalation error
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.