Skip to main content

Read ODS files into a pandas DataFrame while preserving direct formatting (bold, italic, underline, superscript, subscript, and anchors) as markup (AsciiDoc, Markdown, HTML).

Project description

direct-formatting-pandas-ods-reader

pipeline status

Inspired from pandas-ods-reader, but with the ability to read direct formatting. pandas-ods-reader relies on ezodf to extract cell content, but ezodf ignores direct formatting, so we need to go at a lower level.

The direct (italic, bold, underline, subscript, superscript, and anchors) formatting is turned into markup in the resulting pandas DataFrame.

Dependencies

  • lxml
  • pandas

Usage

In a Python project

from direct_formatting_pandas_ods_reader import read_ods

read_ods("test/test.ods")

read_ods("test/test.ods", format="markdown")

read_ods("test/test.ods", format="html", sheet=0)

The format argument can be either html, markdown or asciidoc, and defaults to asciidoc. There is also a sheet argument that defaults to 0, which corresponds to the first sheet of the file.

Command-line usage

You can use this package as a CLI tool:

python -m direct_formatting_pandas_ods_reader input.ods -o output.csv -t html
  • input.ods path to the ODS file (required).
  • -o, --output output CSV file (if omitted, writes to standard output).
  • -t, --type output format for formatting marks (asciidoc, markdown, html). Defaults to asciidoc.
  • -s, --sheet index of the sheet to read (0-based, default: 0).

Install as a global CLI tool

pip install -e .

This creates a global direct-formatting-ods-to-csv command:

direct-formatting-ods-to-csv input.ods -t markdown

License

Licensed under the GNU GPL v3.0, Yann Trividic.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

direct_formatting_pandas_ods_reader-0.4.0.tar.gz (45.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file direct_formatting_pandas_ods_reader-0.4.0.tar.gz.

File metadata

File hashes

Hashes for direct_formatting_pandas_ods_reader-0.4.0.tar.gz
Algorithm Hash digest
SHA256 dd877831b7d53bbff95b15fe3908deb3ba57016bde6798ef4de3743cb1928d75
MD5 1b16018343b4d7b516f9aded6c3660b7
BLAKE2b-256 d514b6fa425ba51519a31c912eb9e373f81a7573f804b94bbd2ce70b5e886ec9

See more details on using hashes here.

File details

Details for the file direct_formatting_pandas_ods_reader-0.4.0-py3-none-any.whl.

File metadata

File hashes

Hashes for direct_formatting_pandas_ods_reader-0.4.0-py3-none-any.whl
Algorithm Hash digest
SHA256 0b5e9fa7ea99ec0c9741f07932cd7e0430409d7468b5f3fcfdd5407d66b90e31
MD5 02bd1b7ebb13b1510e13d1f08c71f351
BLAKE2b-256 e6c3c9103008e2c4f4274eebf848bb60e78905a7fdb0a4c4e0272a6ce73d0e8c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page