Skip to main content

A simple converter of MARCXML/PICAXML to CSV/TSV

Project description

marcxml2csv

A simple converter of (possibly gzipped) MARCXML/PICAXML to (possibly gzipped) CSV/TSV.

The resulting CSV/TSV has been designed to be easy to use as a data table, but also to retain all ordering informaation in the original when such is needed. The format is as follows: record_number,field_number,subfield_number,field_code,subfield_code,value

Here, record_number identifies the MARC/PICA+ record, while field_number and subfield_number can be used for more exact filtering / reconstructing the original field flow if needed.

For the MARC leader and control fields, subfield_number will be empty.

For MARC data fields, ind1 and ind2 values are reported as separate rows with the subfield_code being ind1 or ind2, but only when non-empty. The also have an empty subfield_number.

Installation

Install from pypi with e.g. pipx install marcxml2csv.

Usage

Usage: marcxml2csv [OPTIONS] [INPUT]...

  Convert from MARCXML (gz) input files into (gzipped) CSV/TSV

Options:
  -o, --output TEXT  Output CSV/TSV (gz) file  [required]
  --help             Show this message and exit.
Usage: picaxml2csv [OPTIONS] [INPUT]...

  Convert from PICAXML (gz) input files into (gzipped) CSV/TSV

Options:
  -o, --output TEXT  Output CSV/TSV (gz) file  [required]
  --help             Show this message and exit.

Files will be read/written using gzip if the filename ends with .gz. TSV format will be used if the output filename contains .tsv, otherwise CSV will be used.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

marcxml2csv-1.0.5.tar.gz (3.6 kB view details)

Uploaded Source

Built Distribution

marcxml2csv-1.0.5-py3-none-any.whl (4.9 kB view details)

Uploaded Python 3

File details

Details for the file marcxml2csv-1.0.5.tar.gz.

File metadata

  • Download URL: marcxml2csv-1.0.5.tar.gz
  • Upload date:
  • Size: 3.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.1.15 CPython/3.10.6 Darwin/21.6.0

File hashes

Hashes for marcxml2csv-1.0.5.tar.gz
Algorithm Hash digest
SHA256 0907d529917560812c123c11cd951013ae7b880f1e108ef7db3e4e8f79fad720
MD5 24964fe797b410c85a76aa21efe323f0
BLAKE2b-256 a9c5d551f16342e84b534e85b07bdb9ede177ac04f3da6ae64ca2d5d8b10a5bc

See more details on using hashes here.

File details

Details for the file marcxml2csv-1.0.5-py3-none-any.whl.

File metadata

  • Download URL: marcxml2csv-1.0.5-py3-none-any.whl
  • Upload date:
  • Size: 4.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.1.15 CPython/3.10.6 Darwin/21.6.0

File hashes

Hashes for marcxml2csv-1.0.5-py3-none-any.whl
Algorithm Hash digest
SHA256 0a6b64608c0647501b9cc29811d63ae17e59ba0f9ba6772b7ce438356616cba7
MD5 179266a61aaeb99d89cab9e12426ce5b
BLAKE2b-256 e802399cf048e86feef9a510eeea21b77991ab2cf3937c4a5a8b8edad2c5f565

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page