Skip to main content

A simple converter of MARCXML/PICAXML to CSV/TSV

Project description

marcxml2csv

A simple converter of (possibly gzipped) MARCXML/PICAXML to (possibly gzipped) CSV/TSV.

The resulting CSV/TSV has been designed to be easy to use as a data table, but also to retain all ordering informaation in the original when such is needed. The format is as follows: record_number,field_number,subfield_number,field_code,subfield_code,value

Here, record_number identifies the MARC/PICA+ record, while field_number and subfield_number can be used for more exact filtering / reconstructing the original field flow if needed.

For the MARC leader and control fields, subfield_number will be empty.

For MARC data fields, ind1 and ind2 values are reported as separate rows with the subfield_code being ind1 or ind2, but only when non-empty. The also have an empty subfield_number.

Installation

Install from pypi with e.g. pipx install marcxml2csv.

Usage

Usage: marcxml2csv [OPTIONS] [INPUT]...

  Convert from MARCXML (gz) input files into (gzipped) CSV/TSV

Options:
  -o, --output TEXT  Output CSV/TSV (gz) file  [required]
  --help             Show this message and exit.
Usage: picaxml2csv [OPTIONS] [INPUT]...

  Convert from PICAXML (gz) input files into (gzipped) CSV/TSV

Options:
  -o, --output TEXT  Output CSV/TSV (gz) file  [required]
  --help             Show this message and exit.

Files will be read/written using gzip if the filename ends with .gz. TSV format will be used if the output filename contains .tsv, otherwise CSV will be used.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

marcxml2csv-1.0.3.tar.gz (3.3 kB view details)

Uploaded Source

Built Distribution

marcxml2csv-1.0.3-py3-none-any.whl (4.4 kB view details)

Uploaded Python 3

File details

Details for the file marcxml2csv-1.0.3.tar.gz.

File metadata

  • Download URL: marcxml2csv-1.0.3.tar.gz
  • Upload date:
  • Size: 3.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.1.13 CPython/3.10.4 Darwin/21.5.0

File hashes

Hashes for marcxml2csv-1.0.3.tar.gz
Algorithm Hash digest
SHA256 61a5962b051bb73b357c31d0c68d23aafd6e751c17fc95df43cfd37d2896d22a
MD5 1387f0f32ca70dacff0a513157093582
BLAKE2b-256 144749cc51c2cde7b35bdc1fea8d74c19ea7fcbe0f6bd08e8ab6ad7d0edfae7e

See more details on using hashes here.

File details

Details for the file marcxml2csv-1.0.3-py3-none-any.whl.

File metadata

  • Download URL: marcxml2csv-1.0.3-py3-none-any.whl
  • Upload date:
  • Size: 4.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.1.13 CPython/3.10.4 Darwin/21.5.0

File hashes

Hashes for marcxml2csv-1.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 e76d9b37d1566437f7811bdfa4a62e0b22203758f5042b010e505676c4363ea3
MD5 f398e8d2c580aaf0907270b3225280b6
BLAKE2b-256 55256229e0844deec6e396d19f2cc30481f4692202d1cd57ee174b17ec13f663

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page