Skip to main content

A simple converter of MARCXML/PICAXML to CSV/TSV

Project description

marcxml2csv

A simple converter of (possibly gzipped) MARCXML/PICAXML to (possibly gzipped) CSV/TSV.

The resulting CSV/TSV has been designed to be easy to use as a data table, but also to retain all ordering informaation in the original when such is needed. The format is as follows: record_number,field_number,subfield_number,field_code,subfield_code,value

Here, record_number identifies the MARC/PICA+ record, while field_number and subfield_number can be used for more exact filtering / reconstructing the original field flow if needed.

For the MARC leader and control fields, subfield_number will be empty.

For MARC data fields, ind1 and ind2 values are reported as separate rows with the subfield_code being ind1 or ind2, but only when non-empty. The also have an empty subfield_number.

Installation

Install from pypi with e.g. pipx install marcxml2csv.

Usage

Usage: marcxml2csv [OPTIONS] [INPUT]...

  Convert from MARCXML (gz) input files into (gzipped) CSV/TSV

Options:
  -o, --output TEXT  Output CSV/TSV (gz) file  [required]
  --help             Show this message and exit.
Usage: picaxml2csv [OPTIONS] [INPUT]...

  Convert from PICAXML (gz) input files into (gzipped) CSV/TSV

Options:
  -o, --output TEXT  Output CSV/TSV (gz) file  [required]
  --help             Show this message and exit.

Files will be read/written using gzip if the filename ends with .gz. TSV format will be used if the output filename contains .tsv, otherwise CSV will be used.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

marcxml2csv-1.0.2.tar.gz (3.1 kB view details)

Uploaded Source

Built Distribution

marcxml2csv-1.0.2-py3-none-any.whl (3.2 kB view details)

Uploaded Python 3

File details

Details for the file marcxml2csv-1.0.2.tar.gz.

File metadata

  • Download URL: marcxml2csv-1.0.2.tar.gz
  • Upload date:
  • Size: 3.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.1.13 CPython/3.10.4 Darwin/21.5.0

File hashes

Hashes for marcxml2csv-1.0.2.tar.gz
Algorithm Hash digest
SHA256 e8a31b1fd84546aaf4af9ab8ec7b79c06a7c05e269e48d52f2f6a4139c743484
MD5 f553e8fe13bad129c3f0ccfa8a936c5a
BLAKE2b-256 879548885bb981732a492b1b870b4ca726e707bf1abe4e6affa51f4ef2e11aa8

See more details on using hashes here.

File details

Details for the file marcxml2csv-1.0.2-py3-none-any.whl.

File metadata

  • Download URL: marcxml2csv-1.0.2-py3-none-any.whl
  • Upload date:
  • Size: 3.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.1.13 CPython/3.10.4 Darwin/21.5.0

File hashes

Hashes for marcxml2csv-1.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 a8e06e9ec42003bd02fc5c0936c0b551df40e0ae9a4b74be28f7f63a94083fe1
MD5 1f7a5a122d0906b27e0f1be343906348
BLAKE2b-256 be6bb49c039cacca72123f551d7135724dbda570a26cd07a5bad832f85cab24c

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page