smc.bibencodings

ISO-5426 (MAB2) and MARC (USMARC, ANSEL) encodings for Python.

These details have not been verified by PyPI

Project links

Homepage

Project description

ISO-5426 (MAB2) and MARC (USMARC, ANSEL) encodings for Python. The encodings are used in bibliographic information systems and Z.3950 interfaces of OPACs.

Background

ISO-5426 and USMARC encodings are historic encodings that are still in use by bibliographic information systems. They are extensions to Latin codec family and implement a variable length encoding someward similar to UTF-8 and decomposed unicode (NFD) forms.

Non-ASCII chars are expressed as combining chars. For example umlaut a (LATIN SMALL LETTER U WITH DIAERESIS) is written in MAB2 encoding as ‘xc8u’, where ‘xc8’ is the sign for combining diaeresis. Contrary to unicode the combining chars are writte as prefix while unicode has combinig suffixes.

Both MAB2 and MARC encoding support up to two combining chars in front of a letter, for example ‘xc5xc8u’ for LATIN SMALL LETTER U WITH DIAERESIS AND MACRON.

smc.bibencodings implements small deviations from the standards as it supports all chars from 0x00 to 0x7e as well as more combining chars. The code has been tested against several German and Swiss OPACs.

Usage

You just have to import “smc.bibencodings” somehwere in your code to enable the codecs

iso-5426, iso5426, mab2::: standard ISO-5426 encoding
iso-5426-xe0, iso5426-xe0, mab2-xe0::: special ISO-5426 encoding with special identity mapping for 0xa4, 0xe0-0xff
marc, usmarc, ansel::: MARC encoding

>>> import smc.bibencodings
>>> b"Abr\xc2eg\xc2e Historique De L'Origine".decode("mab2")
"Abrégé Historique De L'Origine"

Data source

The encoding tables are extracted from Thomas Berger’s excellent page http://www.gymel.com/charsets/chartab.html and linked pages. Thank you very much!

Authors

Christian Heimes

Copyright

semantics
Kommunikationsmanagement GmbH
Viktoriaallee 45
D-52066 Aachen
Germany

Tel.: +49 241 89 49 89 29
eMail: info(at)semantics.de
http://www.semantics.de/

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

0.1

Jul 15, 2012

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

smc.bibencodings-0.1.zip (63.3 kB view details)

Uploaded Jul 15, 2012 Source

File details

Details for the file smc.bibencodings-0.1.zip.

File metadata

Download URL: smc.bibencodings-0.1.zip
Upload date: Jul 15, 2012
Size: 63.3 kB
Tags: Source
Uploaded using Trusted Publishing? No

File hashes

Hashes for smc.bibencodings-0.1.zip
Algorithm	Hash digest
SHA256	`6a0a4d6cc688cbe9d1f2ac1163ffa7d2c1d74b97827f39ab3b60b8c0ddd13225`
MD5	`6691cac912381a5409e89dda78aaab71`
BLAKE2b-256	`4a7dbce1cc39b308a0e48ceb9e13a572f193f6bc496979deb92ed5ce517e802c`

See more details on using hashes here.

smc.bibencodings 0.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Background

Usage

Data source

Authors

Copyright

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes