Additional EBCDIC codecs
Project description
Ebcdic is a Python package adding additional EBCDIC codecs for data exchange with legacy system. It works with Python 2.6+ and Python 3.1+.
EBCDIC is short for Extended Binary Coded Decimal Interchange Code and is a family of character encodings that is mainly used on mainframe computers. There is no real point in using it unless you have to exchange data with legacy systems that still require it. To learn more about EBCDIC, you can start at <https://en.wikipedia.org/wiki/EBCDIC>.
Installation
Ebcdic is available from https://pypi.python.org/pypi/ebcdic and can be installed using pip:
pip install ebcdic
Example usage
To encode ‘hello world’ on EBCDIC systems in German speaking countries, use:
>>> import ebcdic >>> 'hello world'.encode('cp1141') b'\x88\x85\x93\x93\x96@\xa6\x96\x99\x93\x84O'
Supported codecs
The ebcdic package includes EBCDIC codecs for the following regions:
cp1140 - Australia, Brazil, Canada, New Zealand, Portugal, South Africa, USA
cp1141 - Austria, Germany, Switzerland
cp1142 - Denmark, Norway
cp1143 - Finland, Sweden
cp1144 - Italy
cp1145 - Latin America, Spain
cp1146 - Great Britain, Ireland, North Ireland
cp1147 - France
cp1148 - international
cp1149 - Iceland
It also includes legacy codecs:
cp037 - Australia, Brazil, Canada, New Zealand, Portugal, South Africa; similar to cp1140 but without Euro sign
cp273 - Austria, Germany, Switzerland; similar to cp1141 but without Euro sign
cp277 - Denmark, Norway; similar to cp1142 but without Euro sign
cp278 - Finland, Sweden; similar to cp1143 but without Euro sign
cp280 - Italy; similar to cp1141 but without Euro sign
cp284 - Latin America, Spain; similar to cp1145 but without Euro sign
cp285 - Great Britain, Ireland, North Ireland; similar to cp1146 but without Euro sign
cp297 - France; similar to cp1147 but without Euro sign
cp500 - international; similar to cp1140 but without Euro sign
cp871 - Iceland; similar to cp1149 but without Euro sign
cp1047 - Open Systems (MVS C compiler)
Codecs in the standard library overrule some of these codecs. At the time of this writing this concerns cp037, cp273 (since 3.4), cp500 and cp1140.
To see get a list of EBCDIC codecs that are already provided by different sources, use ebcdic.ignored_code_names(). For example, with Python 3.4:
>>> ebcdic.ignored_codec_names() ['cp037', 'cp1140', 'cp273', 'cp500']
Source code
These codecs have been generated using CodecMapper available from https://github.com/roskakori/CodecMapper. Read the README in order to to build the ebcdic package from source.
To add another 8 bit EBCDIC codec just extend the ant target ebcdic in build.xml using a line like:
<arg value="cpXXX" />
License
Copyright (c) 2014, Thomas Aglassinger All rights reserved.
Redistribution and use in source and binary forms, with or without modification, are permitted provided that the following conditions are met:
Redistributions of source code must retain the above copyright notice, this list of conditions and the following disclaimer.
Redistributions in binary form must reproduce the above copyright notice, this list of conditions and the following disclaimer in the documentation and/or other materials provided with the distribution.
THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS “AS IS” AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT HOLDER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
Changes
Version 0.7, 2014-11-17
Clarified which codecs are already part of the standard library and that these codecs overrule the ebcdic package. Also added a function ebcdic.ignored_code_names() that returns the name of the EBCDIC codecs provided by other means. To obtain access to ebcdic codecs overruled by the standard library, use ebcdic.lookup().
Cleaned up (PEP8, __all__, typos, …).
Version 0.6, 2014-11-15
Added support for Python 2.6+ and 3.1+ (#1).
Included a modified version of gencodec.py that still builds maps instead of tables so the generated codecs work with Python version earlier than 3.3. It also does a from __future__ import unicode_literals so the codecs even work with Python 2.6+ using the same source code. As a side effect, this simplifies building the codecs because it removes the the need for a local copy of the cpython source code.
Version 0.5, 2014-11-13
Initial public release
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.