Skip to main content

A Python library to convert COBOL ebcdic file to CSV format based on copybook

Project description

pycobol2csv

pycobol2csv is a Python library to convert COBOL ebcdic file to CSV format. The package is built to cater for advanced features in COBOL copybooks such as OCCURES x TIMES, BINARY, COMP.

The CSV file is RDBMS friendly and all headers are ready to be used as database column names. CSV conversion is controlled by config file in csv_config.json

  • Update in version 1.0.6

Recently Microsoft upgraded Spark in Synapse from version 3.2 to 3.4, which upgraded the included Python version from 3.8 to 3.10, which is a version with known issues on csv writer.

Any users on Python 3.10 should upgrade to pycobol2csv 1.0.6 and above ASAP, otherwise there might get an error _csv.Error: need to escape, but no escapechar set.

Other Python versions (such as 3.8, 3.11) are safe so far.

  • Update in version 1.0.5

Added more enhancements for the outdated REDEFINE and PIC syntax for a new client.

Install the python module:

pip install pycobol2csv

To use the module:

from pycobol2csv import convert_cobol_file, decode_copybook_file

row_length, cobol_struc = decode_copybook_file(copybook_file)

convert_cobol_file(copybook_file, data_file, output_file, config_file, codepage, debug=False)

  • copybook_file: copybook filename
  • data_file: data filename
  • output_file: output csv filename
  • config_file: csv configuration filename, refer to csv_config.json
  • codepage: codepage for edibic, refer to https://docs.python.org/3.7/library/codecs.html#standard-encodings for details
  • debug: enable for more debug information, default is OFF

test

2 sets of test data have been created from scratch. Each set includes a copybook and an EBCDIC data file.

To test:

python convert_cobol_test_main.py --copybook [COPYBOOK_FILE] --data [DATA_FILE] --output [CSV_FILE]

known issues and limitations

  • Be aware of the resources available in your runtime environment and make sure the Cobol file size is not beyond the limit or cause any performance issue.

To handle large Cobol files, you can split the files into smaller chunks and then process the chunks in parallel. Please refer to the medium post for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pycobol2csv-1.0.6-py3-none-any.whl (9.3 kB view details)

Uploaded Python 3

File details

Details for the file pycobol2csv-1.0.6-py3-none-any.whl.

File metadata

  • Download URL: pycobol2csv-1.0.6-py3-none-any.whl
  • Upload date:
  • Size: 9.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.11.9

File hashes

Hashes for pycobol2csv-1.0.6-py3-none-any.whl
Algorithm Hash digest
SHA256 b8d336c752245dbd6fc85252ba52cd04ed2af83cc61a04a7952722521211be60
MD5 98c1e043a92189e5d1d3a87d0fb05014
BLAKE2b-256 df8664fbaf8ab886e2e0f8eaccfb38f7f36926fcab9c1720d6236496e44e5f7b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page