Pydifact is a library that aims to provide complete support for reading and writing EDIFACT files. These file format, despite being old, is still a standard in many business cases. In Austria e.g., it is used for the transfer of medical reports between medical doctors.
Project description
pydifact
A Python library to parse and serialize UN/EDIFACT interchanges.
Preamble
This is a port of metroplex-systems/edifact to Python. Thanks here at the start to Craig Duncan for this cool piece of software. Porting was like a breeze due to the high code quality there. All credits for the initial code here go to him, I just did the translation to Python(3), some "pythonifications" of the code and little improvements.
Why another EDIFACT library?
Because I did not find a decent UN/EDIFACT library for Python, so I decided to port one of the available good PHP libraries to Python. Here is the result.
ATM this is a Work In Progress, the API is not stable yet. Feel free to help.
Install
uv pip install pydifact
# or
python -m pip install pydifact
However, it is not marked as stable yet, so the pypi version, including documentation and code examples, could differ from the latest git code. If in doubt, use the git version. You should use uv for the development process:
git clone https://github.com/nerdocs/pydifact.git
cd pydifact
uv sync
Usage
To read a full Interchange from a file or string, take the Interchange class and
iter over the messages and segments:
from pydifact.segmentcollection import Interchange
interchange = Interchange.from_file("./tests/data/wikipedia.edi")
interchange = Interchange.from_str(
"UNA:+,? '"
"UNB+UNOC:1+1234+3333+200102:2212+42'"
"UNH+42z42+PAORES:93:1:IA'"
"MSG+1:45'"
"IFT+3+XYZCOMPANY AVAILABILITY'"
"ERC+A7V:1:AMD'"
"UNT+5+42z42'UNZ+2+42'"
)
for message in interchange.get_messages():
for segment in message.segments:
print(f"Segment tag: {segment.tag}, content: {segment.elements}")
You may also want to iterate directly on segments :
from pydifact.segmentcollection import Interchange
interchange = Interchange.from_str(
"UNA:+,? '"
"UNB+UNOC:1+1234+3333+200102:2212+42'"
"UNH+42z42+PAORES:93:1:IA'"
"MSG+1:45'"
"IFT+3+XYZCOMPANY AVAILABILITY'"
"ERC+A7V:1:AMD'"
"UNT+5+42z42'UNZ+2+42'"
)
for segment in interchange.segments:
print(f"Segment tag: {segment.tags}, content: {segment.elements}")
Or you can create an EDI interchange on the fly:
from pydifact.segmentcollection import Interchange
from pydifact.segments import Segment
interchange = Interchange(syntax_identifier=("IBMA",1),
sender="MeMyselfAndIrene",
recipient="TheOtherOne",
control_reference="KLuzs7c6")
interchange.add_segment(Segment("QTY", ["12", "3"]))
print(interchange.serialize())
To include or override the Service String Advice segment (UNA), just specify it as a regular segment:
interchange.add_segment(Segment("UNA", ":+.? '"))
You may also want to parse a « raw » segment bunch which is not an interchange:
from pydifact.segmentcollection import RawSegmentCollection
collection = RawSegmentCollection.from_str("UNH+1+ORDERS:D:96A:UN:EAN008'")
for segment in collection.segments:
print(f"Segment tag: {segment.tags}, content: {segment.elements}")
Limitations
- No support of optional functional groups (
UNG→UNE),
Alternatives
In the Python ecosystem:
- python-edifact - simpler, IMHO less clean code, less flexible. may be faster though (not tested). Seems unmaintained.
- bots - huge, with webinterface (bots-monitor), webserver, bots-engine.
- edicat - simple, only for separating lines/segments for CLI-piping.
Development
Setup
To develop pydifact, clone the repository and install the dev requirements:
uv sync
This installs all the python packages needed for development and testing.
Code formatting
Format all python files using black before committing:
uv run black .
Happy coding, PR are more than welcome to make this library better, or to add a feature that matches your needs. Nevertheless, don't forget adding tests for every aspect you add in code.
Testing
pydifact uses pytest for testing. There is a shortcut in the Makefile for your convenience:
make test
This is recommended for faster testing.
There are some additional tests to check the performance of parsing huge files - you can include that tests by calling
make test-extended
Credits
The official formats for UN/EDIFAT provided by UN Secretariat are raw text files located here and html (.htm) files located here, similarly for service codes as text and html files located at https://www.gefeg.com/jswg/.
There are many other helpful projects and people who made this possible:
- untid-parser by Stefano - thanks for your help.
For a contributors' list, see Contributors.
License
This library is licensed under the MIT license, see the LICENSE file.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file pydifact-0.2.3.tar.gz.
File metadata
- Download URL: pydifact-0.2.3.tar.gz
- Upload date:
- Size: 43.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.7.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
fb68b9aa4c95f3ee360849d762ae662f3355b88a7781229d875976d1509bcda3
|
|
| MD5 |
7d132ef4fc597e4f36c9afde65c28da8
|
|
| BLAKE2b-256 |
331a9759525a838990f3a34420a741bdb6f2e69796003f345a23140b2e2ad1f5
|
File details
Details for the file pydifact-0.2.3-py3-none-any.whl.
File metadata
- Download URL: pydifact-0.2.3-py3-none-any.whl
- Upload date:
- Size: 42.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.7.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
6a4f00dbeba9906808ee29fe54473cc2acc891ecab18d7e3abfbbe7c8f6231fc
|
|
| MD5 |
becdd5794dcd63b1fc4f24a2d6d1bca5
|
|
| BLAKE2b-256 |
03134728fafa1d8314c9421f3a1299ebc33aa971ab69377a13e14c8949d7daba
|