CaseCat dictionary file support for Plover
Project description
Add support for reading Stenograph CaseCATalyst saved dictionaries to Plover.
CaseCATalyst dictionaries have the extension “.sgdct”. There is often a corresponding “.sgxml” file, but this contains no dictionary data.
Much of the detail of the file format remains unknown. Contributions and corrections are very welcome. Thanks are due to Sooty, who provided example files for dissection.
The files begin with a 640-byte header, which begins with the magic number SGCAT32. Nothing is known about header fields at present.
One or more records follow the header. Each record gives a single translation from steno to text.
The record header is 21 bytes. header[18] contains the number of strokes, and header[19] gives the number of letters in the text. Each is an unsigned byte. The purpose of all other fields in the record header is unknown at present.
The stroke follows, as a sequence of four-byte unsigned integers. Each integer is a bitmap of keys in the standard steno order, with the first “S” as the most significant bit.
Then the text follows, as ordinary ASCII text. Nothing is currently known about coding of text outside the ASCII range. Various non-ASCII characters crop up, apparently as control codes.
Finally, there are zero to three padding bytes, in order to bring us up to a four-byte boundary.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for plover_casecat_dictionary-0.1.0.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7636fa32788f8f4bff39e3c12b6a9816f6fd6fb0108307fcc9cd4933624ebba3 |
|
MD5 | 42f57c1466b33d88f0b469943c00b540 |
|
BLAKE2b-256 | a24241bbc0dcf9c7af767a56e428384a838cb5a67047ef03bba6016fe89b3856 |
Hashes for plover_casecat_dictionary-0.1.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | e93929215d1873564d3be25a9f1d6f3b61d7522d7340ddc9c645b1eb148ff27a |
|
MD5 | 1c996a0e5c21ec3cc093c767c50176de |
|
BLAKE2b-256 | 38fef7dafb97f11f290a19ef11bfe86211c5a438c0ba8d4d57184d24b7c5ee46 |