bit-packing and unpacking code generator for C, C#, Go, Rust, Swift, TypeScript, and Zig
Project description
Beschi
This is a custom bit-packing and unpacking code generator for C, C#, Go, Rust, Swift, TypeScript, and Zig. You feed it a data description and it generates source files for writing/reading buffers of that data, along the lines of FlatBuffers or Cap'n Proto, but with much less functionality for much simpler use cases. It was initially written for a larger project that was passing data back and forth between a Unity game, a Go server, and a web client, but I extracted it into its own thing. If all you need is a simple way to pack a data structure into a compact, portable binary form, this might be useful for you.
I go into more explanation for why this exists in the documentation, but I'll be honest, too: it was kind of fun to write a code generator. 😝
Documentation
Language-Specific Documentation:
C | C# | Go | Rust | Swift | TypeScript | Zig |
---|
Installation
If you use Homebrew, you can install it directly and simply:
brew install sjml/sjml/beschi
Otherwise, you need a Python setup and can then install from PyPI:
pip install beschi
It installs an executable that you can run directly, so long as however you invoked pip
put it on your path. (This will work in an activated virtual environment, for instance.)
Example:
beschi --lang csharp --protocol ./messages.toml
Basic Usage
By default, it prints to standard output, but you can also write to a file with an output flag like --output ./Messages.cs
.
From the input protocol file (detailed below), you get a code file that you can integrate to a project allowing you encode messages as compact and portable binary buffers.
Protocols
The protocol files are written in TOML. There's a fuller example in the test suite and a more through explanation in the documentation, but here's an annotated sample.
[meta]
# The namespace is optional; not all languages
# do something useful with it, but it's handy
# to have to help avoid clashes
namespace = "AppMessages"
# The size types specify what kind of number to
# use when recording the length of lists and
# strings. If not specified, they both default
# to uint32 (four bytes).
list_size_type = "uint16"
string_size_type = "byte"
# Messages are defined by a name and their data
# members. This will become a class or struct in
# the target languages with these pieces of data
# as accessible members.
[[messages]]
_name = "Vector3Message"
x = "float"
y = "float"
z = "float"
# There are a variety of different data member
# types that can be defined.
[[messages]]
_name = "NewCharacterMessage"
id = "uint64"
characterName = "string"
job = "CharacterClass" # an enum! (see below)
strength = "uint16"
intelligence = "uint16"
dexterity = "uint16"
wisdom = "uint16"
goldInWallet = "uint32"
nicknames = "[string]" # brackets indicate a list/array
# You can also define enumerated values which will be
# translated into the target language's enum / integer
# types as appropriate.
[[enums]]
_name = "CharacterClass"
_values = [
"Fighter",
"Wizard",
"Rogue",
"Cleric"
]
# There are also structs, collections of data that go
# together, but are not themselves a message.
[[structs]]
_name = "Color"
red = "float"
green = "float"
blue = "float"
alpha = "float"
# Structs can contain other structs, and even lists of them.
[[structs]]
_name = "Spectrum"
defaultColor = "Color"
colors = "[Color]"
# Structs and enums can then be used in messages
# (which can also have lists of structs, of course).
[[messages]]
_name = "CharacterJoinedTeam"
characterID = "uint64"
teamName = "string"
teamColors = "[Color]"
role = "TeamRole"
# Enums can also be non-sequential if you need
[[enums]]
_name = "TeamRole"
_values = [
# in a non-sequential enum, the first
# listed value will be used as the default
{ _name = "Minion", _value = 256 },
{ _name = "Ally", _value = 512 },
{ _name = "Leader", _value = 1024 },
# values can even be negative
{ _name = "Traitor", _value = -1 }
]
Data Members
These are the base types from which you can build up whatever structures and messages you need to, along with what they correspond to in the various languages.
Protocol Type | C | C# | Go | Rust | Swift | TypeScript | Zig |
---|---|---|---|---|---|---|---|
byte |
uint8_t |
byte |
byte |
u8 |
UInt8 |
number |
u8 |
bool |
bool |
bool |
bool |
bool |
Bool |
boolean |
bool |
int16 |
uint16_t |
short |
int16 |
i16 |
Int16 |
number |
i16 |
uint16 |
int16_t |
ushort |
uint16 |
u16 |
UInt16 |
number |
u16 |
int32 |
uint32_t |
int |
int32 |
i32 |
Int32 |
number |
i32 |
uint32 |
int32_t |
uint |
uint32 |
u32 |
UInt32 |
number |
u32 |
int64 |
uint64_t |
long |
int64 |
i64 |
Int64 |
bigint |
i64 |
uint64 |
int64_t |
ulong |
uint64 |
u64 |
UInt64 |
bigint |
u64 |
float |
float |
float |
float32 |
f32 |
Float32 |
number |
f32 |
double |
double |
double |
float64 |
f64 |
Float64 |
number |
f64 |
string |
char* |
string |
string |
String |
String |
string |
[] const u8 |
All the numbers are stored as little-endian in the buffer, if that matters for you.
Usage in code
Beschi does not generate any code to handle writing or reading from disk, pushing data across a network, or anything like that — it will turn a message into bytes and read that same message back from the bytes, but you are responsible for what you do with them otherwise.
With the given protocol, though, you could create a message in C# and write it to a file:
var msg = new AppMessages.Vector3Message();
msg.x = 1.0f;
msg.y = 4096.1234f;
msg.z = -42.56f;
var fs = new FileStream("./vec3.msg", FileMode.Create);
var bw = new BinaryWriter(fs);
msg.WriteBytes(bw, false);
And then read it back in Go:
dat, _ := os.Open("./vec3.msg")
defer dat.Close()
msg := AppMessages.Vector3MessageFromBytes(dat)
if msg.X == 1.0 && msg.Y == 4096.1234 && msg.Z < 0.0 {
print("Ready to go!\n")
}
Or TypeScript:
const data = fs.readFileSync("./vec3.msg");
const dv = new DataView(new Uint8Array(data).buffer);
const msg = AppMessages.Vector3Message.fromBytes(dv, 0).val;
if (msg.y == Math.fround(4096.1234)) {
console.log("Ready to... typescript?");
}
For the most part, Beschi tries to keep behavior and structures consistent across the languages, but there are a few points of difference outlined on the various language pages. Notice in the example above, for instance, that in TypeScript you have to make a call to Math.fround
if you want to do a straight comparison of float values because of how the underlying JavaScript engine treats all numbers as double-width floats. (Doing equality comparisons on floats is usually a bad idea, but in this instance we want to check that they are actually bitwise identical.) Similarly, see how the data members are upper-cased in Go to match that language's export conventions, and the byte reading function is part of the namespace because Go doesn't have static functions for data types. The goal is to make working across languages feel seamless, but there are some differences that we adapt to as much as possible.
There are more extensive examples in the test harnesses.
Future
Beschi makes code that I use "in production" on personal projects. It could probably stand to be better optimized, there are probably edge cases and bugs, etc. But for the most part, this is mature enough that I stopped thinking of it as a project in itself and now it's just a tool that I use.
I will admit that part of me wants to make new writers, but that's not likely to get done until I have a specific project motivating it. If someone loves this system, though, and really wants to see a generator for Haskell or Lua or whatever, go for it. The existing writers should be decent starting points — they aren't terribly clever (no AST or interesting data structures), just iterating over the protocol and writing out serialization/deserialization code.
Beschi?
Constanzo Giuseppe Beschi was an Italian Jesuit who worked in southern India during the early 18th century. He was noted as a talented linguist, able to tie concepts from multiple languages into a single form. At the same time, he was adept at the Jesuit principle of "inculturation," where foreign concepts are adapted for a new culture and the foreigner attempting the adaptation also respectfully adopts habits and ways of proceeding from the host culture.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file beschi-0.2.2.tar.gz
.
File metadata
- Download URL: beschi-0.2.2.tar.gz
- Upload date:
- Size: 39.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/5.1.1 CPython/3.12.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | bbbf1f668eb7b5a90f1a18cbd8b4908d6d85af784e485bc099843b06a2b0f8ac |
|
MD5 | 569be88d3d177951ff669a5d6988a2d1 |
|
BLAKE2b-256 | c0eedd76b169b27317c1a00f01d8425dcda30f44ae57bc764428e6fb09a21936 |
Provenance
The following attestation bundles were made for beschi-0.2.2.tar.gz
:
Publisher:
ci.yml
on sjml/beschi
-
Statement type:
https://in-toto.io/Statement/v1
- Predicate type:
https://docs.pypi.org/attestations/publish/v1
- Subject name:
beschi-0.2.2.tar.gz
- Subject digest:
bbbf1f668eb7b5a90f1a18cbd8b4908d6d85af784e485bc099843b06a2b0f8ac
- Sigstore transparency entry: 148036384
- Sigstore integration time:
- Predicate type:
File details
Details for the file beschi-0.2.2-py3-none-any.whl
.
File metadata
- Download URL: beschi-0.2.2-py3-none-any.whl
- Upload date:
- Size: 46.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/5.1.1 CPython/3.12.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | b5bbddf96067e580daac9212092b295acf77196a421678294c4f7eab5098ad6c |
|
MD5 | d712c8d85dbd0428606aad4e2bbf7b52 |
|
BLAKE2b-256 | 57f8f5d85148076210695e2ce1b6f06b0726538942cad68cacbf21b404693d36 |
Provenance
The following attestation bundles were made for beschi-0.2.2-py3-none-any.whl
:
Publisher:
ci.yml
on sjml/beschi
-
Statement type:
https://in-toto.io/Statement/v1
- Predicate type:
https://docs.pypi.org/attestations/publish/v1
- Subject name:
beschi-0.2.2-py3-none-any.whl
- Subject digest:
b5bbddf96067e580daac9212092b295acf77196a421678294c4f7eab5098ad6c
- Sigstore transparency entry: 148036385
- Sigstore integration time:
- Predicate type: