Defining and constructing production-grade prompts via an expressive templating engine.

Project description

Hermes

Defining and constructing production-grade LLM prompts via rich structured templates.

Goals & Requirements

Centralized: By centralizing the prompt construction into one place Hermes aims to simplify prompt the prompt construction mechanics.
Extensible: Make it as easy as possible to create new prompts for new use cases.
Experimentation: Must be easy to experiment with new prompt data and placements.

Fundamentally, Hermes is split into two layers -- the Templating Layer and the Logical Layer. We aim to keep a clear separation between these layers such that the Templating layer exclusively handles the representation of the prompt and the Logical Layer handles the mechanics of constructing the prompt.

Templating Layer

Templates

Perhaps the most important part of this design, prompt templates are an expressive yet human readable files that define the prompt structure, data placements and formatting of the final prompt. The templating engine aims to strike a balance between being readable and explicit with no magic. As such, we have chosen to use a combination of YAML and Jinja syntax to represent prompt templates.

Fundamentally, prompt templates are YAML files that when fully compiled contain a list of parts. Each part contains a human readable name, a raw_string and a truncation_priority. We concatenate these parts together to form the final prompt and do our best effort in following the truncation policy laid out by the truncation_priority field. In the future they may support other fields to help express even more rich prompts.

We use Jinja to express the raw_string and more complex notation such as for loops, conditionals and indexing. By relying on a well known templating language such as Jinja we ensure templates remain extensible as new use cases emerge. We do not aim to reinvent the wheel - many open source projects have adopted this same templating language of combining Jinja and YAML such as Ansible and SaltStack. This means there are already tools at our disposal for validating, previewing and otherwise manipulating our template files (e.g. https://ansible.sivel.net/test/).

Alternatives considered:

Langchain: PromptTemplate, AIMessage, SystemMessage and HumanMessage abstractions. Basically just f-strings wrapped in a python class, not very readable or expressive enough.
LMQL: not very readable, non-trivial to reason about what the final interpolated prompt would look like.
Raw python f-strings: better readability but not very expressive.
Jinja: probably the best standalone bet I've found so far but leaves several things to be desired. See an example here.
YAML: also could work by rolling our own basic interpreter. See an example here.
Several OSS “prompt management” solutions: Pezzo, Agenta, PromptHub (paid), Langflow. These all miss the mark in terms of extensibility of the core templating language and infrastructure and focus on using external APIs rather than needing to truncate and tokenize which is crucial for us as we host our own models.

Template Registry

TODO

Logical Layer

The logical layer contains the necessary logic for rendering templates and performing tokenization and truncation.

Project details

Release history Release notifications | RSS feed

0.0.82

Sep 7, 2024

0.0.81

Sep 3, 2024

0.0.80

Aug 18, 2024

0.0.79

Aug 18, 2024

0.0.78

Aug 18, 2024

0.0.77

Aug 18, 2024

0.0.76

Aug 2, 2024

0.0.75

Aug 2, 2024

0.0.74

Jul 22, 2024

0.0.73

Jul 18, 2024

0.0.72

Jul 18, 2024

0.0.71

Jul 18, 2024

0.0.70

Jul 18, 2024

0.0.69

Jul 17, 2024

0.0.68

Jul 17, 2024

0.0.67

Jul 12, 2024

0.0.66

Jul 10, 2024

0.0.65

Jul 10, 2024

0.0.64

Jul 9, 2024

0.0.63

Jul 3, 2024

0.0.62

Jul 3, 2024

0.0.61

Jul 2, 2024

0.0.60

Jul 2, 2024

0.0.59

Jul 2, 2024

0.0.58

Jul 2, 2024

0.0.57

Jul 1, 2024

0.0.56

Jul 1, 2024

0.0.55

Jun 30, 2024

0.0.54

Jun 30, 2024

0.0.53

Jun 30, 2024

0.0.52

Jun 29, 2024

0.0.51

Jun 29, 2024

0.0.50

Jun 29, 2024

0.0.49

Jun 29, 2024

0.0.48

Jun 29, 2024

0.0.47

Jun 29, 2024

0.0.46

Jun 29, 2024

0.0.45

Jun 26, 2024

0.0.44

Jun 18, 2024

0.0.43

Jun 17, 2024

0.0.42

Jun 12, 2024

0.0.41

Jun 12, 2024

0.0.40

Jun 11, 2024

0.0.39

Jun 10, 2024

0.0.38

Jun 7, 2024

0.0.37

Jun 7, 2024

0.0.36

Jun 7, 2024

0.0.35

Jun 7, 2024

0.0.34

Jun 5, 2024

0.0.33

Jun 5, 2024

0.0.32

Jun 5, 2024

0.0.31

Jun 3, 2024

0.0.30

Jun 3, 2024

0.0.29

May 31, 2024

0.0.28

May 30, 2024

0.0.25

May 30, 2024

0.0.24

May 30, 2024

0.0.23

May 30, 2024

0.0.22

May 30, 2024

0.0.21

May 30, 2024

0.0.20

May 30, 2024

0.0.19

May 30, 2024

0.0.18

May 30, 2024

0.0.17

May 30, 2024

0.0.16

May 30, 2024

0.0.15

May 30, 2024

0.0.13

May 22, 2024

This version

0.0.12

May 22, 2024

0.0.11

May 22, 2024

0.0.10

May 21, 2024

0.0.9

May 20, 2024

0.0.8

May 20, 2024

0.0.7

May 20, 2024

0.0.6

May 20, 2024

0.0.5

May 20, 2024

0.0.4

May 20, 2024

0.0.3

May 20, 2024

0.0.2

May 20, 2024

0.0.1

May 20, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hermes_cai-0.0.12.tar.gz (3.8 kB view hashes)

Uploaded May 22, 2024 Source

Built Distribution

hermes_cai-0.0.12-py3-none-any.whl (3.8 kB view hashes)

Uploaded May 22, 2024 Python 3

Hashes for hermes_cai-0.0.12.tar.gz

Hashes for hermes_cai-0.0.12.tar.gz
Algorithm	Hash digest
SHA256	`bf90805cf263a79d14d3258a304bb2b70eada96591b2d36ef69f29f2ba9fcec6`
MD5	`d690cf21d0ee67c4c475c2262b411277`
BLAKE2b-256	`a4f9bf26352b2cdf1e22c0b280006e91e5f1b2e5e6426c4f9a8a1964b92398f4`

Hashes for hermes_cai-0.0.12-py3-none-any.whl

Hashes for hermes_cai-0.0.12-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b772c22e2832e5a5fd426d0db4b785188b567f36ae11754ad4c7796f07d146a4`
MD5	`01ddb2b0fc62b89895caad8ff69518d7`
BLAKE2b-256	`ee532fbddcabe8e721a113604db0379bc60d0d379691a075800e514508ba6e6d`