Peer-to-peer distributed inference for open-source language models

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

 _                                                   ____   _
| |                                                 |  __`\(_)                
| |     __ _  ___   ___  _   _  __ _  __ _  ___     | |__) | |_ __   ___  ___ 
| |    / _` |/ _ \ / _ `| | | |/ _` |/ _` |/ _ \    |  ___/| | '_ \ / _ \/ __|
| |___| (_| | | | | (_| | |_| | (_| | (_| |  __/    | |    | | |_) |  __/\__ \
|______\__,_|_| |_|\__, |\__,_|\__,_|\__, |\___|    |_|    |_| .__/ \___||___/
                    __/ |            __/ |                   | |              
                   |___/            |___/                    |_|

Peer-to-peer distributed inference for open-source language models

PyPI - Downloads

Documentation & site: https://languagepipes.com

Language Pipes is an open-source distributed inference system built on the transformers library that splits large language model computation across multiple machines. By separating the model's text-handling components (embedding and output head) from its intermediate transformer layers, Language Pipes enables peer-to-peer inference.

Features

OpenAI-compatible API
Automatic model download by HuggingFace ID
Interactive TUI for configuration, monitoring, and control
Decentralized peer-to-peer network with optional AES encryption

How It Works

Language models process input through a sequence of transformer layers. Each layer performs matrix multiplications between learned weights and a hidden state tensor, passing the result to the next layer. Language Pipes distributes these layers across machines, splitting the memory cost across the network while keeping the text-handling components on the origin node.

The architecture provides architectural separation: layer models operate on continuous-valued tensors rather than discrete text while the end models keep text data on trusted systems. The privacy documentation provides a probabilistic threat model that quantifies the difficulty of known inversion attacks under various mitigation configurations.

Installation

Requires Python 3.10+. For GPU support, install the appropriate PyTorch version for your CUDA configuration:
https://pytorch.org/get-started/locally/

Install from pip:

pip install language-pipes

Quick Start

Launch the interactive TUI:

language-pipes

From the main menu, select New Configuration and give it a name to create a TOML config and open the dashboard (or Load Configuration to reopen one you've created before).

The dashboard is organized into tabs along the top: Home, Network, Models, Pipes, and Jobs. A fresh configuration has no node ID yet, so the only option on Home is Configure Network Server. Set a Node ID under Network > Configure, then return to Home and select Start Network Server. Once the network is running, the dashboard exposes the rest of setup: load models under Models > Layer Models / End Models, and configure and start the OpenAI-compatible API under Jobs > Server.

Configuration can also be edited directly as TOML files and run headlessly. See the CLI reference for details on running a saved configuration from the command line.

Two Node Example

This example distributes Qwen/Qwen3-1.7B across two computers. Node 1 hosts the End Model, so prompts and responses stay on Node 1, plus enough layers to fit in its memory. Node 2 hosts the remaining layers.

Node 1 (First Computer)

language-pipes

Select New Configuration and name it (e.g. node-1).

Network > Configure: set Node ID to node-1 and ensure Network IP is set to this machine's local IP address. Leave Network Key empty to disable encryption for this example. Peer Port defaults to 5000.
Back on Home, select Start Network Server.
Models > Installed: select Install New Model and enter Qwen/Qwen3-1.7B to download it.
Models > Layer Models: select Add Layer Model, choose Qwen/Qwen3-1.7B, a device (cpu or cuda:0), and a memory budget in GB (e.g. 2), then Save Model. Confirm to load it now.
Models > End Models: select Add End Model, choose Qwen/Qwen3-1.7B, and confirm to load it now.
Jobs > Server: ensure the Port is set to 8000 and select Start Server.

Node 2 (Second Computer)

language-pipes

Select New Configuration and name it (e.g. node-2).

Network > Configure: set Node ID to node-2. Under Bootstrap Nodes, add an entry with node-1's IP address and peer port (5000) so this node joins node-1's network.
Back on Home, select Start Network Server.
Models > Installed: install Qwen/Qwen3-1.7B as on Node 1.
Models > Layer Models: add Qwen/Qwen3-1.7B with a device and memory budget covering the remaining layers (e.g. 2 on cpu).

Once both nodes have loaded their layers, Pipes > Complete shows a completed pipe for Qwen/Qwen3-1.7B, and the model is ready for inference via node-1's Job Port.

Test the API

The model is accessible via the OpenAI-compatible API.

Example using the OpenAI Python library:

from openai import OpenAI

client = OpenAI(
    base_url="http://127.0.0.1:8000/v1",  # node-1 IP address and Job Port
    api_key="not-needed"  # only required if api_keys is set in the config
)

response = client.chat.completions.create(
    model="Qwen/Qwen3-1.7B",
    max_completion_tokens=100,
    messages=[
        {"role": "system", "content": "You are a helpful assistant."},
        {"role": "user", "content": "Write a haiku about distributed systems."}
    ]
)

print(response.choices[0].message.content)

Install the OpenAI library with: pip install openai

See the OpenAI-compatible API documentation for the full endpoint reference and sampling parameter descriptions.

Supported Models

Language Pipes currently supports a few model families including Qwen3, Phi, Meta Llama 3.1/3.2, and Gemma 3. View all tested models here

Planned Improvements

Additional model architectures
INT8 and INT4 quantization (currently all inference uses fp16)
GGUF format support (currently requires safetensors)

Dependencies

Documentation

The docs are published as a website at https://erinclemmer.github.io/language-pipes (built from this folder by website/). The Markdown source of truth lives in documentation/:

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

erinclemmer

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

2.1.0

Jul 5, 2026

2.0.0

Jun 18, 2026

1.2.0

Feb 22, 2026

1.1.0

Feb 15, 2026

1.0.0

Feb 8, 2026

0.19.7

Jan 31, 2026

0.19.6

Jan 31, 2026

0.19.5

Jan 31, 2026

0.19.4

Jan 31, 2026

0.19.3

Jan 31, 2026

0.19.2

Jan 31, 2026

0.19.1

Jan 31, 2026

0.19.0

Jan 30, 2026

0.18.1

Jan 25, 2026

0.18.0

Jan 25, 2026

0.17.0

Jan 18, 2026

0.16.0

Jan 3, 2026

0.15.0

Jan 1, 2026

0.13.1

Dec 24, 2025

0.13.0

Dec 24, 2025

0.12.4

Dec 21, 2025

0.12.3

Dec 21, 2025

0.12.2

Dec 13, 2025

0.12.1

Dec 11, 2025

0.12.0

Dec 11, 2025

0.11.3

Dec 7, 2025

0.11.2

Dec 7, 2025

0.11.1

Dec 7, 2025

0.11.0

Dec 6, 2025

0.10.0

Dec 2, 2025

0.9.0

Nov 30, 2025

0.8.0

Nov 30, 2025

0.7.0

Nov 30, 2025

0.6.1

Nov 24, 2025

0.6.0

Nov 11, 2025

0.5.2

Sep 30, 2025

0.5.1

Sep 30, 2025

0.5.0

Sep 29, 2025

0.4.4

Sep 25, 2025

0.4.3

Sep 24, 2025

0.4.2

Sep 24, 2025

0.4.1

Sep 22, 2025

0.4.0

Sep 22, 2025

0.3.1

Sep 21, 2025

0.3.0

Sep 21, 2025

0.2.0

Sep 14, 2025

0.1.0

Sep 14, 2025

0.0.4

Sep 6, 2025

0.0.3

Sep 3, 2025

0.0.2

Sep 3, 2025

0.0.1

Sep 1, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

language_pipes-2.1.0.tar.gz (257.2 kB view details)

Uploaded Jul 5, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

language_pipes-2.1.0-py3-none-any.whl (164.4 kB view details)

Uploaded Jul 5, 2026 Python 3

File details

Details for the file language_pipes-2.1.0.tar.gz.

File metadata

Download URL: language_pipes-2.1.0.tar.gz
Upload date: Jul 5, 2026
Size: 257.2 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for language_pipes-2.1.0.tar.gz
Algorithm	Hash digest
SHA256	`a13897f837d28727c4a00f2b26f913ae4eb3f011d8d370985010214e3a5bc752`
MD5	`fde8a5984b312b9bbe9fbf8562d25eb2`
BLAKE2b-256	`cd159fa6116d47ac6f5b3232465a1af788c2385778dbcde6ef9731e90c55ea21`

See more details on using hashes here.

Provenance

The following attestation bundles were made for language_pipes-2.1.0.tar.gz:

Publisher: publish.yml on erinclemmer/language-pipes

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: language_pipes-2.1.0.tar.gz
- Subject digest: a13897f837d28727c4a00f2b26f913ae4eb3f011d8d370985010214e3a5bc752
- Sigstore transparency entry: 2083009364
- Sigstore integration time: Jul 5, 2026
Source repository:
- Permalink: erinclemmer/language-pipes@50d8d2790d939febeaa950d2e4fed08f3d9defdb
- Branch / Tag: refs/tags/2.1.0
- Owner: https://github.com/erinclemmer
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@50d8d2790d939febeaa950d2e4fed08f3d9defdb
- Trigger Event: release

File details

Details for the file language_pipes-2.1.0-py3-none-any.whl.

File metadata

Download URL: language_pipes-2.1.0-py3-none-any.whl
Upload date: Jul 5, 2026
Size: 164.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for language_pipes-2.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`da88e310e9e57972a48a1205ff7d0ee5ec18fbd4da43e8ccdeda098ed614b170`
MD5	`1d800206550a2e513e5424180893eeb4`
BLAKE2b-256	`d500a5321d6360060f32badebd20d2e81568a32c108c5a2a958b50a4a77a6389`

See more details on using hashes here.

Provenance

The following attestation bundles were made for language_pipes-2.1.0-py3-none-any.whl:

Publisher: publish.yml on erinclemmer/language-pipes

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: language_pipes-2.1.0-py3-none-any.whl
- Subject digest: da88e310e9e57972a48a1205ff7d0ee5ec18fbd4da43e8ccdeda098ed614b170
- Sigstore transparency entry: 2083009427
- Sigstore integration time: Jul 5, 2026
Source repository:
- Permalink: erinclemmer/language-pipes@50d8d2790d939febeaa950d2e4fed08f3d9defdb
- Branch / Tag: refs/tags/2.1.0
- Owner: https://github.com/erinclemmer
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@50d8d2790d939febeaa950d2e4fed08f3d9defdb
- Trigger Event: release

language-pipes 2.1.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Features

How It Works

Installation

Quick Start

Two Node Example

Node 1 (First Computer)

Node 2 (Second Computer)

Test the API

Supported Models

Planned Improvements

Dependencies

Documentation

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance