Sinapsis templates for llama.cpp text generation.

These details have not been verified by PyPI

Project links

Project description

Sinapsis LLaMA CPP

Sinapsis templates for local GGUF-backed text completion, streaming, and MCP with llama-cpp-python.

🐍 Installation • 🚀 Features • 📚 Usage example • 📙 Documentation • 🔍 License

The sinapsis-llama-cpp package provides Sinapsis templates built on top of llama-cpp-python for running local or Hugging Face-hosted GGUF models through LLMConversationPacket.

🐍 Installation

Install using your preferred package manager. We strongly recommend using uv.

Install the base package:

uv pip install sinapsis-llama-cpp --extra-index-url https://pypi.sinapsis.tech

Or with raw pip:

pip install sinapsis-llama-cpp --extra-index-url https://pypi.sinapsis.tech

[!IMPORTANT] If you also want the upstream llama-cpp-python HTTP server, install the optional server extra:
uv pip install sinapsis-llama-cpp[server] --extra-index-url https://pypi.sinapsis.tech
Or install all optional dependencies:
uv pip install sinapsis-llama-cpp[all] --extra-index-url https://pypi.sinapsis.tech

🚀 Features

Templates Supported

LLaMACPPTextCompletion: Standard llama.cpp chat completion using LLMConversationPacket.
LLaMACPPStreamingTextCompletion: Async streaming variant that yields partial packets during generation.
LLaMACPPTextCompletionWithMCP: llama.cpp chat completion with packet-native MCP tool state.

🧩 Common Attributes

init_args (LLaMACPPInitArgs, required): llama.cpp runtime arguments.
- llm_model_name (str, required): Hugging Face repo id or local directory containing the GGUF file.
- llm_model_file (str, required): GGUF file name to load.
- Additional runtime controls include n_ctx, n_threads, n_gpu_layers, flash_attn_type, tensor_split, use_mmap, use_mlock, seed, and chat_format.
completion_args (LLaMACPPCompletionArgs, required): Request-time generation parameters such as max_tokens, temperature, top_p, top_k, min_p, penalties, stop sequences, and structured-output settings.
reasoning_start_tag / reasoning_end_tag (str | None, optional): Tags used to extract reasoning into LLMConversationPacket.reasoning before the final response is cleaned.

[!TIP] Use CLI command sinapsis info --all-template-names to show a list with all the available Template names installed with Sinapsis LLaMA CPP.

📚 Usage example

The following agent runs one local llama.cpp text-completion step using LLMConversationInput.

Config

agent:
  name: text_completion
  description: Single-shot llama-cpp text completion for Q&A and text generation.

templates:
  - template_name: InputTemplate
    class_name: InputTemplate
    attributes: {}

  - template_name: LLMConversationInput
    class_name: LLMConversationInput
    template_input: InputTemplate
    attributes:
      prompt: Give three short tips for staying organized during a busy week.
      system_prompt: You are a helpful assistant.

  - template_name: LLaMACPPTextCompletion
    class_name: LLaMACPPTextCompletion
    template_input: LLMConversationInput
    attributes:
      init_args:
        llm_model_name: unsloth/Qwen3.5-9B-GGUF
        llm_model_file: Qwen3.5-9B-Q4_K_M.gguf
        n_ctx: 8192
        n_threads: 8
        n_gpu_layers: -1
        flash_attn_type: -1
        seed: 10
      completion_args:
        max_tokens: 4096
        temperature: 0.2
        seed: 10

📙 Documentation

Documentation for this and other sinapsis packages is available on the sinapsis website

Tutorials for different projects within sinapsis are available at sinapsis tutorials page

🔍 License

This project is licensed under the AGPLv3 license, which encourages open collaboration and sharing. For more details, please refer to the LICENSE file.

For commercial use, please refer to our official Sinapsis website for information on obtaining a commercial license.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.5.1

Jun 11, 2026

0.5.0

Mar 25, 2026

0.4.4

Mar 3, 2026

0.4.3

Feb 27, 2026

0.4.2

Feb 26, 2026

0.4.1

Feb 25, 2026

0.4.0

Feb 19, 2026

0.3.14

Jan 15, 2026

0.3.13

Dec 9, 2025

0.3.12

Nov 10, 2025

0.3.11

Nov 3, 2025

0.3.10

Sep 8, 2025

0.3.9

Aug 29, 2025

0.3.8

Aug 19, 2025

0.3.7

Aug 5, 2025

0.3.6

Jul 28, 2025

0.3.5

Jun 3, 2025

0.3.4

May 2, 2025

0.3.3

Apr 30, 2025

0.3.2

Apr 30, 2025

0.3.1

Apr 29, 2025

0.3.0

Apr 9, 2025

0.2.0

Apr 1, 2025

0.1.0

Mar 26, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sinapsis_llama_cpp-0.5.1.tar.gz (29.6 kB view details)

Uploaded Jun 11, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

sinapsis_llama_cpp-0.5.1-py3-none-any.whl (32.4 kB view details)

Uploaded Jun 11, 2026 Python 3

File details

Details for the file sinapsis_llama_cpp-0.5.1.tar.gz.

File metadata

Download URL: sinapsis_llama_cpp-0.5.1.tar.gz
Upload date: Jun 11, 2026
Size: 29.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.5.16

File hashes

Hashes for sinapsis_llama_cpp-0.5.1.tar.gz
Algorithm	Hash digest
SHA256	`f5b282fc6cd364b838b6dde51705c84be345d53f4536773473855a31b4848b71`
MD5	`dd1a16b6480be0629c0eb596d61196ae`
BLAKE2b-256	`b0118830cd8224165b69fc53e9eda94bd02250a670be8cd62389e760f96ae918`

See more details on using hashes here.

File details

Details for the file sinapsis_llama_cpp-0.5.1-py3-none-any.whl.

File metadata

Download URL: sinapsis_llama_cpp-0.5.1-py3-none-any.whl
Upload date: Jun 11, 2026
Size: 32.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.5.16

File hashes

Hashes for sinapsis_llama_cpp-0.5.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`696d8b62da7cfae87d239406088819b1fb1fc5ae05b82cc32b7379bb0e117e95`
MD5	`ef847b462ed193d58b0ce9e07366a637`
BLAKE2b-256	`fd6bbd2cc93d21dce6fc9b8de77960e0d64c71cd25e7cb07a91ad4fc92055299`

See more details on using hashes here.

sinapsis-llama-cpp 0.5.1

Navigation

Verified details

Maintainers

Meta

Unverified details

Project links

Meta

Project description

Sinapsis LLaMA CPP

Sinapsis templates for local GGUF-backed text completion, streaming, and MCP with llama-cpp-python.

🐍 Installation

🚀 Features

Templates Supported

📚 Usage example

📙 Documentation

🔍 License

Project details

Verified details

Maintainers

Meta

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes