Gemma Model for DashAI

These details have not been verified by PyPI

Project links

Project description

Gemma Model Plugin for DashAI

This plugin integrates Google's Gemma 3 language models into the DashAI framework using the llama.cpp backend. It enables efficient and flexible text generation with GGUF quantized models and supports private access using a Hugging Face API token.

Included Models

1. Gemma 3 1B It QAT

Lightweight instruction-tuned model with 1.3B parameters
Quantized and optimized for local inference (q4_0 format)
Based on google/gemma-3-1b-it-qat-q4_0-gguf

2. Gemma 3 4B It QAT

Instruction-tuned model with 4B parameters
Balanced size and capability for local or cloud deployment
Based on google/gemma-3-4b-it-qat-q4_0-gguf

Both models are instruction-tuned, designed for high-quality generation and compatibility with CPU or GPU inference using llama.cpp.

About Gemma

Gemma is a family of lightweight, state-of-the-art open models from Google, developed with the same technology as the Gemini models.
Key features of Gemma 3 models:

Multimodal: support text and image input (in general; this plugin currently handles text-only generation)
Large context window: up to 128K tokens
Instruction-tuned variants available
Multilingual: over 140 languages supported
Open weights with access control via Hugging Face

Gemma is designed for deployment on laptops, desktops, and cloud infrastructure, making advanced AI more accessible.

Features

Text generation via chat-style prompt completion
GGUF format for optimized performance and memory usage
Configurable generation parameters:
- max_tokens: Output length
- temperature: Output randomness
- frequency_penalty: Controls repetition
- context_window: Number of tokens per forward pass
- device: "gpu" or "cpu"
Automatic login to Hugging Face to access gated models

Model Parameters

Parameter	Description	Default
`model_name`	Model ID from Hugging Face	`"google/gemma-3-4b-it-qat-q4_0-gguf"`
`huggingface_key`	Hugging Face API token to access restricted models	Required
`max_tokens`	Maximum number of tokens to generate	100
`temperature`	Sampling temperature (higher = more random)	0.7
`frequency_penalty`	Penalizes repeated tokens to encourage diversity	0.1
`context_window`	Maximum context window (tokens in prompt)	512
`device`	Inference device (`"gpu"` or `"cpu"`)	`"gpu"` if available

Requirements

DashAI
llama-cpp-python
Valid Hugging Face Access Token
Model files from Hugging Face:
- google/gemma-3-1b-it-qat-q4_0-gguf
- google/gemma-3-4b-it-qat-q4_0-gguf

⚠️ Access Notice: You must accept the model terms on Hugging Face and use a valid Hugging Face token.
This repository is publicly accessible, but gated. You need to agree to share your contact information to access the model files.

Notes

This plugin uses the GGUF format, developed by the llama.cpp team for fast inference and low memory consumption.

The model is pretrained and instruction-tuned for inference and is not designed for fine-tuning.
Currently, this plugin supports only text generation (not image inputs).

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.0.1

Jul 10, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dashai_gemma_model_package-0.0.1.tar.gz (19.5 kB view details)

Uploaded Jul 10, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

dashai_gemma_model_package-0.0.1-py3-none-any.whl (4.6 kB view details)

Uploaded Jul 10, 2025 Python 3

File details

Details for the file dashai_gemma_model_package-0.0.1.tar.gz.

File metadata

Download URL: dashai_gemma_model_package-0.0.1.tar.gz
Upload date: Jul 10, 2025
Size: 19.5 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.9.6

File hashes

Hashes for dashai_gemma_model_package-0.0.1.tar.gz
Algorithm	Hash digest
SHA256	`8ebca2e11a8674704c2cb7baeed45abb54e9d8675f3261bd3d3b86f851003423`
MD5	`c2056db798851f9562ffa411e5015143`
BLAKE2b-256	`9bae0750e2806a89f9ab22728ee1cf7bda67b5f64fc7cfe5e1efea09fe60da1e`

See more details on using hashes here.

File details

Details for the file dashai_gemma_model_package-0.0.1-py3-none-any.whl.

File metadata

Download URL: dashai_gemma_model_package-0.0.1-py3-none-any.whl
Upload date: Jul 10, 2025
Size: 4.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.9.6

File hashes

Hashes for dashai_gemma_model_package-0.0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`953907b4a9664b7c0c4385d529fd124b23d83ce9dc31e903c1102bc73b6c7e5d`
MD5	`cc655695933efa442bec58a94ba68d45`
BLAKE2b-256	`78cb62b8be783cf740c999a2879b59a6ac9340077eb8596bd03199ac01aecba9`

See more details on using hashes here.

dashai-gemma-model-package 0.0.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

Gemma Model Plugin for DashAI

Included Models

1. Gemma 3 1B It QAT

2. Gemma 3 4B It QAT

About Gemma

Features

Model Parameters

Requirements

Notes

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes