Gemma Model for DashAI
Project description
Gemma Model Plugin for DashAI
This plugin integrates Google's Gemma 3 language models into the DashAI framework using the llama.cpp backend. It enables efficient and flexible text generation with GGUF quantized models and supports private access using a Hugging Face API token.
Included Models
1. Gemma 3 1B It QAT
- Lightweight instruction-tuned model with 1.3B parameters
- Quantized and optimized for local inference (
q4_0format) - Based on
google/gemma-3-1b-it-qat-q4_0-gguf
2. Gemma 3 4B It QAT
- Instruction-tuned model with 4B parameters
- Balanced size and capability for local or cloud deployment
- Based on
google/gemma-3-4b-it-qat-q4_0-gguf
Both models are instruction-tuned, designed for high-quality generation and compatibility with CPU or GPU inference using llama.cpp.
About Gemma
Gemma is a family of lightweight, state-of-the-art open models from Google, developed with the same technology as the Gemini models.
Key features of Gemma 3 models:
- Multimodal: support text and image input (in general; this plugin currently handles text-only generation)
- Large context window: up to 128K tokens
- Instruction-tuned variants available
- Multilingual: over 140 languages supported
- Open weights with access control via Hugging Face
Gemma is designed for deployment on laptops, desktops, and cloud infrastructure, making advanced AI more accessible.
Features
- Text generation via chat-style prompt completion
- GGUF format for optimized performance and memory usage
- Configurable generation parameters:
max_tokens: Output lengthtemperature: Output randomnessfrequency_penalty: Controls repetitioncontext_window: Number of tokens per forward passdevice:"gpu"or"cpu"
- Automatic login to Hugging Face to access gated models
Model Parameters
| Parameter | Description | Default |
|---|---|---|
model_name |
Model ID from Hugging Face | "google/gemma-3-4b-it-qat-q4_0-gguf" |
huggingface_key |
Hugging Face API token to access restricted models | Required |
max_tokens |
Maximum number of tokens to generate | 100 |
temperature |
Sampling temperature (higher = more random) | 0.7 |
frequency_penalty |
Penalizes repeated tokens to encourage diversity | 0.1 |
context_window |
Maximum context window (tokens in prompt) | 512 |
device |
Inference device ("gpu" or "cpu") |
"gpu" if available |
Requirements
DashAIllama-cpp-python- Valid Hugging Face Access Token
- Model files from Hugging Face:
⚠️ Access Notice: You must accept the model terms on Hugging Face and use a valid Hugging Face token.
This repository is publicly accessible, but gated. You need to agree to share your contact information to access the model files.
Notes
This plugin uses the GGUF format, developed by the llama.cpp team for fast inference and low memory consumption.
The model is pretrained and instruction-tuned for inference and is not designed for fine-tuning.
Currently, this plugin supports only text generation (not image inputs).
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file dashai_gemma_model_package-0.0.1.tar.gz.
File metadata
- Download URL: dashai_gemma_model_package-0.0.1.tar.gz
- Upload date:
- Size: 19.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.9.6
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
8ebca2e11a8674704c2cb7baeed45abb54e9d8675f3261bd3d3b86f851003423
|
|
| MD5 |
c2056db798851f9562ffa411e5015143
|
|
| BLAKE2b-256 |
9bae0750e2806a89f9ab22728ee1cf7bda67b5f64fc7cfe5e1efea09fe60da1e
|
File details
Details for the file dashai_gemma_model_package-0.0.1-py3-none-any.whl.
File metadata
- Download URL: dashai_gemma_model_package-0.0.1-py3-none-any.whl
- Upload date:
- Size: 4.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.9.6
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
953907b4a9664b7c0c4385d529fd124b23d83ce9dc31e903c1102bc73b6c7e5d
|
|
| MD5 |
cc655695933efa442bec58a94ba68d45
|
|
| BLAKE2b-256 |
78cb62b8be783cf740c999a2879b59a6ac9340077eb8596bd03199ac01aecba9
|