Skip to main content

Gemma Model for DashAI

Project description

Gemma Model Plugin for DashAI

This plugin integrates Google's Gemma 3 language models into the DashAI framework using the llama.cpp backend. It enables efficient and flexible text generation with GGUF quantized models and supports private access using a Hugging Face API token.

Included Models

1. Gemma 3 1B It QAT

2. Gemma 3 4B It QAT

Both models are instruction-tuned, designed for high-quality generation and compatibility with CPU or GPU inference using llama.cpp.

About Gemma

Gemma is a family of lightweight, state-of-the-art open models from Google, developed with the same technology as the Gemini models.
Key features of Gemma 3 models:

  • Multimodal: support text and image input (in general; this plugin currently handles text-only generation)
  • Large context window: up to 128K tokens
  • Instruction-tuned variants available
  • Multilingual: over 140 languages supported
  • Open weights with access control via Hugging Face

Gemma is designed for deployment on laptops, desktops, and cloud infrastructure, making advanced AI more accessible.

Features

  • Text generation via chat-style prompt completion
  • GGUF format for optimized performance and memory usage
  • Configurable generation parameters:
    • max_tokens: Output length
    • temperature: Output randomness
    • frequency_penalty: Controls repetition
    • context_window: Number of tokens per forward pass
    • device: "gpu" or "cpu"
  • Automatic login to Hugging Face to access gated models

Model Parameters

Parameter Description Default
model_name Model ID from Hugging Face "google/gemma-3-4b-it-qat-q4_0-gguf"
huggingface_key Hugging Face API token to access restricted models Required
max_tokens Maximum number of tokens to generate 100
temperature Sampling temperature (higher = more random) 0.7
frequency_penalty Penalizes repeated tokens to encourage diversity 0.1
context_window Maximum context window (tokens in prompt) 512
device Inference device ("gpu" or "cpu") "gpu" if available

Requirements

⚠️ Access Notice: You must accept the model terms on Hugging Face and use a valid Hugging Face token.
This repository is publicly accessible, but gated. You need to agree to share your contact information to access the model files.

Notes

This plugin uses the GGUF format, developed by the llama.cpp team for fast inference and low memory consumption.

The model is pretrained and instruction-tuned for inference and is not designed for fine-tuning.
Currently, this plugin supports only text generation (not image inputs).

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dashai_gemma_model_package-0.0.1.tar.gz (19.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dashai_gemma_model_package-0.0.1-py3-none-any.whl (4.6 kB view details)

Uploaded Python 3

File details

Details for the file dashai_gemma_model_package-0.0.1.tar.gz.

File metadata

File hashes

Hashes for dashai_gemma_model_package-0.0.1.tar.gz
Algorithm Hash digest
SHA256 8ebca2e11a8674704c2cb7baeed45abb54e9d8675f3261bd3d3b86f851003423
MD5 c2056db798851f9562ffa411e5015143
BLAKE2b-256 9bae0750e2806a89f9ab22728ee1cf7bda67b5f64fc7cfe5e1efea09fe60da1e

See more details on using hashes here.

File details

Details for the file dashai_gemma_model_package-0.0.1-py3-none-any.whl.

File metadata

File hashes

Hashes for dashai_gemma_model_package-0.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 953907b4a9664b7c0c4385d529fd124b23d83ce9dc31e903c1102bc73b6c7e5d
MD5 cc655695933efa442bec58a94ba68d45
BLAKE2b-256 78cb62b8be783cf740c999a2879b59a6ac9340077eb8596bd03199ac01aecba9

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page