DeepSeek Model for DashAI
Project description
Phi Model Plugin for DashAI
This plugin integrates Microsoft's Phi language models into the DashAI framework using the llama.cpp backend. It provides a lightweight, efficient text generation system with support for quantized GGUF models.
Included Models
1. Phi-3 Mini 4K Instruct
- 3.8B parameter lightweight model from the Phi-3 family
- Designed for high-quality output with strong reasoning abilities
- Trained on synthetic and filtered public datasets
- Fine-tuned with supervised techniques and direct preference optimization
- Based on
microsoft/Phi-3-mini-4k-instruct-gguf - Uses GGUF file:
Phi-3-mini-4k-instruct-q4.gguf
2. Phi-4
- State-of-the-art open model developed by Microsoft Research
- Trained on high-quality public domain content, academic books, and Q&A datasets
- Emphasizes precise instruction-following and strong safety alignment
- Based on
microsoft/phi-4-gguf - Uses GGUF file:
phi-4-IQ3_M.gguf
Both models use the GGUF format and are compatible with CPU and GPU inference.
Components
PhiModel
- Implements the
TextToTextGenerationTaskModelinterface from DashAI - Uses the
llama.cppbackend with GGUF support - Automatically loads the correct quantized model file based on the selected model
- Performs chat-style completion with system/user/assistant messages
Features
-
Configurable text generation with:
max_tokens: Number of tokens to generatetemperature: Controls output randomnessfrequency_penalty: Reduces repetitioncontext_window: Max tokens per forward passdevice:"cpu"or"gpu"(auto-detected)
-
Efficient memory usage with quantized GGUF format
-
Automatic model loading from Hugging Face
-
Compatible with chat-style prompts (role-based message format)
Model Parameters
| Parameter | Description | Default |
|---|---|---|
model_name |
Model ID from Hugging Face | "microsoft/Phi-3-mini-4k-instruct-gguf" |
max_tokens |
Maximum number of tokens to generate | 100 |
temperature |
Sampling temperature (higher = more random) | 0.7 |
frequency_penalty |
Penalizes repeated tokens to encourage diversity | 0.1 |
context_window |
Maximum context window (tokens in prompt) | 512 |
device |
Device for inference ("gpu" or "cpu") |
Auto-detected |
Requirements
DashAIllama-cpp-python- Model files from Hugging Face:
Notes
This plugin uses the GGUF format, introduced by the llama.cpp team in August 2023.
GGUF replaces the older GGML format and is optimized for fast inference and low memory usage.
Both Phi-3 Mini and Phi-4 models have undergone supervised fine-tuning and preference optimization to improve instruction adherence and safety.
⚠️ These models are designed for inference only and are not intended for fine-tuning.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file dashai_phi_model_package-0.0.1.tar.gz.
File metadata
- Download URL: dashai_phi_model_package-0.0.1.tar.gz
- Upload date:
- Size: 19.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.9.6
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
2dbb6e9ee6b63bba393cfe3916a8ea0c2c214e9ff66deec6520845917b99f431
|
|
| MD5 |
62ef7fa4725840440697eecc4ce8ac5b
|
|
| BLAKE2b-256 |
1f934eeedf5eb783fcdd90d07355c704ddf08ea539befff63914bda5026fcb2a
|
File details
Details for the file dashai_phi_model_package-0.0.1-py3-none-any.whl.
File metadata
- Download URL: dashai_phi_model_package-0.0.1-py3-none-any.whl
- Upload date:
- Size: 4.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.9.6
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
8d9d239e8c1d1fd516173c5d6d90d16df3d10d661eeac07e8bc39e5efb407e6d
|
|
| MD5 |
80fcb116e64f13716a1c273a74a977bd
|
|
| BLAKE2b-256 |
06f02d948405c2d8386be0ee8658752f7b9be1e59dea75b68978329b721169e4
|