LLM access to DeepSeek's API
Project description
llm-deepseek
LLM access to DeepSeek's API
Installation
Install this plugin in the same environment as LLM.
llm install llm-deepseek-xtreme
Usage
First, set an API key for DeepSeek:
llm keys set deepseek
# Paste key here
Models (DeepSeek-V3.2)
deepseek-chat: V3.2 non-thinking mode, 128K context, json output + tool calls + chat prefix completiondeepseek-reasoner: V3.2 thinking mode, 128K context, json output + tool calls + chat prefix completion; reasoning always showndeepseek-reasoner-speciale: V3.2-Speciale thinking-only endpoint athttps://api.deepseek.com/v3.2_speciale_expires_on_20251215, up to 128K output, no tools/json/prefix completion (auto-registered)
Chat/reasoner calls use https://api.deepseek.com (Speciale uses its own base automatically). Text completions/FIM automatically use the beta endpoint https://api.deepseek.com/beta.
Pricing (from https://api-docs.deepseek.com/quick_start/pricing): $0.028/M input tokens (cache hit), $0.28/M input tokens (cache miss), $0.42/M output tokens.
Run llm models to list the models, and llm models --options to include a list of their options.
Running Prompts
Run prompts like this:
llm -m deepseek-chat "Describe a futuristic city on Mars"
llm -m deepseek-chat-completion "The AI began to dream, and in its dreams," -o echo true
llm -m deepseek-reasoner "Write a Python function to sort a list of numbers"
Note: The DeepSeek Reasoner model only supports the chat endpoint, not the completion endpoint.
DeepSeek Reasoner Model
The DeepSeek Reasoner model uses a Chain of Thought (CoT) approach to solve complex problems, showing its reasoning process before providing the final answer.
The plugin always shows the model's chain of thought reasoning in non-streaming mode. The reasoning feature is currently only supported in non-streaming mode.
# Normal usage - will show reasoning by default
llm -m deepseek-reasoner "What is 537 * 943?"
New Features
Prefill
The prefill option allows you to provide initial text for the model's response. This is useful for guiding the model's output.
Example:
llm -m deepseek-chat "What are some wild and crazy activities for a holiday party?" -o prefill "Here are some off-the-wall ideas to make your holiday party unforgettable [warning: these may not be suitable for work holiday parties]:"
You can also load prefill text from a file:
# Create a file with your prefill text
echo "Here are some unique holiday party ideas:" > prefill.txt
# Use the file path as the prefill value
llm -m deepseek-chat "What are some fun activities for a holiday party?" -o prefill prefill.txt
This is especially useful for longer prefill text that would be unwieldy on the command line.
JSON Response Format
The response_format option allows you to specify that the model should output its response in JSON format. To ensure the model outputs valid JSON, include the word "json" in the system or user prompt. Optionally, you can provide an example of the desired JSON format to guide the model.
Example:
llm -m deepseek-chat "What are some fun activities for a holiday party?" -o response_format json_object --system "json"
To guide the model further, you can provide an example JSON structure:
llm -m deepseek-chat "What are some way to tell if a holiday party is fun?" -o response_format json_object --system 'EXAMPLE JSON OUTPUT: {"event": "holiday_party_fun", "success_metric": ["..."]}'
Development
To set up this plugin locally, first checkout the code. Then create a new virtual environment:
cd llm-deepseek
python3 -m venv venv
source venv/bin/activate
Publish to PyPI after bumping the version:
pip install -e '.[test]' build twine
python -m build
twine upload dist/*
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file llm_deepseek_xtreme-0.2.2.tar.gz.
File metadata
- Download URL: llm_deepseek_xtreme-0.2.2.tar.gz
- Upload date:
- Size: 7.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
55d87eed6d370fec30c1312626c9738d5e12bd5ef59bd6758e2952c992c2f68c
|
|
| MD5 |
892b2840554f0039ed40e2dc0b617561
|
|
| BLAKE2b-256 |
5d0ac71b6a37903dcadc1226a67a77f05b5ec532c16ea31819f0898a0f29ad5e
|
File details
Details for the file llm_deepseek_xtreme-0.2.2-py3-none-any.whl.
File metadata
- Download URL: llm_deepseek_xtreme-0.2.2-py3-none-any.whl
- Upload date:
- Size: 8.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
f593b5eaee222707f343380751bb08722b1c145d1eb95a292b80df546cf85ed5
|
|
| MD5 |
232349f2e6609f3b90fe4628da37d1b9
|
|
| BLAKE2b-256 |
4d15ba25a099ebea80eb8209d5a813e1f93d1aa542aae3c6ab865390d326f513
|