Skip to main content

llama-index llms siliconflow integration

Project description

LlamaIndex Llms Integration: SiliconFlow

1. Product Introduction

SiliconCloud provides cost-effective GenAI services based on an excellent open-source foundation model. introduction: https://docs.siliconflow.cn/introduction

2. Product features

  • As a one-stop cloud service platform that integrates top large models, SiliconCloud is committed to providing developers with faster, cheaper, more comprehensive, and smoother model APIs.

    • SiliconCloud has been listed on Qwen2.5-72B, DeepSeek-V2.5, Qwen2, InternLM2.5-20B-Chat, BCE, BGE, SenseVoice-Small, Llama-3.1, FLUX.1, DeepSeek-Coder-V2, SD3 Medium, GLM-4-9B-Chat, A variety of open-source large language models, image generation models, code generation models, vector and reordering models, and multimodal large models, including InstantID.

    • Among them, Qwen 2.5 (7B), Llama 3.1 (8B) and other large model APIs are free to use, so that developers and product managers do not need to worry about the computing power costs caused by the R&D stage and large-scale promotion, and realize "token freedom".

  • Provide out-of-the-box large model inference acceleration services to bring a more efficient user experience to your GenAI applications.

3. Installation

pip install llama-index-llms-siliconflow

4. Usage

Complete/Chat

import asyncio
import os
from llama_index.core.llms import ChatMessage
from llama_index.llms.siliconflow import SiliconFlow

llm = SiliconFlow(
    api_key=os.getenv("SILICONFLOW_API_KEY"),
)

response = llm.complete("...")
print(response)

response = asyncio.run(llm.acomplete("..."))
print(response)

messages = [ChatMessage(role="user", content="...")]

response = llm.chat(messages)
print(response)

response = asyncio.run(llm.achat(messages))
print(response)

Function Calling

from llama_index.llms.siliconflow import SiliconFlow

llm = SiliconFlow(
    api_key=os.getenv("SILICONFLOW_API_KEY"),
)
tools = [
    {
        "type": "function",
        "function": {
            "name": "add",
            "description": "Compute the sum of two numbers",
            "parameters": {
                "type": "object",
                "properties": {
                    "a": {
                        "type": "int",
                        "description": "A number",
                    },
                    "b": {
                        "type": "int",
                        "description": "A number",
                    },
                },
                "required": ["a", "b"],
            },
        },
    },
    ...,
]
response = llm.complete("...", tools=tools)
print(llm.get_tool_calls_from_response(response))

# output
# [ToolSelection(tool_id='...', tool_name='add', tool_kwargs={'a': x, 'b': x})]

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llama_index_llms_siliconflow-0.1.0.tar.gz (5.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

llama_index_llms_siliconflow-0.1.0-py3-none-any.whl (6.3 kB view details)

Uploaded Python 3

File details

Details for the file llama_index_llms_siliconflow-0.1.0.tar.gz.

File metadata

  • Download URL: llama_index_llms_siliconflow-0.1.0.tar.gz
  • Upload date:
  • Size: 5.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.10.12 Linux/6.5.0-1025-azure

File hashes

Hashes for llama_index_llms_siliconflow-0.1.0.tar.gz
Algorithm Hash digest
SHA256 a8659183e86411990109c03bdd85e1781ace36a642cd08a27fe150e9b6eaeefd
MD5 5e10f2f60345bcd304e5c4fa1be7983c
BLAKE2b-256 d020157daebddc646eb5e6f21ad8aa8e15703d9c5272edcd0aa0cd3e084fa0ca

See more details on using hashes here.

File details

Details for the file llama_index_llms_siliconflow-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for llama_index_llms_siliconflow-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 76bceea0ac8dd03daa7711553b9d132684a3e8acac2edb111e1e8c3d892d197f
MD5 9cb978c74345548924f7ffff1ea958e0
BLAKE2b-256 44ac183253fd199a1da34dd66dc80ba37cd934ab0ce9ae30cd1c533ea1d92f93

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page