Skip to main content

llama-index llms siliconflow integration

Project description

LlamaIndex Llms Integration: SiliconFlow

1. Product Introduction

SiliconCloud provides cost-effective GenAI services based on an excellent open-source foundation model. introduction: https://docs.siliconflow.cn/introduction

2. Product features

  • As a one-stop cloud service platform that integrates top large models, SiliconCloud is committed to providing developers with faster, cheaper, more comprehensive, and smoother model APIs.

    • SiliconCloud has been listed on Qwen2.5-72B, DeepSeek-V2.5, Qwen2, InternLM2.5-20B-Chat, BCE, BGE, SenseVoice-Small, Llama-3.1, FLUX.1, DeepSeek-Coder-V2, SD3 Medium, GLM-4-9B-Chat, A variety of open-source large language models, image generation models, code generation models, vector and reordering models, and multimodal large models, including InstantID.

    • Among them, Qwen 2.5 (7B), Llama 3.1 (8B) and other large model APIs are free to use, so that developers and product managers do not need to worry about the computing power costs caused by the R&D stage and large-scale promotion, and realize "token freedom".

  • Provide out-of-the-box large model inference acceleration services to bring a more efficient user experience to your GenAI applications.

3. Installation

pip install llama-index-llms-siliconflow

4. Usage

Complete/Chat

import asyncio
import os
from llama_index.core.llms import ChatMessage
from llama_index.llms.siliconflow import SiliconFlow

llm = SiliconFlow(
    api_key=os.getenv("SILICONFLOW_API_KEY"),
)

response = llm.complete("...")
print(response)

response = asyncio.run(llm.acomplete("..."))
print(response)

messages = [ChatMessage(role="user", content="...")]

response = llm.chat(messages)
print(response)

response = asyncio.run(llm.achat(messages))
print(response)

Function Calling

from llama_index.llms.siliconflow import SiliconFlow

llm = SiliconFlow(
    api_key=os.getenv("SILICONFLOW_API_KEY"),
)
tools = [
    {
        "type": "function",
        "function": {
            "name": "add",
            "description": "Compute the sum of two numbers",
            "parameters": {
                "type": "object",
                "properties": {
                    "a": {
                        "type": "int",
                        "description": "A number",
                    },
                    "b": {
                        "type": "int",
                        "description": "A number",
                    },
                },
                "required": ["a", "b"],
            },
        },
    },
    ...,
]
response = llm.complete("...", tools=tools)
print(llm.get_tool_calls_from_response(response))

# output
# [ToolSelection(tool_id='...', tool_name='add', tool_kwargs={'a': x, 'b': x})]

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llama_index_llms_siliconflow-0.2.1.tar.gz (6.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

llama_index_llms_siliconflow-0.2.1-py3-none-any.whl (6.8 kB view details)

Uploaded Python 3

File details

Details for the file llama_index_llms_siliconflow-0.2.1.tar.gz.

File metadata

  • Download URL: llama_index_llms_siliconflow-0.2.1.tar.gz
  • Upload date:
  • Size: 6.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.12.3 Linux/6.8.0-1021-azure

File hashes

Hashes for llama_index_llms_siliconflow-0.2.1.tar.gz
Algorithm Hash digest
SHA256 cb7fdb30bb5034077b8585e11971bdd62c85cf9c9c30659c41c460eddcd5ef63
MD5 91e0b6a149379bfad2ab603f83eae637
BLAKE2b-256 2dab18033a5296e33cdf92a9563245aeec50cddf35f594c5e4fd97cad1a0bf51

See more details on using hashes here.

File details

Details for the file llama_index_llms_siliconflow-0.2.1-py3-none-any.whl.

File metadata

File hashes

Hashes for llama_index_llms_siliconflow-0.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 b611ec3a58e3bc30d81e3385c91a1535267d98a6334744d4c4d924dc3be545a8
MD5 f4c82dede09208f374981d7ae06694c6
BLAKE2b-256 f57d6f6994162f89d0b760c64bdcfb03f8acc2896d90906e55f801e8725df8d5

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page