llama-index llms ipex-llm integration
Project description
LlamaIndex Llms Integration: IPEX-LLM
IPEX-LLM is a PyTorch library for running LLM on Intel CPU and GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and Max) with very low latency. This module enables the use of LLMs optimized with ipex-llm
in LlamaIndex pipelines.
Installation
On CPU
pip install llama-index-llms-ipex-llm
On GPU
pip install llama-index-llms-ipex-llm[xpu] --extra-index-url https://pytorch-extension.intel.com/release-whl/stable/xpu/us/
Usage
from llama_index.llms.ipex_llm import IpexLLM
Examples
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for llama_index_llms_ipex_llm-0.2.1.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | c5106fe9fc29920a3dd164a8a8828ccb1602ada19b8864d13b061159697a753f |
|
MD5 | 9dc4a0e9f2dcb24ed535fba25c888e7f |
|
BLAKE2b-256 | 34dedd2b45699ae92b75262349ab502823b2dc63b01802207ae43c202ad71be3 |
Close
Hashes for llama_index_llms_ipex_llm-0.2.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | f940fcb89cb60e1d08c55aa6c05404fa434fb39edddda75182d936bc8243331c |
|
MD5 | f90385036985ca0438e6b19ab5516b10 |
|
BLAKE2b-256 | 8192b14615388d3a403e568be50f12177f7061a4e83322ec3e328c654f0a6f64 |