Skip to main content

llama-index llms gaudi integration

Project description

LlamaIndex Llms Integration with Intel Gaudi

Installation

pip install --upgrade-strategy eager optimum[habana]
pip install llama-index-llms-gaudi
pip install llama-index-llms-huggingface

Usage

import argparse
import os, logging
from llama_index.llms.gaudi import GaudiLLM


def setup_parser(parser):
    parser.add_argument(...)
    args = parser.parse_args()
    return args


if __name__ == "__main__":
    parser = argparse.ArgumentParser(
        description="GaudiLLM Basic Usage Example"
    )
    args = setup_parser(parser)
    args.model_name_or_path = "HuggingFaceH4/zephyr-7b-alpha"

    llm = GaudiLLM(
        args=args,
        logger=logger,
        model_name="HuggingFaceH4/zephyr-7b-alpha",
        tokenizer_name="HuggingFaceH4/zephyr-7b-alpha",
        query_wrapper_prompt=PromptTemplate(
            "<|system|>\n</s>\n<|user|>\n{query_str}</s>\n<|assistant|>\n"
        ),
        context_window=3900,
        max_new_tokens=256,
        generate_kwargs={"temperature": 0.7, "top_k": 50, "top_p": 0.95},
        messages_to_prompt=messages_to_prompt,
        device_map="auto",
    )

    query = "Is the ocean blue?"
    print("\n----------------- Complete ------------------")
    completion_response = llm.complete(query)
    print(completion_response.text)

Examples

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llama_index_llms_gaudi-0.3.1.tar.gz (12.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

llama_index_llms_gaudi-0.3.1-py3-none-any.whl (12.9 kB view details)

Uploaded Python 3

File details

Details for the file llama_index_llms_gaudi-0.3.1.tar.gz.

File metadata

File hashes

Hashes for llama_index_llms_gaudi-0.3.1.tar.gz
Algorithm Hash digest
SHA256 353d5c390256cdc2de23cd7ffa4482f870ec55519fc6a1b484cbc6ad68fe0269
MD5 51cbf09d93ce3ca28354833ec10c6cd8
BLAKE2b-256 b101b847e6955eb49bd7b28b544ada5aac2c8d30270543c0916779cadef78943

See more details on using hashes here.

File details

Details for the file llama_index_llms_gaudi-0.3.1-py3-none-any.whl.

File metadata

File hashes

Hashes for llama_index_llms_gaudi-0.3.1-py3-none-any.whl
Algorithm Hash digest
SHA256 c2ea5c7512c3b54be15ad0d73eeede5cde5f6ec49bbe1e0382c6a9a8fc517872
MD5 24f83278b52206c8f7a49b8b77609888
BLAKE2b-256 98e99e340edb0753634d5a1fe86275a28177f16bac90827083516acdb3cc871d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page