LLM utilities for the Llama project

These details have not been verified by PyPI

Project description

llama-utils

LlamaIndex utility package

PyPI - Python Version

GitHub last commit GitHub forks GitHub Repo stars

GitHub commits since latest release (by SemVer including pre-releases) GitHub last commit

Current release info

Name	Downloads	Version	Platforms

llama-utils - Large Language Model Utility Package

llama-utils is a large language model utility package

Main Features

llama-index

Package Overview

graph TB
    Package[llama-utils]
    Package --> SubPackage1[Indexing]
    Package --> SubPackage3[Storage]
    SubPackage1 --> Module1[index_manager.py]
    SubPackage1 --> Module2[custom_index.py]
    SubPackage3 --> Module5[storage.py]
    SubPackage3 --> Module6[config_loader.py]

complete overview of the design and architecture here

Installing llama-utils

Installing llama-utils from the conda-forge channel can be achieved by:

conda install -c conda-forge llama-utils=0.3.0

It is possible to list all the versions of llama-utils available on your platform with:

conda search llama-utils --channel conda-forge

Install from GitHub

to install the last development to time, you can install the library from GitHub

pip install git+https://github.com/Serapieum-of-alex/llama-utils

pip

to install the last release, you can easily use pip

pip install llama-utils==0.3.0

Quick start

First download ollama from here ollama and install it.
Then run the following command to pull the llama3 model

ollama pull llama3

Then run ollama server (if you get an error, check the errors section below to solve it)

ollama serve

Now you can use the llama-utils package to interact with the ollama server

from llama_utils.retrieval.storage import Storage
STORAGE_DIR= "examples/data/llama3"
storage = Storage.create()
data_path = "examples/data/essay"
docs = storage.read_documents(data_path)
storage.add_documents(docs)
storage.save(STORAGE_DIR)

Errors

You might face the following error when you run the ollama serve command

Error: listen tcp 127.0.0.1:11434: bind: Only one usage of each socket address (protocol/network address/port) is normally permitted.

This error is due to the port 11434 is already in use, to solve this error, you can check which process is using this port by running the following command

netstat -ano | findstr :11434

You will get the following output

    TCP    127.0.0.1:11434        0.0.0.0:0              LISTENING       20796

Then you can kill the process by running the following command

taskkill /F /PID 20796

you will gee the following output

SUCCESS: The process with PID 20796 has been terminated.

Then you can run the ollama serve command again, you should see the following output

2024/11/22 23:20:04 routes.go:1189: INFO server config env="map[CUDA_VISIBLE_DEVICES: GPU_DEVICE_ORDINAL: HIP_VISIBLE_DEVICES: HSA_OVERRIDE_GFX_VERSION: HTTPS_PROXY: HTTP_PROXY: NO_PROXY: OLLAMA_DEBUG:false OLLAMA_FLASH_ATTENTION:false OLLAMA_GPU_OVERHEAD:0 OLLAMA_HOST:http://127.0.0.1:11434 OLLAMA_INTEL_GPU:false OLLAMA_KEEP_ALIVE:5m0s OLLAMA_LLM_LIBRARY: OLLAMA_LOAD_TIMEOUT:5m0s OLLAMA_MAX_LOADED_MODELS:0 OLLAMA_MAX_QUEUE:512 OLLAMA_MODELS:C:\\Users\\eng_m\\.ollama\\models OLLAMA_MULTIUSER_CACHE:false OLLAMA_NOHISTORY:false OLLAMA_NOPRUNE:false OLLAMA_NUM_PARALLEL:0 OLLAMA_ORIGINS:[http://localhost https://localhost http://localhost:* https://localhost:* http://127.0.0.1 https://127.0.0.1 http://127.0.0.1:* https://127.0.0.1:* http://0.0.0.0 https://0.0.0.0 http://0.0.0.0:* https://0.0.0.0:* app://* file://* tauri://* vscode-webview://*] OLLAMA_SCHED_SPREAD:false OLLAMA_TMPDIR: ROCR_VISIBLE_DEVICES:]"
time=2024-11-22T23:20:04.393+01:00 level=INFO source=images.go:755 msg="total blobs: 28"
time=2024-11-22T23:20:04.395+01:00 level=INFO source=images.go:762 msg="total unused blobs removed: 0"
time=2024-11-22T23:20:04.397+01:00 level=INFO source=routes.go:1240 msg="Listening on 127.0.0.1:11434 (version 0.4.1)"
time=2024-11-22T23:20:04.400+01:00 level=INFO source=common.go:49 msg="Dynamic LLM libraries" runners="[cpu cpu_avx cpu_avx2 cuda_v11 cuda_v12 rocm]"
time=2024-11-22T23:20:04.400+01:00 level=INFO source=gpu.go:221 msg="looking for compatible GPUs"
time=2024-11-22T23:20:04.400+01:00 level=INFO source=gpu_windows.go:167 msg=packages count=1
time=2024-11-22T23:20:04.400+01:00 level=INFO source=gpu_windows.go:214 msg="" package=0 cores=8 efficiency=0 threads=16
time=2024-11-22T23:20:04.592+01:00 level=INFO source=types.go:123 msg="inference compute" id=GPU-04f76f9a-be0a-544b-9a6f-8607b8d0a9ab library=cuda variant=v12 compute=8.6 driver=12.6 name="NVIDIA GeForce RTX 3060 Ti" total="8.0 GiB" available="7.0 GiB"

you can change the port by running the following command ollama serve --port 11435

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.3.0

Feb 5, 2025

0.2.0

Jan 23, 2025

0.1.0

Nov 12, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llama_utils-0.3.0.tar.gz (21.7 kB view details)

Uploaded Feb 5, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

llama_utils-0.3.0-py3-none-any.whl (22.7 kB view details)

Uploaded Feb 5, 2025 Python 3

File details

Details for the file llama_utils-0.3.0.tar.gz.

File metadata

Download URL: llama_utils-0.3.0.tar.gz
Upload date: Feb 5, 2025
Size: 21.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.1

File hashes

Hashes for llama_utils-0.3.0.tar.gz
Algorithm	Hash digest
SHA256	`60760d9e32d01181678d821c8097d34634deed10c036261e3766cdfad4e150af`
MD5	`1e885c0850eed8ed814b5fd1c6c3faee`
BLAKE2b-256	`305ea90d1dc13c9ed88b926e1c74c8e598c85c60f67e07223a1c7cef7f7d655a`

See more details on using hashes here.

File details

Details for the file llama_utils-0.3.0-py3-none-any.whl.

File metadata

Download URL: llama_utils-0.3.0-py3-none-any.whl
Upload date: Feb 5, 2025
Size: 22.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.1

File hashes

Hashes for llama_utils-0.3.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`075640dbded99b51f477dccc86622283b33e653eca428bc10483bb396958c5cb`
MD5	`e397cdd9f63b57db7658355a2e812400`
BLAKE2b-256	`759fe33a243e1a5daf0218aa47b7990f8912b2bf096d2b4e326174be4ce7cd53`

See more details on using hashes here.

llama-utils 0.3.0

Navigation

Verified details

Maintainers

Meta

Unverified details

Meta

Classifiers

Project description

llama-utils

Current release info

llama-utils - Large Language Model Utility Package

Main Features

Package Overview

Installing llama-utils

Install from GitHub

pip

Quick start

Errors

Project details

Verified details

Maintainers

Meta

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes