Large Language Models Tools
Project description
Environment
To create a Python virtual environment, use the command:
conda env create -f environment.yml
Installation
pip install lmtools
Supported Models
The following LLM models are supported:
- CohereForAI/c4ai-command-r-08-2024
- google/gemma-2-27b-it
- mistralai/Mistral-Large-Instruct-2407
LLM backends
The following LLM backends are supported:
- Llama.cpp Server API
Run Llama.CPP Server backend
llama.cpp/build/bin/llama-server -m model_q5_k_m.gguf -ngl 99 -fa -c 4096 --host 0.0.0.0 --port 8000
Install CUDA toolkit for Llama.cpp compilation
Please note that the toolkit version must match the driver version. The driver version can be found using the nvidia-smi command. To install toolkit for CUDA 12.5 you need to run the following commands:
CUDA_TOOLKIT_VERSION=12-5
wget https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64/cuda-keyring_1.1-1_all.deb
sudo dpkg -i cuda-keyring_1.1-1_all.deb
sudo apt update
sudo apt -y install cuda-toolkit-${CUDA_TOOLKIT_VERSION}
echo -e '
export CUDA_HOME=/usr/local/cuda
export PATH=${CUDA_HOME}/bin:${PATH}
export LD_LIBRARY_PATH=${CUDA_HOME}/lib64:$LD_LIBRARY_PATH
' >> ~/.bashrc
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
smart_context-0.0.4.tar.gz
(5.6 kB
view details)
Built Distribution
File details
Details for the file smart_context-0.0.4.tar.gz
.
File metadata
- Download URL: smart_context-0.0.4.tar.gz
- Upload date:
- Size: 5.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.3
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 81ab1ba0d0a00f8393cae4c643201f0951c1acf340028df53e45dea4a5c2d7c4 |
|
MD5 | 03eddf48700e285b83017414c357a7fc |
|
BLAKE2b-256 | 9a9c74f0a1d081f3dd3864e9f596cac88dd188d61fbadfc399987e16cfd21509 |
File details
Details for the file smart_context-0.0.4-py3-none-any.whl
.
File metadata
- Download URL: smart_context-0.0.4-py3-none-any.whl
- Upload date:
- Size: 6.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.3
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8c123ed6a7fca85fe1463dfd813a7b5c98bd7a2836642125afb8d1a0ec3b21f8 |
|
MD5 | d84925ff13b7ab1a7ec70176c61ac9c4 |
|
BLAKE2b-256 | 5a303fb0a91be87be70918a5608eb91d7a024ab71a15241c597595ff37823470 |