Exa - Pytorch

These details have not been verified by PyPI

Project links

Project description

Exa

Ultra-optimized fast inference library for running exascale LLMs locally on modern consumer-class GPUs.

Principles

Radical Simplicity (Utilizing super-powerful LLMs with as minimal code as possible)
Ultra-Optimizated (High Performance classes that extract all the power from these LLMs)
Fludity & Shapelessness (Plug in and play and re-architecture as you please)

🤝 Schedule a 1-on-1 Session

Book a 1-on-1 Session with Kye, the Creator, to discuss any issues, provide feedback, or explore how we can improve Zeta for you.

📦 Installation 📦

You can install the package using pip

pip install exxa

Usage

Inference

from exa import Inference

model = Inference(
    model_id="georgesung/llama2_7b_chat_uncensored",
    quantized=True
)

model.run("What is your name")

GPTQ Inference

from exa import GPTQInference

model_id = "facebook/opt-125m"
model = GPTQInference(model_id=model_id, max_length=400)

prompt = "in a land far far away"
result = model.run(prompt)
print(result)

Quantize

from exa import Quantize

#usage
quantize = Quantize(
     model_id="bigscience/bloom-1b7",
     bits=8,
     enable_fp32_cpu_offload=True,
)

quantize.load_model()
quantize.push_to_hub("my model")
quantize.load_from_hub('my model')

🎉 Features 🎉

World-Class Quantization: Get the most out of your models with top-tier performance and preserved accuracy! 🏋️‍♂️
Automated PEFT: Simplify your workflow! Let our toolkit handle the optimizations. 🛠️
LoRA Configuration: Dive into the potential of flexible LoRA configurations, a game-changer for performance! 🌌
Seamless Integration: Designed to work seamlessly with popular models like LLAMA, Falcon, and more! 🤖

💌 Feedback & Contributions 💌

We're excited about the journey ahead and would love to have you with us! For feedback, suggestions, or contributions, feel free to open an issue or a pull request. Let's shape the future of fine-tuning together! 🌱

License

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.6.4

Apr 5, 2024

0.6.3

Mar 29, 2024

0.6.2

Mar 29, 2024

0.6.0

Mar 24, 2024

0.5.9

Mar 24, 2024

0.5.8

Mar 21, 2024

0.5.7

Mar 21, 2024

0.5.6

Mar 21, 2024

0.5.5

Sep 23, 2023

0.5.3

Sep 22, 2023

0.5.2

Sep 22, 2023

0.5.1

Sep 22, 2023

0.4.9

Sep 22, 2023

0.4.8

Sep 22, 2023

0.4.7

Sep 22, 2023

0.4.6

Sep 22, 2023

0.4.5

Sep 22, 2023

0.4.4

Sep 22, 2023

0.4.3

Sep 22, 2023

0.4.2

Sep 20, 2023

0.4.1

Sep 20, 2023

0.4.0

Sep 20, 2023

0.3.9

Sep 20, 2023

0.3.8

Sep 20, 2023

0.3.7

Sep 20, 2023

0.3.6

Sep 20, 2023

0.3.5

Sep 20, 2023

0.3.4

Sep 20, 2023

0.3.3

Sep 20, 2023

0.3.1

Sep 20, 2023

0.3.0

Sep 20, 2023

0.2.9

Sep 14, 2023

0.2.8

Sep 14, 2023

0.2.7

Sep 14, 2023

0.2.6

Sep 14, 2023

0.2.5

Sep 14, 2023

0.2.4

Sep 14, 2023

0.2.3

Sep 14, 2023

0.2.2

Sep 14, 2023

0.2.1

Sep 14, 2023

0.2.0

Sep 14, 2023

0.1.8

Sep 14, 2023

0.1.7

Sep 14, 2023

0.1.6

Sep 14, 2023

0.1.5

Sep 14, 2023

0.1.4

Sep 14, 2023

0.1.3

Sep 14, 2023

0.1.2

Sep 14, 2023

0.1.1

Sep 14, 2023

0.1.0

Sep 14, 2023

0.0.9

Sep 14, 2023

This version

0.0.8

Sep 14, 2023

0.0.7

Sep 14, 2023

0.0.5

Sep 14, 2023

0.0.4

Sep 14, 2023

0.0.3

Sep 14, 2023

0.0.2

Sep 14, 2023

0.0.1

Sep 5, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

exxa-0.0.8.tar.gz (7.9 kB view details)

Uploaded Sep 14, 2023 Source

Built Distribution

exxa-0.0.8-py3-none-any.whl (8.0 kB view details)

Uploaded Sep 14, 2023 Python 3

File details

Details for the file exxa-0.0.8.tar.gz.

File metadata

Download URL: exxa-0.0.8.tar.gz
Upload date: Sep 14, 2023
Size: 7.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.3.2 CPython/3.11.0 Darwin/22.4.0

File hashes

Hashes for exxa-0.0.8.tar.gz
Algorithm	Hash digest
SHA256	`43d74106fc27cec9f61f933dd4e9752b1e9323c35ed092465dc9540ebd0a0fd7`
MD5	`901a58c2ae0a201e4d4a62481295eee1`
BLAKE2b-256	`71c34637d2e4fcb10ba4e6e9c71368e4901d9b308df1cbb773217fa9547592f4`

See more details on using hashes here.

File details

Details for the file exxa-0.0.8-py3-none-any.whl.

File metadata

Download URL: exxa-0.0.8-py3-none-any.whl
Upload date: Sep 14, 2023
Size: 8.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.3.2 CPython/3.11.0 Darwin/22.4.0

File hashes

Hashes for exxa-0.0.8-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b6ebfc014cf7b3af1003993716c6399c29fc88f63ff966b57b5f389cb90c19e5`
MD5	`c15ce1db9cca702c6876af60aff2f54e`
BLAKE2b-256	`3f038174c55fd2ff35bcb64e97d9b830429663d29ba2d1682f51b2ffebaafb5f`