No project description provided

Project description

nebullvm nebuly AI accelerate inference optimize DeepLearning

AI Optimization AppStore

Nebullvm is an ecosystem of open-source Apps to boost the performances of your AI systems. The optimization Apps are stack-agnostic and work with any library.

Data. Models. Hardware. These are not independent factors, and making optimal choices on all fronts is hard. Our open source Apps help you to combine these 3 factors seamlessly, thus bringing incredibly fast and efficient AI systems to your fingertips. Four Apps categories to push the boundaries of AI efficiency. Dozens of Apps.

If you like the idea, give us a star to show your support for the project ⭐

Accelerate Apps

Achieve sub-10ms response time for any AI application, including generative and language models. Improve customer experience by providing near real-time inferences.

Speedster: Automatically apply SOTA optimization techniques to achieve the maximum inference speed-up on your hardware.
OptiMate: Interactive tool guiding savvy users in achieving the best inference performance out of a given model / hardware setup.
LargeSpeedster: Automatically apply SOTA optimization techniques on large AI models to achieve the maximum acceleration on your hardware.
CloudSurfer: Discover the optimal inference hardware and cloud platform to run an optimized version of your AI model.
MatrixMaster: Boost your DL model's performance with MatrixMaster's custom-generated matrix multiplication algorithms (AlphaTensor open-source).

Maximize Apps

Make your Kubernetes GPU infrastructure efficient. Simplify cluster management, maximize hardware utilization and minimize costs.

GPU Partitioner: Effortlessly maximize the utilization of GPU resources in a Kubernetes cluster through real-time dynamic partitioning.
GPUs Elasticity: Maximize your GPUs Kubernetes resource utilization with flexible and efficient elastic quotas.

Extract Apps

Don’t settle on generic AI-models. Extract domain-specific knowledge from large foundational models to create portable, super efficient AI models tailored for your use case.

Promptify: Effortlessly fine-tune large language and multi-modal models with minimal data and hardware requirements using p-tuning.
LargeOracle Distillation: Leverage advanced knowledge distillation to extract a small and efficient model out of a larger model.

Simulate Apps

The time for trial and error is over. Simulate the performances of large models on different computing architectures to reduce time-to-market, maximize accuracy and minimize costs.

Simulinf: Simulate inference performances of your AI model on different hardware and cloud platforms.
[ ] TrainingSim: Easily simulate and optimize the training of large AI models on a distributed infrastructure.

Couldn't find the optimization app you were looking for? Please open an issue or contact us at info@nebuly.ai and we will be happy to develop it together.

Join the community | Contribute to the library

Installation • Get started • Notebooks • Benchmarks

Project details

Release history Release notifications | RSS feed

0.10.0

Jun 18, 2023

0.9.1

Mar 21, 2023

0.9.0

Mar 21, 2023

0.8.1

Feb 15, 2023

0.8.0

Jan 23, 2023

0.7.3

Jan 12, 2023

0.7.2

Jan 12, 2023

0.7.1

Jan 10, 2023

0.7.0

Jan 9, 2023

0.6.1

Jan 1, 2023

This version

0.6.0

Dec 16, 2022

0.5.0

Nov 10, 2022

0.4.4

Oct 19, 2022

0.4.3

Sep 12, 2022

0.4.2

Sep 8, 2022

0.4.1

Sep 2, 2022

0.4.0

Jul 26, 2022

0.3.2

Jul 19, 2022

0.3.1

Jun 28, 2022

0.3.0

May 10, 2022

0.2.2

Apr 30, 2022

0.2.1

Apr 26, 2022

0.2.0

Mar 22, 2022

0.1.2

Mar 1, 2022

0.1.1

Feb 21, 2022

0.1.0

Feb 21, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nebullvm-0.6.0.tar.gz (90.0 kB view hashes)

Uploaded Dec 16, 2022 Source

Built Distribution

nebullvm-0.6.0-py3-none-any.whl (157.0 kB view hashes)

Uploaded Dec 16, 2022 Python 3

Hashes for nebullvm-0.6.0.tar.gz

Hashes for nebullvm-0.6.0.tar.gz
Algorithm	Hash digest
SHA256	`b32974daa72c92155149c6050d7708ef75c847a667dac44d5c77469b97b4e0e1`
MD5	`83e5be8a94ff993326f381948ef7a027`
BLAKE2b-256	`0b777b6a1fb31f4624bbc04d27046de13ec8fd9160e04b1b4b63e6c37a911027`

Hashes for nebullvm-0.6.0-py3-none-any.whl

Hashes for nebullvm-0.6.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`4f68d829ddaebf46bc8e3b3022c9f37c5a7b4d2ba3df4b08ae33140b77a18e3d`
MD5	`7dfbc4ec57a4780d1e063cd9c37ad000`
BLAKE2b-256	`25038204398b079c76cd7043bee4e73bf10eaa7d874bf7ee62f6ea0017e2b647`