Skip to main content

Run next-generation LLMs and VLMs locally, natively, and at top speed on any hardware.

Project description

nexaai

Run next-generation LLMs and VLMs locally, natively, and at top speed on any hardware.

nexaai is the official Python SDK for NexaSDK/NexaML:
An NPU‑first cross‑platform inference engine that powers LLMs, VLMs, embeddings, audio, and vision models across NPU, GPU, and CPU—on desktop, mobile, automotive, and IoT.


Features

  • 🚀 NPU‑First Performance — Accelerated AI with automatic device selection and backend optimizations.
  • 🔮 Day‑0 Architecture Support — Run new LLMs, VLMs, ASR, and CV models immediately (GGUF, MLX, .nexa, and more).
  • 🧩 True Multimodality — Seamless pipelines for text, vision, and audio.
  • 🤖 OpenAI-Compatible Server — Supports serving, chat, and function calling.
  • 🏆 Cross‑Platform — macOS (Apple Silicon), Windows (x64, ARM64), Linux, mobile, embedded.

What is it?

nexaai offers a clean Python API over NexaML’s runtime. Build modern AI applications—chatbots, copilots, vision tools—that run fully on-device with maximal hardware utilization. Designed for rapid adoption of new models and formats, especially with NPU acceleration.


Installation & Quickstart

Please refer to our official docs for up-to-date install steps, environment notes, and code examples:

The docs include supported Python versions, backend requirements (NPU, MLX, GPU, CPU), and ready-to-run examples for LLMs, VLMs, embeddings, and audio.


Links


Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nexaai-1.0.37rc5.tar.gz (53.9 kB view details)

Uploaded Source

File details

Details for the file nexaai-1.0.37rc5.tar.gz.

File metadata

  • Download URL: nexaai-1.0.37rc5.tar.gz
  • Upload date:
  • Size: 53.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.19

File hashes

Hashes for nexaai-1.0.37rc5.tar.gz
Algorithm Hash digest
SHA256 b297e83a844b2b5c5a9798de5bdc1a25a7719c5a299b77e6d0a86df75378239b
MD5 f5dfe2e7be19a6a97f8e319d5b2413ed
BLAKE2b-256 7ca8dbada6e4b7e3da3eb6df2dce24e0a6f319faac72a6355fa53721d8769dd5

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page