Skip to main content

Run next-generation LLMs and VLMs locally, natively, and at top speed on any hardware.

Project description

nexaai

Run next-generation LLMs and VLMs locally, natively, and at top speed on any hardware.

nexaai is the official Python SDK for NexaSDK/NexaML:
An NPU‑first cross‑platform inference engine that powers LLMs, VLMs, embeddings, audio, and vision models across NPU, GPU, and CPU—on desktop, mobile, automotive, and IoT.


Features

  • 🚀 NPU‑First Performance — Accelerated AI with automatic device selection and backend optimizations.
  • 🔮 Day‑0 Architecture Support — Run new LLMs, VLMs, ASR, and CV models immediately (GGUF, MLX, .nexa, and more).
  • 🧩 True Multimodality — Seamless pipelines for text, vision, and audio.
  • 🤖 OpenAI-Compatible Server — Supports serving, chat, and function calling.
  • 🏆 Cross‑Platform — macOS (Apple Silicon), Windows (x64, ARM64), Linux, mobile, embedded.

What is it?

nexaai offers a clean Python API over NexaML’s runtime. Build modern AI applications—chatbots, copilots, vision tools—that run fully on-device with maximal hardware utilization. Designed for rapid adoption of new models and formats, especially with NPU acceleration.


Installation & Quickstart

Please refer to our official docs for up-to-date install steps, environment notes, and code examples:

The docs include supported Python versions, backend requirements (NPU, MLX, GPU, CPU), and ready-to-run examples for LLMs, VLMs, embeddings, and audio.


Links


Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nexaai-1.0.37rc2.tar.gz (52.4 kB view details)

Uploaded Source

File details

Details for the file nexaai-1.0.37rc2.tar.gz.

File metadata

  • Download URL: nexaai-1.0.37rc2.tar.gz
  • Upload date:
  • Size: 52.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.19

File hashes

Hashes for nexaai-1.0.37rc2.tar.gz
Algorithm Hash digest
SHA256 0c64ecc4c0804a9300a58853ca7d4092da19ba5ac325436a016a39e1828566c3
MD5 cf2b224dbbe8c3946fd61e787459b50d
BLAKE2b-256 1256249a47f78f09abd42541ac4234eff05f5d1a73eee2bce9f271222041cade

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page