Skip to main content

Run next-generation LLMs and VLMs locally, natively, and at top speed on any hardware.

Project description

nexaai

Run next-generation LLMs and VLMs locally, natively, and at top speed on any hardware.

nexaai is the official Python SDK for NexaSDK/NexaML:
An NPU‑first cross‑platform inference engine that powers LLMs, VLMs, embeddings, audio, and vision models across NPU, GPU, and CPU—on desktop, mobile, automotive, and IoT.


Features

  • 🚀 NPU‑First Performance — Accelerated AI with automatic device selection and backend optimizations.
  • 🔮 Day‑0 Architecture Support — Run new LLMs, VLMs, ASR, and CV models immediately (GGUF, MLX, .nexa, and more).
  • 🧩 True Multimodality — Seamless pipelines for text, vision, and audio.
  • 🤖 OpenAI-Compatible Server — Supports serving, chat, and function calling.
  • 🏆 Cross‑Platform — macOS (Apple Silicon), Windows (x64, ARM64), Linux, mobile, embedded.

What is it?

nexaai offers a clean Python API over NexaML’s runtime. Build modern AI applications—chatbots, copilots, vision tools—that run fully on-device with maximal hardware utilization. Designed for rapid adoption of new models and formats, especially with NPU acceleration.


Installation & Quickstart

Please refer to our official docs for up-to-date install steps, environment notes, and code examples:

The docs include supported Python versions, backend requirements (NPU, MLX, GPU, CPU), and ready-to-run examples for LLMs, VLMs, embeddings, and audio.


Links


Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nexaai-1.0.38rc5.tar.gz (61.4 kB view details)

Uploaded Source

File details

Details for the file nexaai-1.0.38rc5.tar.gz.

File metadata

  • Download URL: nexaai-1.0.38rc5.tar.gz
  • Upload date:
  • Size: 61.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.19

File hashes

Hashes for nexaai-1.0.38rc5.tar.gz
Algorithm Hash digest
SHA256 c64a7d3e5a769cf148232088d9d36a8f3acd8f1609d7dd6f3b02cbfc1014319d
MD5 2fe9c737644d9a33236a57d4f1e29678
BLAKE2b-256 eb780ffa614d3eb59c0ab01f354d2b10d999229f05ed3896054ad5fa67d839a3

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page