Skip to main content

Run next-generation LLMs and VLMs locally, natively, and at top speed on any hardware.

Project description

nexaai

Run next-generation LLMs and VLMs locally, natively, and at top speed on any hardware.

nexaai is the official Python SDK for NexaSDK/NexaML:
An NPU‑first cross‑platform inference engine that powers LLMs, VLMs, embeddings, audio, and vision models across NPU, GPU, and CPU—on desktop, mobile, automotive, and IoT.


Features

  • 🚀 NPU‑First Performance — Accelerated AI with automatic device selection and backend optimizations.
  • 🔮 Day‑0 Architecture Support — Run new LLMs, VLMs, ASR, and CV models immediately (GGUF, MLX, .nexa, and more).
  • 🧩 True Multimodality — Seamless pipelines for text, vision, and audio.
  • 🤖 OpenAI-Compatible Server — Supports serving, chat, and function calling.
  • 🏆 Cross‑Platform — macOS (Apple Silicon), Windows (x64, ARM64), Linux, mobile, embedded.

What is it?

nexaai offers a clean Python API over NexaML’s runtime. Build modern AI applications—chatbots, copilots, vision tools—that run fully on-device with maximal hardware utilization. Designed for rapid adoption of new models and formats, especially with NPU acceleration.


Installation & Quickstart

Please refer to our official docs for up-to-date install steps, environment notes, and code examples:

The docs include supported Python versions, backend requirements (NPU, MLX, GPU, CPU), and ready-to-run examples for LLMs, VLMs, embeddings, and audio.


Links


Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nexaai-1.0.41.tar.gz (65.3 kB view details)

Uploaded Source

File details

Details for the file nexaai-1.0.41.tar.gz.

File metadata

  • Download URL: nexaai-1.0.41.tar.gz
  • Upload date:
  • Size: 65.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.19

File hashes

Hashes for nexaai-1.0.41.tar.gz
Algorithm Hash digest
SHA256 4b4b4b1c2f47820b4d40be72ac64c40e35af10365991b0f2eabe09f98dc6154e
MD5 d37ce589ccf5b9abed64e3e792eadc27
BLAKE2b-256 707991720a8c25eba601b18a83bd20f81a2000781ba83460710b8f401d807a0b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page