Skip to main content

Run next-generation LLMs and VLMs locally, natively, and at top speed on any hardware.

Project description

nexaai

Run next-generation LLMs and VLMs locally, natively, and at top speed on any hardware.

nexaai is the official Python SDK for NexaSDK/NexaML:
An NPU‑first cross‑platform inference engine that powers LLMs, VLMs, embeddings, audio, and vision models across NPU, GPU, and CPU—on desktop, mobile, automotive, and IoT.


Features

  • 🚀 NPU‑First Performance — Accelerated AI with automatic device selection and backend optimizations.
  • 🔮 Day‑0 Architecture Support — Run new LLMs, VLMs, ASR, and CV models immediately (GGUF, MLX, .nexa, and more).
  • 🧩 True Multimodality — Seamless pipelines for text, vision, and audio.
  • 🤖 OpenAI-Compatible Server — Supports serving, chat, and function calling.
  • 🏆 Cross‑Platform — macOS (Apple Silicon), Windows (x64, ARM64), Linux, mobile, embedded.

What is it?

nexaai offers a clean Python API over NexaML’s runtime. Build modern AI applications—chatbots, copilots, vision tools—that run fully on-device with maximal hardware utilization. Designed for rapid adoption of new models and formats, especially with NPU acceleration.


Installation & Quickstart

Please refer to our official docs for up-to-date install steps, environment notes, and code examples:

The docs include supported Python versions, backend requirements (NPU, MLX, GPU, CPU), and ready-to-run examples for LLMs, VLMs, embeddings, and audio.


Links


Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nexaai-1.0.44rc3.tar.gz (65.3 kB view details)

Uploaded Source

File details

Details for the file nexaai-1.0.44rc3.tar.gz.

File metadata

  • Download URL: nexaai-1.0.44rc3.tar.gz
  • Upload date:
  • Size: 65.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.19

File hashes

Hashes for nexaai-1.0.44rc3.tar.gz
Algorithm Hash digest
SHA256 3a50ef68870e81a9e99b8da2ffa466aa5f39ffd1b282461e6ad4e3887e179ceb
MD5 25703b67e5ff4bba6866adeea4a9fbb6
BLAKE2b-256 82b5b265ec1c43cdb7c5d5be2ce08615795cae7bed52db2c09aa2ce85ee9f639

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page