Skip to main content

First open-source AI agent for Northeast Indian languages

Project description

NE-Agent

Open-source AI agent for Northeast Indian languages. Built on the NE-Stack by MWire Labs, Shillong.


Quickstart

pip install ne-agent

Install Ollama and pull a model:

ollama pull qwen2.5:1.5b
ollama serve

Then run:

ne-agent

What is NE-Agent?

NE-Agent is a terminal-based agentic assistant that speaks Northeast India's indigenous languages. It automatically detects the input language, then routes each query to the right tool — retrieval-augmented search, Khasi-English translation, or speech transcription — and generates responses, all running locally with no API keys required.


Powered by NE-Stack

Component Role Model
NE-LID Language identification fastText, 11 languages, 99.09% accuracy
Tool Router Dynamic tool selection qwen2.5:1.5b (search / translate / transcribe)
NE-Embed Multilingual embeddings LaBSE fine-tuned, 768-dim
FAISS Vector retrieval IndexFlatIP, cosine similarity
NLLB (Khasi) Khasi-English translation Fine-tuned NLLB
NE-ASR Speech transcription Whisper-medium, 8 languages
Ollama Local LLM qwen2.5:1.5b (default)

Supported Languages

Assamese · Khasi · Garo · Mizo · Meitei · Bodo · Kokborok · Nyishi · Nagamese · English

Translation currently supports Khasi. Transcription supports Khasi, Garo, Mizo, Nagamese, Kokborok, Assamese, Chakma, and Wancho.


Bring Your Own Corpus

The default corpus ships 500 monolingual sentences each for Assamese, Khasi, Mizo, and Garo. To use your own documents, replace the txt files in ne_agent/data/ with your own — one sentence per line.


Requirements

  • Python 3.9+
  • Ollama with qwen2.5:1.5b pulled
  • GPU recommended for translation/transcription tools; search works on CPU

Links


License

CC-BY-4.0 — MWire Labs, Shillong, Meghalaya.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ne_agent-0.2.3.tar.gz (95.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ne_agent-0.2.3-py3-none-any.whl (96.3 kB view details)

Uploaded Python 3

File details

Details for the file ne_agent-0.2.3.tar.gz.

File metadata

  • Download URL: ne_agent-0.2.3.tar.gz
  • Upload date:
  • Size: 95.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.10

File hashes

Hashes for ne_agent-0.2.3.tar.gz
Algorithm Hash digest
SHA256 d733684ea7a1d4fec0be5bf2b4533eecf7c4244368ed98b2289981beef57e7f8
MD5 373bfeea9acc29381e2beb7f8dd36899
BLAKE2b-256 3b84eaf86f826efa4a4aa4f40008d4a4b3af54c74b767baef1887e18b8991f12

See more details on using hashes here.

File details

Details for the file ne_agent-0.2.3-py3-none-any.whl.

File metadata

  • Download URL: ne_agent-0.2.3-py3-none-any.whl
  • Upload date:
  • Size: 96.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.10

File hashes

Hashes for ne_agent-0.2.3-py3-none-any.whl
Algorithm Hash digest
SHA256 6dd9daa34bd84bb435094b2bedca8bd60be56652b0cd2783af859df7883b9e34
MD5 02c5cc6edcf617866aca455d5a22eeec
BLAKE2b-256 ed621fef70ffed5d8edf28c6b63cf7360f050bcd9b8249cb76e1d8af032097cd

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page