Skip to main content

Agentic AI framework for Nepal government service automation — offline-capable, Nepali-language aware, with RAG and MLOps infra built in.

Project description

Nepal GovAgent

Open-source agentic AI framework for Nepal's government service layer — RAG, multi-step task automation, and MLOps infra designed for offline deployment and Nepali language workflows.

License: MIT PyPI version PyPI Downloads Python Status


Why this exists

Nepal has a digital government layer. The Nagarik App has 1.5M+ registered citizens. Diyo AI has deployed Nepali-language chatbots for Lalitpur and Butwal municipalities. Paaila Technology has built Nepali NLU and TTS. Fusemachines is training hundreds of AI fellows.

The conversational layer exists. What doesn't exist is the intelligence layer above it.

Today, Nepal's government AI systems can answer questions. They cannot:

  • Retrieve and cite specific clauses from Nepal's legal and policy corpus with source tracing
  • Run reliably without internet connectivity in areas with poor network coverage
  • Handle Nepali and English queries against the same document corpus
  • Monitor and audit themselves at production scale

Nepal's National AI Policy 2082 explicitly identifies infrastructure, data, and sovereignty as its foundational pillars. The AI Association of Nepal (AIAN) has framed national AI readiness around four pillars: Data, Infrastructure, Policy, Resources.

This project is the infrastructure answer.


Install

pip install nepal-gov-agent

For answer generation with citations (optional):

pip install nepal-gov-agent[mistral]   # Mistral API
pip install nepal-gov-agent[ollama]    # Local Ollama

Quick start

from nepal_gov_agent import GovRAG

rag = GovRAG(corpus_dir="Data/")
result = rag.ask("What is the vision of Nepal's National AI Policy?")

print(result.answer)
print(result.confidence)   # "high" | "medium" | "low"
for src in result.sources:
    print(src["doc"], "page", src["page"])

Real output:

[National AI Policy-Final_uxc94vg.pdf, p.1] Introduction
The vision of Nepal's National AI Policy is to build an ethical, safe, and
human-centric AI ecosystem that promotes inclusive and sustainable socio-economic
growth through responsible use of artificial intelligence...

---

[National AI Policy-Final_uxc94vg.pdf, p.2] Objectives
The mission focuses on maximising AI use for Nepal's socio-economic development...

high
National AI Policy-Final_uxc94vg.pdf page 1
National AI Policy-Final_uxc94vg.pdf page 2
dnf_jbji8eb.pdf page 4

Use cases

1. Query the Nepal AI Policy (English)

from nepal_gov_agent import GovRAG

rag = GovRAG(corpus_dir="Data/")

result = rag.ask("What is the role of the National AI Centre?")
print(result.answer)
print("Confidence:", result.confidence)

Real output:

[National AI Policy-Final_uxc94vg.pdf, p.18] National AI Centre
The National AI Centre shall serve as the Secretariat of the AI Regulation
Council. It will promote, encourage, and regulate AI study, research,
development, and application. The Centre is tasked with conducting research
and development in the AI technology sector and supporting the capacity
development of local researchers and institutions...

Confidence: high

2. Nepali language query

result = rag.ask("नेपालको राष्ट्रिय AI नीतिको उद्देश्य के हो?")

print(result.answer)
print("Confidence:", result.confidence)
print("Fallback triggered:", result.fallback_triggered)

Real output:

[National AI Policy-Final_uxc94vg.pdf, p.1] Introduction
Nepal's AI Policy aims to build a reliable AI ecosystem, boost economic
productivity, improve public services, and strengthen governance. The
objectives include building infrastructure, promoting research, and
ensuring ethical and secure AI deployment...

Confidence: medium
Fallback triggered: False

3. Constitution query

result = rag.ask("What does the Constitution say about fundamental rights?")

print(result.answer[:600])
print("\nSources:")
for src in result.sources[:3]:
    print(f"  {src['doc']} — page {src['page']}")
    if src["heading"]:
        print(f"  Section: {src['heading']}")

Real output:

[Constitution of Nepal (2nd amd. English)_xf33zb3.pdf, p.12] Part 3 — Fundamental Rights
Every citizen shall have the following rights: Right to equality before
law. No discrimination shall be made against any citizen in the application
of general laws on grounds of origin, religion, race, caste, tribe, sex,
economic condition, language, region, ideology or on similar other grounds...

Sources:
  Constitution of Nepal (2nd amd. English)_xf33zb3.pdf — page 12
  Section: Part 3 — Fundamental Rights
  Constitution of Nepal (2nd amd. English)_xf33zb3.pdf — page 13
  Constitution of Nepal (2nd amd. English)_xf33zb3.pdf — page 14

4. Raw search (no answer generation)

blocks = rag.search("National AI Centre secretariat", k=5)

for b in blocks:
    print(b["doc_id"])
    print(b["title"])
    print(b["content"][:200])
    print("---")

Real output:

pdf:National AI Policy-Final_uxc94vg.pdf
None
10.2 National AI Centre
The National AI Centre shall serve as the Secretariat of the AI Regulation
Council. It will promote, encourage, and regulate AI study, research,
development and application...
---
pdf:National AI Policy-Final_uxc94vg.pdf
None
10.1 AI Regulation Council
The AI Regulation Council shall meet at least twice a year. The National
AI Centre shall serve as its Secretariat...
---

5. With LLM for cited answers (Mistral)

import os
from ragnav.llm.mistral import MistralClient
from nepal_gov_agent import GovRAG

os.environ["MISTRAL_API_KEY"] = "your_key_here"  # or load from .env

rag = GovRAG(corpus_dir="Data/")
llm = MistralClient()

result = rag.ask(
    "How many AI professionals does Nepal aim to train?",
    llm=llm,
    with_citations=True,
)
print(result.answer)

Real output (with citations):

Nepal aims to train at least 5,000 skilled AI professionals within five
years [[pdf:National AI Policy-Final_uxc94vg.pdf#p3]]. This target is
part of a broader capacity-building initiative that includes integrating
AI curricula from school to university level [[pdf:National AI Policy-Final_uxc94vg.pdf#p4]].

6. Run the benchmark

from nepal_gov_agent import GovRAG, run_benchmark

rag = GovRAG(corpus_dir="Data/")
results = run_benchmark(rag, verbose=True)
print(results.report())

Real output:

  ✓ [english] What is the vision of Nepal's National AI Policy?...
  ✓ [english] What are the four pillars of Nepal's AI readiness?...
  ✓ [english] What is the role of the National AI Centre?...
  ✓ [english] How many AI professionals does Nepal aim to train?...
  ✓ [english] What does the Constitution say about fundamental rights...
  ✓ [english] What is Digital Nepal Framework 2.0?...
  ✓ [nepali] नेपालको राष्ट्रिय AI नीतिको उद्देश्य के हो?...

============================================================
Nepal GovAgent Benchmark Results
============================================================
Total queries:      7
Recall@1:           0.571
Recall@3:           0.857
Recall@5:           1.000
Keyword hit rate:   1.000
Doc hit rate:       1.000
Nepali recall@3:    1.000
English recall@3:   0.833
============================================================

Note: These numbers measure retrieval, not generated answer quality. Recall@3 = 0.857 means the expected keywords appeared in the top 3 retrieved blocks for 6 of 7 queries. Doc hit rate = 1.000 means the expected source PDF appeared in the top‑5 hits for every query (harness normalizes the pdf: doc id prefix). See Benchmark: retrieval not answer quality below.


7. CLI

# Ask in English
nepal-gov-agent ask "What is Nepal's National AI Policy?"

# Ask in Nepali
nepal-gov-agent ask "नेपालको राष्ट्रिय AI नीतिको उद्देश्य के हो?"

# Custom corpus folder
nepal-gov-agent ask "What are fundamental rights?" --corpus /path/to/docs/

# Retrieve more blocks
nepal-gov-agent ask "AI infrastructure" --k 10

# Run benchmark
nepal-gov-agent benchmark --corpus Data/

# Show corpus stats
nepal-gov-agent stats

Real CLI output (ask):

============================================================
ANSWER
============================================================
[National AI Policy-Final_uxc94vg.pdf, p.1] Introduction
The vision of Nepal's National AI Policy is to build an ethical, safe,
and human-centric AI ecosystem...

============================================================
SOURCES
============================================================
1. National AI Policy-Final_uxc94vg.pdf (page 1)
   Excerpt: The vision of Nepal's National AI Policy is to build an ethical...
2. National AI Policy-Final_uxc94vg.pdf (page 2)
   Excerpt: The mission focuses on maximising AI use for Nepal's socio-econ...
3. dnf_jbji8eb.pdf (page 4)
   Excerpt: Digital Nepal Framework 2.0 builds upon the comprehensive vision...

Confidence: high

8. Corpus statistics

import logging

from nepal_gov_agent import GovRAG

logging.basicConfig(level=logging.WARNING)

rag = GovRAG(corpus_dir="Data/")
print(rag.stats)

Real output:

{
    "documents": 6,
    "blocks": 856,
    "corpus_dir": "Data/",
    "offline": True
}

9. Custom config

from nepal_gov_agent import GovRAG, GovRAGConfig

config = GovRAGConfig(
    w_bm25=0.7,              # More weight on keyword search
    w_vec=0.3,               # Less weight on semantic search
    k_final=12,              # Retrieve more blocks
    max_fallback_attempts=5, # More retries on low confidence
    cache_dir=".my_cache",
    embedding_model="all-MiniLM-L6-v2",
)

rag = GovRAG(corpus_dir="Data/", config=config)
result = rag.ask("What is the Digital Nepal Framework?")

What's in the corpus (seed)

Document Language
National AI Policy 2082 English
Constitution of Nepal (2nd amendment) English
Digital Nepal Framework 2.0 English
प्रतिनिधि सभा सदस्य निर्वाचन अध्यादेश २०८२ Nepali
मानव अधिकार पुरस्कार कोष सञ्चालन नियमावली २०७५ Nepali

Add your own PDFs to Data/ — they are automatically ingested on next run. Only PDFs in the root of Data/ are picked up (not subdirectories).


Under the hood

Nepal GovAgent
│
├── RAG Layer          →  ragnav==0.3.0
│   ├── PDF ingestion (PyMuPDF, page-level blocks)
│   ├── BM25 index (rank-bm25)
│   ├── Vector index (sentence-transformers, all-MiniLM-L6-v2)
│   ├── Hybrid retrieval (BM25 0.6 + vector 0.4, RRF fusion)
│   ├── Structure expansion (parent/child block hierarchy)
│   ├── QueryFallback for low-confidence query retries
│   ├── Inline citation enforcement
│   └── SQLite embedding + retrieval cache (offline)
│
├── Reliability Layer  →  ragfallback==2.0.2
│   └── Metrics and adaptive utilities (PyPI dependency; GovRAG benchmark uses keyword / Recall@k on retrieved blocks)
│
└── Agent Layer        →  agentensemble==0.3.0 (Phase 2)
    └── Multi-step workflows, SQLite session memory (planned)

Key design decisions:

  • BM25 weight 0.6 > vector 0.4: Nepal gov docs use specific legal terminology that keyword search handles better than embeddings
  • Offline by default: all-MiniLM-L6-v2 runs on CPU with no API key needed
  • SQLite cache: embeddings cached locally — second run is instant
  • Fallback only on ConfidenceLevel.LOW: avoids unnecessary LLM calls on already-confident retrievals

Benchmark: retrieval, not answer quality

The built-in benchmark measures retrieval quality — whether the right content appears in the top-k retrieved blocks. It does not evaluate answer accuracy, fluency, or factual correctness.

When presenting results to stakeholders:

"Recall@3 = 0.857 means that for 6 out of 7 test queries, the relevant section of the government document appeared in the top 3 retrieved chunks."

This is a meaningful signal for infrastructure quality. Answer quality evaluation (LLM-as-judge, BLEU/ROUGE) is planned for Phase 2.


The gap this fills

Layer Who's doing it What's missing
Nepali NLP / NLU Paaila Technology, Diyo AI
Government chatbots Diyo AI (Muna), Fusemachines
AI training / fellowships Fusemachines
Legal RAG with citation Nobody ← This project
Adaptive retrieval fallback Nobody ← This project
Offline-capable full stack Nobody ← This project

Nepal context


Roadmap

Phase 1 — RAG core ✅

  • GovRAG class: hybrid BM25 + vector retrieval over Nepal gov corpus
  • Offline embedding with sentence-transformers
  • Adaptive retrieval fallback on low confidence
  • Inline citation support (with external LLM)
  • nepal-gov-agent ask / benchmark / stats CLI
  • Nepal-specific benchmark harness (Recall@k, keyword hit rate)
  • 9 tests passing

Phase 2 — Agent capabilities

  • GovAgent class: multi-step task execution via AgentEnsemble
  • Common government workflow templates (citizenship, permits, licenses)
  • Nagarik App integration layer
  • Corpus expansion: ministry circulars 2080–2082

Phase 3 — Production infra

  • MLOps monitoring dashboard
  • Audit trail: every agent action logged and explainable
  • Deployment guide for municipal servers (low-spec hardware)
  • Bhashini + Bolna integration for voice workflows

Contributing

See CONTRIBUTING.md for guidelines.

Highest priority: Nepal government document corpus. If you have publicly available PDFs — ministry circulars, SOPs, municipality guidelines — open an issue or PR adding them to Data/.


Built on

Library Role Version
RAGNav Retrieval, citations, PDF ingestion >=0.3.0
ragfallback Reliability, fallback, diagnostics >=2.0.2
AgentEnsemble Orchestration, memory, planning >=0.3.0

All three are open-source, MIT licensed, and independently usable on PyPI.


About

Built by Irfan AliGitHub | Founder, DataCortex IQ | 7+ years in LLM engineering, RAG pipelines, and agentic AI | 11 open-source Python libraries on PyPI | M.Sc. Data Science & AI, IISER Tirupati.

Built in the spirit of AIAN's four pillars: Data. Infrastructure. Policy. Resources.


License

MIT — free for government, private sector, and academic use. No strings.


Working on Nepal's AI infrastructure layer? Open an issue or reach out directly.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nepal_gov_agent-0.1.0.tar.gz (22.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

nepal_gov_agent-0.1.0-py3-none-any.whl (17.8 kB view details)

Uploaded Python 3

File details

Details for the file nepal_gov_agent-0.1.0.tar.gz.

File metadata

  • Download URL: nepal_gov_agent-0.1.0.tar.gz
  • Upload date:
  • Size: 22.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.1

File hashes

Hashes for nepal_gov_agent-0.1.0.tar.gz
Algorithm Hash digest
SHA256 b88a5214924e0589f388bd1b7dce2cb74b830ab6be7119d70ea498f3c2317053
MD5 f4bf6518f87ae51fa059c8292480b7ec
BLAKE2b-256 f52bb0e4c4d620fd6e481a09cfbf61b56451621c27d79731d26dbeac40049c90

See more details on using hashes here.

File details

Details for the file nepal_gov_agent-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for nepal_gov_agent-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 9a1a8a6eebf5e36668d425705ec44d5998d723821fd124d54d77932973c52edf
MD5 52b3b910c8e5047797fc672b9d8fb851
BLAKE2b-256 1c7a2186cb19126a147151e4726416e7d8f2d823790c7cff86982ca206b9ccf3

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page