semantic-router

Super fast semantic router for AI decision making

These details have not been verified by PyPI

Project description

PyPI - Python Version

Semantic Router is a superfast decision-making layer for your LLMs and agents. Rather than waiting for slow LLM generations to make tool-use decisions, we use the magic of semantic vector space to make those decisions — routing our requests using semantic meaning.

Read the Docs

Quickstart

To get started with semantic-router we install it like so:

pip install -qU semantic-router

❗️ If wanting to use a fully local version of semantic router you can use HuggingFaceEncoder and LlamaCppLLM (pip install -qU "semantic-router[local]", see here). To use the HybridRouteLayer you must pip install -qU "semantic-router[hybrid]".

We begin by defining a set of Route objects. These are the decision paths that the semantic router can decide to use, let's try two simple routes for now — one for talk on politics and another for chitchat:

from semantic_router import Route

# we could use this as a guide for our chatbot to avoid political conversations
politics = Route(
    name="politics",
    utterances=[
        "isn't politics the best thing ever",
        "why don't you tell me about your political opinions",
        "don't you just love the president",
        "they're going to destroy this country!",
        "they will save the country!",
    ],
)

# this could be used as an indicator to our chatbot to switch to a more
# conversational prompt
chitchat = Route(
    name="chitchat",
    utterances=[
        "how's the weather today?",
        "how are things going?",
        "lovely weather today",
        "the weather is horrendous",
        "let's go to the chippy",
    ],
)

# we place both of our decisions together into single list
routes = [politics, chitchat]

We have our routes ready, now we initialize an embedding / encoder model. We currently support a CohereEncoder and OpenAIEncoder — more encoders will be added soon. To initialize them we do:

import os
from semantic_router.encoders import CohereEncoder, OpenAIEncoder

# for Cohere
os.environ["COHERE_API_KEY"] = "<YOUR_API_KEY>"
encoder = CohereEncoder()

# or for OpenAI
os.environ["OPENAI_API_KEY"] = "<YOUR_API_KEY>"
encoder = OpenAIEncoder()

With our routes and encoder defined we now create a RouteLayer. The route layer handles our semantic decision making.

from semantic_router.routers import SemanticRouter

rl = SemanticRouter(encoder=encoder, routes=routes, auto_sync="local")

We can now use our route layer to make super fast decisions based on user queries. Let's try with two queries that should trigger our route decisions:

rl("don't you love politics?").name

[Out]: 'politics'

Correct decision, let's try another:

rl("how's the weather today?").name

[Out]: 'chitchat'

We get both decisions correct! Now lets try sending an unrelated query:

rl("I'm interested in learning about llama 2").name

[Out]:

In this case, no decision could be made as we had no matches — so our route layer returned None!

Integrations

The encoders of semantic router include easy-to-use integrations with Cohere, OpenAI, Hugging Face, FastEmbed, and more — we even support multi-modality!.

Our utterance vector space also integrates with Pinecone and Qdrant!

📚 Resources

Docs

Notebook	Description
Introduction	Introduction to Semantic Router and static routes
Dynamic Routes	Dynamic routes for parameter generation and functionc calls
Save/Load Layers	How to save and load `RouteLayer` from file
LangChain Integration	How to integrate Semantic Router with LangChain Agents
Local Execution	Fully local Semantic Router with dynamic routes — local models such as Mistral 7B outperform GPT-3.5 in most tests
Route Optimization	How to train route layer thresholds to optimize performance
Multi-Modal Routes	Using multi-modal routes to identify Shrek vs. not-Shrek pictures

Online Course

Community

Dimitrios Manias, Ali Chouman, Abdallah Shami, Semantic Routing for Enhanced Performance of LLM-Assisted Intent-Based 5G Core Network Management and Orchestration, IEEE GlobeCom 2024
Julian Horsey, Semantic Router superfast decision layer for LLMs and AI agents, Geeky Gadgets
azhar, Beyond Basic Chatbots: How Semantic Router is Changing the Game, AI Insights @ Medium
Daniel Avila, Semantic Router: Enhancing Control in LLM Conversations, CodeGPT @ Medium
Yogendra Sisodia, Stop Chat-GPT From Going Rogue In Production With Semantic Router, Medium
Aniket Hingane, LLM Apps: Why you Must Know Semantic Router in 2024: Part 1, Medium
Adrien Sales, 🔀 Semantic Router w. ollama/gemma2 : real life 10ms hotline challenge 🤯
Adrien Sales, Kaggle Notebook 🔀 Semantic Router: ollama/ gemma2:9b hotline

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.1.15

May 23, 2026

0.1.14 yanked

May 18, 2026

Reason this release was yanked:

CVE-2026-42208: unbounded litellm pin can resolve to a compromised wheel (litellm==1.82.8) that exfiltrates credentials on Python startup. Upgrade to semantic-router>=0.1.15.

0.1.13 yanked

May 14, 2026

Reason this release was yanked:

CVE-2026-42208: unbounded litellm pin can resolve to a compromised wheel (litellm==1.82.8) that exfiltrates credentials on Python startup. Upgrade to semantic-router>=0.1.15.

0.1.12 yanked

Nov 18, 2025

Reason this release was yanked:

CVE-2026-42208: unbounded litellm pin can resolve to a compromised wheel (litellm==1.82.8) that exfiltrates credentials on Python startup. Upgrade to semantic-router>=0.1.15.

0.1.11 yanked

Aug 11, 2025

Reason this release was yanked:

CVE-2026-42208: unbounded litellm pin can resolve to a compromised wheel (litellm==1.82.8) that exfiltrates credentials on Python startup. Upgrade to semantic-router>=0.1.15.

0.1.10 yanked

Jul 15, 2025

Reason this release was yanked:

CVE-2026-42208: unbounded litellm pin can resolve to a compromised wheel (litellm==1.82.8) that exfiltrates credentials on Python startup. Upgrade to semantic-router>=0.1.15.

0.1.9 yanked

Jun 30, 2025

Reason this release was yanked:

CVE-2026-42208: unbounded litellm pin can resolve to a compromised wheel (litellm==1.82.8) that exfiltrates credentials on Python startup. Upgrade to semantic-router>=0.1.15.

0.1.8 yanked

May 9, 2025

Reason this release was yanked:

CVE-2026-42208: unbounded litellm pin can resolve to a compromised wheel (litellm==1.82.8) that exfiltrates credentials on Python startup. Upgrade to semantic-router>=0.1.15.

0.1.7

Apr 3, 2025

0.1.6

Mar 25, 2025

0.1.5

Mar 22, 2025

0.1.4

Mar 21, 2025

0.1.3

Mar 21, 2025

0.1.2

Mar 20, 2025

0.1.1

Mar 11, 2025

0.1.0

Mar 11, 2025

0.1.0.dev10 pre-release

Feb 12, 2025

0.1.0.dev9 pre-release

Feb 3, 2025

0.1.0.dev8 pre-release

Jan 31, 2025

0.1.0.dev7 pre-release

Jan 24, 2025

0.1.0.dev6 pre-release

Jan 13, 2025

0.1.0.dev5 pre-release

Dec 20, 2024

0.1.0.dev4 pre-release

Dec 15, 2024

0.1.0.dev3 pre-release

Dec 1, 2024

0.1.0.dev2 pre-release

Nov 29, 2024

0.1.0.dev1 pre-release

Nov 20, 2024

0.1.0.dev0 pre-release

Nov 16, 2024

0.0.72

Oct 10, 2024

0.0.71

Oct 8, 2024

0.0.70

Oct 3, 2024

0.0.69 yanked

Oct 2, 2024

Reason this release was yanked:

Incorrect dependencies for pinecone package

0.0.68

Sep 23, 2024

0.0.67 yanked

Sep 23, 2024

Reason this release was yanked:

Issue with version

0.0.66

Sep 22, 2024

0.0.65

Sep 6, 2024

0.0.64

Sep 5, 2024

0.0.63

Sep 2, 2024

0.0.62

Aug 29, 2024

0.0.61

Aug 23, 2024

0.0.60

Aug 19, 2024

0.0.59

Aug 14, 2024

0.0.58

Aug 9, 2024

0.0.57

Aug 8, 2024

0.0.56

Aug 8, 2024

0.0.55

Jul 31, 2024

0.0.54

Jul 19, 2024

0.0.53

Jul 16, 2024

0.0.52

Jul 15, 2024

0.0.51

Jul 12, 2024

0.0.50

Jul 5, 2024

0.0.49

Jul 3, 2024

0.0.48

Jun 20, 2024

0.0.47

Jun 13, 2024

0.0.46

Jun 2, 2024

0.0.45

May 31, 2024

0.0.44

May 21, 2024

0.0.43

May 16, 2024

0.0.42

May 15, 2024

0.0.41

May 15, 2024

0.0.40

May 14, 2024

0.0.39

May 11, 2024

0.0.38

May 3, 2024

0.0.37

Apr 28, 2024

0.0.36

Apr 28, 2024

0.0.35

Apr 28, 2024

0.0.34

Apr 17, 2024

0.0.33

Apr 17, 2024

0.0.32

Apr 8, 2024

0.0.31

Apr 7, 2024

0.0.30

Apr 7, 2024

0.0.29

Mar 25, 2024

0.0.28

Mar 15, 2024

0.0.27

Mar 2, 2024

0.0.26

Feb 27, 2024

0.0.25

Feb 27, 2024

0.0.24

Feb 23, 2024

0.0.23

Feb 23, 2024

0.0.22

Feb 18, 2024

0.0.21

Feb 18, 2024

0.0.20

Jan 28, 2024

0.0.19

Jan 28, 2024

0.0.18

Jan 24, 2024

0.0.17

Jan 17, 2024

0.0.16

Jan 14, 2024

0.0.15

Jan 7, 2024

0.0.14

Dec 28, 2023

0.0.13

Dec 28, 2023

0.0.12

Dec 20, 2023

0.0.11

Dec 18, 2023

0.0.10

Dec 18, 2023

0.0.9

Dec 15, 2023

0.0.8

Dec 14, 2023

0.0.7

Dec 13, 2023

0.0.5

Dec 11, 2023

0.0.3

Nov 15, 2023

0.0.2

Nov 13, 2023

0.0.1

Nov 9, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

semantic_router-0.1.15.tar.gz (95.6 kB view details)

Uploaded May 23, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

semantic_router-0.1.15-py3-none-any.whl (128.1 kB view details)

Uploaded May 23, 2026 Python 3

File details

Details for the file semantic_router-0.1.15.tar.gz.

File metadata

Download URL: semantic_router-0.1.15.tar.gz
Upload date: May 23, 2026
Size: 95.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.11.16 {"installer":{"name":"uv","version":"0.11.16","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for semantic_router-0.1.15.tar.gz
Algorithm	Hash digest
SHA256	`328256ddc3c2b713101ec69561d6585aecbf1198ea3461e1486289d8c3a35288`
MD5	`4f863337b878fa8453a499d8ac198a9f`
BLAKE2b-256	`dca91a689e916e8b280f1fd8fb335cc059be626a22fe4533baa045d32fcd6de5`

See more details on using hashes here.

File details

Details for the file semantic_router-0.1.15-py3-none-any.whl.

File metadata

Download URL: semantic_router-0.1.15-py3-none-any.whl
Upload date: May 23, 2026
Size: 128.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.11.16 {"installer":{"name":"uv","version":"0.11.16","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for semantic_router-0.1.15-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c08978584c73c5ff8e75005202007ac8ee6593d77deaf8c7ec53f71e01e7f757`
MD5	`6fe9a6e4ab38bdfb40127f6164536997`
BLAKE2b-256	`1ec7f4a20292aef9badd277efbb24d697c5b934693fb21e8e490d3ecb0fc83f0`

See more details on using hashes here.

semantic-router 0.1.15

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Read the Docs

Quickstart

Integrations

📚 Resources

Docs

Online Course

Community

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes