AI-powered voice command assistant for desktop automation

These details have not been verified by PyPI

Project links

Project description

Sipho AI - Voice Command Assistant

An AI-powered voice command assistant for desktop automation that uses natural language processing to execute commands on your PC. Built "alternative_message": "Message when app is missing"

Command Examplesth Flask API and OpenAI integration for intelligent conversation handling.

Features

AI-Powered Conversations: Natural language processing with OpenAI integration
Smart Command Detection: Automatically distinguishes between commands and conversations
REST API: Flask-based API for web and mobile integration
JSON-Based Commands: All commands stored in commands.json for easy management
Voice Confirmation: Sensitive commands require confirmation for safety
Third-Party App Detection: Automatically checks if required applications are installed
System Checker: Comprehensive system scanning and installation suggestions
Web Integration: Open websites, perform searches, launch applications
System Control: Shutdown, restart, lock, volume control, and more

Quick Start

Installation

pip install siphoai

Basic Usage

# Start the API server
siphoai

# View available commands
siphoai help-commands

# Test a command
siphoai test "open youtube"

The server will start on http://localhost:5000 with the following endpoints:

POST /api/command - Execute commands & handle conversations
GET /api/help - Get available commands
GET /api/status - Server status

Configuration

Create a .env file for AI features:

OPENROUTER_API_KEY=your_openrouter_api_key_here
FLASK_HOST=0.0.0.0
FLASK_PORT=5000
FLASK_DEBUG=false

Get your free API key from OpenRouter.

Enhanced Features

AI Integration

Powered by OpenAI through OpenRouter API
Natural conversation handling
Smart command vs conversation detection
Context-aware responses
Enhanced command feedback

Voice Confirmation

Sensitive commands (shutdown, restart, lock) require confirmation:

Server asks "Are you sure you want to [action]?"
Respond with confirmation to proceed
Automatic timeout for safety

Third-Party App Detection

Automatically detects if required applications are installed
Provides download links for missing apps
Supports environment variable expansion
Installation instructions included

System Checker

Built-in utility to scan your system:

Detects installed applications
Checks voice command requirements
Generates installation suggestions
Comprehensive system reports

Package Structure

siphoai/
├── __init__.py           # Package initialization
├── app.py               # Main Flask application
├── cli.py               # Command-line interface
├── data/
│   └── commands.json    # Command configurations
└── utils/
    ├── command_manager.py # Command management tools
    └── system_checker.py  # System scanning utilities

Supported Commands

The package comes with pre-configured commands in multiple categories:

Web Commands

Social Media: "Open YouTube", "Open Facebook", "Open Twitter", "Open Instagram"
Professional: "Open LinkedIn", "Open GitHub"
Search: "Open Google", "Search for [term]"
Entertainment: "Open Netflix"

Application Commands

System Tools: "Open Calculator", "Open Notepad", "Open Paint"
File Management: "Open File Explorer", "Open Command Prompt"
System: "Open Task Manager", "Open Control Panel", "Open Settings"
Development: "Open Visual Studio Code", "Open PowerShell"

System Commands

Power: "Shutdown computer", "Restart computer", "Sleep computer"
Security: "Lock computer"
Control: "Cancel shutdown", "Minimize all windows"

Information Commands

Time: "What time is it?"
Date: "What date is it?" or "Today"

Volume Control (requires nircmd)

Audio: "Volume up", "Volume down", "Mute"

Media Commands

Players: "Open music player", "Open photos"

Vision Commands

Screenshot: "Take screenshot", "Analyze screen"
AI Analysis: "What's on my screen?", "Describe my screen"

Adding New Commands

Method 1: Using Command Manager (Recommended)

Run launcher.bat and select option 2
Choose the type of command to add
Follow the interactive prompts
Save your changes

Method 2: Edit JSON Directly

Edit commands.json to add new commands. Each command has this structure:

{
  "triggers": ["phrase1", "phrase2"],
  "action": "action_type",
  "response": "What the app will say",
  "url": "https://example.com",  // For web commands
  "command": ["executable", "arg1", "arg2"]  // For app/system commands
}

Available Action Types:

web_open - Opens a URL in the browser
run_application - Runs an application/executable
system_command - Executes a system command
volume_control - Controls system volume
get_time - Gets current time
get_date - Gets current date
web_search - Performs a web search
show_help - Shows help information
exit_app - Exits the application

Additional Properties:

requires_confirmation - Set to true for sensitive commands
confirmation_message - Custom confirmation question
requires_third_party - Object with third-party app information:
- app_name - Display name of the application
- executable - Path or name of executable to check
- download_url - Where to download the app
- install_instructions - How to install the app
- alternative_message - Message when app is missing

Command Examples

Basic Command

{
  "triggers": ["open notepad", "notepad"],
  "action": "run_application",
  "command": ["notepad.exe"],
  "response": "Opening Notepad"
}

Command with Confirmation

{
  "triggers": ["shutdown computer"],
  "action": "system_command",
  "command": ["shutdown", "/s", "/t", "30"],
  "response": "Shutting down computer",
  "requires_confirmation": true,
  "confirmation_message": "Are you sure you want to shutdown?"
}

Third-Party App Command

{
  "triggers": ["open discord"],
  "action": "run_application",
  "command": ["C:\\Users\\%USERNAME%\\AppData\\Local\\Discord\\Discord.exe"],
  "response": "Opening Discord",
  "requires_third_party": {
    "app_name": "Discord",
    "executable": "C:\\Users\\%USERNAME%\\AppData\\Local\\Discord\\Discord.exe",
    "download_url": "https://discord.com/download",
    "install_instructions": "Download and install Discord from the official website.",
    "alternative_message": "Discord is not installed."
  }
}

API Usage

Command Execution

import requests

# Execute a command
response = requests.post('http://localhost:5000/api/command', 
                        json={'command': 'open youtube'})
print(response.json())

# Handle conversation
response = requests.post('http://localhost:5000/api/command',
                        json={'command': 'how are you today?'})
print(response.json())

JavaScript Integration

// Execute command from web app
async function executeCommand(command) {
    const response = await fetch('http://localhost:5000/api/command', {
        method: 'POST',
        headers: {'Content-Type': 'application/json'},
        body: JSON.stringify({command: command})
    });
    return await response.json();
}

// Usage
executeCommand('open calculator').then(result => console.log(result));

Response Format

{
  "success": true,
  "message": "Opening YouTube",
  "action": "web_open", 
  "url": "https://www.youtube.com",
  "ai_message": "I've opened YouTube for you!",
  "processing_time": 0.123
}

Requirements

Python: 3.7 or higher
OS: Windows, macOS, or Linux (some commands are OS-specific)
Internet: Required for AI features and web commands
Optional: Microphone for voice input (when integrated with speech recognition)

Command Management Tools

Command Manager (`command_manager.py`)

Add new commands interactively
List all current commands
Save/reload command configurations
User-friendly interface for command management

Backup Manager (`backup_manager.py`)

Create timestamped backups of your commands
Restore from previous backups
Export commands to readable text format
List all available backups

Examples

Adding a New Web Command

{
  "triggers": ["open stackoverflow", "stack overflow"],
  "action": "web_open",
  "url": "https://stackoverflow.com",
  "response": "Opening Stack Overflow"
}

Adding a New Application Command

{
  "triggers": ["open discord", "discord"],
  "action": "run_application",
  "command": ["C:\\Users\\YourName\\AppData\\Local\\Discord\\Discord.exe"],
  "response": "Opening Discord"
}

Notes

The app uses Google's speech recognition service, so an internet connection is required
For volume control commands, you may need to install nircmd (optional)
Speak clearly and wait for the app to process your command before giving the next one
Say "help" to hear the available commands
Press Ctrl+C to force quit the application

Troubleshooting

If you get audio-related errors, make sure your microphone is working and properly configured
If speech recognition isn't working, check your internet connection
The app adjusts for ambient noise when it starts, so wait for the ready message before speaking

Security Note

This app can execute system commands like shutdown and restart. Use with caution and only run commands you understand.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

1.0.4

Sep 14, 2025

1.0.2

Sep 14, 2025

1.0.1

Sep 14, 2025

1.0.0

Sep 14, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

siphoai-1.0.4.tar.gz (43.2 kB view details)

Uploaded Sep 14, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

siphoai-1.0.4-py3-none-any.whl (39.2 kB view details)

Uploaded Sep 14, 2025 Python 3

File details

Details for the file siphoai-1.0.4.tar.gz.

File metadata

Download URL: siphoai-1.0.4.tar.gz
Upload date: Sep 14, 2025
Size: 43.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.13

File hashes

Hashes for siphoai-1.0.4.tar.gz
Algorithm	Hash digest
SHA256	`b02f57e48142090e1e054e139b4241eb34dc178666cf9f403bfaebdc81b11dd7`
MD5	`d95f81fc775106c53cd3426e8a458567`
BLAKE2b-256	`f79c196c6fe817f04dcfd199600512a5891471bc3bd6e82497808c950324d77b`

See more details on using hashes here.

File details

Details for the file siphoai-1.0.4-py3-none-any.whl.

File metadata

Download URL: siphoai-1.0.4-py3-none-any.whl
Upload date: Sep 14, 2025
Size: 39.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.13

File hashes

Hashes for siphoai-1.0.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`0de72a0f6a19e7ec6e296d0c56effd6ed9c5a3444b590ab09a2736dec4e9ed69`
MD5	`644227a710df2b25e614173303c6c70d`
BLAKE2b-256	`a0aa73ee478694a4bcff30f6a8a3325551a123142246ed532a5b5f7045799b7f`

See more details on using hashes here.

siphoai 1.0.4

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Sipho AI - Voice Command Assistant

Command Examplesth Flask API and OpenAI integration for intelligent conversation handling.

Features

Quick Start

Installation

Basic Usage

Configuration

Enhanced Features

AI Integration

Voice Confirmation

Third-Party App Detection

System Checker

Package Structure

Supported Commands

Web Commands

Application Commands

System Commands

Information Commands

Volume Control (requires nircmd)

Media Commands

Vision Commands

Adding New Commands

Method 1: Using Command Manager (Recommended)

Method 2: Edit JSON Directly

Available Action Types:

Additional Properties:

Command Examples

Basic Command

Command with Confirmation

Third-Party App Command

API Usage

Command Execution

JavaScript Integration

Response Format

Requirements

Command Management Tools

Command Manager (command_manager.py)

Backup Manager (backup_manager.py)

Examples

Adding a New Web Command

Adding a New Application Command

Notes

Troubleshooting

Security Note

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

Command Manager (`command_manager.py`)

Backup Manager (`backup_manager.py`)