AI Voice Detection System

Project description

VoiceAuth - AI Voice Detection System

A desktop application that uses machine learning to differentiate between AI-generated and human voices based on audio samples. VoiceAuth employs logistic regression and advanced audio feature extraction to provide accurate classification of voice samples.

Features

✅ Audio Processing: Extract features from various audio formats (WAV, MP3, OGG, FLAC)
🧠 Machine Learning: Logistic regression model with hyperparameter optimization
🔍 Real-time Classification: Analyze and classify voice samples instantly
⚡ Performance: Hardware acceleration through parallel processing
📊 Visualization: Feature importance and model performance metrics
🔄 Feedback System: Continuously improve model accuracy with user feedback
💾 Cross-Platform Support: Works on Windows, macOS, and Linux

Screenshots

[Screenshots would be included here]

Installation

Option 1: Install as a Package (Recommended)

VoiceAuth is now available as an installable Python package:

# Install directly from PyPI
pip install voiceauth

# Or install from GitHub
pip install git+https://github.com/zohaiblazuli/VoiceAuth.git

After installation, you can start the application by simply running:

voiceauth

For detailed installation instructions, see the INSTALL.md file.

Option 2: Manual Setup

Prerequisites

Python 3.8 or higher

Setup

Clone the repository:

git clone https://github.com/zohaiblazuli/VoiceAuth.git
cd VoiceAuth

Install dependencies:
```
pip install -r requirements.txt
```
Run the application:
```
python run_voiceauth.py
```

Usage

Running the Application

VoiceAuth features a user-friendly interface with several tabs:

Import Sample Tab: Import audio files for classification
Record Sample Tab: Record your voice directly for classification
Feedback Tab: Provide feedback on classification results to improve the model
Information Tab: View model statistics and performance metrics

Sample Dataset Structure

For training the model, organize your dataset as follows:

dataset_folder/
├── ai_generated/  (folder containing AI-generated voice samples)
│   ├── sample1.wav
│   ├── sample2.wav
│   └── ...
└── human/  (folder containing human voice samples)
    ├── sample1.wav
    ├── sample2.wav
    └── ...

Technical Details

Path Management System

VoiceAuth uses a robust path management system to ensure compatibility across different environments, particularly when shared via GitHub. The system:

Automatically determines the base directory: Works whether running from source, as a compiled executable, or from any relative directory
Standardizes path access: All file paths are accessed through utility functions, not hardcoded strings
Creates necessary directories: Output and model directories are automatically created if they don't exist
Cross-platform compatibility: Paths are normalized for the operating system in use

Key path utility functions:

get_base_dir(): Gets the base application directory
get_resource_path(relative_path): Gets the absolute path to any resource
get_media_path(filename): Gets the path to media files
get_output_path(filename): Gets the path to output files or directories
get_model_path(filename): Gets the path to model files

Feature Extraction

VoiceAuth extracts various audio features using the librosa library:

MFCCs (Mel-Frequency Cepstral Coefficients)
Spectral Centroid, Contrast, Rolloff
Zero Crossing Rate
Chroma Features
Spectral Bandwidth
Tempo and Beat Features
Mel Spectrogram

Machine Learning Model

Preprocessing: Standard scaling for feature normalization
Feature Selection: Optional using SelectFromModel
Model: Logistic regression with hyperparameter tuning
Evaluation: Uses accuracy, precision, recall, and F1-score

Feedback System

The application includes an adaptive learning system that:

Collects user feedback on classification results
Stores correctly labeled samples
Automatically retrains the model when sufficient feedback data is collected
Updates the model in real-time

Development

Project Structure

voiceauth.py: Main application module
ui_components.py: UI components and widgets
simple_model.py: Machine learning model implementation
audio_processor.py: Audio processing and feature extraction
batch_process.py: Batch processing of audio samples
tabs.py: Implementation of application tabs
utils.py: Utility functions, including path management
media/: Contains graphics and media assets
output/: Contains generated files (features, models)

Extending the Application

To add new features:

For new UI components, add them to ui_components.py
For new tabs, extend the functionality in tabs.py
For new model features, modify simple_model.py
For additional audio processing, update audio_processor.py

Troubleshooting

Common Issues

Media files not found: If you see errors about missing media files, ensure the media directory is in the same location as the application.
Model loading errors: Make sure the model has been trained and the appropriate model files exist in the output directory.
Audio recording issues: Check your microphone permissions and settings.

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgements

librosa for audio feature extraction
scikit-learn for machine learning components
PyQt5 for the GUI framework

Project details

Release history Release notifications | RSS feed

1.0.1

Mar 23, 2025

This version

1.0.0

Mar 23, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

voiceauth-1.0.0.tar.gz (924.0 kB view details)

Uploaded Mar 23, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

voiceauth-1.0.0-py3-none-any.whl (925.2 kB view details)

Uploaded Mar 23, 2025 Python 3

File details

Details for the file voiceauth-1.0.0.tar.gz.

File metadata

Download URL: voiceauth-1.0.0.tar.gz
Upload date: Mar 23, 2025
Size: 924.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.9

File hashes

Hashes for voiceauth-1.0.0.tar.gz
Algorithm	Hash digest
SHA256	`5aaa457661b34b8d0702f279451ab377781d79aa83a570107ba7b9dfed3657aa`
MD5	`2a2311e02b882b01eaf0b9f9d7bdbd3b`
BLAKE2b-256	`0f5ebdea840ce20d4e94a6fc0149c110fd67c8d0f880ac3dae48addf90816157`

See more details on using hashes here.

File details

Details for the file voiceauth-1.0.0-py3-none-any.whl.

File metadata

Download URL: voiceauth-1.0.0-py3-none-any.whl
Upload date: Mar 23, 2025
Size: 925.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.9

File hashes

Hashes for voiceauth-1.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`1c5131931f28e41126df31a2abb0ae0f4aa70ade1e7eee4225c5009397ee22b4`
MD5	`36255dc2a467554275402fb4ef20020e`
BLAKE2b-256	`50475a5f5ac0159342fe0b70d8d41e4f44d1a3a0954bc9de3e91483e304cd15a`

See more details on using hashes here.

voiceauth 1.0.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

VoiceAuth - AI Voice Detection System

Features

Screenshots

Installation

Option 1: Install as a Package (Recommended)

Option 2: Manual Setup

Prerequisites

Setup

Usage

Running the Application

Sample Dataset Structure

Technical Details

Path Management System

Feature Extraction

Machine Learning Model

Feedback System

Development

Project Structure

Extending the Application

Troubleshooting

Common Issues

Contributing

License

Acknowledgements

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes