A Python application for Azure AI Speech to Text service

These details have not been verified by PyPI

Project links

Homepage

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 4 - Beta
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Programming Language

Project description

Audio Analyser's logo

Audio Analyser: Speech-to-Text & Analysis 🎙️

Audio Analyser banner

• Website • Report Bug • Request Feature • Contributing Guidelines

divider

Overview

Discover Hidden Insights in Minutes: AI-Powered Audio Analysis for Your Call Recordings

Streamline call recording and audio file transcription, uncover actionable insights in seconds with advanced text analysis, powered by Microsoft Azure AI services

Go beyond simple transcription: Discover sentiment, key information, and gain a multi-faceted understanding of your conversations through in-depth analysis and comprehensive reports.
Audio Analyser leverages the power of Azure's advanced AI services to transform your audio data into valuable insight reports in no time.

divider

Key Features

Speech to Text: Convert spoken language into text using Azure's speech-to-text service.
Text Analysis: Analyze text for various features using Azure's text analytics service.
Instant Transcription:
- Instantly transcribe audio files and recordings into text.
Support for outputting results in different formats, including JSON, TXT and SQLite.
Actionable Insights:
- Analyze text for various features, including Overall Sentiment, Positive/Negative Sentiment Analysis, Identify Key Topics and Entities, Language, Personally Identifiable Information (PII).
- Uncover sentiment and key information within conversations.
Data-Driven Reports:
- Generate detailed reports for easy sharing and analysis.
Web Server: A CherryPy-based web server to handle incoming requests and process them.

divider

Built on a Robust Foundation

Azure-powered technology and a secure CherryPy web server ensure accurate analysis and reliable data management.
Scalable architecture: Adapt seamlessly to your needs, handling large datasets with ease.

Experience the power of Audio Analyser today!

divider

Dependencies

CherryPy
Azure Cognitive Services Speech SDK
Azure AI Text Analytics
Python standard libraries: asyncio, threading, logging, sqlite3, json
Dotenv for environment variable management

divider

Installation

Create a Virtual Environment

We recommend creating a virtual environment to install the Audio Analyser. This will ensure that the package is installed in an isolated environment and will not affect other projects.

python3 -m venv venv
source venv/bin/activate  # On Windows use `venv\Scripts\activate`

Installation and Setup

Install required Python packages:

   pip install cherrypy azure-ai-textanalytics azure-cognitiveservices-speech

Set up Azure services and obtain necessary API keys.
Configure environment variables for Azure services in a .env file.

Getting Started

Install audioanalyser with just one command:

pip install audioanalyser

Usage Instructions

To run the Audio Analyser CLI

Start the CLI using audioanalyser:

python -m audioanalyser

Follow the instructions to utilize speech-to-text and text analysis features.
Access the generated transcript and report files in the resources directory in the root folder.

To run the Audio Analyser server

Start the server using audioanalyser:

python -m audioanalyser -s

Access the server at the specified host and port to utilize speech-to-text and text analysis features.

Usage

To run the application, use the following command:

python server.py

This will start the CherryPy web server, and you can interact with the application through the defined endpoints.

Requirements

The minimum supported Python version is 3.6.

Azure Cognitive Services for speech and text processing.
CherryPy for the web server.
Python's standard libraries including asyncio, sqlite3, and threading.

divider

Configuration

Ensure that your Azure credentials and other configurations are correctly set in a .env file in the root directory. Please refer to the env.example file for the required environment variables.

divider

License

The project is licensed under the terms of both the MIT license and the Apache License (Version 2.0).

divider

Contribution

We welcome contributions to audioanalyser. Please see the contributing instructions for more information.

Unless you explicitly state otherwise, any contribution intentionally submitted for inclusion in the work by you, as defined in the Apache-2.0 license, shall be dual licensed as above, without any additional terms or conditions.

divider

Acknowledgements

We would like to extend a big thank you to all the awesome contributors of audioanalyser for their help and support.

Project details

These details have not been verified by PyPI

Project links

Homepage

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 4 - Beta
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Programming Language

Release history Release notifications | RSS feed

0.0.6

Feb 8, 2024

0.0.5

Jan 25, 2024

0.0.4

Jan 20, 2024

0.0.3

Jan 11, 2024

0.0.2

Jan 10, 2024

This version

0.0.1

Dec 22, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

audioanalyser-0.0.1.tar.gz (17.6 kB view hashes)

Uploaded Dec 22, 2023 Source

Built Distribution

audioanalyser-0.0.1-py2.py3-none-any.whl (18.5 kB view hashes)

Uploaded Dec 22, 2023 Python 2 Python 3

Hashes for audioanalyser-0.0.1.tar.gz

Hashes for audioanalyser-0.0.1.tar.gz
Algorithm	Hash digest
SHA256	`649fccfed62d49d14a86c1fbf2eccdbd2479fa7f7e59783b1c6c1be0a93a1386`
MD5	`fab86e8a027c3845f2ad64ac6622bcce`
BLAKE2b-256	`cc669f1398bdc49d9b6b937c3280c054bd6ea97fd6ee5bbd3d62a2ef2f678b54`

Hashes for audioanalyser-0.0.1-py2.py3-none-any.whl

Hashes for audioanalyser-0.0.1-py2.py3-none-any.whl
Algorithm	Hash digest
SHA256	`426a79226467855d902a3e3f87f98b1d527d2db318af724f997c24c280c03920`
MD5	`a74a67743ef24cfa134352453709c877`
BLAKE2b-256	`bf6764f145aaceb34d0af7fef138ca2af6249823ca03a0a1a125cfaa3f374c10`