No project description provided

Project description

AudioMind

Overview

AudioMind is a Python-based solution designed to extract meaningful insights from audio files. By leveraging whisper and LLMs, the platform transcribes and summarizes audio content, making it easier to derive actionable information.

Stack

LLM

OpenAI

Speech to Text

Whisper (Openai API) [DEFAULT]
Whisper (On-Device)

Current Solutions

Create a journal entry from your voice note.

Goals

Transcribe audio files to text.
Summarize the transcribed text.
Easy to integrate and use.
Get Insights from any audio file, including podcasts , interviews, lectures, etc.
Solve actual problems.

Installation

Prerequisites

Python 3.x
pip

Use PIP Package

Steps to Install

Clone the Repository

git clone https://github.com/onlyoneaman/audiomind.git
cd audiomind

Create a Virtual Environment
```
python3 -m venv .venv
```
Activate the virtual environment:
- Unix or MacOS
```
source .venv/bin/activate
```
- Windows
```
.\.venv\Scripts\activate
```
Install Dependencies
```
pip install -r requirements.txt
```

Environment Variables

Copy .env.template to .env.

cp .env.template .env

Open .env and provide your OpenAI API key:

OPENAI_API_KEY=your_openai_api_key_here
DREAMBOAT_API_KEY=your_dreamboat_api_key_here // optional

Run the Application
```
python3 -m audiomind
```

Usage

Place the audio files in the /exmaples folder and run the audio_to_journal.py script. The script will transcribe the audio and summarize it.

python3 -m audiomind --file examples/1.mp3

You can add some information about yourself in person.txt file. Audiomind will use this information too while creating the journal entry.

Roadmap

Transcribe audio files to text.
Summarize the transcribed text.
Easy to integrate and use.
Get Insights from any audio file, including podcasts , interviews, lectures, etc.
Create a journal entry from your voice note.
Improve the journal entry.
Create a summary of a podcast episode.
Create a summary of a lecture.
Create a summary of a meeting.

Contributing

Feel free to submit issues and enhancement requests.

License

MIT

Enjoy using AudioMind!

Project details

Release history Release notifications | RSS feed

This version

0.2.0

Oct 29, 2023

0.1

Oct 28, 2023

0.0.1

Oct 28, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

audiomind-0.2.0.tar.gz (14.0 MB view details)

Uploaded Oct 29, 2023 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

audiomind-0.2.0-py3-none-any.whl (14.0 MB view details)

Uploaded Oct 29, 2023 Python 3

File details

Details for the file audiomind-0.2.0.tar.gz.

File metadata

Download URL: audiomind-0.2.0.tar.gz
Upload date: Oct 29, 2023
Size: 14.0 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.9.17

File hashes

Hashes for audiomind-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`cd7ebabb1a3b3edf4a85b2120ed9228239365fb323ec9d044a8578e7fe27d077`
MD5	`5b4206f0db78cb14b1cdf1cd5539aad5`
BLAKE2b-256	`b1d75ba4e3d4e41c5f7024fd6d33c5a8820633d4ffbb2f6cad747d72e0f33260`

See more details on using hashes here.

File details

Details for the file audiomind-0.2.0-py3-none-any.whl.

File metadata

Download URL: audiomind-0.2.0-py3-none-any.whl
Upload date: Oct 29, 2023
Size: 14.0 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.9.17

File hashes

Hashes for audiomind-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`27f2d43f0abfa023fe2b6128d1f47da5900b199297026ff10e4f76b836f18838`
MD5	`051a6e263c84515d3af6a5ed94f6494d`
BLAKE2b-256	`b1aebc56295770e31d52d09aec97a134903db07802261c9be000190b18aac1ac`

See more details on using hashes here.

audiomind 0.2.0

Navigation

Verified details

Maintainers

Unverified details

Project description

AudioMind

Overview

Stack

LLM

Speech to Text

Current Solutions

Goals

Installation

Prerequisites

Use PIP Package

Steps to Install

Usage

Roadmap

Contributing

License

Project details

Verified details

Maintainers

Unverified details

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes