Skip to main content

No project description provided

Project description

AudioMind

Overview

AudioMind is a Python-based solution designed to extract meaningful insights from audio files. By leveraging whisper and LLMs, the platform transcribes and summarizes audio content, making it easier to derive actionable information.

Stack

LLM

  • OpenAI

Speech to Text

  • Whisper (Openai API) [DEFAULT]
  • Whisper (On-Device)

Current Solutions

  • Create a journal entry from your voice note.

Goals

  • Transcribe audio files to text.
  • Summarize the transcribed text.
  • Easy to integrate and use.
  • Get Insights from any audio file, including podcasts , interviews, lectures, etc.
  • Solve actual problems.

Installation

Prerequisites

  • Python 3.x
  • pip

Use PIP Package

Steps to Install

  1. Clone the Repository

    git clone https://github.com/onlyoneaman/audiomind.git
    cd audiomind
    
  2. Create a Virtual Environment

    python3 -m venv .venv
    

    Activate the virtual environment:

    • Unix or MacOS

      source .venv/bin/activate
      
    • Windows

      .\.venv\Scripts\activate
      
  3. Install Dependencies

    pip install -r requirements.txt
    
  4. Environment Variables

    Copy .env.template to .env.

    cp .env.template .env
    

    Open .env and provide your OpenAI API key:

    OPENAI_API_KEY=your_openai_api_key_here
    DREAMBOAT_API_KEY=your_dreamboat_api_key_here // optional
    
  5. Run the Application

    python3 -m audiomind
    

Usage

Place the audio files in the /exmaples folder and run the audio_to_journal.py script. The script will transcribe the audio and summarize it.

python3 -m audiomind --file examples/1.mp3

You can add some information about yourself in person.txt file. Audiomind will use this information too while creating the journal entry.

Roadmap

  • Transcribe audio files to text.
  • Summarize the transcribed text.
  • Easy to integrate and use.
  • Get Insights from any audio file, including podcasts , interviews, lectures, etc.
  • Create a journal entry from your voice note.
  • Improve the journal entry.
  • Create a summary of a podcast episode.
  • Create a summary of a lecture.
  • Create a summary of a meeting.

Contributing

Feel free to submit issues and enhancement requests.

License

MIT


Enjoy using AudioMind!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

audiomind-0.2.0.tar.gz (14.0 MB view hashes)

Uploaded Source

Built Distribution

audiomind-0.2.0-py3-none-any.whl (14.0 MB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page