Convert audio to phonetic text and practice improving your speech accent.
Project description
pnm
pnm is an audio-to-phoneme conversion tool designed to transform spoken English into phonetic transcriptions. This project is a mini-project derived from a larger, unfinished personal project aimed at creating a tool for English phonetic practice. Although the main project wasn't completed, PNM is being transformed into a Python library for open-source use.
Currently, the tool is a work-in-progress but is functional and offers a simple way to convert audio into phonemes.
It is possible to classify the speech quality of the person training using the pnm tool. By analyzing the phonetic transcriptions generated from the spoken audio (by token confidence). This analysis can help in evaluating the quality of the speaker’s pronunciation and progress over time, allowing for personalized feedback during training.
Installation
To install the required dependencies, use the following command:
For cpu
pip install "pnm[cpu]"
For cuda 11.X
pip install "pnm[gpu]"
For cuda 12.X
pip install "pnm[gpu]" --extra-index-url https://aiinfra.pkgs.visualstudio.com/PublicPackages/_packaging/onnxruntime-cuda-12/pypi/simple/
Usage
Command Line Interface
For get the phonemes of an audio file:
pnm file --file_path path/to/audio.wav
For get the phonemes of an audio recorder (default input device):
pnm recorder
For practice (default input device):
pnm practice
Images
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file pnm-0.0.1-py3-none-any.whl.
File metadata
- Download URL: pnm-0.0.1-py3-none-any.whl
- Upload date:
- Size: 39.4 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.0.1 CPython/3.9.16
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3131139de268606183d8cadf48bb7474d4f158c382a1197f3425738f29486284
|
|
| MD5 |
6d749f14e0fb511c2bb8a3215afdc78a
|
|
| BLAKE2b-256 |
f19c8e461c6fcfe898451440e6ad6202a69be546dd78c9742172335174bcd294
|