Skip to main content

Convert audio to phonetic text and practice improving your speech accent.

Project description

pnm

pnm is an audio-to-phoneme conversion tool designed to transform spoken English into phonetic transcriptions. This project is a mini-project derived from a larger, unfinished personal project aimed at creating a tool for English phonetic practice. Although the main project wasn't completed, PNM is being transformed into a Python library for open-source use.

Currently, the tool is a work-in-progress but is functional and offers a simple way to convert audio into phonemes.

It is possible to classify the speech quality of the person training using the pnm tool. By analyzing the phonetic transcriptions generated from the spoken audio (by token confidence). This analysis can help in evaluating the quality of the speaker’s pronunciation and progress over time, allowing for personalized feedback during training.

Installation

To install the required dependencies, use the following command:

For cpu

pip install "pnm[cpu]"

For cuda 11.X

pip install "pnm[gpu]"

For cuda 12.X

pip install "pnm[gpu]" --extra-index-url https://aiinfra.pkgs.visualstudio.com/PublicPackages/_packaging/onnxruntime-cuda-12/pypi/simple/

Usage

Command Line Interface

For get the phonemes of an audio file:

pnm file --file_path path/to/audio.wav

For get the phonemes of an audio recorder (default input device):

pnm recorder

For practice (default input device):

pnm practice

Images

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pnm-0.0.1-py3-none-any.whl (39.4 MB view details)

Uploaded Python 3

File details

Details for the file pnm-0.0.1-py3-none-any.whl.

File metadata

  • Download URL: pnm-0.0.1-py3-none-any.whl
  • Upload date:
  • Size: 39.4 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.9.16

File hashes

Hashes for pnm-0.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 3131139de268606183d8cadf48bb7474d4f158c382a1197f3425738f29486284
MD5 6d749f14e0fb511c2bb8a3215afdc78a
BLAKE2b-256 f19c8e461c6fcfe898451440e6ad6202a69be546dd78c9742172335174bcd294

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page