Skip to main content

Tool to make high quality text to speech (tts) corpus from audio + text books.

Project description

Narizaka

Tool to make high quality text to speech (tts) corpus from audio + text books.

How it works

First it transcribes audio with whisper ASR, saving all word level timestamps, then it alligns this transcription with original text, if distance is very small we consider it as match and add it to the dataset.

Installation

First, you should install several system dependancies:

On deb linux:

sudo apt install ffmpeg pandoc

on MacOSX:

brew install ffmpeg pandoc libmagic

Then you can install narizaka:

pip install narizaka

or if you want to use the latest development version:

pip install git+https://github.com/patriotyk/narizaka.git

Also if you plan to modify sources:

git clone https://github.com/patriotyk/narizaka.git
pip install -e narizaka/

Flag -e means that you can edit source files in the directory where you have cloned this project and they will be reflected when you run command narizaka

Every tagged commit on the main branch, automatically generates and pushes image to the docker hub. So you can also pull this images:

docker pull patriotyk/narizaka:latest

How to use

Application as input accepts directory that contains audio data, it can be folder or subfolder of audio files, or just one audio file and there also should be one text file tat represents this audio. This text file, can be any document that accepts pandoc application. Example:

narizaka test_data/farshrutka 

Or

narizaka test_data

to process all books.

This repository contains test_data that includes two audio and text books that you can use for testing.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

narizaka-1.2.4.tar.gz (34.7 kB view details)

Uploaded Source

Built Distribution

narizaka-1.2.4-py3-none-any.whl (35.5 kB view details)

Uploaded Python 3

File details

Details for the file narizaka-1.2.4.tar.gz.

File metadata

  • Download URL: narizaka-1.2.4.tar.gz
  • Upload date:
  • Size: 34.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.9.19

File hashes

Hashes for narizaka-1.2.4.tar.gz
Algorithm Hash digest
SHA256 ac15c2554bb1cdcce1b48e8e1836c5ce00a7fa98d853e2a212f641aba6903a64
MD5 041f482d86770c0440bbe512ca1fc290
BLAKE2b-256 138ef68b20fa82798275ced2162e8507471198d045817e41290e5a11e46c2cd6

See more details on using hashes here.

File details

Details for the file narizaka-1.2.4-py3-none-any.whl.

File metadata

  • Download URL: narizaka-1.2.4-py3-none-any.whl
  • Upload date:
  • Size: 35.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.9.19

File hashes

Hashes for narizaka-1.2.4-py3-none-any.whl
Algorithm Hash digest
SHA256 aa42a70815f9444c6265920b4b37a836a6acf2874e3932de8219643ffa50fe65
MD5 71c632841ea40a65999466a14f81f33a
BLAKE2b-256 da3ae59b201a9ce21775a47c93e41e1be8f7b84d948e6635a5dd675723ac71b0

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page