Convert youtube urls to text with speech recognition
Project description
Converts Youtube URLs to Text with Speech Recognition
What does the library does?
- Youtube -> Text: Translate youtube urls as text file (csv)
- Youtube -> Audio: Downloads youtube urls as audio file (wav, flac)
- Audio -> Text: Translate audio file (wav, flac) to text file (csv)
Three folders will be created to store the output files.
<Own Path> or <HOME_DIRECTORY>/youtube2text
│
├── audio/
│ └── 2022Jan02_011802.flac
|
├── audio-chunks/
│ └── 2022Jan02_011802
│ ├── chunk1.flac
│ ├── chunk2.flac
│ └── chunk3.flac
│
└── text/
└── 2022Jan02_011802.csv
How to install
Install and update using pip
pip install youtube2text
Build from source
git clone <this_repo>
cd <this_repo>
python setup.py install
How to use
- Using the library requires internet connection for both downloading youtube videos and speech recognition operation
from youtube2text import Youtube2Text
converter = Youtube2Text()
converter.url2text(urlpath="https://www.youtube.com/watch?v=Ad9Q8rM0Am0&t=114s")
Check out more at howtouse.ipynb
Functions
- Support audio output of
- wav
- flac
- Support Automatic Speech Recognition with backend
- Native Python backend
- Huggingface
Youtube -> Text
def url2text(self, urlpath, outfile = None, audioformat = "flac", audiosamplingrate=16000):
'''
Convert youtube url to text
Parameters:
urlpath (str): Youtube url
outfile (str, optional): File path/name of output file (.csv)
audioformat (str, optional): Audioformat supported in self.__audioextension
audiosamplingrate (int, optional): Audio sampling rate
'''
Youtube -> Audio
def url2audio(self, urlpath, audiofile = None, audiosamplingrate=16000):
'''
Convert youtube url to audiofile
Parameters:
urlpath (str): Youtube url
audiofile (str, optional): File path/name to save audio file
audiosamplingrate (int, optional): Audio sampling rate
'''
Audio -> Text
def audio2text(self, audiofile, textfile = None):
'''
Convert audio to csv file
Parameters:
audiofile (str): File path/name of audio file
textfile (str, optional): File path/name of text file (*.csv)
'''
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
youtube2text-0.0.9.tar.gz
(29.5 kB
view hashes)
Built Distribution
Close
Hashes for youtube2text-0.0.9-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4202aeb0e32fa460380071022f2dd7bf308b58003791a41b9bf1de7f42c0ef14 |
|
MD5 | 91da92875bdd1074ded4396eccca7d4b |
|
BLAKE2b-256 | 53bdf31adbb6c7b472b900e554ea69c79afd05e13be22bee7dd45289fe15dfd9 |