The CLI for perform actions over the Open Speech Corpus
Project description
Open Speech Corpus CLI
This repository contains the code required to download audiodata from openspeechcorpus.com
Open Speech Corpus is composed by far for three subcorpuses:
- Tales: A crowdsourced corpus based on reading of latin american short tales
- Aphasia: A crowdsourced corpus based in words categorized in 4 levels of difficulty
- Isolated words: A crowdsourced corpus based in isolated words
To download files from the Tales Project use
ops \
--output_folder tales/ \
--output_file tales.txt \
--corpus tales
To download files from the Isolated Words Project use
ops \
--output_folder isolated_words/ \
--output_file isolated_words.txt \
--corpus words
To download files from the Aphasia Project use
ops \
--output_folder aphasia/ \
--output_file aphasia.txt \
--corpus aphasia
By default the page size is 500, to modify it use the args --from
and --to
i.e:
ops \
--from 500 \
--to 1000 \
--output_folder aphasia/ \
--output_file aphasia.txt \
--corpus aphasia
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
openspeechcorpus-0.0.3.tar.gz
(3.3 kB
view hashes)
Built Distribution
Close
Hashes for openspeechcorpus-0.0.3-py3.7.egg
Algorithm | Hash digest | |
---|---|---|
SHA256 | 57be73722d41607545a969f13993cf7f90cdec2280087de7ee0ba44445ba8ad3 |
|
MD5 | 1e1d0d80e69acb4dbb57053b7c0d0385 |
|
BLAKE2b-256 | 399042bd0995384d126ec08001b1c21aa0352afcf9dc07ee63f9de8db8fa72a5 |