Speaker Annotation for Transcripts using Audio Classification
Project description
speakerbox
Speaker Annotation for Transcripts using Audio Classification
Installation
Stable Release: pip install speakerbox
Development Head: pip install git+https://github.com/CouncilDataProject/speakerbox.git
Documentation
For full package documentation please visit councildataproject.github.io/speakerbox.
Quickstart
Load the 2021 Seattle Prototype Dataset, get summary statistics
about speaker time, finally pull the matching audio file for each annotation file
and store annotation file matched to audio as a pandas.DataFrame
.
from speakerbox import datasets
seattle_2021_ds_dir = datasets.unpack_seattle_2021_proto(clean=True)
seattle_2021_ds_summary_stats = datasets.summarize_annotation_statistics(
seattle_2021_ds_dir / "annotations"
)
seattle_2021_ds = datasets.pull_seattle_2021_proto_audio()
Development
See CONTRIBUTING.md for information related to developing the code.
MIT license
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
speakerbox-0.0.2.tar.gz
(49.3 kB
view hashes)
Built Distribution
Close
Hashes for speakerbox-0.0.2-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 42b97e9b518a515cd45a865fdb0f47195b5dd19b15e9f8f5f7ab1da936bab6e1 |
|
MD5 | 475a564bb270362a537dcb9f441afbfd |
|
BLAKE2b-256 | 9037f837313c1ec765bb0f591d09274df4241dfc519baf3f04f48196685d7ac4 |