Automatic annotation and analysis of audio/video speech recordings.

These details have not been verified by PyPI

Project links

Project description

SPPAS

sppas

The automatic annotation and analysis of speech.

sppas is the Python package of the SPPAS software. It provides an API for annotating, segmenting, and analyzing audio and video speech recordings, as well as tools for working with annotated data.

Homepage: https://sppas.org/
Documentation: https://brigittebigi.github.io/sppas/
Main Repository: https://github.com/brigitte-bigi/SPPAS
Development Repository: https://sourceforge.net/projects/sppas
License: AGPL-3.0-or-later
Copyright (C) 2011-2026 Brigitte Bigi, CNRS, Laboratoire Parole et Langage, Aix-en-Provence, France

How SPPAS software can be helpful?

SPPAS is designed to assist researchers and linguists in analyzing speech data by providing a range of tools and functionalities related to phonetics, acoustics, signal processing, video processing, etc.

Among other features, SPPAS allows users to automatically annotate speech data at different phonetic levels. This annotated data can then be used to segment speech into meaningful units (e.g., phonemes, syllables, words) and align it with the original speech signal. SPPAS includes statistical tools for analyzing the data and extracting relevant information from the corpus. The software also offers a manual annotation interface and supports various annotation formats. It can export annotated data in various formats for compatibility with other speech processing and statistical analysis tools.

SPPAS allows users to customize automatic annotations to their own needs and implement custom annotation solutions.

It is a research-oriented tool and may require some familiarity with speech processing concepts and techniques.

SPPAS software Features

SPPAS provides 24 automatic annotation and analysis solutions, facilitating the study of various aspects of phonetics and linguistics.

It is primarily used for speech segmentation: the process of taking the orthographic transcription of an audio segment (such as IPUs – Inter-Pausal Units) and determining where particular phonemes or words occur in that segment.

SPPAS is then a free and open-source speech segmentation tool designed to meet the real needs of researchers in linguistics and phonetics. Unlike most systems evaluated only through technical benchmarks, SPPAS offers a transparent, modular and fully traceable architecture. Each step—text normalization, phonetic transcription, forced alignment—is explicit, documented, published and customizable. This FAIR-compliant approach (Findable, Accessible, Interoperable, Reusable) supports scientific reproducibility, adaptability, and open science practices.

SPPAS challenges the limits of conventional evaluation metrics and proposes an alternative, multi-criteria framework for assessing usability, transparency, and research value.

SPPAS features can be extended in two ways:

Plugins are extra tools that the user chooses and downloads separately. Once downloaded, SPPAS can install them and make them available. A plugin can also be removed later.
Spin-offs are official extensions of SPPAS. They are downloaded and installed automatically by SPPAS during its setup. Spin-offs extend the system permanently and cannot be removed afterward.

Cite

By using SPPAS/sppas, you agree to cite a reference in your publications or product.

Any publication or product resulting from the use of this software, including but not limited to academic journal and conference publications, technical reports and manuals, or software, must cite at least one reference either among the following works or among any of the references listed in the book:

Brigitte Bigi (2015). SPPAS - Multi-lingual Approaches to the Automatic Annotation of Speech. The Phonetician - International Society of Phonetic Sciences, ISSN 0741-6164, Number 111-112 / 2015-I-II, pages 54-69.

'sppas' Quick start

Install

$ pip install sppas

Install optional components (linguistic resources, models, etc.):

$ sppassetup --help
$ sppassetup --eng

API usage

Read an annotated file and convert it to another format:

from sppas.src.anndata import sppasTrsRW

parser = sppasTrsRW("recording.TextGrid")
trs = parser.read()

parser.set_filename("recording.csv")
parser.write(trs)

For more examples, see the API documentation.

Command-line usage

Automatic annotations are available via the scripts in sppas/bin/. For example, full automatic speech segmentation:

$ sppasseg -w audio.wav -l eng -e .TextGrid

Additional scripts for data processing and analysis are available in sppas/scripts/.

'sppas' Developer install

$ git clone https://github.com/brigittebigi/sppas.git
$ python -m venv sppas-venv
$ source sppas-venv/bin/activate
$ pip install -e .
$ sppassetup --help

The API documentation is available locally in the docs/ folder — open docs/index.html in a browser. It is generated by ClammingPy and uses Whakerexa for accessibility (light/dark mode, dyslexia-friendly font, WCAG spacing). To re-generate it:

$ pip install pygments markdown2 "ClammingPy>=2.1"
$ python makedoc.py

Contributing to 'sppas'

Bug reports and contributions are welcome by e-mail at contact@sppas.org. Please read the code of conduct and the code style guide before contributing. Contributors must agree to the DCO.

License

This program is free software: you can redistribute it and/or modify it under the terms of the GNU Affero General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.

See the accompanying LICENSE file or https://www.gnu.org/licenses/.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

5.0.0

Apr 6, 2026

This version

5.0.0rc2 pre-release

Mar 30, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

sppas-5.0.0rc2-py3-none-any.whl (19.2 MB view details)

Uploaded Mar 30, 2026 Python 3

File details

Details for the file sppas-5.0.0rc2-py3-none-any.whl.

File metadata

Download URL: sppas-5.0.0rc2-py3-none-any.whl
Upload date: Mar 30, 2026
Size: 19.2 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for sppas-5.0.0rc2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`08c743b353b87498e5b0cedfe17e4197d8c7b09ab5b537902c425e09d02f17c4`
MD5	`f2fc37356047e7e2796297eb5db21620`
BLAKE2b-256	`648270809a49401cf98502cdcb0ad3736e91d3dd91be1adeede9156c52d10452`

See more details on using hashes here.

sppas 5.0.0rc2

Navigation

Verified details

Maintainers

Meta

Unverified details

Project links

Meta

Classifiers

Project description

sppas

How SPPAS software can be helpful?

SPPAS software Features

Cite

'sppas' Quick start

Install

API usage

Command-line usage

'sppas' Developer install

Contributing to 'sppas'

License

Project details

Verified details

Maintainers

Meta

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

File details

File metadata

File hashes