sbvoicedb

Saarbrueken Voice Database Downloader and Reader

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

tikuma-lsuhsc

These details have not been verified by PyPI

Development Status
- 4 - Beta
License
- OSI Approved :: GNU General Public License v2 (GPLv2)
Programming Language
- Python :: 3
Topic
- Multimedia :: Sound/Audio

Project description

PyPI PyPI - Status PyPI - Python Version GitHub

This Python module provides capability to download and organize Saarbrücker Stimmdatenbank (Saarbrücken Voice Database, https://stimmdb.coli.uni-saarland.de/) with SQLAlchemy (sqlalchemy.org).

Features

Auto-download the database file at https://stimmdb.coli.uni-saarland.de
Auto-download the associated datasets from Zonedo: https://zenodo.org/records/16874898
Supports incremental, on-demand download per-pathology
Stores database information as a local SQLite3 file
Database and datasets are accessed via SQLAlchemy ORM (Object Relational Mapper) classes for ease of use
Acoustic and EGG signals can be retrieved as NumPy arrays directly
Supports filters to specify study conditions on pathologies, speaker’s gender and age, recording types, etc.
Fixes known errors in the dataset (i.e., corrupted files and swapping of acoustic/EGG data)

Install

pip install sbvoicedb

If you prefer manually downloading the full dataset from Zonedo (data.zip, the full dataset, 17.9 GB) you may download the file first and unzip the content to a directory. Make sure that the zip file’s internal structure is preserved. If you’re placing your downloaded database in my_svd folder, its directory structure should appear like this:

.../my_svd/
└── data/
    ├── 1/
    │   ├── sentnces
    │   │   ├── 1-phrase.nsp
    │   │   └── 1-phrase-egg.egg
    │   └── vowels
    │       ├── 1-a_h.nsp
    │       ├── 1-a_h.nsp
    │       ⋮
    │       └── 1-u_n-egg.egg
    ├── 2/
    │   │   ├── 2-phrase.nsp
    │   │   └── 2-phrase-egg.egg
    │   └── vowels
    │       ├── 2-a_h.nsp
    │       ├── 2-a_h.nsp
    │       ⋮
    │       └── 2-u_n-egg.egg
    ⋮

Examples

from sbvoicedb import SbVoiceDb

dbpath = '<path to the root directory of the extracted database>'

# to create a database instance
db = SbVoiceDb(dbpath)
# - if no downloaded database data found, it'll automatically download the database (not files)

This creates a new database instance. If dbpath does not contain the SQLite database file, sbvoice.db, it gets populated from the downloaded CSV file.

If any portion of the dataset is already available in data subdirectory, it further populates the recordings table. These database population processes are visualized with progress bars in the console.

By default, no dataset will be downloaded at this point. You can check how much of the datasets are available by

print(f"{db.number_of_sessions_downloaded}/{db.number_of_all_sessions}")

The db.number_of_all_sessions property should always return 2043.

There are 4 tables to the SQLite database: pathologies, speakers, recording_sessions, and recordings. The contents of these tables can be accessed by

db.get_pathology_count()
db.get_speaker_count()
db.get_session_count()
db.get_recording_count()

db.iter_pathologies()
db.iter_speakers()
db.iter_sessions()
db.iter_recordings()

Your study may not require all the recordings. In such case, you can set filters on each table when creating the database object. For example, the following creates a subset of the database which only consists of recordings of sustained /a/ or /i/ at normal pitch, uttered by women of age between 50 and 70 with normal voice or with a diagnosis of Laryngitis:

from sbvoicedb import Pathology, Speaker, RecordingSession, Recording, sql_expr

db_laryngitis = database.SbVoiceDb(
    dbdir,
    pathology_filter=Pathology.name == "Laryngitis",
    include_healthy=True,
    speaker_filter=Speaker.gender == "w",
    session_filter=RecordingSession.speaker_age.between(50, 70),
    recording_filter=Recording.utterance.in_(("a_n", "i_n")),
)
print(f"number of pathologies found: {db_laryngitis.get_pathology_count()}")
print(f"number of recording sessions found: {db_laryngitis.get_session_count()}")
print(f"number of unique speakers: {db_laryngitis.get_speaker_count()}")
print(f"number of recordings: {db_laryngitis.get_recording_count()}")

number of pathologies found: 1
number of recording sessions found: 45
number of unique speakers: 44
number of recordings: 90

You can iterate over the rows of any of the tables:

# iterate over included pathologies
for patho in db_laryngitis.iter_pathologies():
  print(f'{patho.id)}: {patho.name} ({patho.downloaded})'

# iterate over included speakers
for speaker in db_laryngitis.iter_speakers():
  print(f'{speaker.id)}: {speaker.gender}'

# iterate over included recording sessions
for session in db_laryngitis.iter_sessions():
  print(f'{session.id)}: speaker_id={session.speaker_id}, speaker_age={session.speaker_age}, speaker_health={session.type}'

# iterate over included recordings
for rec in db_laryngitis.iter_recordings():
  print(f'{rec.id)}: session_id={rec.session_id}, utterance={rec.utterance}, nspfile={rec.nspfile}, eggfile={rec.eggfile}'

To retrieve the acoustic and egg data, use Recording.nspdata and Recording.eggdata:

import numpy as np
from matplotlib import pyplot as plt

rec = next(db_laryngitis.iter_recordings())

t = np.arange(rec.length)/rec.rate

fig, axes = plt.subplots(2, 1, sharex=True)
axes[0].plot(t,rec.nspdata)
axes[0].set_ylabel('acoustic data')
axes[1].plot(t,rec.eggdata)
axes[1].set_ylabel('EGG data')
axes[1].set_xlabel('time (s)')
plt.tight_layout()
plt.show()

License

sbvoicedb is released under the GNU General Public License, version 2. See the LICENSE file for details.
Saarbruecken Voice Database released under CC BY 4.0 (Creative Commons Attribution 4.0 International). sbvoicedb programmatically downloads the recordings directly from https://zenodo.org, except for the metadata CSV file (summary.csv) in data.zip. This file is included in the sbvoicedb package at sbvoicedb/summary.csv.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

tikuma-lsuhsc

These details have not been verified by PyPI

Development Status
- 4 - Beta
License
- OSI Approved :: GNU General Public License v2 (GPLv2)
Programming Language
- Python :: 3
Topic
- Multimedia :: Sound/Audio

Release history Release notifications | RSS feed

0.6.1

Feb 15, 2026

0.6.0.post0

Feb 6, 2026

0.6.0

Feb 6, 2026

This version

0.5.0

Feb 6, 2026

0.4.0

Dec 3, 2025

0.3.0

Sep 13, 2025

0.2.0

Sep 12, 2025

0.1.0.dev7 pre-release

Mar 10, 2023

0.1.0.dev6 pre-release

Mar 10, 2023

0.1.0.dev5 pre-release

Feb 9, 2023

0.1.0.dev4 pre-release

Feb 9, 2023

0.1.0.dev3 pre-release

Feb 8, 2023

0.1.0.dev2 pre-release

Feb 8, 2023

0.1.0.dev1 pre-release

Feb 7, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sbvoicedb-0.5.0.tar.gz (85.1 kB view details)

Uploaded Feb 6, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

sbvoicedb-0.5.0-py3-none-any.whl (84.5 kB view details)

Uploaded Feb 6, 2026 Python 3

File details

Details for the file sbvoicedb-0.5.0.tar.gz.

File metadata

Download URL: sbvoicedb-0.5.0.tar.gz
Upload date: Feb 6, 2026
Size: 85.1 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for sbvoicedb-0.5.0.tar.gz
Algorithm	Hash digest
SHA256	`b88b0a7f313b4d19df9b6dce3bcbcc98c3709d978cec0f375f9516dfb960c962`
MD5	`9e1fac8a7f2dbee959aef14745686558`
BLAKE2b-256	`1cc40700a49375fe44e497cfb3fdb3761f41433494517334b0f1a9e57b4cdfbf`

See more details on using hashes here.

Provenance

The following attestation bundles were made for sbvoicedb-0.5.0.tar.gz:

Publisher: pub.yml on tikuma-lsuhsc/python-sbvoicedb

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: sbvoicedb-0.5.0.tar.gz
- Subject digest: b88b0a7f313b4d19df9b6dce3bcbcc98c3709d978cec0f375f9516dfb960c962
- Sigstore transparency entry: 924474250
- Sigstore integration time: Feb 6, 2026
Source repository:
- Permalink: tikuma-lsuhsc/python-sbvoicedb@ffa1be1dce0819c8929712d615d8fa386a9c46c5
- Branch / Tag: refs/tags/v0.5.0
- Owner: https://github.com/tikuma-lsuhsc
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: pub.yml@ffa1be1dce0819c8929712d615d8fa386a9c46c5
- Trigger Event: push

File details

Details for the file sbvoicedb-0.5.0-py3-none-any.whl.

File metadata

Download URL: sbvoicedb-0.5.0-py3-none-any.whl
Upload date: Feb 6, 2026
Size: 84.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for sbvoicedb-0.5.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`41635cc2eb114b75d3e8353b7b3218df2e21780fbc0d40e28a72daf135d70b7e`
MD5	`b0e2c8d677960cc3149600822953794f`
BLAKE2b-256	`9164d0e313cb6e7e41bd95977cd65674ac7ffc77e82b223204c26157850031d1`

See more details on using hashes here.

Provenance

The following attestation bundles were made for sbvoicedb-0.5.0-py3-none-any.whl:

Publisher: pub.yml on tikuma-lsuhsc/python-sbvoicedb

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: sbvoicedb-0.5.0-py3-none-any.whl
- Subject digest: 41635cc2eb114b75d3e8353b7b3218df2e21780fbc0d40e28a72daf135d70b7e
- Sigstore transparency entry: 924474253
- Sigstore integration time: Feb 6, 2026
Source repository:
- Permalink: tikuma-lsuhsc/python-sbvoicedb@ffa1be1dce0819c8929712d615d8fa386a9c46c5
- Branch / Tag: refs/tags/v0.5.0
- Owner: https://github.com/tikuma-lsuhsc
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: pub.yml@ffa1be1dce0819c8929712d615d8fa386a9c46c5
- Trigger Event: push

sbvoicedb 0.5.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Features

Install

Examples

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance