Skip to main content

Interactive bioacoustic annotation tool for measuring vocalizations

Project description

YAAAT! Yet Another Audio Annotation Tool

Interactive bioacoustic annotation tool for measuring vocalizations.

Features:

  1. Changepoint Annotator, for marking temporal onset, offset, and changepoints in vocalizations. Useful for describing rapid fluctuations and identifying nonlinear phenomena.
  2. Peak Annotator, for marking dominant frequency peaks on the power spectrum. Useful for describing spectrally complex vocalizations.
  3. Harmonic Annotator, for identifying harmonics and boundaries to describe wavelets
Changepoint Annotator Peak Annotator Harmonic Annotator
Changepoint Annotator Peak Annotator Harmonic Annotator

Installation

Via Pip (probably easiest)

pip install yaaat

From Source

git clone https://github.com/laelume/yaaat.git
cd yaaat
pip install -e .

Usage

Launch the Application

yaaat

Opens a tabbed interface with both annotators. Includes test audio files to get started. For some reason, auto-load is a little buggy, so clicking Load Audio Files and selecting the included test_audio yourself lets the interface work as-intended.

Use in Python Scripts

from yaaat import ChangepointAnnotator, PeakAnnotator
import tkinter as tk

# Launch changepoint annotator
root = tk.Tk()
app = ChangepointAnnotator(root)
root.mainloop()

# Or launch peak annotator
root = tk.Tk()
app = PeakAnnotator(root)
root.mainloop()

Getting Started

  1. Click Load Audio Directory to select files or Load Test Audio to explore test audio
  2. Choose where to save annotations (existing, new, or default directory)
  3. Click on the spectrogram to add annotation points
  4. Click Finish Syllable when done with annotation
  5. Move between files using Next/Previous buttons
  6. Annotations auto-save on file navigation or Finish syllable

Navigation & Features

  • Intuitive real-time interactive visualization with zoom, pan, and keycommand + mousewheel navigation

  • Visualize harmonics with adjustable multipliers and draggable bounding boxes

  • JSON annotations saved per-file to minimize corruption

  • Mark and track unusable files

  • Adjust spectrogram resolution for accuracy comparison

  • TODO: implement ranking system for annotation quality; inject as learning feedback mechanism

Requirements

  • Python ≥3.8 (built using 3.11)
  • numpy
  • matplotlib
  • librosa
  • scipy
  • natsort
  • sounddevice
  • soundfile

License

MIT License - Copyright (c) 2025 laelume

Contributing

Contributions welcome! Please open an issue or submit a pull request. I'm especially interested in talking to people about using this in their existing AI workflows, so please feel free to reach out !!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

yaaat-0.1.8.tar.gz (2.7 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

yaaat-0.1.8-py3-none-any.whl (2.7 MB view details)

Uploaded Python 3

File details

Details for the file yaaat-0.1.8.tar.gz.

File metadata

  • Download URL: yaaat-0.1.8.tar.gz
  • Upload date:
  • Size: 2.7 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for yaaat-0.1.8.tar.gz
Algorithm Hash digest
SHA256 7a2e97a3c0d1d263e33c78937434798441d2a2daa88eb14ca9696c77fbe1b0c2
MD5 e6211045a5055909db9b155ad8b10727
BLAKE2b-256 e2dea3e94147564b7ae2bb997f12443ce8102b6eb4e4f980a50459bb157be7c9

See more details on using hashes here.

File details

Details for the file yaaat-0.1.8-py3-none-any.whl.

File metadata

  • Download URL: yaaat-0.1.8-py3-none-any.whl
  • Upload date:
  • Size: 2.7 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for yaaat-0.1.8-py3-none-any.whl
Algorithm Hash digest
SHA256 0b377afb41adfa5f782530b6ab5865c6d9869342ffef9ff28ad4b73e40b9c999
MD5 6c5f29a6f724cbeede02764c2c0e52a0
BLAKE2b-256 3ecd375819924f18e9ff818fdf1824184f1084bf1c89f56790fa0cc641c148d0

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page