Skip to main content

Interactive bioacoustic annotation tool for measuring vocalizations

Project description

YAAAT: Yet Another Audio Annotation Tool

Interactive bioacoustic annotation tool for measuring vocalizations.

Features:

  1. Changepoint Annotator, for marking temporal onset, offset, and changepoints in vocalizations.
  2. Peak Annotator, for marking dominant frequency peaks on the power spectrum.
Changepoint Annotator Peak Annotator
Changepoint Annotator Peak Annotator

Installation

Via PyPI (Recommended)

pip install yaaat

From Source

git clone https://github.com/laelume/yaaat.git
cd yaaat
pip install -e .

Usage

Launch the Application

yaaat

Opens a tabbed interface with both annotators. Includes test audio files to get started immediately.

Use in Python Scripts

from yaaat import ChangepointAnnotator, PeakAnnotator
import tkinter as tk

# Launch changepoint annotator
root = tk.Tk()
app = ChangepointAnnotator(root)
root.mainloop()

# Or launch peak annotator
root = tk.Tk()
app = PeakAnnotator(root)
root.mainloop()

Getting Started

  1. Click Load Audio Directory to select files or Load Test Audio to explore test audio
  2. Choose where to save annotations (existing, new, or default directory)
  3. Click on the spectrogram to add annotation points
  4. Click Finish Syllable when done with annotation
  5. Move between files using Next/Previous buttons
  6. Annotations auto-save on file navigation or Finish syllable

Navigation & Features

  • Intuitive real-time interactive visualization with zoom, pan, and keycommand + mousewheel navigation

  • Visualize harmonics with adjustable multipliers and draggable bounding boxes

  • JSON annotations saved per-file to minimize corruption

  • Mark and track unusable files

  • Adjust spectrogram resolution for accuracy comparison

  • TODO: implement ranking system for annotation quality; inject as learning feedback mechanism

Requirements

  • Python ≥3.8 (built using 3.11)
  • numpy
  • matplotlib
  • librosa
  • scipy
  • natsort
  • sounddevice
  • soundfile

License

MIT License - Copyright (c) 2025 laelume

Contributing

Contributions welcome! Please open an issue or submit a pull request. I'm especially interested in talking to people about using this in their existing AI workflows, so please feel free to reach out !!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

yaaat-0.1.3.tar.gz (1.5 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

yaaat-0.1.3-py3-none-any.whl (1.5 MB view details)

Uploaded Python 3

File details

Details for the file yaaat-0.1.3.tar.gz.

File metadata

  • Download URL: yaaat-0.1.3.tar.gz
  • Upload date:
  • Size: 1.5 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for yaaat-0.1.3.tar.gz
Algorithm Hash digest
SHA256 a7e5d2c592a6ccbc6cf87cb029a6db243746818fac78d163cd93327147cd1082
MD5 7e189566318d9726d5623714f21ed5c1
BLAKE2b-256 3741130317f27382e9649943f63ceb572a0787b2e75126ef1cf3d92720065c03

See more details on using hashes here.

File details

Details for the file yaaat-0.1.3-py3-none-any.whl.

File metadata

  • Download URL: yaaat-0.1.3-py3-none-any.whl
  • Upload date:
  • Size: 1.5 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for yaaat-0.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 6856586554cb2cad922664f7ba57d4d5d8546e124621cd79e17bcc31abca3fcc
MD5 1be3c290eddbab6f4838d8584a50a0c6
BLAKE2b-256 9f3f35af1e79b7c702b0a9ddd3be5ffade6453a83cb876a8a233d532de938940

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page