Skip to main content

Interactive bioacoustic annotation tool for measuring vocalizations

Project description

YAAAT: Yet Another Audio Annotation Tool

Interactive bioacoustic annotation tool for measuring vocalizations.

Features:

  1. Changepoint Annotator, for marking temporal onset, offset, and changepoints in vocalizations.
  2. Peak Annotator, for marking dominant frequency peaks on the power spectrum.
Changepoint Annotator Peak Annotator
Changepoint Annotator Peak Annotator

Installation

Via PyPI (Recommended)

pip install yaaat

From Source

git clone https://github.com/laelume/yaaat.git
cd yaaat
pip install -e .

Usage

Launch the Application

yaaat

Opens a tabbed interface with both annotators. Includes test audio files to get started immediately.

Use in Python Scripts

from yaaat import ChangepointAnnotator, PeakAnnotator
import tkinter as tk

# Launch changepoint annotator
root = tk.Tk()
app = ChangepointAnnotator(root)
root.mainloop()

# Or launch peak annotator
root = tk.Tk()
app = PeakAnnotator(root)
root.mainloop()

Getting Started

  1. Click Load Audio Directory to select files or Load Test Audio to explore test audio
  2. Choose where to save annotations (existing, new, or default directory)
  3. Click on the spectrogram to add annotation points
  4. Click Finish Syllable when done with annotation
  5. Move between files using Next/Previous buttons
  6. Annotations auto-save on file navigation or Finish syllable

Navigation & Features

  • Intuitive real-time interactive visualization with zoom, pan, and keycommand + mousewheel navigation

  • Visualize harmonics with adjustable multipliers and draggable bounding boxes

  • JSON annotations saved per-file to minimize corruption

  • Mark and track unusable files

  • Adjust spectrogram resolution for accuracy comparison

  • TODO: implement ranking system for annotation quality; inject as learning feedback mechanism

Requirements

  • Python ≥3.8 (built using 3.11)
  • numpy
  • matplotlib
  • librosa
  • scipy
  • natsort
  • sounddevice
  • soundfile

License

MIT License - Copyright (c) 2025 laelume

Contributing

Contributions welcome! Please open an issue or submit a pull request. I'm especially interested in talking to people about using this in their existing AI workflows, so please feel free to reach out !!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

yaaat-0.1.2.tar.gz (1.5 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

yaaat-0.1.2-py3-none-any.whl (46.6 kB view details)

Uploaded Python 3

File details

Details for the file yaaat-0.1.2.tar.gz.

File metadata

  • Download URL: yaaat-0.1.2.tar.gz
  • Upload date:
  • Size: 1.5 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for yaaat-0.1.2.tar.gz
Algorithm Hash digest
SHA256 c7d34063565191b2b959e5d7708d6b0e05f92deae64d5fdcfe98dc42bccd1e90
MD5 da3e4aba70919191c501008bf5227f67
BLAKE2b-256 ada5232ac543d28f2aa5b89da7a6e6af51b3c5c77846338a6043ec336c9c7957

See more details on using hashes here.

File details

Details for the file yaaat-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: yaaat-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 46.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for yaaat-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 7c90e8ee3a4d1abb395b31b85a013059fe0d74e97eccbe3f666ce1818989614e
MD5 43036c077e6286a062c8a106d448147b
BLAKE2b-256 9fc6e87711b372e8d602a8bfcf12009038866080ac5b004e750aaec39b4f51ad

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page