Skip to main content

audio search/retrieval library

Project description

shira 🔖🎧

A simple audio search/retrieval library. (wip)

This is the audio version of ripple. It's meant to be an neural encoded version of Shazam, but might just be for small scale/local usage.

Methodology

It's basically a semantic search library for audio.

The local audio data/files are indexed and embeddings are generated(with CLAP), then a FAISS vector index is created.
The files are retrieved based on cosine similarity between embeddings.

This process makes use of contrastively pretrained audio-language model, CLAP(like OpenAI CLIP for audio), specifically LAION's laion/larger_clap_music_and_speech checkpoint/model

Acknowldgements

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

shira_audio-0.1.0.tar.gz (13.9 kB view details)

Uploaded Source

Built Distribution

shira_audio-0.1.0-py3-none-any.whl (13.9 kB view details)

Uploaded Python 3

File details

Details for the file shira_audio-0.1.0.tar.gz.

File metadata

  • Download URL: shira_audio-0.1.0.tar.gz
  • Upload date:
  • Size: 13.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for shira_audio-0.1.0.tar.gz
Algorithm Hash digest
SHA256 968bc7601d4d85a19661bbd624c88eefdc87e7a3ff201c0b36ab51f684265472
MD5 677299905fa918ad022735e9d06361bd
BLAKE2b-256 7754b5ef903721ef7a8d723a7be381ddb460568195220ed188f1176cd42f7af3

See more details on using hashes here.

File details

Details for the file shira_audio-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: shira_audio-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 13.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for shira_audio-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 85e88092935f31c7caa09bf776e292e2d802a5c19d01817e853dc95e2b937167
MD5 1214a83c679d5b4d1a1a8e9d56df6d9a
BLAKE2b-256 16b07bf4edb1d388820277a5f54a0e25dbfb449914147df2e3f7d9b7857ecdbd

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page