Skip to main content

Representation Learning for the Automatic Indexing of Sound Effects Libraries (ISMIR 2022): Deep audio embeddings pre-trained on UCS & Non-UCS-compliant datasets.

Project description

aiSFX

Picture

Representation Learning for the Automatic Indexing of Sound Effects Libraries (ISMIR 2022): Deep audio embeddings pre-trained on UCS & Non-UCS-compliant datasets.

This work was inspired by the creation of the Universal Category System (UCS), an industry-proposed public domain initiative initialized by Tim Nielsen, Justin Drury, Kai Paquin, and others. First launching in the fall of 2020, UCS offers a standardized framework for sound effects library metadata designed by and for sound designers and editors.

How To Use

Please refer to this package's documentation for Installation Instructions and Tutorials of how to extract embeddings.

Cite This Work

Please cite the paper below if you use it in your work.

This paper has been accepted at the 23rd International Society for Music Information Retrieval Conference (ISMIR) in Bengaluru, India (December 04-08, 2022). To cite a pre-print, please refer to the following.

[1] Representation Learning for the Automatic Indexing of Sound Effects Libraries

  @article{pp_aisfx},
        title = {Representation Learning for the Automatic Indexing of Sound Effects Libraries},
        author = {Ma, Alison Bernice and Lerch, Alexander},
        journal={arXiv preprint arXiv:xxxx.xxxxx},
        year = {2022},
        archivePrefix = {arXiv},
  }

Acknowledgements

We would like to thank those who provided the data required to conduct this research as well as those who took the time to share their insights and software licenses for tools regarding sound search, query, and retrieval.

Universal Category System (UCS)Alex LaneAll You Can Eat AudioArticulated Sounds • Audio ShadeaXLSoundBig Sound BankBaseHeadBonsonBOOM LibraryFrick & TraaHzandbitsInspectorJKai PaquinKEDR AudioKrotos AudioNikola SimikicPenguin GrenadePro Sound EffectsRick Allen CreativeSononymSound IdeasSoundlySoundminerStoryblocksTim NielsenThomas Rex BeverlyZapSplat

License: Pre-trained Model & Paper

This pre-trained model and paper [1] is made available under a Creative Commons Attribution 4.0 International License (CC BY 4.0).

Visualizations of UCS Classes

Click the above to visualize coarse-level "Category" UCS classes in Pro Sound Effects (PSE), Soundly (SDLY), and UCS Mixed (UMIX).

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aisfx-0.1.0.tar.gz (23.1 MB view hashes)

Uploaded source

Built Distribution

aisfx-0.1.0-py2.py3-none-any.whl (23.1 MB view hashes)

Uploaded py2 py3

Supported by

AWS AWS Cloud computing Datadog Datadog Monitoring Facebook / Instagram Facebook / Instagram PSF Sponsor Fastly Fastly CDN Google Google Object Storage and Download Analytics Huawei Huawei PSF Sponsor Microsoft Microsoft PSF Sponsor NVIDIA NVIDIA PSF Sponsor Pingdom Pingdom Monitoring Salesforce Salesforce PSF Sponsor Sentry Sentry Error logging StatusPage StatusPage Status page