This package is written for text-to-audio/music generation.

These details have not been verified by PyPI

Project links

Homepage

Project description

AudioSR: Versatile Audio Super-resolution at Scale

Pass your audio in, AudioSR will make it high fidelity!

Work on all types of audio (e.g., music, speech, dog, raining, ...) & all sampling rates.

Share your thoughts/samples/issues in our discord channel: https://discord.gg/HWeBsJryaf

Image Description

Change Log

2023-09-24: Add replicate demo (@nateraw); Fix error on windows, librosa warning etc (@ORI-Muchim).
2023-09-16: Fix DC shift issue. Fix duration padding bug. Update default DDIM steps to 50.

Commandline Usage

Installation

# Optional
conda create -n audiosr python=3.9; conda activate audiosr
# Install AudioLDM
pip3 install audiosr==0.0.7

Usage

Process a list of files. The result will be saved at ./output by default.

audiosr -il batch.lst

Process a single audio file.

audiosr -i example/music.wav

Full usage instruction

> audiosr -h

> usage: audiosr [-h] -i INPUT_AUDIO_FILE [-il INPUT_FILE_LIST] [-s SAVE_PATH] [--model_name {basic,speech}] [-d DEVICE] [--ddim_steps DDIM_STEPS] [-gs GUIDANCE_SCALE] [--seed SEED]

optional arguments:
  -h, --help            show this help message and exit
  -i INPUT_AUDIO_FILE, --input_audio_file INPUT_AUDIO_FILE
                        Input audio file for audio super resolution
  -il INPUT_FILE_LIST, --input_file_list INPUT_FILE_LIST
                        A file that contains all audio files that need to perform audio super resolution
  -s SAVE_PATH, --save_path SAVE_PATH
                        The path to save model output
  --model_name {basic,speech}
                        The checkpoint you gonna use
  -d DEVICE, --device DEVICE
                        The device for computation. If not specified, the script will automatically choose the device based on your environment.
  --ddim_steps DDIM_STEPS
                        The sampling step for DDIM
  -gs GUIDANCE_SCALE, --guidance_scale GUIDANCE_SCALE
                        Guidance scale (Large => better quality and relavancy to text; Small => better diversity)
  --seed SEED           Change this value (any integer number) will lead to a different generation result.
  --suffix SUFFIX       Suffix for the output file

TODO

Add gradio demo.
Optimize the inference speed.

Cite our work

If you find this repo useful, please consider citing:

@article{liu2023audiosr,
  title={{AudioSR}: Versatile Audio Super-resolution at Scale},
  author={Liu, Haohe and Chen, Ke and Tian, Qiao and Wang, Wenwu and Plumbley, Mark D},
  journal={arXiv preprint arXiv:2309.07314},
  year={2023}
}

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

0.0.7

Feb 5, 2024

0.0.6

Oct 26, 2023

0.0.5

Sep 24, 2023

0.0.4

Sep 24, 2023

0.0.3

Sep 16, 2023

0.0.2

Sep 15, 2023

0.0.1

Sep 15, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

audiosr-0.0.7.tar.gz (2.9 MB view details)

Uploaded Feb 5, 2024 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

audiosr-0.0.7-py3-none-any.whl (2.9 MB view details)

Uploaded Feb 5, 2024 Python 3

File details

Details for the file audiosr-0.0.7.tar.gz.

File metadata

Download URL: audiosr-0.0.7.tar.gz
Upload date: Feb 5, 2024
Size: 2.9 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.9.18

File hashes

Hashes for audiosr-0.0.7.tar.gz
Algorithm	Hash digest
SHA256	`6e272a00f6dfbb4ae7b749b8cc6c90f58e0f4aaf2f96ae7b1e5f58ddb8c75803`
MD5	`39930c0f2f655fe5782219f6ae21d442`
BLAKE2b-256	`64e578b6385ae7156697a3d04cf7e8fbb894f11ab8a95ebf6dd70bd4f49307ee`

See more details on using hashes here.

File details

Details for the file audiosr-0.0.7-py3-none-any.whl.

File metadata

Download URL: audiosr-0.0.7-py3-none-any.whl
Upload date: Feb 5, 2024
Size: 2.9 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.9.18

File hashes

Hashes for audiosr-0.0.7-py3-none-any.whl
Algorithm	Hash digest
SHA256	`05794f995ab42458216eb302dd9768c1c1e535091ee54e9e09f69a60eaad96e2`
MD5	`5655d2dc5f26279defb604891a1b665d`
BLAKE2b-256	`7e12e53ffbe19d52d544ab7bcc4f1e0a691fd6ed5e4dc342ebc5a30fcbe519bf`

See more details on using hashes here.

audiosr 0.0.7

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

AudioSR: Versatile Audio Super-resolution at Scale

Change Log

Commandline Usage

Installation

Usage

TODO

Cite our work

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes