Skip to main content

The script reads a JSON file (or folder) and generates an ELAN file to match.

Project description

JSON to ELAN

The script looks in a folder, and generates an ELAN file to match each JSON file.

JSON format

It has been written for the JSON output from Huggingface ASR pipelines. Here's an example of the expected JSON format.

[
    {
        "text": "luanghan",
        "timestamp":
        [
            1.16,
            1.48
        ]
    },
    {
        "text": "ian",
        "timestamp":
        [
            1.56,
            1.7
        ]
    }
]

Basic usage

Put your JSON files somewhere easily accessible, eg in a data folder in your working directory. Install it. Use it by providing a path to your data.

pip install json-to-elan
from json_to_elan import make_elan 
make_elan(data_dir="content")

Using this in Colab?

To use this in Google Colab, upload your JSON files into the File browser. Then define the data directory as:

data_dir="/content"

Options

You can also set a different tier name from the default (which is "default").

The ELAN file gets a linked media file written, for which we assume that the media file is a WAV with the same name as the JSON file. If you want to change this to MP3, change the audio_format.

Here's an example:

make_elan(data_dir="content", tier_name="Words", audio_format="mp3")

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

json-to-elan-0.1.17.tar.gz (3.1 kB view hashes)

Uploaded Source

Built Distribution

json_to_elan-0.1.17-py3-none-any.whl (3.8 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page