An open-source NLP framework that offers high-level wrappers designed for effortless launch, enhanced reproducibility, superior control, and unmatched flexibility for your experiments.

Project description

🚀 New Release Available! We're excited to announce a new version of our project! 🎉 Our README.md is currently being updated to reflect all the fantastic changes. In the meantime, please explore the release notes and dive in. We'd love to hear your feedback! ❤️

UrarTU 🦁

Welcome to UrarTU, your go-to NLP framework designed to simplify your work. UrarTU provides easy-to-use abstractions for common pipeline components, making your machine learning journey smoother. With a .yaml file-based configuration system and seamless slurm job submission capabilities on clusters, UrarTU removes the technical hassle so you can focus on making impactful NLP work! 🚀

urartu_schema drawio

Installation

Getting started with UrarTU is super easy! 🌀 Just run:

pip install urartu

Or, if you prefer to install directly from the source:

Clone the repository:

git clone git@github.com:tamohannes/urartu.git`

Navigate to the project directory:
```
cd urartu
```
Execute the magic command:
```
pip install -e .
```

And just like that, you're all set! ✨ Use the following command anywhere in your system to access UrarTU:

urartu --help

Getting started

To jump right in with UrarTU, start with our starter_template. You can copy this to begin your project or check out the steps in our Starter Template Setup. Your setup will mirror what's found in this directory.

Think of UrarTU as the foundational framework for your projects, similar to an abstract class in object-oriented programming (OOP). Your project acts as the concrete implementation, where UrarTU provides the foundational scaffolding. It includes high-level abstractions, configuration through .yaml files powered by Hydra, and slurm job management utilizing the Submitit library. This setup ensures your projects are both flexible and robust, making your machine learning workflow efficient and scalable. It also includes key NLP features such as dataset readers, model loaders, and device handlers.

Here's how to get started:

Extend UrarTU: Inherit the structure of UrarTU and customize it by writing your own actions and configurations, akin to implementing methods from an abstract class in OOP.
Utilize Core Functionalities: Jumpstart your project with pre-defined functionalities:
- Datasets:
  - Load a HF (Hugging Face) dataset from a dictionary, a file, or directly from the HF hub.
- Models:
  - Use a HF model as a causal language model or integrate it into a pipeline.
  - Incorporate the OpenAI API for advanced modeling.
Customize Further: Develop and place your own classes within the corresponding directories of your project to meet your specific needs.

By following these steps, you can efficiently set up and customize your machine learning projects with UrarTU.

Firing Up 🔥

Once you've cloned the starter_template, head over to that directory in your terminal:

cd starter_template

To launch a single run with predefined configurations, execute the following command:

urartu action_config=generate aim=aim slurm=slurm

If you're looking to perform multiple runs, simply use the --multirun flag. To configure multiple runs, add a sweeper at the end of your generate.yaml config file like this:

...

hydra:
  sweeper:
    params:
      action_config.task.model.generate.num_beams: 1,5,10

This setup initiates 3 separate runs, each utilizing different num_beams settings to adjust the model's behavior.

Then, start your multi-run session with the same command:

urartu action_config=generate aim=aim slurm=slurm

With these steps, you can effortlessly kickstart your machine learning experiments with UrarTU, whether for a single test or comprehensive multi-run analyses!

Navigating the UrarTU Architecture

Dive into the structured world of UrarTU, where managing NLP components becomes straightforward and intuitive.

Configs: Tailoring Your Setup

Set up your environment effortlessly with our configuration templates found in the urartu/config directory:

urartu/config/main.yaml: This primary configuration file lays the groundwork with default settings for all system keys.
urartu/config/action_config This space is dedicated to configurations specific to various actions.

Crafting Customizations

Configuring UrarTU to meet your specific needs is straightforward. You have two easy options:

Custom Config Files: Store your custom configuration files in the configs directory to adjust the settings. This directory aligns with urartu/config, allowing you to maintain project-specific settings in files like generate.yaml for your starter_template project.
- Personalized User Configs: For an even more tailored experience, create a configs_{username} directory at the same level as configs, replacing {username} with your system username. This setup automatically loads and overrides default settings without extra steps. ✨

Configuration files are prioritized in the following order: urartu/config, starter_template/configs, starter_template/configs_{username}, ensuring your custom settings take precedence.

CLI Approach: If you prefer using the command-line interface (CLI), UrarTU supports enhancing commands with key-value pairs directly in the CLI, such as:
```
urartu action_config=example action_config.experiment_name=NAME_OF_EXPERIMENT
```

Select the approach that best fits your workflow and enjoy the customizability that UrarTU offers.

Actions: Shaping Functionality

At the heart of UrarTU is the Action class, which orchestrates all operations. This script manages everything from CLI arguments to the execution of the main function based on the action_name parameter.

Logging: Capture Every Detail

UrarTU is equipped with a comprehensive logging system to ensure no detail of your project’s execution is missed. Here’s how it works:

Standard Runs: Every execution is meticulously logged and stored in a structured directory within your current working directory. The path format is: .runs/${action_name}/${now:%Y-%m-%d}_${now:%H-%M-%S}
Debug Mode: If the debug flag is enabled, logs are saved under: .runs/debug/${action_name}/${now:%Y-%m-%d}_${now:%H-%M-%S}
Multi-run Sessions: For runs involving multiple configurations or tests, logs are appended with a .runs/debug/${action_name}/${now:%Y-%m-%d}_${now:%H-%M-%S}_multirun suffix to differentiate them.

Each run directory is organized to contain essential files such as:

output.log: Captures all output from the run.
notes.md: Allows for manual annotations and observations.
cfg.yaml: Stores the configuration used for the run.

Additional files may be included depending on the type of run, ensuring you have all the data you need at your fingertips.

Effortless Launch

Launching with UrarTU is a breeze, offering you two launch options:

Local Marvel: Execute jobs right on your local machine.
Cluster Voyage: Set sail to the slurm cluster by toggling the slurm.use_slurm in config_{username}/slurm/slurm.yaml to switch between local and cluster executions.

Choose your adventure and launch your projects with ease! 🚀

Encountered any issues or have suggestions? Feel free to open an issue for support.

Exploring the Experiments

Unveil insights with ease using UrarTU in partnership with Aim, the intuitive and powerful open-source AI metadata tracker. To access a rich trove of metrics captured by Aim, simply:

Navigate to the directory containing the .aim repository.
Fire up the magic with:

aim up

Watch as Aim brings your experiments into sharp relief, providing the clarity needed to drive informed decisions and pioneering efforts in machine learning. 📈

Project details

Release history Release notifications | RSS feed

5.0.4

Feb 10, 2026

5.0.3

Jan 23, 2026

5.0.2

Jan 22, 2026

5.0.1

Dec 10, 2025

5.0.0

Nov 27, 2025

4.2.1

Nov 26, 2025

4.2.0

Nov 22, 2025

4.1.5

Nov 17, 2025

4.1.4

Nov 7, 2025

4.1.3

Oct 30, 2025

4.1.2

Oct 17, 2025

4.1.1

Oct 6, 2025

4.1.0

Oct 6, 2025

4.0.5

Oct 2, 2025

4.0.4

Oct 2, 2025

4.0.3

Aug 28, 2025

4.0.2

Aug 25, 2025

4.0.1

Aug 22, 2025

4.0.0

Aug 22, 2025

3.2.4

Jul 9, 2025

3.2.3

May 13, 2025

3.2.2

May 12, 2025

3.2.1

Apr 28, 2025

3.1.15

Apr 23, 2025

3.1.14

Apr 23, 2025

3.1.13

Apr 23, 2025

3.1.12

Apr 22, 2025

3.1.11

Apr 22, 2025

3.1.10

Apr 16, 2025

3.1.9

Mar 19, 2025

3.1.8

Mar 19, 2025

3.1.7

Mar 16, 2025

3.1.6

Feb 26, 2025

3.1.5

Feb 21, 2025

3.1.4

Feb 21, 2025

3.1.3

Feb 18, 2025

3.1.2

Feb 18, 2025

3.1.1

Feb 16, 2025

This version

3.1.0

Feb 16, 2025

3.0.11

Feb 16, 2025

3.0.10

Feb 12, 2025

3.0.9

Feb 12, 2025

3.0.8

Feb 11, 2025

3.0.7

Feb 10, 2025

3.0.6

Feb 9, 2025

3.0.5

Feb 3, 2025

3.0.4

Feb 2, 2025

3.0.3

Feb 2, 2025

3.0.2

Jan 27, 2025

3.0.1

Jan 27, 2025

3.0.0

Jan 21, 2025

2.0.28

Jan 21, 2025

2.0.27

Aug 19, 2024

2.0.26

Jul 24, 2024

2.0.25

Jul 24, 2024

2.0.24

Jul 18, 2024

2.0.23

Jul 18, 2024

2.0.22

Jul 15, 2024

2.0.21

Jul 15, 2024

2.0.20

Jul 12, 2024

2.0.19

Jul 12, 2024

2.0.18

Jul 10, 2024

2.0.17

Jul 3, 2024

2.0.16

Jul 2, 2024

2.0.15

Jul 2, 2024

2.0.14

Jul 2, 2024

2.0.13

Jun 27, 2024

2.0.12

Jun 27, 2024

2.0.11

Jun 27, 2024

2.0.10

Jun 27, 2024

2.0.9

Jun 27, 2024

2.0.8

Jun 27, 2024

2.0.7

Jun 27, 2024

2.0.6

Jun 27, 2024

2.0.5

Jun 27, 2024

2.0.4

Jun 27, 2024

2.0.3

Jun 26, 2024

2.0.2

Jun 26, 2024

2.0.1

Jun 26, 2024

2.0

Jun 26, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

urartu-3.1.0.tar.gz (27.5 kB view details)

Uploaded Feb 16, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

urartu-3.1.0-py3-none-any.whl (32.7 kB view details)

Uploaded Feb 16, 2025 Python 3

File details

Details for the file urartu-3.1.0.tar.gz.

File metadata

Download URL: urartu-3.1.0.tar.gz
Upload date: Feb 16, 2025
Size: 27.5 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for urartu-3.1.0.tar.gz
Algorithm	Hash digest
SHA256	`94a95c1249f29451e499536fdf344075f59d1cd7e7a64cf8f0bfd5c7ce7039ce`
MD5	`df046a55d8f1b0bf6529d7f9d4761297`
BLAKE2b-256	`3c05b3cf5159dd237867146081dd72d4a095cc8fc99a54e7cbf564a18dd62b98`

See more details on using hashes here.

File details

Details for the file urartu-3.1.0-py3-none-any.whl.

File metadata

Download URL: urartu-3.1.0-py3-none-any.whl
Upload date: Feb 16, 2025
Size: 32.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for urartu-3.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c0ed5bcb0e72289c03e340f4577306863e80fcf0b7099ecebfe72ace87d93edf`
MD5	`45169e371a795d351d6ea9aa19f13d9a`
BLAKE2b-256	`23337bab993036dc45164dc184b90423e068b42acea0d631715e78016f6ff3b9`

See more details on using hashes here.

urartu 3.1.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

UrarTU 🦁

Installation

Getting started

Firing Up 🔥

Navigating the UrarTU Architecture

Configs: Tailoring Your Setup

Crafting Customizations

Actions: Shaping Functionality

Logging: Capture Every Detail

Effortless Launch

Exploring the Experiments

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes