The Argilla python server SDK

Project description

Argilla

Work on data together, make your model outputs better!

Codecov

Argilla is a collaboration tool for AI engineers and domain experts who need to build high-quality datasets for their projects. If you just want to get started, we recommend our UI demo or our free Hugging Face Spaces deployment integration. Curious, and want to know more? Read our documentation.

Why use Argilla?

Whether you are working on monitoring and improving complex generative tasks involving LLM pipelines with RAG, or you are working on a predictive task for things like AB-testing of span- and text-classification models. Our versatile platform helps you ensure your data work pays off.

Improve your AI output quality through data quality

Compute is expensive and output quality is important. We help you focus on data, which tackles the root cause of both of these problems at once. Argilla helps you to achieve and keep high-quality standards for your data. This means you can improve the quality of your AI output.

Take control of your data and models

Most AI tools are black boxes. Argilla is different. We believe that you should be the owner of both your data and your models. That's why we provide you with all the tools your team needs to manage your data and models in a way that suits you best.

Improve efficiency by quickly iterating on the right data and models

Gathering data is a time-consuming process. Argilla helps by providing a platform that allows you to interact with your data in a more engaging way. This means you can quickly and easily label your data with filters, AI feedback suggestions and semantic search. So you can focus on training your models and monitoring their performance.

🏘️ Community

We are an open-source community-driven project and we love to hear from you. Here are some ways to get involved:

Community Meetup: listen in or present during one of our bi-weekly events.
Discord: get direct support from the community in #argilla-distilabel-general and #argilla-distilabel-help.
Roadmap: plans change but we love to discuss those with our community so feel encouraged to participate.

What do people build with Argilla?

Open-source datasets and models

The community uses Argilla to create amazing open-source datasets and models.

Cleaned UltraFeedback dataset used to fine-tune the Notus and Notux models. The original UltraFeedback dataset was curated using Argilla UI filters to find and report a bug in the original data generation code. Based on this data curation process, Argilla built this new version of the UltraFeedback dataset and fine-tuned Notus, outperforming Zephyr on several benchmarks.
distilabeled Intel Orca DPO dataset used to fine-tune the improved OpenHermes model. This dataset was built by combining human curation in Argilla with AI feedback from distilabel, leading to an improved version of the Intel Orca dataset and outperforming models fine-tuned on the original dataset.

Examples Use cases

AI teams from companies like the Red Cross, Loris.ai and Prolific use Argilla to improve the quality and efficiency of AI projects. They shared their experiences in our AI community meetup.

AI for good: the Red Cross presentation showcases how the Red Cross domain experts and AI team collaborated by classifying and redirecting requests from refugees of the Ukrainian crisis to streamline the support processes of the Red Cross.
Customer support: during the Loris meetup they showed how their AI team uses unsupervised and few-shot contrastive learning to help them quickly validate and gain labeled samples for a huge amount of multi-label classifiers.
Research studies: the showcase from Prolific announced their integration with our platform. They use it to actively distribute data collection projects among their annotating workforce. This allows Prolific to quickly and efficiently collect high-quality data for research studies.

👨‍💻 Getting started

Installation

First things first! You can install the SDK with pip as follows:

pip install argilla

After that, you will need to deploy Argilla Server. The easiest way to do this is through our free Hugging Face Spaces deployment integration.

To use the client, you need to import the Argilla class and instantiate it with the API URL and API key.

import argilla as rg

client = rg.Argilla(api_url="https://[your-owner-name]-[your_space_name].hf.space", api_key="owner.apikey")

Create your first dataset

We can now create a dataset with a simple text classification task. First, you need to define the dataset settings.

settings = rg.Settings(
    guidelines="Classify the reviews as positive or negative.",
    fields=[
        rg.TextField(
            name="review",
            title="Text from the review",
            use_markdown=False,
        ),
    ],
    questions=[
        rg.LabelQuestion(
            name="my_label",
            title="In which category does this article fit?",
            labels=["positive", "negative"],
        )
    ],
)
dataset = rg.Dataset(
    name=f"my_first_dataset",
    settings=settings,
    client=client,
)
dataset.create()

Next, we can add records to the dataset.

pip install datasets

from datasets import load_dataset

data = load_dataset("imdb", split="train[:100]").to_list()
dataset.records.log(records=data, mapping={"text": "review"})

🎉 You have successfully created your first dataset with Argilla. You can now access it in the Argilla UI and start annotating the records. Need more info, check out our docs.

🥇 Contributors

To help our community with the creation of contributions, we have created our community docs. Additionally, you can always schedule a meeting with our Developer Advocacy team so they can get you up to speed.

Project details

Release history Release notifications | RSS feed

This version

2.8.0

Mar 10, 2025

2.7.0

Jan 21, 2025

2.6.0

Dec 18, 2024

2.5.0

Nov 29, 2024

2.4.0

Oct 30, 2024

2.3.0

Oct 3, 2024

2.2.2

Sep 25, 2024

2.2.1

Sep 23, 2024

2.2.0

Sep 19, 2024

2.1.0

Sep 5, 2024

2.0.1

Aug 13, 2024

2.0.0

Jul 30, 2024

2.0.0rc2 pre-release

Jul 5, 2024

2.0.0rc1 pre-release

Jun 21, 2024

2.0.0a0 pre-release

Jun 17, 2024

1.29.1

Jul 22, 2024

1.29.0

May 30, 2024

1.28.0

May 9, 2024

1.27.0

Apr 18, 2024

1.26.1

Mar 27, 2024

1.26.0

Mar 22, 2024

1.25.0

Feb 29, 2024

1.24.0

Feb 9, 2024

1.23.1

Feb 8, 2024

1.23.0

Feb 2, 2024

1.22.0

Jan 18, 2024

1.21.0

Dec 21, 2023

1.20.0

Nov 30, 2023

1.19.0

Nov 13, 2023

1.18.0

Oct 25, 2023

1.17.0

Oct 19, 2023

1.16.0

Sep 18, 2023

1.15.1

Sep 8, 2023

1.15.0

Aug 31, 2023

1.14.1

Aug 16, 2023

1.14.0

Aug 11, 2023

1.13.3

Jul 27, 2023

1.13.2

Jul 24, 2023

1.13.1

Jul 21, 2023

1.13.0

Jul 21, 2023

1.12.1

Jul 12, 2023

1.12.0

Jun 29, 2023

1.11.0

Jun 22, 2023

1.10.0

Jun 16, 2023

1.9.0

Jun 9, 2023

1.8.0

May 31, 2023

1.7.0

May 10, 2023

1.6.0

Apr 9, 2023

1.5.1

Mar 31, 2023

1.5.0

Mar 22, 2023

1.4.1

Mar 30, 2023

1.4.0

Mar 9, 2023

1.3.2

Mar 30, 2023

1.3.1

Feb 24, 2023

1.3.0

Feb 9, 2023

1.2.2

Mar 30, 2023

1.2.1

Jan 23, 2023

1.2.0

Jan 12, 2023

1.1.1

Nov 29, 2022

1.1.0

Nov 24, 2022

1.0.1

Nov 4, 2022

1.0.0

Oct 24, 2022

1.0.0a3 pre-release

Oct 24, 2022

1.0.0a2 pre-release

Oct 14, 2022

1.0.0a1 pre-release

Oct 13, 2022

0.0.1

Oct 6, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

argilla-2.8.0.tar.gz (124.0 kB view details)

Uploaded Mar 10, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

argilla-2.8.0-py3-none-any.whl (161.3 kB view details)

Uploaded Mar 10, 2025 Python 3

File details

Details for the file argilla-2.8.0.tar.gz.

File metadata

Download URL: argilla-2.8.0.tar.gz
Upload date: Mar 10, 2025
Size: 124.0 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: pdm/2.22.4 CPython/3.13.2 Linux/6.8.0-1021-azure

File hashes

Hashes for argilla-2.8.0.tar.gz
Algorithm	Hash digest
SHA256	`4364d58d8bfd51880efefc6ab08eb17983ffbede3807420536f4bbd6662afa37`
MD5	`975b90042111ffac0597b47e8e9618f3`
BLAKE2b-256	`e853b9529fcb586f1d826f05688a59144e33573aa1b6bf6b3a00d5eefe922d31`

See more details on using hashes here.

File details

Details for the file argilla-2.8.0-py3-none-any.whl.

File metadata

Download URL: argilla-2.8.0-py3-none-any.whl
Upload date: Mar 10, 2025
Size: 161.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: pdm/2.22.4 CPython/3.13.2 Linux/6.8.0-1021-azure

File hashes

Hashes for argilla-2.8.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d78264a92c8a7137232ea26042d68aec56bce96c47b26a079ee218182bf89cbb`
MD5	`9a43be0e23f8c9da38e069b93516a01e`
BLAKE2b-256	`1a1aa38a528c8d5c53dd954b1699dda5efe1b6b38ee2000bed38e46e9bf0b102`

See more details on using hashes here.

argilla 2.8.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Argilla

Work on data together, make your model outputs better!

Why use Argilla?

Improve your AI output quality through data quality

Take control of your data and models

Improve efficiency by quickly iterating on the right data and models

🏘️ Community

What do people build with Argilla?

Open-source datasets and models

Examples Use cases

👨‍💻 Getting started

Installation

Create your first dataset

🥇 Contributors

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes