A Toloka provider for Apache Airflow
Project description
Airflow Toloka Provider
This library allows you to run crowdsourcing Toloka processes in Apache Airflow - a widely used workflow management system
Here you can find a collection of ready-made Airflow tasks for the most frequently used actions in Toloka-Kit.
Getting started
$ pip install airflow-provider-toloka
A good way to start is to follow the example in this repo.
TolokaHook
TolokaHook is used for getting toloka OAuth token and creating TolokaClient with it.
You can get TolokaClient from TolokaHook by calling get_conn() method.
To make an appropriate Airflow Connection you need to create it in the Airflow Connections UI with following parameters:
Conn ID:toloka_defaultConn Type:TolokaToken: enter your OAuth token for Toloka. You can learn more about how to get it here.Environment: enterproductionorsandbox
Tasks use the toloka_default connection id by default, but
if needed, you can create additional Airflow Connections and reference them
as the function toloka_conn_id argument.
Tasks and Sensors
There are several tasks and sensors that give you easy way to interact with Toloka from Airflow DAGs.
Creating a project and a pool, adding tasks and getting assignments are among them.
You can easily create your own task using TolokaHook if it is beyond the scope of implemented ones.
And it would be nice to have your pull request with updates.
Check out our example to see tasks and sensors in the battlefield.
Useful Links
Questions and bug reports
- For reporting bugs please use the Toloka/bugreport page.
- Join our English-speaking slack community for both tech and abstract questions.
License
© YANDEX LLC, 2022. Licensed under the Apache License, Version 2.0. See LICENSE file for more details.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file airflow-provider-toloka-0.0.8.tar.gz.
File metadata
- Download URL: airflow-provider-toloka-0.0.8.tar.gz
- Upload date:
- Size: 9.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.8.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a22e5c159fe7f4411f5ed358e2a9c3acb930ceeb63d0c3955c1199d3603a2b4b
|
|
| MD5 |
a13cdea4fb3efc167ca291ec964e29b8
|
|
| BLAKE2b-256 |
66c22aa5469803870b07c8bcd80047eab12e64967e9cf6e3cfe21c361b8012f8
|
File details
Details for the file airflow_provider_toloka-0.0.8-py3-none-any.whl.
File metadata
- Download URL: airflow_provider_toloka-0.0.8-py3-none-any.whl
- Upload date:
- Size: 10.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.8.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c9337eb7d991b77fb101bc25ce1025d3619492ef1a0418ece579eaf35c077d0e
|
|
| MD5 |
7ec0251319d01d536483c0ce7d30719f
|
|
| BLAKE2b-256 |
66a8ad09b84ad78b08b5c886272eaef48bc55340f82a9921cd025f4da8284a82
|