A simple data ingestion library to guide data flows from some places to other places.
Project description
Viadot
Documentation: https://viadot.docs.dyvenia.com
Source Code: https://github.com/dyvenia/viadot/tree/2.0
A simple data ingestion library to guide data flows from some places to other places.
Getting Data from a Source
Viadot supports several API and RDBMS sources, private and public. Currently, we support the UK Carbon Intensity public API and base the examples on it.
from viadot.sources.uk_carbon_intensity import UKCarbonIntensity
ukci = UKCarbonIntensity()
ukci.query("/intensity")
df = ukci.to_df()
print(df)
Output:
from | to | forecast | actual | index | |
---|---|---|---|---|---|
0 | 2021-08-10T11:00Z | 2021-08-10T11:30Z | 211 | 216 | moderate |
The above df
is a pandas DataFrame
object. It contains data downloaded by viadot
from the Carbon Intensity UK API.
Loading data to a destination
Depending on the destination, viadot
provides different methods of uploading data. For instance, for databases, this would be bulk inserts. For data lakes, it would be file uploads.
For example:
from viadot.sources import UKCarbonIntensity
from viadot.sources import AzureDataLake
ukci = UKCarbonIntensity()
ukci.query("/intensity")
df = ukci.to_df()
adls = AzureDataLake(config_key="my_adls_creds")
adls.from_df(df, "my_folder/my_file.parquet")
Getting started
Prerequisites
We use Rye. You can install it like so:
curl -sSf https://rye-up.com/get | bash
Installation
pip install viadot2
Configuration
In order to start using sources, you must configure them with required credentials. Credentials can be specified either in the viadot config file (by default, $HOME/.config/viadot/config.yaml
), or passed directly to each source's credentials
parameter.
You can find specific information about each source's credentials in the documentation.
Next steps
Check out the documentation for more information on how to use viadot
.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file viadot2-2.1.16.tar.gz
.
File metadata
- Download URL: viadot2-2.1.16.tar.gz
- Upload date:
- Size: 288.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/5.1.1 CPython/3.12.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 436b23db164555a36e7b1d78180448e7261b99e5f7356590c461e315f1e0fa31 |
|
MD5 | 67679ffb6fb0c90198c04e557ec80115 |
|
BLAKE2b-256 | 65333175d23dcaa3a6c68076aacbba2ad4e78e43fabb64dbe4d38f6f128e7b32 |
File details
Details for the file viadot2-2.1.16-py3-none-any.whl
.
File metadata
- Download URL: viadot2-2.1.16-py3-none-any.whl
- Upload date:
- Size: 149.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/5.1.1 CPython/3.12.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | a5b176a231bb548986d816d612cef72d14693fb3919f426d17a8dba3173a6634 |
|
MD5 | 0039ef4f359d7af673bcbb9b35f82c67 |
|
BLAKE2b-256 | e42cdc419bf9846b6bf4915882aa54d35c02543cf050c825f2ebc9e7fedc0ee5 |