Skip to main content

A simple data ingestion library to guide data flows from some places to other places.

Project description

Viadot

Rye formatting


Documentation: https://viadot.docs.dyvenia.com

Source Code: https://github.com/dyvenia/viadot/tree/2.0


A simple data ingestion library to guide data flows from some places to other places.

Getting Data from a Source

Viadot supports several API and RDBMS sources, private and public. Currently, we support the UK Carbon Intensity public API and base the examples on it.

from viadot.sources.uk_carbon_intensity import UKCarbonIntensity

ukci = UKCarbonIntensity()
ukci.query("/intensity")
df = ukci.to_df()

print(df)

Output:

from to forecast actual index
0 2021-08-10T11:00Z 2021-08-10T11:30Z 211 216 moderate

The above df is a pandas DataFrame object. It contains data downloaded by viadot from the Carbon Intensity UK API.

Loading data to a destination

Depending on the destination, viadot provides different methods of uploading data. For instance, for databases, this would be bulk inserts. For data lakes, it would be file uploads.

For example:

from viadot.sources import UKCarbonIntensity
from viadot.sources import AzureDataLake

ukci = UKCarbonIntensity()
ukci.query("/intensity")
df = ukci.to_df()

adls = AzureDataLake(config_key="my_adls_creds")
adls.from_df(df, "my_folder/my_file.parquet")

Getting started

Prerequisites

We use Rye. You can install it like so:

curl -sSf https://rye-up.com/get | bash

Installation

pip install viadot2

Configuration

In order to start using sources, you must configure them with required credentials. Credentials can be specified either in the viadot config file (by default, $HOME/.config/viadot/config.yaml), or passed directly to each source's credentials parameter.

You can find specific information about each source's credentials in the documentation.

Next steps

Check out the documentation for more information on how to use viadot.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

viadot2-2.1.22.tar.gz (342.4 kB view details)

Uploaded Source

Built Distribution

viadot2-2.1.22-py3-none-any.whl (213.9 kB view details)

Uploaded Python 3

File details

Details for the file viadot2-2.1.22.tar.gz.

File metadata

  • Download URL: viadot2-2.1.22.tar.gz
  • Upload date:
  • Size: 342.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for viadot2-2.1.22.tar.gz
Algorithm Hash digest
SHA256 bef515d8280cd36fe079a03998c8f1531cea406fb8743638d06b6a7d4d5c7ec5
MD5 f6d935ae5978cea063b932afc0d23e64
BLAKE2b-256 a3d3785b18f989022bdbfbb65fc7c00dd28f29cd9b43384c179a94eb710c4e82

See more details on using hashes here.

File details

Details for the file viadot2-2.1.22-py3-none-any.whl.

File metadata

  • Download URL: viadot2-2.1.22-py3-none-any.whl
  • Upload date:
  • Size: 213.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for viadot2-2.1.22-py3-none-any.whl
Algorithm Hash digest
SHA256 019c4febe14ee3479f3a09b682f88cc6b5f553cf9f4f86ff6ee87ff68197a7b7
MD5 b73e2a174c3fa885969b507f04e3f552
BLAKE2b-256 57c07e2a8c1002b458a310e42893abaf8bcb1e4c41ce30bcb6e947781c36201b

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page