Skip to main content

A simple data ingestion library to guide data flows from some places to other places.

Project description

Viadot

Rye formatting


Documentation: https://viadot.docs.dyvenia.com

Source Code: https://github.com/dyvenia/viadot/tree/2.0


A simple data ingestion library to guide data flows from some places to other places.

Getting Data from a Source

Viadot supports several API and RDBMS sources, private and public. Currently, we support the UK Carbon Intensity public API and base the examples on it.

from viadot.sources.uk_carbon_intensity import UKCarbonIntensity

ukci = UKCarbonIntensity()
ukci.query("/intensity")
df = ukci.to_df()

print(df)

Output:

from to forecast actual index
0 2021-08-10T11:00Z 2021-08-10T11:30Z 211 216 moderate

The above df is a pandas DataFrame object. It contains data downloaded by viadot from the Carbon Intensity UK API.

Loading data to a destination

Depending on the destination, viadot provides different methods of uploading data. For instance, for databases, this would be bulk inserts. For data lakes, it would be file uploads.

For example:

from viadot.sources import UKCarbonIntensity
from viadot.sources import AzureDataLake

ukci = UKCarbonIntensity()
ukci.query("/intensity")
df = ukci.to_df()

adls = AzureDataLake(config_key="my_adls_creds")
adls.from_df(df, "my_folder/my_file.parquet")

Getting started

Prerequisites

We use Rye. You can install it like so:

curl -sSf https://rye-up.com/get | bash

Installation

pip install viadot2

Configuration

In order to start using sources, you must configure them with required credentials. Credentials can be specified either in the viadot config file (by default, $HOME/.config/viadot/config.yaml), or passed directly to each source's credentials parameter.

You can find specific information about each source's credentials in the documentation.

Next steps

Check out the documentation for more information on how to use viadot.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

viadot2-2.1.20.tar.gz (319.1 kB view details)

Uploaded Source

Built Distribution

viadot2-2.1.20-py3-none-any.whl (186.2 kB view details)

Uploaded Python 3

File details

Details for the file viadot2-2.1.20.tar.gz.

File metadata

  • Download URL: viadot2-2.1.20.tar.gz
  • Upload date:
  • Size: 319.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.1 CPython/3.12.6

File hashes

Hashes for viadot2-2.1.20.tar.gz
Algorithm Hash digest
SHA256 6823702a35685d219c4c0f07689d77e2029c789a6faebc1c59efbb9bd2cde185
MD5 85fa1be3d95133cae1753c2bf4e5baae
BLAKE2b-256 38f0ac6c0267f36769295fafb578c18206a9f87fde142af78c05ce3099e8dbe9

See more details on using hashes here.

File details

Details for the file viadot2-2.1.20-py3-none-any.whl.

File metadata

  • Download URL: viadot2-2.1.20-py3-none-any.whl
  • Upload date:
  • Size: 186.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.1 CPython/3.12.6

File hashes

Hashes for viadot2-2.1.20-py3-none-any.whl
Algorithm Hash digest
SHA256 4391ef9d6ebb72d37aa344b0a9e78517e07735d3a2aa415b86675d9d33cd8312
MD5 1b9f69fbfdbc18e22017bb1705a7fe1a
BLAKE2b-256 41ec3af8fa8844967ccdcf0050f8d321694038238afce1ce0a48aa131cc9c6c1

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page