Skip to main content

Pandas on AWS.

Project description

AWS Data Wrangler

Pandas on AWS

AWS Data Wrangler

An AWS Professional Service open source initiative

Release Python Version Code style: black License

Checked with mypy Coverage Static Checking Documentation Status

Source Downloads Page Installation Command
PyPi PyPI Downloads Link pip install awswrangler
Conda Conda Downloads Link conda install -c conda-forge awswrangler

Table of contents

Quick Start

Installation command: pip install awswrangler

import awswrangler as wr
import pandas as pd

df = pd.DataFrame({"id": [1, 2], "value": ["foo", "boo"]})

# Storing data on Data Lake
wr.s3.to_parquet(
    df=df,
    path="s3://bucket/dataset/",
    dataset=True,
    database="my_db",
    table="my_table"
)

# Retrieving the data directly from Amazon S3
df = wr.s3.read_parquet("s3://bucket/dataset/", dataset=True)

# Retrieving the data from Amazon Athena
df = wr.athena.read_sql_query("SELECT * FROM my_table", database="my_db")

# Get Redshift connection (SQLAlchemy) from Glue and retrieving data from Redshift Spectrum
engine = wr.catalog.get_engine("my-redshift-connection")
df = wr.db.read_sql_query("SELECT * FROM external_schema.my_table", con=engine)

# Get MySQL connection (SQLAlchemy) from Glue Catalog and LOAD the data into MySQL
engine = wr.catalog.get_engine("my-mysql-connection")
wr.db.to_sql(df, engine, schema="test", name="my_table")

# Get PostgreSQL connection (SQLAlchemy) from Glue Catalog and LOAD the data into PostgreSQL
engine = wr.catalog.get_engine("my-postgresql-connection")
wr.db.to_sql(df, engine, schema="test", name="my_table")

Read The Docs

Who uses AWS Data Wrangler?

Knowing which companies are using this library is important to help prioritize the project internally.

Please send a PR with your company name and @githubhandle if you may.

  1. Digio [@afonsomy]
  2. Pier [@flaviomax]
  3. M4U [@Thiago-Dantas]
  4. Serasa Experian [@andre-marcos-perez]

Project details


Release history Release notifications | RSS feed

This version

1.8.0

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

awswrangler-1.8.0.tar.gz (99.7 kB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

awswrangler-1.8.0-py3.6.egg (266.3 kB view details)

Uploaded Egg

awswrangler-1.8.0-py3-none-any.whl (125.0 kB view details)

Uploaded Python 3

File details

Details for the file awswrangler-1.8.0.tar.gz.

File metadata

  • Download URL: awswrangler-1.8.0.tar.gz
  • Upload date:
  • Size: 99.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.2.1 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.6.11

File hashes

Hashes for awswrangler-1.8.0.tar.gz
Algorithm Hash digest
SHA256 2f873150621398ba33b620ac635f13f308ab9e2899a3d9bf3857835ac33d9aba
MD5 4f19080e4b1483dafed1b2acd3173128
BLAKE2b-256 09e9efa29a096d3c85361cff155dc03c163bb25981f267ac27e50d8fac4a80e5

See more details on using hashes here.

File details

Details for the file awswrangler-1.8.0-py3.6.egg.

File metadata

  • Download URL: awswrangler-1.8.0-py3.6.egg
  • Upload date:
  • Size: 266.3 kB
  • Tags: Egg
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.2.1 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.6.11

File hashes

Hashes for awswrangler-1.8.0-py3.6.egg
Algorithm Hash digest
SHA256 73d313fd9c5c918e2b5901dba70fe6c4a3d0e0a59777e78b41ec4117b97f3cf1
MD5 5301a48dbb93208b6f4dceda193a0d6f
BLAKE2b-256 c1fe7b29fb21705e4d885baa8d2ae8a179430b470ab5bcbfec55f7603063b961

See more details on using hashes here.

File details

Details for the file awswrangler-1.8.0-py3-none-any.whl.

File metadata

  • Download URL: awswrangler-1.8.0-py3-none-any.whl
  • Upload date:
  • Size: 125.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.2.1 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.6.11

File hashes

Hashes for awswrangler-1.8.0-py3-none-any.whl
Algorithm Hash digest
SHA256 b297e45f493f6a700b44a323dd1950f47a72102d0b5bacedee397134f3d8960d
MD5 1ea2438ac050d1df16f2730a2da8fbe0
BLAKE2b-256 3480e84a677a20a84f8cc3957f9df7177d7ed03028712d811c6ee65518d7bf26

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page