Skip to main content

Pandas on AWS.

Project description

AWS Data Wrangler

Pandas on AWS

AWS Data Wrangler

An AWS Professional Service open source initiative | aws-proserve-opensource@amazon.com

Release Python Version Code style: black License

Checked with mypy Coverage Static Checking Documentation Status

Source Downloads Installation Command
PyPi PyPI Downloads pip install awswrangler
Conda Conda Downloads conda install -c conda-forge awswrangler

Powered By

Table of contents

Quick Start

Installation command: pip install awswrangler

import awswrangler as wr
import pandas as pd

df = pd.DataFrame({"id": [1, 2], "value": ["foo", "boo"]})

# Storing data on Data Lake
wr.s3.to_parquet(
    df=df,
    path="s3://bucket/dataset/",
    dataset=True,
    database="my_db",
    table="my_table"
)

# Retrieving the data directly from Amazon S3
df = wr.s3.read_parquet("s3://bucket/dataset/", dataset=True)

# Retrieving the data from Amazon Athena
df = wr.athena.read_sql_query("SELECT * FROM my_table", database="my_db")

# Get Redshift connection (SQLAlchemy) from Glue and retrieving data from Redshift Spectrum
engine = wr.catalog.get_engine("my-redshift-connection")
df = wr.db.read_sql_query("SELECT * FROM external_schema.my_table", con=engine)

# Get MySQL connection (SQLAlchemy) from Glue Catalog and LOAD the data into MySQL
engine = wr.catalog.get_engine("my-mysql-connection")
wr.db.to_sql(df, engine, schema="test", name="my_table")

# Get PostgreSQL connection (SQLAlchemy) from Glue Catalog and LOAD the data into PostgreSQL
engine = wr.catalog.get_engine("my-postgresql-connection")
wr.db.to_sql(df, engine, schema="test", name="my_table")

Read The Docs

Who uses AWS Data Wrangler?

Knowing which companies are using this library is important to help prioritize the project internally.

Please send a PR with your company name and @githubhandle if you may.

Project details


Release history Release notifications | RSS feed

This version

1.9.3

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

awswrangler-1.9.3.tar.gz (107.7 kB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

awswrangler-1.9.3-py3.6.egg (284.9 kB view details)

Uploaded Egg

awswrangler-1.9.3-py3-none-any.whl (132.7 kB view details)

Uploaded Python 3

File details

Details for the file awswrangler-1.9.3.tar.gz.

File metadata

  • Download URL: awswrangler-1.9.3.tar.gz
  • Upload date:
  • Size: 107.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/50.2.0 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.6.11

File hashes

Hashes for awswrangler-1.9.3.tar.gz
Algorithm Hash digest
SHA256 b53e447d019f3ba480ff69fd57f376c600392988e59de489aecf9683007bfda4
MD5 d18432639313cce615ca1ef50f5d56d5
BLAKE2b-256 22a83a9fd46285164286320dd7855ec11a5ac14e01ddbaee893497eff7f8b283

See more details on using hashes here.

File details

Details for the file awswrangler-1.9.3-py3.6.egg.

File metadata

  • Download URL: awswrangler-1.9.3-py3.6.egg
  • Upload date:
  • Size: 284.9 kB
  • Tags: Egg
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/50.2.0 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.6.11

File hashes

Hashes for awswrangler-1.9.3-py3.6.egg
Algorithm Hash digest
SHA256 08fc2c3a9d12f7887a97dd9d9c7224efb11a19f9e7c2f26dedd394921d75f739
MD5 6f250b2157c4204859bdc1d1f244351b
BLAKE2b-256 7cbbaeb3846da3b08c79cbe7d64764ded97487343ef711a093c9c7edbf63431a

See more details on using hashes here.

File details

Details for the file awswrangler-1.9.3-py3-none-any.whl.

File metadata

  • Download URL: awswrangler-1.9.3-py3-none-any.whl
  • Upload date:
  • Size: 132.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/50.2.0 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.6.11

File hashes

Hashes for awswrangler-1.9.3-py3-none-any.whl
Algorithm Hash digest
SHA256 dccb2fd08f334e238128eacf1a2117e35c97c0fb3155065bf1799e9717a9b7f7
MD5 8a076e869b58c31401eaa45d1d62c74a
BLAKE2b-256 e0ec53183230be605c4fc5e861ce526bff9014b4341833732273aaa8f04e3a69

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page