Skip to main content

Pandas on AWS.

Project description

AWS Data Wrangler

Pandas on AWS

AWS Data Wrangler

An AWS Professional Service open source initiative | aws-proserve-opensource@amazon.com

Release Python Version Code style: black License

Checked with mypy Coverage Static Checking Documentation Status

Source Downloads Page Installation Command
PyPi PyPI Downloads Link pip install awswrangler
Conda Conda Downloads Link conda install -c conda-forge awswrangler

Table of contents

Quick Start

Installation command: pip install awswrangler

import awswrangler as wr
import pandas as pd

df = pd.DataFrame({"id": [1, 2], "value": ["foo", "boo"]})

# Storing data on Data Lake
wr.s3.to_parquet(
    df=df,
    path="s3://bucket/dataset/",
    dataset=True,
    database="my_db",
    table="my_table"
)

# Retrieving the data directly from Amazon S3
df = wr.s3.read_parquet("s3://bucket/dataset/", dataset=True)

# Retrieving the data from Amazon Athena
df = wr.athena.read_sql_query("SELECT * FROM my_table", database="my_db")

# Get Redshift connection (SQLAlchemy) from Glue and retrieving data from Redshift Spectrum
engine = wr.catalog.get_engine("my-redshift-connection")
df = wr.db.read_sql_query("SELECT * FROM external_schema.my_table", con=engine)

# Get MySQL connection (SQLAlchemy) from Glue Catalog and LOAD the data into MySQL
engine = wr.catalog.get_engine("my-mysql-connection")
wr.db.to_sql(df, engine, schema="test", name="my_table")

# Get PostgreSQL connection (SQLAlchemy) from Glue Catalog and LOAD the data into PostgreSQL
engine = wr.catalog.get_engine("my-postgresql-connection")
wr.db.to_sql(df, engine, schema="test", name="my_table")

Read The Docs

Who uses AWS Data Wrangler?

Knowing which companies are using this library is important to help prioritize the project internally.

Please send a PR with your company name and @githubhandle if you may.

  1. Digio [@afonsomy]
  2. Pier [@flaviomax]
  3. M4U [@Thiago-Dantas]
  4. Serasa Experian [@andre-marcos-perez]
  5. LINE TV [@bryanyang0528]
  6. OKRA Technologies [@JPFrancoia, @schot]

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

awswrangler-1.8.1.tar.gz (99.7 kB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

awswrangler-1.8.1-py3.6.egg (266.4 kB view details)

Uploaded Egg

awswrangler-1.8.1-py3-none-any.whl (125.1 kB view details)

Uploaded Python 3

File details

Details for the file awswrangler-1.8.1.tar.gz.

File metadata

  • Download URL: awswrangler-1.8.1.tar.gz
  • Upload date:
  • Size: 99.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.2.1 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.6.11

File hashes

Hashes for awswrangler-1.8.1.tar.gz
Algorithm Hash digest
SHA256 81f2bdb664c75bc4369070965ff8f97114c1be1f4c031446fe11958937a29f61
MD5 36a34cb19455bb842e3d86856e60677c
BLAKE2b-256 909d0f662753d0eae611eaa5df3f8f61a3b644b7d686794ee03305a02c244807

See more details on using hashes here.

File details

Details for the file awswrangler-1.8.1-py3.6.egg.

File metadata

  • Download URL: awswrangler-1.8.1-py3.6.egg
  • Upload date:
  • Size: 266.4 kB
  • Tags: Egg
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.2.1 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.6.11

File hashes

Hashes for awswrangler-1.8.1-py3.6.egg
Algorithm Hash digest
SHA256 e496fa4dfcdb5862d8741f6b298c2e8013f1d25dd250fb18f640a3f590bc45f9
MD5 d693bbac87402958f2386161c95dc98b
BLAKE2b-256 e3f16f252936677ab14de05354a0260b67483158c5bc076c9a2a96aca25580f4

See more details on using hashes here.

File details

Details for the file awswrangler-1.8.1-py3-none-any.whl.

File metadata

  • Download URL: awswrangler-1.8.1-py3-none-any.whl
  • Upload date:
  • Size: 125.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.2.1 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.6.11

File hashes

Hashes for awswrangler-1.8.1-py3-none-any.whl
Algorithm Hash digest
SHA256 dcee84c566bfbd9d3ea7dd923eab7c5b2650edcd1a0748ad04ebf57543131646
MD5 d90b2f25a45db2992d23395cf65624bd
BLAKE2b-256 b71cc76487c0dd3fe6e81aecc7ed8f6c06f9675424f85b3d1cca66a629f4689e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page