Skip to main content

Improving the development of Spark applications deployed as jobs on AWS services like Glue and EMR

Project description


snakespark-logo

PyPI PyPI - Downloads PyPI - Status

CI workflow Documentation Status codecov

Table of content

What is the sparksnake library?

The sparksnake library provides an easy, fast, and efficient way to use Spark features inside analytics services on AWS. With sparksnake, it is possible to use classes, methods and functions developed in pyspark to simplify, as much as possible, the journey of building Spark applications along all the particularities found in AWS services, such as Glue and EMR, for example.

Do you want to take your job Glue or your EMR cluster to the next level? Take a look at sparksnake!

Note Now the sparksnake library has an official documentation in readthedocs! Visit the following link and check out usability technical details, practical examples and more!

Features

  • 🤖 Enhanced development experience of Spark Applications to be deployed as jobs in AWS services like Glue and EMR
  • 🌟 Possibility to use common Spark operations for improving ETL steps using custom classes and methods
  • ⚙️ No need to think too much into the hard and complex service setup (e.g. with sparksnake you can have all elements for a Glue Job on AWS with a single line of code)
  • 👁️‍🗨️ Application observability improvement with detailed log messages in CloudWatch
  • 🛠️ Exception handling already embedded in library methods

Contacts


References

Python

Docs

Github

Tests

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sparksnake-0.1.21.tar.gz (34.1 kB view hashes)

Uploaded Source

Built Distribution

sparksnake-0.1.21-py3-none-any.whl (36.6 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page