Skip to main content

No project description provided

Project description

Datashack

What is this?

Create AWS Kinesis+Glue pipeline with Python, Javascript, Golang, Java. (currently Python only)

For example:

from datashack_sdk_py import StreamingTable, Column

db = Database("db1", "tables-bucket")

UserEvents = StreamingTable("users", "db1")

UserEvents['id'] = Column('string')
UserEvents['age'] = Column('int')
UserEvents['name'] = Column('string')
UserEvents['ts'] = Column('timestamp')

run datashack plan/apply to see the actual changes or actually applying them to your AWS account.

creates this pipeline:

Tux, the Linux mascot

  • Provision Kinesis+Spark+Glue+Iam
  • Automate schema evolution
  • Tests

Pre-requisites

To work with this project, you will need to have the following software installed on your machine:

Getting Started

run in your terminal

pip install datashack-sdk --upgrade
git clone https://github.com/datashack-dev/datashack-sdk-examples
datashack plan ./datashack-sdk-examples/py_my_app/models

Roadmap

  • Additional AWS services
  • Additional data sources and sinks, such as Apache Kafka or Elasticsearch
  • 3rd Party sources - Google Analytics, Salesforce ...
  • Spark based transformation and processing capabilities
  • Integration with CICD workflows
  • Integration with runtime applications
  • Testing
  • Automatic documentation generation
  • More languages support for Datashack SDK: Go, Java, Yaml

Stay tuned

We are working on a fully funcional beta with many more features. Join here so we can ping you

https://www.datashack.dev/stay-in-touch

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

datashack_sdk-0.0.27.tar.gz (30.1 kB view details)

Uploaded Source

File details

Details for the file datashack_sdk-0.0.27.tar.gz.

File metadata

  • Download URL: datashack_sdk-0.0.27.tar.gz
  • Upload date:
  • Size: 30.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.16

File hashes

Hashes for datashack_sdk-0.0.27.tar.gz
Algorithm Hash digest
SHA256 7711b4949a0c5e1292223f4097894a9680b94e6da0b1dc618353fd127bf77e01
MD5 df812f17e655bc0cfd5b665bdf1d4f8f
BLAKE2b-256 5754169e13ed3a18d9839a5e2bc100fad516fefab2d22b3ff42ec239158facb8

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page