No project description provided
Project description
Datashack
What is this?
Create AWS Kinesis+Glue pipeline with Python, Javascript, Golang, Java. (currently Python only)
For example:
from datashack_sdk_py import StreamingTable, Column
db = Database("db1", "tables-bucket")
UserEvents = StreamingTable("users", "db1")
UserEvents['id'] = Column('string')
UserEvents['age'] = Column('int')
UserEvents['name'] = Column('string')
UserEvents['ts'] = Column('timestamp')
run datashack plan/apply to see the actual changes or actually applying them to your AWS account.
creates this pipeline:
- Provision Kinesis+Spark+Glue+Iam
- Automate schema evolution
- Tests
Pre-requisites
To work with this project, you will need to have the following software installed on your machine:
-
Windows 10, macOS 10.13 or later, or a modern Linux distribution
-
4GB of RAM (8GB recommended)
-
Node.js: You can download and install Node.js from the official website: https://nodejs.org/en/download/
-
Python: You can download and install Python from the official website: https://www.python.org/downloads/
-
Terraform: These tools are used for defining infrastructure as code. To install them, follow the instructions on the HashiCorp website: https://developer.hashicorp.com/terraform/tutorials/aws-get-started/install-cli.
-
CDKTF: you can install the CDKTF CLI by running the following command:
npm install --global cdktf-cli@latest -
AWS Account: create an AWS account if you don't have one and configure your local computer aws configuration with
aws config- https://docs.aws.amazon.com/cli/latest/userguide/cli-configure-files.html
Getting Started
run in your terminal
pip install datashack-sdk --upgrade
git clone https://github.com/datashack-dev/datashack-sdk-examples
datashack plan ./datashack-sdk-examples/py_my_app/models
Roadmap
- Additional AWS services
- Additional data sources and sinks, such as Apache Kafka or Elasticsearch
- 3rd Party sources - Google Analytics, Salesforce ...
- Spark based transformation and processing capabilities
- Integration with CICD workflows
- Integration with runtime applications
- Testing
- Automatic documentation generation
- More languages support for Datashack SDK: Go, Java, Yaml
Stay tuned
We are working on a fully funcional beta with many more features. Join here so we can ping you
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file datashack_sdk-0.0.27.tar.gz.
File metadata
- Download URL: datashack_sdk-0.0.27.tar.gz
- Upload date:
- Size: 30.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.8.16
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7711b4949a0c5e1292223f4097894a9680b94e6da0b1dc618353fd127bf77e01
|
|
| MD5 |
df812f17e655bc0cfd5b665bdf1d4f8f
|
|
| BLAKE2b-256 |
5754169e13ed3a18d9839a5e2bc100fad516fefab2d22b3ff42ec239158facb8
|