Extend Apache Beam python API with new modules
Project description
beam-extended package
Provide extensions to existing SDKs (mainly Python)
It currently contains:
- mongoDB IO Connector
Installation
To install the package, please use the pip
installation as follows:
pip install beam-extended
Note: currently, due to Apache Beam constraints, beam-extended is only available for Python 3.6.
Example Usage
Here is a short example of using the package.
from beam_extended.io.mongodbio import ReadFromMongo, WriteToMongo
...
with beam.Pipeline(options=options) as pipeline:
( p | 'read' >> ReadFromMongo('myUrl1', 'myDb1', 'myColl1', query={}, projection=['_id'])
| 'transform' >> beam.Map(transform)
| 'save' >> WriteToMongo('myUrl2', 'myDb2', 'myColl2'))
Publishing
Run the pypi.sh
script.
Credits
Based on:
https://gist.github.com/dlebech/e9d6ba266014db8783dbbeb362593020 by David Volquartz Lebech
and
https://gist.github.com/sandboxws/08b9c5e373b94056733f8a662d9a2fa2 by Ahmed El.Hussaini
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
beam-extended-0.1.0.tar.gz
(5.0 kB
view details)
File details
Details for the file beam-extended-0.1.0.tar.gz
.
File metadata
- Download URL: beam-extended-0.1.0.tar.gz
- Upload date:
- Size: 5.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.11.0 pkginfo/1.4.2 requests/2.19.1 setuptools/40.0.0 requests-toolbelt/0.8.0 tqdm/4.24.0 CPython/2.7.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | c36de21538ac099621464bad2d7a58551c19f7d0a2f4c34c40c61749ba0ff984 |
|
MD5 | 94dada44983bca000c5545b15dbc6341 |
|
BLAKE2b-256 | 218179a4eb8c9a8269f772d5029a37810bbfffbc9250113831c8c7a0c86ecc88 |