Skip to main content

Data modeling managing and transforming data

Project description

PyPI version Docs badge Repository License

Ding Dong

dingDong created for modeling devlop and maintin complex data integration projects - relational database or cloud APIs, Sql or no sql, data cleansing or modeling algorithms.

The project is currently support and batch loading, our next phase is to extend it for REST and websockecet APIs itegration as well.

Current project is purely python based. REST and websocket will be implemnted by nodeJs.

This project aims to use as a glue between diverse data storage types. we did not implementeed any JOIN or UNION method which can be used in much efficient way at the connectoer themself we did impleented pure meta data manager, data tranformation and extracting method. Using the capabiltes of existing connectors with dingDong allow us to create robust data project using the adavatges of all the componenents

Connectors :

Sql server - tested, ready for production Oracle - tested, ready for production SqlLite - tested, ready for production text files - tested, ready for production CSV - tested Vertica - partially tested MySql - partially tested MongoDB - partially tested Hadoop/Hive - not implemted

API Support SalesForce - partially, not tested

dingDong is splitted to two main moduls:
  • DING - create and manage overall metadata strucutre for all object at the workflow
    • creating new object

    • modify existing object by using back propogation mechnism

    • update data into now object

    • store old strucure

    • (todo) –> truck all changes in main repo for CI/CD processess

  • DONG - extract and load data from diverse realtion / not relational connectors
    • extract data - support multithreading for extracting massive data volume

    • transfer - enable to manipylate date by adding manipulation function on column
      • enable to add custom calculated fields

    • merge - merging source with target data can be done if source and merge located at the same connector

    • exec - enable to execute PL/SQL or sstored procedure command as part of the whole data workflow

Read more about dingDong at http://www.biSkilled.com (marketing) or at dingDong documentation

Installation

download from GitHub or install by using pip:

pip install dingDong

Samples

download samples CSV files DATAELEMENTDESCRIPTION.csv, DEMOGRAPHICS.csv, MEASURESOFBIRTHANDDEATH.csv located at samples/sampleHealthCare/csvData folder In this sample we use C:\dingDong as our main folder

In this sample

Hallo WORLD

print (“HAllo wrodl”)

Road map

We would like to create a platform that will enable to design, implement and maintenance and data integration project such as:

  • Any REST API connectivity from any API to any API using simple JSON mapping

  • Any Relational data base connectivity using JSON mapping

  • Any Non relational storage

  • Main platform for any middleware business logic - from sample if-than-else up to statistics algorithms using ML and DL algorithms

  • Enable Real time and scheduled integration

We will extend our connectors and Meta-data manager accordingly.

Cuurent supporting features

  • APIs : Salesforce

  • RMDBs : Sql-Server, Access, Oracle, Vertice, MySql

  • middleware : column transformation and simple data cleansing

  • DBs : mongoDb

  • Batch : Using external scheduler currently …..

  • onLine : Needs to be implemented …..

Authors

dingDong was created by Tal Shany (tal@biSkilled.com) We are looking for contributions !!!

License

GNU General Public License v3.0

See COPYING to see the full text.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dingDong-1.0.2.tar.gz (301.8 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page