Skip to main content

shaping data

Project description

protoform

Release Build status codecov Commit activity License

shaping data

Getting started with your project

Concept of protoform

protoform is a data operation management system implemented in python using a graph and network (python-igraph) to handle the manipulation of data structure. protoform leverages tasks and performed operations to the data graph by these steps.

  • Define the source of data (for instance from sql database, csv file).
  • Define the tasks.
  • Convert the source of data and tasks to a graph (data graph and task graph)
  • Execute protoform. protoform will digest the tasks and independently operate in parallel (unless you define dependencies) then apply those tasks to the raw data, and export new data.

Working modules

  • cli : captures the arguments of task and data paths and executes operations regarding the task plans.
  • io : converts tasks to task graphs, reads input data (csv, tsv, parquet, and sql), converts the data to data graph, exports the objects to graph, csv file, or pushes to sql database.
  • operator : contains a list of operations what to do to the data.
  • data : implements data operation classes.
  • graph : implements graph classs.

Installation

Execution

  • You need to prepare two files:
  1. task manifest file (such as task.json). Follow an instruction how to but a task graph here: https://github.com/dataxight/protoform/wiki/Execution-Graph-Specification
  2. data manifest file (such as data.json). Follow an instruction how to but a data graph here: https://github.com/dataxight/protoform/wiki/Data-Graph-Specification
  • Run the first command
protoform --task-plan task.json --data-graph data.json

Repository initiated with fpgmaas/cookiecutter-poetry.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

protoform-0.0.2.tar.gz (36.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

protoform-0.0.2-py3-none-any.whl (64.8 kB view details)

Uploaded Python 3

File details

Details for the file protoform-0.0.2.tar.gz.

File metadata

  • Download URL: protoform-0.0.2.tar.gz
  • Upload date:
  • Size: 36.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for protoform-0.0.2.tar.gz
Algorithm Hash digest
SHA256 745986ab51eb47c1639f7794a53a7b814d70ab8f5b540895894ba20a83e89530
MD5 4b8769944137e2c1dc367b769320ea10
BLAKE2b-256 9f681ec59fd9db966f49238e5728095103e5526497b440a83c4b5080de5f1649

See more details on using hashes here.

File details

Details for the file protoform-0.0.2-py3-none-any.whl.

File metadata

  • Download URL: protoform-0.0.2-py3-none-any.whl
  • Upload date:
  • Size: 64.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for protoform-0.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 1574dfb8a004bba4bfd17bb5f9a68b0f531a3d27e66c2e9be9df124bb52cb30a
MD5 f46816a7299bfa21980afcfe817d101d
BLAKE2b-256 22c5e50e1fabb3de63712dec61cd6daf06f2fccdd769474fc3ccecaae9e17313

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page