Skip to main content

Data Processing is used for data processing through MinIO, databases, Web APIs, etc.

Project description

Data Processing

Current Version Main Features

Data Processing is used for data processing through MinIO, databases, Web APIs, etc. The data types handled include:

  • txt
  • json
  • doc
  • html
  • excel
  • csv
  • pdf
  • markdown
  • ppt

Current Text Type Processing

The data processing process includes: cleaning abnormal data, filtering, de-duplication, and anonymization.

Design

Design

Local Development

Software Requirements

Before setting up the local data-process environment, please make sure the following software is installed:

  • Python 3.10.x

Environment Setup

Install the Python dependencies in the requirements.txt file

Running

Run the server.py file in the src directory

isort

isort is a tool for sorting imports alphabetically within your Python code. It helps maintain a consistent and clean import order.

install

pip install isort

isort a file

isort src/server.py

isort a directory

isort .

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

one-data-processing-0.0.1.tar.gz (2.2 kB view hashes)

Uploaded Source

Built Distribution

one_data_processing-0.0.1-py3-none-any.whl (2.1 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page