Skip to main content

Aims to simplify and help with commonly used functions in the data processing areas.

Project description

Binary Rain Helper Toolkit: Data Processing

binaryrain_helper_data_processing is a python package that aims to simplify and help with common functions data processing areas. It builds on top of the pandas library and provides additional functionality to make data processing easier, reduces boilerplate code and provides clear error messages.

Supported File Formats

  • PARQUET: For efficient columnar storage
  • CSV: For common tabular data
  • JSON: For structured data exchange
  • DICT: For Python dictionary data

Key Functions

  • create_dataframe() simplifies creating pandas DataFrames from various formats:

      from binaryrain_helper_data_processing import FileFormat, create_dataframe
    
      # Create from CSV bytes
      df = create_dataframe(csv_bytes, FileFormat.CSV)
    
      # Create with custom options
      df = create_dataframe(parquet_bytes, FileFormat.PARQUET,
      file_format_options={'engine': 'pyarrow'})
    
  • convert_dataframe_to_type(): handles converting DataFrames to different formats:

      from binaryrain_helper_data_processing import FileFormat, convert_dataframe_to_type
    
      # ....df is a pandas DataFrame
    
      # Convert to CSV bytes
      csv_bytes = convert_dataframe_to_type(df, FileFormat.CSV)
    
      # Convert with custom options
      parquet_bytes = convert_dataframe_to_type(df, FileFormat.PARQUET,
      file_format_options={'engine': 'pyarrow'})
    
  • merge_dataframes(): provides a simple way to merge multiple DataFrames:

      from binaryrain_helper_data_processing import merge_dataframes
    
      # ....df1 and df2 are pandas DataFrames
    
      # Merge DataFrames
      merged_df = merge_dataframes(df1, df2, sort=True)
    

Benefits

  • Consistent interface for different file formats
  • Simplified error handling with clear messages
  • Optional format-specific configurations
  • Built on pandas for robust data processing
  • Type hints for better IDE support

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

binaryrain_helper_data_processing-0.0.6.tar.gz (3.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file binaryrain_helper_data_processing-0.0.6.tar.gz.

File metadata

File hashes

Hashes for binaryrain_helper_data_processing-0.0.6.tar.gz
Algorithm Hash digest
SHA256 b318d6d2bf3b075f6b9a53605c647989787af3e06534dc955475edcd418eb669
MD5 d8710e4b8b4f71591dab8936d36e6fcc
BLAKE2b-256 d49de7e4e4b3371a890a36e8d6a8eb4ad0c75bcc72b8b9241b0880522ed8e1a6

See more details on using hashes here.

File details

Details for the file binaryrain_helper_data_processing-0.0.6-py3-none-any.whl.

File metadata

File hashes

Hashes for binaryrain_helper_data_processing-0.0.6-py3-none-any.whl
Algorithm Hash digest
SHA256 5d4ebbc95dd4b0d90b0c90bba61b07fad7a5f254b91139763b256e226bd00e75
MD5 3d78cfb94ace659f0673c17fe7ac209c
BLAKE2b-256 45a7e9d607ec0f45f433a3fcabbae1d9c9cf504b3197fc730bf8a932dc84ac03

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page