Skip to main content

Aims to simplify and help with commonly used functions in the data processing areas.

Project description

Binary Rain Helper Toolkit: Data Processing

binaryrain_helper_data_processing is a python package that aims to simplify and help with common functions data processing areas. It builds on top of the pandas library and provides additional functionality to make data processing easier, reduces boilerplate code and provides clear error messages.

Supported File Formats

  • PARQUET: For efficient columnar storage
  • CSV: For common tabular data
  • JSON: For structured data exchange
  • DICT: For Python dictionary data

Key Functions

  • create_dataframe() simplifies creating pandas DataFrames from various formats:

      from binaryrain_helper_data_processing import FileFormat, create_dataframe
    
      # Create from CSV bytes
      df = create_dataframe(csv_bytes, FileFormat.CSV)
    
      # Create with custom options
      df = create_dataframe(parquet_bytes, FileFormat.PARQUET,
      file_format_options={'engine': 'pyarrow'})
    
  • convert_dataframe_to_type(): handles converting DataFrames to different formats:

      from binaryrain_helper_data_processing import FileFormat, convert_dataframe_to_type
    
      # ....df is a pandas DataFrame
    
      # Convert to CSV bytes
      csv_bytes = convert_dataframe_to_type(df, FileFormat.CSV)
    
      # Convert with custom options
      parquet_bytes = convert_dataframe_to_type(df, FileFormat.PARQUET,
      file_format_options={'engine': 'pyarrow'})
    
  • merge_dataframes(): provides a simple way to merge multiple DataFrames:

      from binaryrain_helper_data_processing import merge_dataframes
    
      # ....df1 and df2 are pandas DataFrames
    
      # Merge DataFrames
      merged_df = merge_dataframes(df1, df2, sort=True)
    

Benefits

  • Consistent interface for different file formats
  • Simplified error handling with clear messages
  • Optional format-specific configurations
  • Built on pandas for robust data processing
  • Type hints for better IDE support

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

binaryrain_helper_data_processing-0.0.1.tar.gz (10.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file binaryrain_helper_data_processing-0.0.1.tar.gz.

File metadata

File hashes

Hashes for binaryrain_helper_data_processing-0.0.1.tar.gz
Algorithm Hash digest
SHA256 cc05069fb3b41e2643fcd1c533351ae1e94a57728b866c29542cc35cc359b5f0
MD5 c6ad1c1c6a3ac89daa0eadd4ffce6df6
BLAKE2b-256 6c4582fed5dc0202b40697c2058d5f7e39ae1c00e1bc3aefe242d7d9295ab2fd

See more details on using hashes here.

File details

Details for the file binaryrain_helper_data_processing-0.0.1-py3-none-any.whl.

File metadata

File hashes

Hashes for binaryrain_helper_data_processing-0.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 b642e177c614d4a24e9b47d5f0f7da53ac7a46ee6f4ccce4b5c7730e28c51586
MD5 9d2fb11e6d4754afd2b33096cf96061d
BLAKE2b-256 f4d2c3b0070d242391e5bf809834fa056e0a5255b4033185bd7a96a6a6b8138a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page