Skip to main content

Enhanced Google Sheets operations with advanced data type handling

Project description

gspreadplusplus

A Python library that enhances Google Sheets operations with additional functionality and improved data type handling.

Features

  • Transfer Spark DataFrames to Google Sheets with proper type conversion
  • Append data to existing sheets while maintaining structure
  • Intelligent handling of various data types (numbers, dates, timestamps, etc.)
  • Preserve or update sheet headers
  • Selective column clearing options
  • Automatic date formatting
  • Sheet dimension management
  • Configuration management with key-value storage

Installation

pip install gspreadplusplus

Requirements

  • Python 3.7+
  • gspread
  • pyspark
  • google-auth

Usage

Basic DataFrame Export

from gspreadplusplus import GPP
from pyspark.sql import SparkSession

# Initialize Spark and create a DataFrame
spark = SparkSession.builder.appName("example").getOrCreate()
df = spark.createDataFrame([
    ("2024-01-01", 100, "Complete"),
    ("2024-01-02", 150, "Pending")
], ["date", "amount", "status"])

# Your Google Sheets credentials
creds_json = {
    "type": "service_account",
    # ... rest of your service account credentials
}

# Export DataFrame to Google Sheets
GPP.df_to_sheets(
    df=df,
    spreadsheet_id="your_spreadsheet_id",
    sheet_name="Sheet1",
    creds_json=creds_json
)

Advanced DataFrame Export Options

# Export with custom options
GPP.df_to_sheets(
    df=df,
    spreadsheet_id="your_spreadsheet_id",
    sheet_name="Sheet1",
    creds_json=creds_json,
    keep_header=True,     # Preserve existing header row
    erase_whole=False,    # Clear only necessary columns
    create_sheet=True     # Create sheet if it doesn't exist
)

# Append data to existing sheet
GPP.df_append_to_sheets(
    df=df,
    spreadsheet_id="your_spreadsheet_id",
    sheet_name="Sheet1",
    creds_json=creds_json,
    keep_header=True,     # Keep existing header
    create_sheet=True     # Create sheet if it doesn't exist
)

Configuration Management

The library provides functionality to store and update configuration values in a Google Sheet. By default, it uses a sheet named "CONFIG" with keys in column A and values in column B.

# Store or update a configuration value
result = GPP.set_config(
    spreadsheet_id="your_spreadsheet_id",
    key="api_endpoint",
    value="https://api.example.com",
    creds_json=creds_json,
    sheet_name="CONFIG"  # Optional, defaults to "CONFIG"
)

if result == 0:
    print("Configuration updated successfully")
else:
    print("Error updating configuration")

Method Reference

df_to_sheets

Exports a Spark DataFrame to Google Sheets, optionally preserving existing headers.

Parameters:

  • df: Spark DataFrame containing the data to transfer
  • spreadsheet_id: The ID of the Google Spreadsheet
  • sheet_name: Name of the worksheet to update
  • creds_json: Dictionary containing Google service account credentials
  • keep_header: If True, preserve the first row of the sheet (default: False)
  • erase_whole: If True, clear all columns and rows (default: True)
  • create_sheet: If True, create the sheet if it doesn't exist (default: True)

df_append_to_sheets

Appends data from a Spark DataFrame to an existing Google Sheet.

Parameters:

  • df: Spark DataFrame containing the data to append
  • spreadsheet_id: The ID of the Google Spreadsheet
  • sheet_name: Name of the worksheet to update
  • creds_json: Dictionary containing Google service account credentials
  • keep_header: If True, preserve existing header (default: False)
  • create_sheet: If True, create the sheet if it doesn't exist (default: True)

set_config

Stores or updates configuration values in a designated sheet.

Parameters:

  • spreadsheet_id: The ID of the Google Spreadsheet
  • key: The key to store/update
  • value: The value to set
  • creds_json: Dictionary containing Google service account credentials
  • sheet_name: Name of the configuration worksheet (default: "CONFIG")

Returns:

  • 0 on success
  • 1 on error

Data Type Support

The library automatically handles conversion of various data types:

  • Strings
  • Integers (regular, long, bigint)
  • Floating point numbers (double, float)
  • Decimals
  • Dates
  • Timestamps
  • Booleans

Null values are converted to:

  • 0 for numeric types
  • Empty string for other types

Error Handling

The library implements comprehensive error handling:

  • Returns status codes for operations (0 for success, 1 for failure)
  • Prints detailed error messages for debugging
  • Gracefully handles missing keys, sheet access issues, and credential problems
  • Validates column count when appending with preserved headers

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gspreadplusplus-3.1.1.tar.gz (6.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

gspreadplusplus-3.1.1-py3-none-any.whl (7.2 kB view details)

Uploaded Python 3

File details

Details for the file gspreadplusplus-3.1.1.tar.gz.

File metadata

  • Download URL: gspreadplusplus-3.1.1.tar.gz
  • Upload date:
  • Size: 6.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.12.6

File hashes

Hashes for gspreadplusplus-3.1.1.tar.gz
Algorithm Hash digest
SHA256 46f63b8fcaa0599f7dd2b22b3fdb96234b53d2565ab9e8d5ff1d3bfeb7d8f34f
MD5 cb25bcc6d4264b72015dc0879d605b6f
BLAKE2b-256 e5393dd1c548c124ea1bea92805ed287a5f3df3a1c7b1d315dd920c141f84bec

See more details on using hashes here.

File details

Details for the file gspreadplusplus-3.1.1-py3-none-any.whl.

File metadata

File hashes

Hashes for gspreadplusplus-3.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 b5f09e8c8ce990eb8b1ffc4ea3c6a9ba99e3ad846618b6ff927b76d850cb0e71
MD5 02c76f25b6b9cca5d09e8fc3e710122a
BLAKE2b-256 3d666960fee90497e18717e7e55f0a0554be5dfede31ba5b3876b71adb97d3c8

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page