Enhanced Google Sheets operations with advanced data type handling
Project description
gspreadplusplus
A Python library that enhances Google Sheets operations with additional functionality and improved data type handling.
Features
- Transfer Spark DataFrames to Google Sheets with proper type conversion
- Append data to existing sheets while maintaining structure
- Intelligent handling of various data types (numbers, dates, timestamps, etc.)
- Preserve or update sheet headers
- Selective column clearing options
- Automatic date formatting
- Sheet dimension management
- Configuration management with key-value storage
Installation
pip install gspreadplusplus
Requirements
- Python 3.7+
- gspread
- pyspark
- google-auth
Usage
Basic DataFrame Export
from gspreadplusplus import GPP
from pyspark.sql import SparkSession
# Initialize Spark and create a DataFrame
spark = SparkSession.builder.appName("example").getOrCreate()
df = spark.createDataFrame([
("2024-01-01", 100, "Complete"),
("2024-01-02", 150, "Pending")
], ["date", "amount", "status"])
# Your Google Sheets credentials
creds_json = {
"type": "service_account",
# ... rest of your service account credentials
}
# Export DataFrame to Google Sheets
GPP.df_to_sheets(
df=df,
spreadsheet_id="your_spreadsheet_id",
sheet_name="Sheet1",
creds_json=creds_json
)
Advanced DataFrame Export Options
# Export with custom options
GPP.df_to_sheets(
df=df,
spreadsheet_id="your_spreadsheet_id",
sheet_name="Sheet1",
creds_json=creds_json,
keep_header=True, # Preserve existing header row
erase_whole=False, # Clear only necessary columns
create_sheet=True # Create sheet if it doesn't exist
)
# Append data to existing sheet
GPP.df_append_to_sheets(
df=df,
spreadsheet_id="your_spreadsheet_id",
sheet_name="Sheet1",
creds_json=creds_json,
keep_header=True, # Keep existing header
create_sheet=True # Create sheet if it doesn't exist
)
Configuration Management
The library provides functionality to store and update configuration values in a Google Sheet. By default, it uses a sheet named "CONFIG" with keys in column A and values in column B.
# Store or update a configuration value
result = GPP.set_config(
spreadsheet_id="your_spreadsheet_id",
key="api_endpoint",
value="https://api.example.com",
creds_json=creds_json,
sheet_name="CONFIG" # Optional, defaults to "CONFIG"
)
if result == 0:
print("Configuration updated successfully")
else:
print("Error updating configuration")
Method Reference
df_to_sheets
Exports a Spark DataFrame to Google Sheets, optionally preserving existing headers.
Parameters:
df: Spark DataFrame containing the data to transferspreadsheet_id: The ID of the Google Spreadsheetsheet_name: Name of the worksheet to updatecreds_json: Dictionary containing Google service account credentialskeep_header: If True, preserve the first row of the sheet (default: False)erase_whole: If True, clear all columns and rows (default: True)create_sheet: If True, create the sheet if it doesn't exist (default: True)
df_append_to_sheets
Appends data from a Spark DataFrame to an existing Google Sheet.
Parameters:
df: Spark DataFrame containing the data to appendspreadsheet_id: The ID of the Google Spreadsheetsheet_name: Name of the worksheet to updatecreds_json: Dictionary containing Google service account credentialskeep_header: If True, preserve existing header (default: False)create_sheet: If True, create the sheet if it doesn't exist (default: True)
set_config
Stores or updates configuration values in a designated sheet.
Parameters:
spreadsheet_id: The ID of the Google Spreadsheetkey: The key to store/updatevalue: The value to setcreds_json: Dictionary containing Google service account credentialssheet_name: Name of the configuration worksheet (default: "CONFIG")
Returns:
- 0 on success
- 1 on error
Data Type Support
The library automatically handles conversion of various data types:
- Strings
- Integers (regular, long, bigint)
- Floating point numbers (double, float)
- Decimals
- Dates
- Timestamps
- Booleans
Null values are converted to:
- 0 for numeric types
- Empty string for other types
Error Handling
The library implements comprehensive error handling:
- Returns status codes for operations (0 for success, 1 for failure)
- Prints detailed error messages for debugging
- Gracefully handles missing keys, sheet access issues, and credential problems
- Validates column count when appending with preserved headers
Contributing
Contributions are welcome! Please feel free to submit a Pull Request.
License
This project is licensed under the MIT License - see the LICENSE file for details.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file gspreadplusplus-3.1.1.tar.gz.
File metadata
- Download URL: gspreadplusplus-3.1.1.tar.gz
- Upload date:
- Size: 6.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.0.1 CPython/3.12.6
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
46f63b8fcaa0599f7dd2b22b3fdb96234b53d2565ab9e8d5ff1d3bfeb7d8f34f
|
|
| MD5 |
cb25bcc6d4264b72015dc0879d605b6f
|
|
| BLAKE2b-256 |
e5393dd1c548c124ea1bea92805ed287a5f3df3a1c7b1d315dd920c141f84bec
|
File details
Details for the file gspreadplusplus-3.1.1-py3-none-any.whl.
File metadata
- Download URL: gspreadplusplus-3.1.1-py3-none-any.whl
- Upload date:
- Size: 7.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.0.1 CPython/3.12.6
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b5f09e8c8ce990eb8b1ffc4ea3c6a9ba99e3ad846618b6ff927b76d850cb0e71
|
|
| MD5 |
02c76f25b6b9cca5d09e8fc3e710122a
|
|
| BLAKE2b-256 |
3d666960fee90497e18717e7e55f0a0554be5dfede31ba5b3876b71adb97d3c8
|