Skip to main content

Lightweight wrapper around gspread that simplifies tasks like downloading, uploading, and appending data from Google Sheets using Pandas.

Project description

pandasheets

Logo

pandasheets is a convenient wrapper around gspread that simplifies common operations such as downloading, uploading, and appending, when working with Google Sheets using pandas.

Getting Started

To use pandasheets, you need to set up a Google Cloud service account credential that grants your code access to manage spreadsheets. Below are the steps to create and configure these credentials:

1. Enable API access for your project

  1. Go to the Google Cloud Console.
  2. From the left-hand navigation menu, select APIs & Services.
  3. Click Enable APIs & Services, then Create project if you don’t already have one.
  4. Provide a Project name and Location, then click Create.

2. Create a Service Account Credential

  1. Under APIs & Services, click Credentials.
  2. Select Create credentials and choose Service account.
  3. Fill out the required fields (e.g., name, description).
  4. Copy the generated email address (it ends with iam.gserviceaccount.com, e.g., myserviceaccount@my-awesome-project.iam.gserviceaccount.com).
  5. Click Create and continue.
  6. Assign a suitable Role for the service account.
  7. Finalize the creation process.

3. Generate the JSON key

  1. On the Service Accounts page, locate the service account you just created.
  2. Click on it to view the details.
  3. In the top menu, select Keys.
  4. Click Add key > Create new key > JSON > Create.
  5. A JSON file will be downloaded. Store it in a secure location.

4. Enable the Google Sheets API

  1. Return to APIs & Services in the Google Cloud Console.
  2. Click Enable APIs and services.
  3. Search for and select Google Sheets API.
  4. Click Enable.

5. Enable the Google Drive API

Repeat the process from Step 4 to enable the Google Drive API.

6. Share your spreadsheet

Finally, open the Google Sheet you want to access and click Share. Enter the service account email you copied in Step 2, and grant appropriate permissions (e.g., Editor). Click Done.

Important: Make sure the service account email has permission to access the spreadsheet and that both the Google Sheets API and the Google Drive API are enabled in your project.

Installation

pip install pandasheets

Usage

Instantiate once with the path to your JSON key

import pandasheets as ps
PS = ps.pandasheets(credential="path/to/credential.json")

Downloading a sheet from Google Sheets into a DataFrame

# Define the sheet name, spreadsheet name

sheet = "mySheet" # sheet to be downloaded and loaded as a DataFrame
spreadsheet = "mySpreadsheet" # Name of the spreadsheet containing the sheet

# Download the sheet and load it as a DataFrame
df = PS.get_sheet_to_dataframe(
    sheet=sheet,
    spreadsheet=spreadsheet,
)

Uploading a DataFrame to a new sheet

# Define your DataFrame, target sheet, spreadsheet name, and credential file

df = myDf # DataFrame to be uploaded
sheet = "mySheet" # Target sheet name for the upload
spreadsheet = "mySpreadsheet" # Name of the spreadsheet where the sheet will be created

PS.upload_dataframe_to_spreadsheet(
    df=df,
    sheet=sheet,
    spreadsheet=spreadsheet,
)

By default, the DataFrame will be uploaded formatted as:

  • Bold header
  • All cells in columns A to Z are set to clip text wrapping
  • The first row is frozen to keep the headers visible during scrolling.

If you wish to avoid the default formatting, call the function with the formatting parameter set to False:

PS.upload_dataframe_to_spreadsheet(
    df=df,
    sheet=sheet,
    spreadsheet=spreadsheet,
    formatting=False
)

By default, if a sheet with the same name already exists in the specified spreadsheet, a ValueError will be raised, as a new sheet will not be created. If you intend to overwrite the content of an existing sheet, set the parameter overwrite to True

PS.upload_dataframe_to_spreadsheet(
    df=df,
    sheet=sheet,
    spreadsheet=spreadsheet,
    formatting=False,
    overwrite=True
)

Important: All the content of the existing sheet will be overwritten if overwrite=True. If you only intend to append new values, use append_dataframe_to_sheet as shown below.

Appending a DataFrame to an existing sheet

# Create a sample DataFrame to be appended
df_to_append = pd.DataFrame({
    "column_A": ["A1", "A2", "A3"],
    "column_B": ["B1", "B2", "B3"],
})

sheet = "mySheet" # Sheet name to which the data will be appended
spreadsheet = "mySpreadsheet" # Spreadsheet name where the sheet exists

PS.append_dataframe_to_sheet(
    df=df_to_append,
    sheet=sheet,
    spreadsheet=spreadsheet,
)

Important: Only values for existing columns in the sheet are appended. Missing values in the DataFrame (present in the sheet) are set as empty strings. Duplicate values are not appended by default.

If duplicate values are acceptable and you would like to append them, set the duplicates parameter to True:

PS.append_dataframe_to_sheet(
    df=df,
    sheet=sheet,
    spreadsheet=spreadsheet,
    duplicates=True
)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pandasheets-0.3.1.tar.gz (44.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pandasheets-0.3.1-py3-none-any.whl (31.2 kB view details)

Uploaded Python 3

File details

Details for the file pandasheets-0.3.1.tar.gz.

File metadata

  • Download URL: pandasheets-0.3.1.tar.gz
  • Upload date:
  • Size: 44.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for pandasheets-0.3.1.tar.gz
Algorithm Hash digest
SHA256 19a20afdfe9d71879c0d43c39367ccee3edf324f066d801254493b98ac71e0e7
MD5 975f80a8c4780b660b5d4a8e8a4946c5
BLAKE2b-256 510571b2e715d945adf7a84f8cce83cdddc0c41e8430bc8254bf11acb3b39207

See more details on using hashes here.

Provenance

The following attestation bundles were made for pandasheets-0.3.1.tar.gz:

Publisher: publish.yml on diegomagela/pandasheets

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file pandasheets-0.3.1-py3-none-any.whl.

File metadata

  • Download URL: pandasheets-0.3.1-py3-none-any.whl
  • Upload date:
  • Size: 31.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for pandasheets-0.3.1-py3-none-any.whl
Algorithm Hash digest
SHA256 61e2fc26370041204572106dd7c0cb40f00c8e62f8236ae6a7410627b713f592
MD5 b66e1c9a5aa513cab7b90d2fb427d4f5
BLAKE2b-256 2b4cb2bbe1ad92e6e1bf8cf42fb30e6866853e2930548178919a344700fd3c20

See more details on using hashes here.

Provenance

The following attestation bundles were made for pandasheets-0.3.1-py3-none-any.whl:

Publisher: publish.yml on diegomagela/pandasheets

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page