gspread-pandas

A package to easily open an instance of a Google spreadsheet and interact with worksheets through Pandas DataFrames.

These details have not been verified by PyPI

Project links

Project description

author: Diego Fernandez

Links:

Attention!

There will be breaking API changes in v2. Mainly, I will be making the user key optional and OAuth credentials will be stored under a default file. This should make it easier to use for the common single user case, as well as for those using ServiceAccount credentials. I’d love to hear your opinion on the issue. I will also be standardizing the API for Spread.add_filter to match other functions. Feel free to check out the current work on the v2 branch.

To disable warnings:

import gspread_pandas.util as util
util.DEPRECATION_WARNINGS_ENABLED = False

Overview

A package to easily open an instance of a Google spreadsheet and interact with worksheets through Pandas DataFrames. It enables you to easily pull data from Google spreadsheets into DataFrames as well as push data into spreadsheets from DataFrames. It leverages gspread in the backend for most of the heavylifting, but it has a lot of added functionality to handle things specific to working with DataFrames as well as some extra nice to have features.

Some key goals/features:

Nicely handle headers and indexes
Run on Jupyter, headless server, and/or scripts
Allow storing different user credentials
Automatically handle token refreshes
Enable handling of frozen rows and columns
Enable handling of merged cells
Nicely handle large data sets and retries
Enable creation of filters
Handle retries when exceeding 100s quota
Handle cell merges with option to merge multi-level header cells

Installation / Usage

To install use pip:

$ pip install gspread-pandas

Or clone the repo:

$ git clone https://github.com/aiguofer/gspread-pandas.git
$ python setup.py install

Before using, you will need to download Google client credentials for your app.

Client Credentials

To allow a script to use Google Drive API we need to authenticate our self towards Google. To do so, we need to create a project, describing the tool and generate credentials. Please use your web browser and go to Google console and :

Choose Create Project in popup menu on the top.
A dialog box appears, so give your project a name and click on Create button.
On the left-side menu click on API Manager.
A table of available APIs is shown. Switch Drive API and click on Enable API button. Do the same for Sheets API. Other APIs might be switched off, for our purpose.
On the left-side menu click on Credentials.
In section OAuth consent screen select your email address and give your product a name. Then click on Save button.
In section Credentials click on Add credentials and switch OAuth 2.0 client ID.
A dialog box Create Cliend ID appears. Select Application type item as Other.
Click on Create button.
Click on Download JSON icon on the right side of created OAuth 2.0 client IDs and store the downloaded file on your file system. Please be aware, the file contains your private credentials, so take care of the file in the same way you care of your private SSH key; i.e. move downloaded JSON to ~/.config/gspread_pandas/google_secret.json (or you can configure the directory and file name by directly calling gspread_pandas.conf.get_config

Thanks to similar project df2gspread for this great description of how to get the client credentials.

User Credentials

Once you have your client credentials, you can have multiple user credentials stored in the same machine. This can be useful when you have a shared server (for example with a Jupyter notebook server) with multiple people that may want to use the library. The first parameter to Spread must be the key identifying a user’s credentials. The first time this is called for a specific key, you will have to authenticate through a text based OAuth prompt; this makes it possible to run on a headless server through ssh or through a Jupyter notebook. After this, the credentials for that user will be stored (by default in ~/.config/gspread_pandas/creds or you can manually set it in GSPREAD_PANDAS_CONFIG_DIR env var) and the tokens will berefreshed automatically any time the tool is used.

Users will only be able to interact with Spreadsheets that they have access to.

Handling Authentication

In the backend, the library is leveraging Google’s oauth2client to handle authentication. It conveniently stores everything as described above so that you don’t have to worry about boiler plate code to handle auth. However, if you need to customize how you handle authentication you can do so in a few different ways. You can change the directory where everything is stored using the GSPREAD_PANDAS_CONFIG_DIR env var. You can also generate your own oauth2client.client.OAuth2Credentials and pass them in when instanciating a Client or Spread object. For other ways to customize authentication, see gspread_pandas.conf.get_config and gspread_pandas.conf.get_creds

Contributing

Code should be run through black, isort, and flake8 before being merged. Pre-commit takes care of it for you, but you need to have Python 3 installed to be able to run black. To contribute, please fork the repo, create a feature branch, push it to your repo, then create a pull request.

To install and set up the environment after you fork it (replace aiguofer with your username):

$ git clone https://github.com/aiguofer/gspread-pandas.git && cd gspread-pandas
$ pip install -e ".[dev]"
$ pre-commit install

Example

from __future__ import print_function
import pandas as pd
from gspread_pandas import Spread, Client

file_name = "http://stats.idre.ucla.edu/stat/data/binary.csv"
df = pd.read_csv(file_name)

# 'Example Spreadsheet' needs to already exist and your user must have access to it
spread = Spread('example_user', 'Example Spreadsheet')
# This will ask to authenticate if you haven't done so before for 'example_user'

# Display available worksheets
spread.sheets

# Save DataFrame to worksheet 'New Test Sheet', create it first if it doesn't exist
spread.df_to_sheet(df, index=False, sheet='New Test Sheet', start='A2', replace=True)
spread.update_cells('A1', 'A1', ['Created by:', spread.email])
print(spread)
# <gspread_pandas.client.Spread - User: '<example_user>@gmail.com', Spread: 'Example Spreadsheet', Sheet: 'New Test Sheet'>

# You can now first instanciate a Client separately and query folders and
# instanciate other Spread objects by passing in the Client
client = Client('example_user')
# Assumming you have a dir called 'example dir' with sheets in it
available_sheets = client.find_spreadsheet_files_in_folders('example dir')
spreads = []
for sheet in available_sheets.get('example dir', []):
    spreads.append(Spread(client, sheet['id']))

Troubleshooting

SSL Error

If you’re getting an SSL related error or can’t seem to be able to open existing spreadsheets that you have access to, you might be running into an issue caused by certifi. This has mainly been experienced on RHEL and CentOS running Python 2.7. You can read more about it in issue 223 and issue 354 but, in short, the solution is to either install a specific version of certifi that works for you, or remove it altogether.

pip install certifi==2015.4.28

pip uninstall certifi

EOFError in Rodeo

If you’re trying to use gspread_pandas from within Rodeo you might get an EOFError: EOF when reading a line error when trying to pass in the verification code. The workaround for this is to first verify your account in a regular shell. Since you’re just doing this to get your Oauth token, the spreadsheet doesn’t need to be valid. Just run this in shell:

python -c "from gspread_pandas import Spread; Spread('<user_key>','')"

Then follow the instructions to create and store the OAuth creds.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

3.3.0

Feb 13, 2024

3.2.3

Aug 31, 2023

3.2.2

Jun 29, 2022

3.2.1

Jun 29, 2022

3.2.0

Mar 20, 2022

3.0.4

Jan 18, 2022

3.0.3

Jan 5, 2022

3.0.2

Dec 29, 2021

3.0.0

Dec 29, 2021

2.3.1

Nov 30, 2021

2.3.0

Mar 22, 2021

2.2.4

Mar 22, 2021

2.2.3

Mar 26, 2020

2.2.2

Mar 21, 2020

2.2.1

Jan 6, 2020

2.2.0

Nov 18, 2019

2.1.4

Nov 18, 2019

2.1.3

Aug 25, 2019

2.1.2

Jul 11, 2019

2.1.1

Mar 22, 2021

2.1.0

Mar 22, 2021

2.0.0

Jun 14, 2019

This version

1.3.1

May 18, 2019

1.3.0

Apr 30, 2019

1.2.2

Apr 16, 2019

1.2.1

Aug 30, 2018

1.1.3

Jul 8, 2018

1.1.2

Jun 23, 2018

1.1.1

Jun 13, 2018

1.1.0

Jun 2, 2018

1.0.5

Apr 14, 2018

1.0.4

Apr 8, 2018

1.0.3

Apr 2, 2018

1.0.2

Jun 14, 2019

1.0.1

Mar 26, 2018

1.0.0

Mar 26, 2018

0.16.4

Mar 27, 2018

0.16.3

Mar 27, 2018

0.16.2

Mar 26, 2018

0.16.1

Mar 24, 2018

0.16.0

Mar 27, 2018

0.15.6

Mar 12, 2018

0.15.5

Mar 12, 2018

0.15.4

Feb 13, 2018

0.15.3

Nov 21, 2017

0.15.2

Nov 18, 2017

0.15.1

Oct 5, 2017

0.15.0

Sep 11, 2017

0.14.3

Jun 22, 2017

0.14.2

Jun 19, 2017

0.14.1

Jun 5, 2017

0.14.0

May 25, 2017

0.13.0

Apr 28, 2017

0.12.1

Apr 25, 2017

0.12.0

Mar 31, 2017

0.11.2

Mar 22, 2017

0.11.1

Mar 22, 2017

0.11.0

Feb 15, 2017

0.10.1

Jan 26, 2017

0.10.0

Jan 18, 2017

0.9

Dec 8, 2016

0.8

Nov 11, 2016

0.7

Nov 11, 2016

0.6

Oct 27, 2016

0.5

Oct 19, 2016

0.4

Oct 19, 2016

0.3

Oct 19, 2016

0.2

Oct 12, 2016

0.1

Oct 12, 2016

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gspread-pandas-1.3.1.tar.gz (18.7 kB view hashes)

Uploaded May 18, 2019 Source

Built Distributions

gspread_pandas-1.3.1-py3.6.egg (37.9 kB view hashes)

Uploaded Jun 14, 2019 Source

gspread_pandas-1.3.1-py2.py3-none-any.whl (20.2 kB view hashes)

Uploaded May 18, 2019 Python 2 Python 3

Hashes for gspread-pandas-1.3.1.tar.gz

Hashes for gspread-pandas-1.3.1.tar.gz
Algorithm	Hash digest
SHA256	`3884eb13f69a4718f83f196558af8cd67f772dbd86d13636c795dc41c92c6e6e`
MD5	`cd78601c62c0cd886b571901592fd898`
BLAKE2b-256	`6e60431de1c1f14e492198317c025ac48a7f13cf98a8bad0f78622256d5a0d71`

Hashes for gspread_pandas-1.3.1-py3.6.egg

Hashes for gspread_pandas-1.3.1-py3.6.egg
Algorithm	Hash digest
SHA256	`a885b68a2ebd88a4c02965973c5ca330c7732f9bff9ec05b817fb28f638254c8`
MD5	`72deef1ca3e74584a9bb3e8a26327168`
BLAKE2b-256	`2c94e9cf7004fc4c0f12978a645ac98b17deca05b9ac09f527e2bbfc704d1daa`

Hashes for gspread_pandas-1.3.1-py2.py3-none-any.whl

Hashes for gspread_pandas-1.3.1-py2.py3-none-any.whl
Algorithm	Hash digest
SHA256	`252329a78ab31da75441f612ee48ca24341b492559940f72f7f1dc5d25a1f767`
MD5	`baa1082a77f9fa127a2ec493dc168082`
BLAKE2b-256	`bae7bc325c30b3197cd2cc325116f023f35f973dc23a501bbb8b43fdec275589`

gspread-pandas 1.3.1

Navigation

Verified details (What is this?)

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Overview

Installation / Usage

Client Credentials

User Credentials

Handling Authentication

Contributing

Example

Troubleshooting

SSL Error

EOFError in Rodeo

Project details

Verified details (What is this?)

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distributions