Library to get data from Tableau Viz
Project description
Tableau Scraper
Python library to scrape data from Tableau viz
R library is under development but a script is available to get the worksheets, see this
Python
Install
pip install tableau-scraper
Usage
- Get worksheets data
from tableauscraper import TableauScraper as TS
url = "https://public.tableau.com/views/PlayerStats-Top5Leagues20192020/OnePlayerSummary"
ts = TS()
ts.loads(url)
dashboard = ts.getDashboard()
for t in dashboard.worksheets:
#show worksheet name
print(f"WORKSHEET NAME : {t.name}")
#show dataframe for this worksheet
print(t.data)
- select a selectable item
from tableauscraper import TableauScraper as TS
url = "https://public.tableau.com/views/PlayerStats-Top5Leagues20192020/OnePlayerSummary"
ts = TS()
ts.loads(url)
#show selectable columns
columns = ts.getWorksheet("ATT MID CREATIVE COMP").getSelectableColumns()
print(columns)
#show values by column name
values = ts.getWorksheet("ATT MID CREATIVE COMP").getValues("ATTR(Player)")
print(values)
#select that value
dashboard = ts.getWorksheet("ATT MID CREATIVE COMP").select("ATTR(Player)", "Vinicius Júnior")
#display worksheets
for t in dashboard.worksheets:
print(t.data)
- select item in a dropdown
from tableauscraper import TableauScraper as TS
url = "https://public.tableau.com/views/PlayerStats-Top5Leagues20192020/OnePlayerSummary"
ts = TS()
ts.loads(url)
dashboard = ts.getDashboard()
#show dropdown input name
inputNames = dashboard.getDropdownInputs()
print(inputNames)
#show dropdown values for a given input name
values = dashboard.getDropdownValues("P.League 2")
print(values)
#select that value
dashboard = dashboard.setDropdown("P.League 2", "Ligue 1")
#display worksheets
for t in dashboard.worksheets:
print(t.data)
Important Note
In ts.loads(url)
you must input the Tableau URL which can be different from the one you're looking at in the browser. Open the network tab under Chrome Development tools and look for URL with those query params : :embed=y
and :showVizHome=no
. Before the session query which looks like 315165BFBC204028B80CD3FB73880452-0:0
in the network tabs
In the following screenshot, the first request is the corret URL. The last request is the session query :
Testing Python script
To discover all worksheets, selectable columns and dropdowns, run prompt.py
script under scripts
directory :
git clone git@github.com:bertrandmartel/tableau-scraping.git
cd tableau-scraping/scripts
#get worksheets data
python3 prompt.py -get dashboard -url "https://public.tableau.com/views/COVID-19inMissouri/COVID-19inMissouri"
#select a selectable item
python3 prompt.py -get select -url "https://public.tableau.com/views/MKTScoredeisolamentosocial/VisoGeral"
#select an item in dropdown
python3 tableau.py -get dropdown -url "https://public.tableau.com/views/COVID-19DailyDashboard_15960160643010/Casesbyneighbourhood"
Settings
TableauScraper
class has the following optional parameters :
Parameters | default value | description |
---|---|---|
logLevel | logging.INFO | log level |
delayMs | 500 | minimum delay in millis between actions (select/dropdown request) |
R
under R
directory :
Rscript tableau.R
R library is under development
How it works
Tableau dashboard is rendered get its data from an internal API. In order to get the data, you must get the initial tableau URL which is called with the query parameter :embed=y
and :showVizHome=no
(checkout networks logs)
In the html body, you have a textarea
tag with id tsConfigContainer
with a JSON configuration. The session_id
field and root path vizql_root
makes possible to build the session uri :
POST https://public.tableau.com/ROOT_PATH/bootstrapSession/sessions/SESSION_ID
The result is 2 json objects. One of them contains the data. The JSON object is complex since it dissociates the value indices from the column indices and from the worksheets.
Stackoverflow Questions
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for TableauScraper-0.0.2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 32a18a4b00cde42dbb2e1ea90444d7d1cebeca69548a41004c60c4c62e39ce38 |
|
MD5 | 311e28f3c81da7e37ddae1dd541a990d |
|
BLAKE2b-256 | fab317cf406f946112883216c0b8e4b4feb13f43904ab0f3c7b22692e99df581 |