Skip to main content

An application for dowloading daily water usage data from Thames Water (UK).

Project description

Disclaimer: I have no connection to Thames Water at all (other than a water supply and sewage pipe ;-) ) and this is a utility that I wrote because I was frustrated that I could not download this information directly from the website.

What It Does

This is a Python utility that allows you to download your daily water usage, over the last 6 months, from the Thames Water website and produce a comma separated values (CSV) file that you can import into a spreadsheet or use in whatever way you wish.

Under the covers the utility uses the Chrome browser to surf the Thames Water website and download the information that you want.

Installation

You will need both the Chrome browser and the ChromeDriver - WebDriver for Chrome. The ChromeDriver - WebDriver for Chrome allows this Python application to drive the Chrome browser just as if you were sat their typing and clicking links – you can sit and watch this happen if you like by not running this script ‘headless’.

So install the Chrome browser if you haven’t already and then copy the ChromeDriver - WebDriver for Chrome to somewhere on your system before installing this utility.

It is recommended that you install the utility into either Virtualenv or Venv environment and you must use Python3 not Python2. Once the environment is created, the following should install this utility:

$ pip install thameswater

The thameswater application can then be run to download your daily water usage data.

Usage

usage: thameswater.py [-h] --login LOGIN --password PASSWORD --driver DRIVER
                      [--headless] --csv CSV

Read daily water usage data from the Thames Water website and write it out to
a CSV file.

optional arguments:
-h, --help           show this help message and exit
--login LOGIN        your Thames Water login e-mail address
--password PASSWORD  your Thames Water login password
--driver DRIVER      location of browser driver e.g. chromedriver
--headless           use headless browser
--csv CSV            Name of CSV file to be written

You probably want to create a batch/shell script to avoid having to type the longish command line each time.

It Went Wrong!

Very occasionally it does, normally because the Thames Water website has stopped responding in a timely manner; go make a cup of tea and try later because there’s nothing you or I can do about that!

Under the Covers

The Python script uses Selenium to drive the Chrome browser via the ChromeDriver - WebDriver for Chrome. It then runs around the Thames Water website just as you would, including logging in using the e-mail address and password that you provided, until it reaches the ‘Daily Usage’ page.

It turns out that in order to show the daily bar charts, the Thames Water website downloads the last 6 months of daily usage in a JSON encoded file, which this script gets hold of, tweaks the dates into a more sensible format and then writes out to the CSV file. The JSON file is accessed via Chrome’s performance data, the data that Chrome uses to show network access if you watch your browsing with the Developer Tools open.

Why Chrome?

  • It’s the browser that I normally use; the code might well work with Firefox and the Firefox driver or perhaps even Edge or other browsers, providing a suitable Selenium driver is available.

  • Chrome can be run in ‘headless’ mode which is nice if you don’t want a browser to launch onto your desktop just to download these figures.

Debugging/Logging

At the time of writing you will see the following log appearing on screen:

DevTools listening on ws://127.0.0.1...

There does not appear to be a way to suppress this at present but if you find one, please let me know.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

thameswater-0.5.tar.gz (6.0 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page