A series of fixes for Data originating from Northern Ireland
Project description
tornamona
A series of fixes for Data originating from Northern Ireland
This is very much a work in progress and I'll only really add to it as things annoy me, but if you have a data set that's messy, and you can use similar workflows as demonstrated in this repo, please feel free to use this and PR the crap out of it.
I make no claims of liability, credit, or property over any of this, and make no promises that anything will ever work or be fixed.
Current Datasets
NISRA Weekly Deaths Datasets
Source: https://www.nisra.gov.uk/publications/weekly-deaths
What is this data?
- Weekly death registrations in Northern Ireland, from 2009 - 2020
- Also includes tracking of 2020 COVID outbreak
What is wrong with the data as presented?
- The Usual Boilerplate (i.e. non numerical footers, non-merged multi row headers, overly complex column headers, inconsistent structure)
a typo in the 2014 Week Start date (Which actually inspired me to make this repo in the first place)(This was fixed)
How do I use the new one
from tornamona.fixes import nisra
dataset = nisra.WeeklyDeaths().get().clean()
dataset.data.head().to_markdown()
Week | Week Start | Week End | Total Deaths | Average Deaths for previous 5 years | Min 5 year deaths | Max 5 year deaths | Respiratory Deaths | Average Respiratory Deaths for previous 5 years | COVID19 Deaths | |
---|---|---|---|---|---|---|---|---|---|---|
0 | 1 | 2008-12-27 00:00:00 | 2009-01-02 00:00:00 | 373 | 332.4 | 309 | 364 | nan | nan | nan |
1 | 2 | 2009-01-03 00:00:00 | 2009-01-09 00:00:00 | 454 | 329.2 | 302 | 377 | nan | nan | nan |
2 | 3 | 2009-01-10 00:00:00 | 2009-01-16 00:00:00 | 388 | 310.2 | 290 | 340 | nan | nan | nan |
3 | 4 | 2009-01-17 00:00:00 | 2009-01-23 00:00:00 | 402 | 324 | 281 | 367 | nan | nan | nan |
4 | 5 | 2009-01-24 00:00:00 | 2009-01-30 00:00:00 | 353 | 305.6 | 272 | 333 | nan | nan | nan |
Proposed Datasets
Everything that has ever pissed me off about open data
Credits
This package was created with Cookiecutter
and the audreyr/cookiecutter-pypackage
project template.
======= History
0.1.2 (2020-04-27)
- Reflect upstream data fix from NISRA
0.1.1 (2020-04-26)
- Instrumentation, Testing and Documentation Fixes
- Security updates
0.1.0 (2020-04-25)
- First release on PyPI.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file tornamona-0.1.2.tar.gz
.
File metadata
- Download URL: tornamona-0.1.2.tar.gz
- Upload date:
- Size: 17.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.1.3 requests-toolbelt/0.9.1 tqdm/4.45.0 CPython/3.8.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 43edd9867ca646d963a8b0270e1b81cfadf317bb770ce3294d990f3f3299a610 |
|
MD5 | 0983a51e315db3b210f55f6add4962f6 |
|
BLAKE2b-256 | 922e7b1e8453dda88412e0048acb19e387851b6f567a20a4456726a5779630cb |
File details
Details for the file tornamona-0.1.2-py2.py3-none-any.whl
.
File metadata
- Download URL: tornamona-0.1.2-py2.py3-none-any.whl
- Upload date:
- Size: 9.5 kB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.1.3 requests-toolbelt/0.9.1 tqdm/4.45.0 CPython/3.8.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 78521edbaf51cb39b503f0068a037ce0938616af49945502189c3fa328ac96f0 |
|
MD5 | a59604af851e8c9bd31162a4b692bd62 |
|
BLAKE2b-256 | b421dea33e97f907e0f5488523fd122ffb539fe8d921f9c7d43d0ad969057a20 |