Routinely collecting heterogeneous surveillance data into a coherent database
Project description
PANDEM - Source
Manage heterogeneous data sources for pandem 2 project.
Description
Identify, map and integrate multiple pandemic related data into a coherent pandemic-management database. Developed within the H2020 project PANDEM-2, Pandem-Source allows users to systematically capture, standardize and analyze data coming from international and national surveillance databases, participatory surveillance projects, social networks and mass media. This tool is focused on flexibility so adding new sources or variables can be easily done as is required during a pandemic episode.
Approach
Data integration follow a semantic approach. Data sources just need to be described using a Data Labelling Schema (DLS) file that specifies the acquisition chennel (URL, git, local file or script) the format and how to map the input data to Pandem-Source variables.
The DLS ensures each source and variable is properly documented knwing its origin, meaning and data quality.
Supported sources
Pandem-Source is designed to be flexible and extensible so new sources can be easily added by end users. In order to demontstrate this principle. The following sources are supported Out of the Box
- COVID19 Data Hub see source definition
- ICD-10-CM see source definition
- ECDC Atlas see source definition
- ECDC COVID19 Datasets see source definition
- Influenza Net see source definition
- MediSys see source definition
- Eurostats NUTS see source definition
- Twitter see source definition
- User provided data. You can use this data template to easily integrate your own real or ficticious datasets.
User Interface - timeseries explorer
User Interface - Integration dashboard
Requirements
- Python 3.7 or higher
- R 3.6.3 or higher
- Docker (optional for supporting machine learning article classification)
Installation
pip install pandem-source
Running Pandem-Source
- Set the PANDEM_HOME variable to a local foder
export PANDEM_HOME=your data folder here
- Load defailt sources
python -m pandemsource reset --restore-factory-defaults
- Running monitoring and dashboard
python -m pandemsource start -d
- Accessing the dashboard from http://localhost:8001 to see the progress and integrated time series
PANDEM 2
PANDEM-2 is a H2020 EU-funded project that aims to develop new solutions for efficient, EU-wide pandemic management. The goal of PANDEM-2 is to prepare Europe for future pandemics through innovations in training and to build capacity between EU member states responding to pandemics on a cross-border basis.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for pandem_source-1.0.2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 66a74d2d85dcffed4c69d123b55ee68da58eee2a3f3904b7fe8976583fc24a31 |
|
MD5 | f6e404d1c6ed876232c174be7dcb7101 |
|
BLAKE2b-256 | 703821d9d05039b1a592c685486b08a6659c1a0b24ad51d8ad257d0ea71ebe0f |