A Python package for the National Syndromic Surveillance Program (NSSP) and its Community of Practice. A collection of classes and methods to advance the practice of Syndromic Surveillance.
Project description
pynssp
Overview
A Python package for the National Syndromic Surveillance Program (NSSP) and its Community of Practice. A collection of classes and methods to advance the practice of Syndromic Surveillance. This python package serves as a Python alternative to the Rnssp R package.
Installation
Stable release
To install pynssp
, run this command in your terminal:
pip install pynssp
This is the preferred method to install pynssp
, as it will always install the most recent stable release.
Development version
You can install the development version of pynssp
from Github:
pip install git+https://github.com/CDCgov/pynssp.git
Usage
from pynssp import *
from datetime import date, timedelta
import pandas as pd
## Creating a user profile (token)
myProfile = create_token_profile()
## Creating a user profile (username and password)
myProfile = create_profile()
## JSON URL from NSSP-ESSENCE API
url = "https://essence.syndromicsurveillance.org/nssp_essence/api/alerts/regionSyndromeAlerts?end_date=31Jan2021&start_date=29Jan2021"
## Update Start and End dates in NSSP-ESSENCE API URL
startDate = date.today() - timedelta(days=30)
endDate = date.today()
url = change_dates(url, start_date = startDate, end_date = endDate)
## Pull Time Series Data from NSSP-ESSENCE
api_data = get_api_data(url, profile=myProfile)
## Inspect data object structure
api_data.columns
## Extract table of interest
api_data = pd.json_normalize(api_data["regionSyndromeAlerts"][0])
## Get a glimpse of the pulled dataset
api_data.head()
Contributing to this project
Should you want to contribute to this project, submit a push request to this Github repository and consider submitting a request to be added as a developer to gazondekon@cdc.gov.
Getting Help
If you encounter a clear bug, please consider emailing the author at gazondekon@cdc.gov and/or file an issue with a minimal reproducible example.
Public Domain Standard Notice
This repository constitutes a work of the United States Government and is not subject to domestic copyright protection under 17 USC ? 105. This repository is in the public domain within the United States, and copyright and related rights in the work worldwide are waived through the CC0 1.0 Universal public domain dedication. All contributions to this repository will be released under the CC0 dedication. By submitting a pull request you are agreeing to comply with this waiver of copyright interest.
License Standard Notice
The repository utilizes code licensed under the terms of the Apache Software License and therefore is licensed under ASL v2 or later.
This source code in this repository is free: you can redistribute it and/or modify it under the terms of the Apache Software License version 2, or (at your option) any later version.
This source code in this repository is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the Apache Software License for more details.
You should have received a copy of the Apache Software License along with this program. If not, see http://www.apache.org/licenses/LICENSE-2.0.html
The source code forked from other open source projects will inherit its license.
Privacy Standard Notice
This repository contains only non-sensitive, publicly available data and information. All material and community participation is covered by the Disclaimer and Code of Conduct. For more information about CDC's privacy policy, please visit http://www.cdc.gov/other/privacy.html.
Contributing Standard Notice
Anyone is encouraged to contribute to the repository by forking and submitting a pull request. (If you are new to GitHub, you might start with a basic tutorial.) By contributing to this project, you grant a world-wide, royalty-free, perpetual, irrevocable, non-exclusive, transferable license to all users under the terms of the Apache Software License v2 or later.
All comments, messages, pull requests, and other submissions received through CDC including this GitHub page may be subject to applicable federal law, including but not limited to the Federal Records Act, and may be archived. Learn more at http://www.cdc.gov/other/privacy.html.
Records Management Standard Notice
This repository is not a source of government records, but is a copy to increase collaboration and collaborative potential. All government records will be published through the CDC web site.
Release Notes
0.1.0 (2023-06-22)
- First release on PyPI.
Features
Credentials Management
create_profile()
initializes aCredentials
object from a username and password.create_token_profile()
initializes aToken
object from a REST API token.
REST API data pulls
get_api_response()
retrieves a response from an API service using a provided profile of typeCredentials
orToken
. Used as a wrapper to the.get_api_response()
method of a provided profile of typeCredentials
orToken
.get_api_data()
retrieves data (in JSON or CSV) from an API service using a provided profile of typeCredentials
orToken
. Used as a wrapper to the.get_api_data()
method of a provided profile of typeCredentials
orToken
.get_api_graph()
retrieves a graph from an API service using a provided profile of typeCredentials
orToken
. Used as a wrapper to the.get_api_graph()
method of a provided profile of typeCredentials
orToken
.get_essence_data()
retrieves data from the NSSP-ESSENCE API service using a provided profile of typeCredentials
orToken
.
Anomaly Detection and Trend Classification
alert_ewma()
implements the EWMA time series anomaly detection algorithm.alert_regression()
implements the Multiple Adaptive Regression time series anomaly detection algorithm.alert_switch()
implements the Regression/EWMA Switch time series anomaly detection algorithmalert_nbinom()
implements the Negative Binomial Regression time series anomaly detection algorithm.alert_serfling()
implements the original and modified Serfling method for time series anomaly detection.classify_trend()
fits rolling binomial models to a daily time series of percentages or proportions to classify the overall trend.
Added Data
load_simulated_ts()
loads a dataframe of simulated time series.load_nssp_stopwords()
loads a dataframe of NSSP-curated stopwords.get_scenario1()
loads a a subset of the simulated time series data ("scenario #1").get_scenario2()
loads a a subset of the simulated time series data ("scenario #2").
Utility Functions
change_dates()
modifies the start and end dates in a given URL to new dates, if provided.webscrape_icd()
scrapes ICD Codes from the Web.
Classes
Credentials
is an abstract representation of a profile object given a username and password strings.Token
is an abstract representation of a profile object given a token string.APIGraph
is an abstract representation of a graph object returns from an API service.NSSPContainer
encapsulates a value or an object to store.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file pynssp-0.1.0.tar.gz
.
File metadata
- Download URL: pynssp-0.1.0.tar.gz
- Upload date:
- Size: 1.1 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.11.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | b419c9dcdbef21fba1966fe4b99e2b828797cc7ad08d198b191b35b9f4b4c672 |
|
MD5 | 16d71c9ef96f32287bf835c5e7c8a7cd |
|
BLAKE2b-256 | c5b321d5b48b9da0b57280ab86170e3ec5846da108594d1cf9386f45a5ed9528 |
File details
Details for the file pynssp-0.1.0-py2.py3-none-any.whl
.
File metadata
- Download URL: pynssp-0.1.0-py2.py3-none-any.whl
- Upload date:
- Size: 42.8 kB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.11.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8900d39b2419831b59c1961103e20bbe547446a5f4a0349f634466a5689ec555 |
|
MD5 | 6c4fb856fcd6847f84b44bbcc29c9a69 |
|
BLAKE2b-256 | 93e99cf1a17c83f7c90d7a6364fd7c552f3372210f8a69eb6985d5f5ab6b9d0e |