USGS and Twitter data gathering and analysis tools
Project description
Project Title
Flood Analytics Information System (FAIS) - Big Data Gathering package from twitter and USGS
Getting Started
FAIS is a python package developed using python 3.7, which is not supported by python2. The package is used as a data gathering tool for USGS flood data and Twitter Data. The user can specifically target the flood station in each State in the US for the historical data as well as real time flood data. To gather Twitter feeds, the user can specified the targeted username or keyword with date gape to gather all the tweets that match the criteria
Prerequisites
The package dependencies are:
- pandas
- numpy
- rpy2
- urllib3
- requests
- opencv-python
- netCDF4
- matplotlib
- textblob
- pyquery
- tweepy
Installing
The package can be installed using Pypi installation
$ pip install fais
Usage
Real Time USGS Flood Data
Each state at the US has its own flood monitoring stations that are identified with specific id, coordinates, etc. The API allows the user to target specific state and get the real time flood information. The user also has choices to either save the data as csv file or return it as a panda dataframe.
Example 1
The user wants to gather South Carolina real time data, and save it as a csv file called “sc_realtime.csv”
>>> from fais import usgsgatherer as usgs
>>> usgs.get_realtime_flood_csv(“sc”, “sc_realtime.csv”)
The software will download the real time flood reading from any station in South Carolina, delete the null result, and save it as sc_realtime.csv file at the specified directory. The inputs of this function are the targeted State abbreviation and the file name.
Example 2
The user wants to gather real time flood data from Arizona and return it as a data frame format.
>>> from fais import usgsgatherer as usgs
>>> df = usgs.get_realtime_flood_(“az”)
The object df is contain a result of the realtime flood data as a panda dataframe format.
Example 3
The user wants to gather river cam image at Rocky Creek cam station 021603273 located in South Carolina. The station list can be found at rivercam
>>> from fais import usgsgatherer as usgs
>>> img = usgs.get_river_cam_sc_color(“021603273”)
The image file contains an image array of the real time river cam. The image is supported by
Historical USGS Flood Data
Example 1
The user wants to gather North Carolina Flood data for Hurricane Matthew, October 06-07, 2016 with specific station number 0212427947, which is REEDY CREEK AT SR NR CHARLOTTE, NC and save it as a csv file called “nc_mathew.csv”
>>> from fais import usgsgatherer
>>> criteria = usgsgatherer.create_usgs_criteria("NC","0212427947", ["00065", "00045","00060"], "2016-10-06", "2016-10-07")
>>> usgs.get_flood_data_csv(criteria, "nc_matthew.csv")
The software will download the flood data from REEDY CREEK AT SR NR CHARLOTTE, NC delete the null result, and save it as nc_mathew.csv file at the specified directory. The inputs of this function are the targeted criteria which include the targeted state, station number, parameters, date and the file name. If the user wish to use the result as a data frame object, simply change get_flood_data_csv() to get_flood_data_dataframe()
Example 2
The user can receive a list of flood stations from each state and use the historical data gathering module to collect the data. the module returns the station list from North Carolina and save it as a csv file called “nc_station.csv”
>>>from fais import usgsgatherer
>>>criteria = usgsgatherer.get_station_list_csv("NC", "nc_station.csv")
Twitter Data
Example 1
The user wants to gather Tweets from governemnet official twitter account such as the National Weather Service during Hurricane Florence --September 20, 2018 – October 20, 2018 and save it as a csv file called “nws_florence.csv”
>>> from fais import twittergatherer
>>> twitter_username = "nws"
>>> twitter_keyword = None
>>> twitter_since = "2018-09-20"
>>> twitter_until = "2018-10-20"
>>> twitter_criteria = twittergatherer.create_twitter_criteria(twitter_username, twitter_keyword,twitter_since,twitter_until, 10000)
>>> twittergatherer.get_tweets_csv(twitter_criteria, "nws_florence.csv")
The software will download all Tweets posted by National Weather Service’s Twitter account during September 20, 2018 to October 20, 2018 with the maximum of 10,000 tweets.
Authors
- Nattapon Donratanapat
- Dr. Vidya Samadi
- Dr. Jose M. Vidal
License
This project is licensed under the MIT License - see the LICENSE.md file for details
Acknowledgments
- Jefferson Henrique - Github
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file fais-0.0.15.tar.gz
.
File metadata
- Download URL: fais-0.0.15.tar.gz
- Upload date:
- Size: 11.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/3.7.1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | b2131e0e563f5d6ab89eb5791c5771100b2bb067ff2230acc0eb7dac90d61ddc |
|
MD5 | 4c998c3b2b1abebf54f1a8334e2e52cc |
|
BLAKE2b-256 | c97b7f9b3d32531b98061f0896d63c303877224d58138a346f4c363678e40233 |
File details
Details for the file fais-0.0.15-py3-none-any.whl
.
File metadata
- Download URL: fais-0.0.15-py3-none-any.whl
- Upload date:
- Size: 15.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/3.7.1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0bca3fcfa79907ebf9697e2c5b68a6d52c725ebb914215ea4578a1371bfa36d6 |
|
MD5 | faa80288843992f07ad61ebc0e7fd833 |
|
BLAKE2b-256 | de77e3f41e30f5111ad2ca7771b91e688a2913509ab36a8489b8d53903187ff9 |