A python package for obtaining and cleaning Tb files

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

Disclaimer

All study area boxes should be oriented to the North when choosing lower right and upper left bounding coordinates.
Currently this python library is not supported in Windows due to pynco only supporting Mac OS or Unix
Requires python 3.6 and Anaconda 3

SWEpy

SWEpy is a python library designed to give you quick and easy access to temperature brightness imagery stored in the MEaSUREs dataset in the NSIDC 0630 database. SWEpy contains tools to web scrape, geographically subset, and concatenate files into time cubes. There is an automated workflow to scrape long time series while periodically stopping to geographically subset and concatenate files in order to reduce disk impact.

Setup:

1. Setup Earthdata Login

Create an Earthdata account to be able to download data: https://urs.earthdata.nasa.gov/

Optional (.netrc file vs passing Username and Password):

Setup your username and password in a .netrc file Run this command in the directory you will be working in

echo "machine urs.earthdata.nasa.gov login <uid> password <password>" >> ~/.netrc
chmod 0600 ~/.netrc

uid is your Earthdata username. Do not include the brackets <>.

https://nsidc.org/support/faq/what-options-are-available-bulk-downloading-data-https-earthdata-login-enabled

2. Install SWEpy Using Conda (Recommended):

SWEpy is available from anaconda, and will install all dependencies when installed.

** Important ** conda-forge must be the first channel in your .condarc file.

channels:
- conda-forge
- wino6687
- defaults

conda install swepy

** Note ** If you do not have my channel wino6687 in your condarc file, then you will need to specify the channel: conda install -c wino6687 swepy

Alternative: Setup conda environment from yaml

The libraries used in this analysis, namely pynco, can be finicky with the channels that dependencies are installed with. Thus, using the provided yaml file to build an environment for this project will make your life simpler. You can add more packages on top of the provided environment as long as you install with the conda-forge channel.

Using the yaml file (.yml) create a new conda environment

conda env create -f swepy_env.yml

3. Install ipykernel (if using jupyter and conda environments)

source activate swepy_env
python -m ipykernel install --user --name <env name> --display-name "<display name>"

Do not include the brackets <>

Using SWEpy for analyzing SWE:

Import the Library:

from swepy.swepy import swepy

Instantiate the class with working directory, date range, bounding coordinates, and earthdata username and password

Reminder: Don't forget to orient your upper-left and lower-right bounding coordinates to the North.
By default, the high_res parameter is set to True, meaning it will scrape high resolution images. If it is passed as 'False' then it will scrape 25km images instead of the 6.25km high resolution images.

upper_left = [lon_upleft, lat_upleft]
lower_right = [lon_lowright, lat_lowright]

start = datetime.date(startY, startM, startD)
end = datetime.date(endY, endM, endD)

path = os.getcwd()

username = <username>
password = <password>

swepy = swepy(path, start, end, upper_left, lower_right, username, password, high_res = True)

Use desired functionality, either separate or individually:

swepy.scrape()
swepy.subset()
swepy.concatenate()

swepy.concatenate(swepy.subset(swepy.scrape()))

Or, use scrape_all to avoid massive file sizes:

swepy.scrape_all()

This limits the number of full-size images on your disk at one time.

If you would like a full grid file, with no subsetting, simply pass the grid id as your upper left and lower right bounding coordinates.

North = "N"
South = "S"
Equator = "T"

upper_left = "N"
lower_right = "N"

If you need to give the class more information, or change information it already has, use the set_params function:

swepy.set_params(ul = [-145,66], lr = [-166, -16])

Using SWEpy's Web Scraper Alone:

Note: Web scraper is enabled automatically in the scrape_all workflow, however it can also be used as a standalone function!

from swepy.nsidcDownloader import nsidcDownloader

## Ways to instantiate nsidcDownloader
nD = nsidcDownloader.nsidcDownloader(username="user", password="pass", folder=".") ## user/pass combo and folder

nD = nsidcDownloader(sensor="SSMIS") ## user/pass combo from .netrc and default folder, setting the default key of sensor

## Download a file:

file = {
    "resolution": "3.125km",
    "platform": "F17",
    "sensor": "SSMIS",
    "date": datetime(2015,10,10),
    "channel": "37H"
}

nD.download_file(**file)

nD.download_range(sensor="SSMIS", date=[datetime(2014,01,01), datetime(2015,01,01)])

Authentication will work if the user/pass combo is saved in ~/.netrc, or if it is passed in the nsidcDownloader instance

Function Summaries

Descriptions of included functions

swepy = swepy(working_dir, start, end, ll_ul, ll_lr, username, password)

Instantiate the class with the working directory path, the start date, the end date, the bounding coordinates, and your Earthdata username and password.
Once the class is instantiated, either call scrape_all or call scrape, then subset, then concatenate as desired.

swepy.set_params(start=None, end=None, username=None, password=None, ul=None, lr=None)

Parameters:
- start/end: datetime objects
- username/password: strings
- ul/lr: lists of [longitude, latitude]
Sets any class members that you want to change or add without re-instantiating the class
Allows users to scrape files based on date and grid and subset later

swepy.get_xy(latlon_ul, latlon_lr)

Parameters: lists of longitude/latitude upper left, longitude/latitude lower right
Uses NSIDC scripts to convert user inputted lat/lon into Ease grid 2.0 coordinates
Returns: Ease grid 2.0 coordinates of inputted lat/longs

swepy.subset()

Parameters: none, list of downloaded files stored in class from scrape() function
Subset will subset the files downloaded geographically to match study area inputed
Returns: subsetted file

swepy.concatenate()

Parameters: current working directory, output file for 19Ghz, output file for 37Ghz
The concatenate function merges all netCDF files into one large file
Returns: concatenated netCDF file

swepy.scrape_all()

Parameters: none, everything needed comes from class instantiation
Complete function that downloads, subsets, and concatenates the data
Returns: file names of concatenated 19/37 time cubes

swepy.plot_a_day(token)

Parameters: mapbox token, everything else comes from the stored concatenated file list
Plots a day of data using Mapbox Jupyter
Returns: interactive map of inputted data

swepy.get_file(path, date, channel)

Parameters: date of file path to get, and the channel (19GHz vs 37GHz)
get file path of file to download for specific day of SWE
Returns: framework for file to be downloaded based on date and channel for analyzing SWE

Main Dependencies:

gdal
affine
requests
scikit-image
pynco
netCDF4
datetime
tqdm
pandas
cartopy

Troubleshooting

‘image not found’ errors If encountering ‘image not found’ errors then it is likely you are having channel dependency issues.

$ cat .condarc channels:
- conda-forge
- defaults

After saving this file, update conda:

conda update all

https://conda-forge.org/docs/conda-forge_gotchas.html#using-multiple-channels

HDF5 errors: If getting HDF5 errors, try deleting all the netCDF files in your directories and starting over. This usually occurs when there are already some files in the data directories before calling scrape_all and ncks gets confused on the subset step.

Known Bugs:

Missing image error when loading in swepy or when calling swepy functions
- there is an issue where gdal is not installed on the conda-forge channel when swepy is downloaded through conda.
- Solution: make sure conda-forge is at the top of your channels list in you .condarc file, then run the command conda update --all.
Missing data can cause plotting to error out.
- missing data is common in the mid-latitudes, so if your midlat study area errors out when plotting, this is likely the issue
25km data will not plot properly in mapbox.
- For some reason, there is an issue converting dataframes to geojson only when scraping the 25km data. I am looking into why this is, but for now visualization only works on high resolution data.

Citations:

This library is designed to work with the MEaSUREs CETB dataset:

Brodzik, M. J., D. G. Long, M. A. Hardman, A. Paget, and R. Armstrong. 2016. MEaSUREs Calibrated Enhanced-Resolution Passive Microwave Daily EASE-Grid 2.0 Brightness Temperature ESDR, Version 1. [Indicate subset used]. Boulder, Colorado USA. NASA National Snow and Ice Data Center Distributed Active Archive Center. doi: https://doi.org/10.5067/MEASURES/CRYOSPHERE/NSIDC-0630.001. [June 2018].

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

1.9.4

Feb 14, 2020

1.9.3

Feb 12, 2020

1.9.2

Feb 5, 2020

1.9.1

Jan 7, 2020

1.9.0

Dec 30, 2019

1.8.3

Oct 11, 2019

1.8.2

Sep 11, 2019

1.8.1

Sep 11, 2019

1.8.0

Sep 10, 2019

1.7.0

Aug 30, 2019

1.6.1

Aug 20, 2019

1.6.0

Aug 9, 2019

1.5.2

Aug 6, 2019

1.3.4

Jul 28, 2019

1.3.3

Jul 19, 2019

1.3.2

Jul 18, 2019

1.3.1

Jul 18, 2019

1.3.0

Jul 10, 2019

1.2.0

Mar 29, 2019

1.1.10

Mar 29, 2019

1.1.7

Oct 13, 2018

1.1.5

Aug 24, 2018

1.1.4

Aug 20, 2018

1.1.3

Aug 4, 2018

This version

1.1.2

Aug 4, 2018

1.0.5

Jul 10, 2018

1.0.4

Jul 10, 2018

1.0.3

Jul 5, 2018

1.0.2

Jun 28, 2018

1.0.1

Jun 7, 2018

1.0.0

Jun 5, 2018

0.1.1

May 31, 2018

0.1.0

May 30, 2018

0.0.20

May 24, 2018

0.0.19

May 23, 2018

0.0.18

May 23, 2018

0.0.17

May 23, 2018

0.0.16

May 22, 2018

0.0.15

May 22, 2018

0.0.14

May 22, 2018

0.0.13

May 22, 2018

0.0.12

May 22, 2018

0.0.11

May 22, 2018

0.0.10

May 22, 2018

0.0.9

May 21, 2018

0.0.8

May 21, 2018

0.0.7

May 21, 2018

0.0.6

May 21, 2018

0.0.5

May 21, 2018

0.0.4

May 21, 2018

0.0.3

May 21, 2018

0.0.2

May 20, 2018

0.0.1

May 20, 2018

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

swepy-1.1.2.tar.gz (15.7 kB view hashes)

Uploaded Aug 4, 2018 Source

Built Distribution

swepy-1.1.2-py3-none-any.whl (12.6 kB view hashes)

Uploaded Aug 4, 2018 Python 3

Hashes for swepy-1.1.2.tar.gz

Hashes for swepy-1.1.2.tar.gz
Algorithm	Hash digest
SHA256	`59a4f717ce4355579350920c92940e52a5bb97a3f5ed28feb4914b9bbe893643`
MD5	`0ab6b37cd63cfc88ded009d83c679a64`
BLAKE2b-256	`e2de32d57d9b01d8d5ad48c76fe5c995b2b68e5fcc6180dbdaac9d6dca64f237`

Hashes for swepy-1.1.2-py3-none-any.whl

Hashes for swepy-1.1.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d7b0036c72f0b6e979617d422f4bc7551ddbecc5f1493503a35d4b0a3c803fe6`
MD5	`e31765f19c9cda31411852e0b4a38992`
BLAKE2b-256	`6823e8ce83c25c022800e4b3589088a5ca5d180414d016a3c81edb9afd62a993`