Provides utility functions for accessing data repository for ARM data examples/notebooks
Project description
arm-test-data
A place to share atmospheric data with the community, shared throughout the Atmospheric Radiation Measurement user facility and beyond!
Sample data sets
These files are used as sample data in openradar examples/notebooks and are downloaded by arm-test-data package:
201509021500.biAAFNAV_COR_20181104_R0.ictAMF_US-CU1_BASE_HH_1-5.csvAMF_US-CU1_BIF_20250318.xlsxNEON.D18.BARR.DP1.00002.001.000.010.001.SAAT_1min.2022-10.expanded.20221107T205629Z.csvNEON.D18.BARR.DP1.00002.001.sensor_positions.20221107T205629Z.csvNEON.D18.BARR.DP1.00002.001.variables.20221201T110553Z.csvanltwr_mar19met.dataayp22199.21mayp22200.00mbrw21001.datbrw_12_2020_hour.datbrw_CCl4_Day.datco2_brw_surface-insitu_1_ccgg_MonthlyData.txtctd21125.15wctd22187.00t.txtenametC1.b1.20221109.000000.cdfgucmetM1.b1.20230301.000000.cdflist_of_files.txtmaraosmetM1.a1.20180201.000000.ncmarirtsstM1.b1.20190320.000000.ncmarnavM1.a1.20180201.000000.ncmet_brw_insitu_1_obop_hour_2020.txtmet_lcl.ncmosaossp2M1.00.20191216.000601.raw.20191216000000.inimosaossp2M1.00.20191216.130601.raw.20191216x193.sp2bmosaossp2auxM1.00.20191217.010801.raw.20191216000000.hknsacloudphaseC1.c1.20180601.000000.ncnsasurfspecalb1mlawerC1.c1.20160609.080000.ncsgp30ebbrE13.b1.20190601.000000.ncsgp30ebbrE32.b1.20191125.000000.ncsgp30ebbrE32.b1.20191130.000000.ncsgp30ecorE14.b1.20190601.000000.cdfsgpaerich1C1.b1.20190501.000342.ncsgpaosacsmE13.b2.20230420.000109.ncsgpaosccn2colaE13.b1.20170903.000000.ncsgpbrsC1.b1.20190705.000000.cdfsgpceilC1.b1.20190101.000000.ncsgpco2flx4mC1.b1.20201007.001500.ncsgpdlppiC1.b1.20191015.120023.cdfsgpdlppiC1.b1.20191015.121506.cdfsgpirt25m20sC1.a0.20190601.000000.cdfsgpmetE13.b1.20190101.000000.cdfsgpmetE13.b1.20190102.000000.cdfsgpmetE13.b1.20190103.000000.cdfsgpmetE13.b1.20190104.000000.cdfsgpmetE13.b1.20190105.000000.cdfsgpmetE13.b1.20190106.000000.cdfsgpmetE13.b1.20190107.000000.cdfsgpmetE13.b1.20190508.000000.cdfsgpmetE13.b1.20210401.000000.csvsgpmetE13.b1.yamlsgpmetE15.b1.20190508.000000.cdfsgpmetE31.b1.20190508.000000.cdfsgpmetE32.b1.20190508.000000.cdfsgpmetE33.b1.20190508.000000.cdfsgpmetE34.b1.20190508.000000.cdfsgpmetE35.b1.20190508.000000.cdfsgpmetE36.b1.20190508.000000.cdfsgpmetE37.b1.20190508.000000.cdfsgpmetE38.b1.20190508.000000.cdfsgpmetE39.b1.20190508.000000.cdfsgpmetE40.b1.20190508.000000.cdfsgpmetE9.b1.20190508.000000.cdfsgpmet_no_time.ncsgpmet_test_time.ncsgpmfrsr7nchE11.b1.20210329.070000.ncsgpmmcrC1.b1.1.cdfsgpmmcrC1.b1.2.cdfsgpmplpolfsC1.b1.20190502.000000.cdfsgprlC1.a0.20160131.000000.ncsgpsebsE14.b1.20190601.000000.cdfsgpsirsE13.b1.20190101.000000.cdfsgpsondewnpnC1.b1.20190101.053200.cdfsgpstampE13.b1.20200101.000000.ncsgpstampE31.b1.20200101.000000.ncsgpstampE32.b1.20200101.000000.ncsgpstampE33.b1.20200101.000000.ncsgpstampE34.b1.20200101.000000.ncsgpstampE9.b1.20200101.000000.ncsodar.20230404.mndtwpsondewnpnC3.b1.20060119.050300.custom.cdftwpsondewnpnC3.b1.20060119.112000.custom.cdftwpsondewnpnC3.b1.20060119.163300.custom.cdftwpsondewnpnC3.b1.20060119.231600.custom.cdftwpsondewnpnC3.b1.20060120.043800.custom.cdftwpsondewnpnC3.b1.20060120.111900.custom.cdftwpsondewnpnC3.b1.20060120.170800.custom.cdftwpsondewnpnC3.b1.20060120.231500.custom.cdftwpsondewnpnC3.b1.20060121.051500.custom.cdftwpsondewnpnC3.b1.20060121.111600.custom.cdftwpsondewnpnC3.b1.20060121.171600.custom.cdftwpsondewnpnC3.b1.20060121.231600.custom.cdftwpsondewnpnC3.b1.20060122.052600.custom.cdftwpsondewnpnC3.b1.20060122.111500.custom.cdftwpsondewnpnC3.b1.20060122.171800.custom.cdftwpsondewnpnC3.b1.20060122.232600.custom.cdftwpsondewnpnC3.b1.20060123.052500.custom.cdftwpsondewnpnC3.b1.20060123.111700.custom.cdftwpsondewnpnC3.b1.20060123.171600.custom.cdftwpsondewnpnC3.b1.20060123.231500.custom.cdftwpsondewnpnC3.b1.20060124.051500.custom.cdftwpsondewnpnC3.b1.20060124.111800.custom.cdftwpsondewnpnC3.b1.20060124.171700.custom.cdftwpsondewnpnC3.b1.20060124.231500.custom.cdftwpvisstgridirtemp.c1.20050705.002500.ncvdis.b1
Adding new datasets
To add a new dataset file, please follow these steps:
- Add the dataset file to the
data/directory - From the command line, run
python make_registry.pyscript to update the registry file residing inarm-test-data/registry.txt - Commit and push your changes to GitHub
Using datasets in notebooks and/or scripts
-
Ensure the
arm-test-datapackage is installed in your environmentpython -m pip install arm-test-data # or python -m pip install git+https://github.com/ARM-DOE/arm-test-data # or conda install -c conda-forge arm-test-data
-
Import
DATASETSand inspect the registry to find out which datasets are availableIn [1]: from arm_test_data import DATASETS In [2]: DATASETS.registry_files Out[2]: ['sample_file.nc`]
-
To fetch a data file of interest, use the
.fetchmethod and provide the filename of the data file. This will- download and cache the file if it doesn't exist already.
- retrieve and return the local path
In [4]: filepath = DATASETS.fetch('sample_data.nc') In [5]: filepath Out[5]: '/Users/mgrover/Library/Caches/arm-test-data/sample_sgp_data.nc'
-
Once you have access to the local filepath, you can then use it to load your dataset into pandas or xarray or your package of choice:
In [6]: radar = pyart.io.read(filepath)
Changing the default data cache location
The default cache location (where the data are saved on your local system) is dependent on the operating system. You can use the locate() method to identify it:
from arm_test_data import locate
locate()
The location can be overwritten by the ACT_TEST_DATA_DIR environment
variable to the desired destination.
References
Ameriflux data
AmeriFlux BASE: https://doi.org/10.17190/AMF/2531143 Citation: Bhupendra Raut, Sujan Pal, Paytsar Muradyan, Joseph R. O'Brien, Max Berkelhammer, Matthew Tuftedal, Max Grover, Scott Collis, Robert C. Jackson (2025), AmeriFlux BASE US-CU1 UIC Plant Research Laboratory Chicago, Ver. 1-5, AmeriFlux AMP, (Dataset). https://doi.org/10.17190/AMF/2531143
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file arm_test_data-0.1.4.tar.gz.
File metadata
- Download URL: arm_test_data-0.1.4.tar.gz
- Upload date:
- Size: 18.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
40fdbbc2e37a48e1d6a394b061404df20c12525e80a38f07ecc5b1bf8837fae4
|
|
| MD5 |
997c354991ce137d63e2ff56002fc1ea
|
|
| BLAKE2b-256 |
fb4cfd7d5473260388b5835a78ab33be9a4ce2cff1df3872b1b0ba689873a1c6
|
File details
Details for the file arm_test_data-0.1.4-py3-none-any.whl.
File metadata
- Download URL: arm_test_data-0.1.4-py3-none-any.whl
- Upload date:
- Size: 13.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
f7a9b6117d0cf61ac297e5aff6a3139589733c7692a95af015ad6e1f0f902587
|
|
| MD5 |
6cf924b1c660058686f9b0e25566ae31
|
|
| BLAKE2b-256 |
c8004683d524ac7e8f81a5e5198cd63d894c4f2c2bdba4f5e590525de6f492bc
|