a set of tools for geophysical data processing
Project description
gdp: Geophysical Data Processing
gdp provides a set of tools that are available through command-line-interface (CLI) to process and/or convert common geophysical data types.
Release notes
Version 0.1.1
Only README is updated for this version.
Version 0.1.0
This version is the first version that is published on PyPI and it includes the following tools:
Tool | Description |
---|---|
cat | concatenate/reformat numerical or non-numerical data |
union | generate the union of input data files |
intersect | generate the intersect of input data files |
difference | generate the difference of input data files |
split | split a concatenated dataset into multiple data files |
min | calculate minimum of values in numerical column(s) |
max | calculate maximum of values in numerical column(s) |
sum | calculate summation of values in numerical column(s) |
mean | calculate mean of values in numerical column(s) |
median | calculate median of values in numerical column(s) |
std | calculate standard deviation of values in numerical column(s) |
pip | output points inside/outside a polygon (ray tracing method) |
gridder | gridding/interpolation of 2D/map data with Gaussian smoothing applied |
mseed2sac | convert mseed to sac. this script also handles data fragmentation issue. |
sac2dat | convert sac to dat (ascii); output format: time, amplitude |
nc2dat | convert nc data to dat/ascii |
Examples
Some example gdp commands are explained below:
gdp cat file* -x 1 2 -v 5 3 4 --header 2 --footer 4 --fmt .2 .4 --sort --uniq --noextra -o concatenated.txt
Description: This command will concatenate files in current directory matching names 'files*'. While reading, 2 header lines and 4 footer lines will be omitted. Positional columns are the first and second columns (-x 1 2), and value/numerical columns are [5 3 4]. Positional columns will be printed in %.2f format, and value columns will be printed in %.4f. If files have extra (non-numerical) columns other than the first 5 columns, '--noextra' will cause not printing them. Flag '-o' can be used to set the output file name and if not specified, the results will be printed to standard output.Many of these flags are also common for the following commands.
gdp union file_1.dat file_2.dat file_3.dat
Description: Output union of a set of numerical data files (two or more) while considering positional columns (default=[1 2]) and value columns as [3] (defaults; these could be modified using '-x' '-v' flags).
gdp intersect file_1.dat file_2.dat file_3.dat
Description: Output intersect of a set of numerical data files (two or more) considering positional columns (similar positional columns that could be specified using '-x' flag; the value of the first file will be the output). Note that the first value of the flag '--fmt' will be important here.
gdp difference file_1.dat file_2.dat file_3.dat
Description: Output difference of a set of numerical data files (two or more) considering positional columns. In this case, data points that are unique to 'file_1.dat' will be the output results.
gdp split dataset.dat --method ncol --number 4 --start -2 --name 3 -o outdir
Description: This command is useful to split/unmerge a concatenated dataset ('dataset.dat'). Two methods can be choosen: (1) nrow: split based on a fixed number of rows, (2) ncol: split based on a row that has a unique number of columns as an identifier. In case of method 'ncol' above: '--number 4' specifies that the row with unique number of columns has 4 columns (reference row); '--start -2' specifies the start line or row offset relative to the reference line; '--name 3' specifies the row offset relative to 'start line' that will be used for output file names; '-o outdir' specifies output directory (it can be omitted for printing to the standard output)
gdp min *.xyz -v 1 2 3
gdp max *.xyz -v 1 2 3
gdp sum *.xyz -v 1 2 3
gdp mean *.xyz -v 1 2 3
gdp median *.xyz -v 1 2 3
gdp std *.xyz -v 1 2 3
Description: Output min, max, sum, mean, median, or std of the three first columns in *.xyz files.
gdp pip *.xyz --polygon polygon.dat
gdp pip *.xyz --polygon polygon.dat -i
Description: Only output points inside or outside ('-i') of the given polygon. Alternatively '--lonrange' and '--latrange' flags could be used to define the polygon.
gdp gridder vs_model/depth* --spacing 0.2 --smoothing 50 --polygon polygon.dat -o outdir
Description: This command will perform gridding (2D interpolation) to the input xyz format data files. In case of the above command: '--spacing 0.2' specifies that grid spacing along both longitude and latitude is 0.2 degrees (two values can be given as well; [lon_spacing, lat_spacing]); '--smoothing 50' sets a 50 km Gaussian smoothing to the output data; '--polygon polygon.dat' is optional: if given, only points inside the given polygon will be printed out.
gdp mseed2sac mseed_dataset/* --reformat --offset -500 --resample 10 -o sac_dataset
Description: This command will convert mseed files in 'mseed_dataset' to another directory named 'sac_dataset'. Flag '--reformat' will cause creating of sub-folders in the output directory in 'YYJJJHHMMSS' format, and the sacfiles within these sub-directories will be renamed as 'YYJJJHHMMSS_STA.CHN', where 'STA' is the station code and 'CHN' is the channel code. If reformat is enabled, offset time can be adjusted using '--offset'. Finally, '--resample 10' results in resampling of output timeseries to 10 Hz.
gdp sac2dat sac_dataset/* -o timeseries --timerange 0 3600
Description: This command will output the first hour (0-3600 s) of the sac data in sac_dataset/*
gdp nc2dat model.nc --metadata
gdp nc2dat model.nc -v vs vp --fmt .2 .6 -o model.dat
Description: This tool can be used to convert NetCDF files to ascii format. In this example, by running the first command, the program will output meta data information related to 'model.nc'. It's necessary to figure out the data fields that one is interested to extract from the nc file first (in this case, they are 'vp' and 'vs'). The second command will print to file the results in a custom format to 'model.dat'.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distributions
File details
Details for the file gdp-0.1.1.tar.gz
.
File metadata
- Download URL: gdp-0.1.1.tar.gz
- Upload date:
- Size: 140.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.0 CPython/3.8.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9e25b1a4e2e77e10fc7b260d77db0c2618680cb731b514ecb3897451d96604fd |
|
MD5 | 42f824b2fe23c152c3c39aa867b9b209 |
|
BLAKE2b-256 | d78038011a38c87578a1a9a2c2d751e1d8e7b680f74a70d04b9acb2ed40533f7 |
File details
Details for the file gdp-0.1.1-cp310-abi3-macosx_10_9_universal2.whl
.
File metadata
- Download URL: gdp-0.1.1-cp310-abi3-macosx_10_9_universal2.whl
- Upload date:
- Size: 291.4 kB
- Tags: CPython 3.10+, macOS 10.9+ universal2 (ARM64, x86-64)
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.0 CPython/3.8.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | ec91adb0343c3ec5b8ac8580f550f1208a6e114c7fe10245714aad2d5c188f9b |
|
MD5 | 962f5794071ec21667c86e3690c50472 |
|
BLAKE2b-256 | d8140eb60edd5c8799e48d753e3c48cb3832a68cbff1212e667e9fad6a4e3234 |
File details
Details for the file gdp-0.1.1-cp38-abi3-macosx_10_9_x86_64.whl
.
File metadata
- Download URL: gdp-0.1.1-cp38-abi3-macosx_10_9_x86_64.whl
- Upload date:
- Size: 218.3 kB
- Tags: CPython 3.8+, macOS 10.9+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.0 CPython/3.8.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7dd0255fff5174a0a83a5840c8597ab27596e35c58a30575696028d478c8bce6 |
|
MD5 | 3a25f5e0f5b9500b53cf09d18345ba2a |
|
BLAKE2b-256 | 610d9871aba8758396d3afa3f341f2e65d92af52102b8fcc43879129f4c65283 |