Python library for fast access to seismic data using TileDB
Project description
tilesegy
Tilesegy is a small MIT licensed Python library for easy interaction with seismic data, powered by TileDB. It combines an intuitive, segyio-like API with a powerful storage engine.
Feature summary
Available features
- Converting from SEG-Y and Seismic Unix formatted seismic data to TileDB arrays.
- Simple and powerful read-only API, closely modeled after segyio.
- 100% unit test coverage.
- Fully type-annotated.
Currently missing features
- API for write operations.
- Converting back to SEG-Y.
- TileDB configuration and performance tuning.
- Comprehensive documentation.
- Real-world usage.
Installation
Tilesegy can be installed
-
from PyPI by
pip:pip install tilesegy -
from source by cloning the Git repository:
git clone https://github.com/gsakkis/tilesegy.git cd tilesegy python setup.py installYou may run the test suite with:
python setup.py test
Converting from SEG-Y
Tilesegy comes with a commandline interface (CLI) called segy2tiledb for converting
SEG-Y and Seismic Unix formatted files to TileDB formatted arrays. At minimum it takes
an input file and generates a tilesegy directory at the same parent directory with the
input and extension .tsgy:
$ segy2tiledb a123.segy
$ du -sh a123.*
73M a123.sgy
55M a123.tsgy
To see the full list of options run:
$ segy2tiledb -h
usage: segy2tiledb [-h] [-o] [-g {auto,structured,unstructured}] [--su]
[--iline ILINE] [--xline XLINE]
[--endian {big,msb,little,lsb}] [-s TILE_SIZE]
[--consolidation-buffersize CONSOLIDATION_BUFFERSIZE]
input [output]
Convert a segy file to tilesegy format
positional arguments:
input Input segy file path
output Output tilesegy directory path (default: None)
optional arguments:
-h, --help show this help message and exit
-o, --overwrite Overwrite the output directory if it already exists (default: False)
-g {auto,structured,unstructured}, --geometry {auto,structured,unstructured}
Geometry of the converted tilesegy:
- auto: same as the input segy.
- structured: same as `auto` but abort if a geometry cannot be inferred.
- unstructured: opt out on building geometry information.
(default: auto)
segyio options:
--su Open a seismic unix file instead of SEG-Y (default: False)
--iline ILINE Inline number field in the trace headers (default: 189)
--xline XLINE Crossline number field in the trace headers (default: 193)
--endian {big,msb,little,lsb}
File endianness, big/msb (default) or little/lsb (default: big)
tiledb options:
-s TILE_SIZE, --tile-size TILE_SIZE
Tile size in bytes.
Larger tile size improves disk access time at the cost of higher memory (default: 4000000)
--consolidation-buffersize CONSOLIDATION_BUFFERSIZE
The size in bytes of the attribute buffers used during consolidation (default: 5000000)
API
Tilesegy generally follows the segyio API so you may consult its excellent
documentation to learn about
the public attributes (ilines, xlines, offsets, samples) and addressing modes
(trace, header, attributes', iline, xline, fast, slow, depth_slice,
gather, text, bin).
You can find usage examples in the included Jupyter notebook.
The following list outlines the main differences from segyio:
-
Probably the biggest difference is that addressing modes that return a generator of numpy arrays in segyio, in tilesegy return a single numpy array of higher dimension(s). For example, in a SEG-Y with 50 ilines, 20 xlines, 100 samples, and 3 offsets:
f.iline[0:5]:segyioreturns a generator that yields 5 2D numpy arrays of (20, 100) shapetilesegyreturns a 3D numpy array of (5, 20, 100) shape
f.iline[0:5, :]:segyioreturns a generator that yields 15 2D numpy arrays of (20, 100) shapetilesegyreturns a 4D numpy array of (5, 3, 20, 100) shape
-
The mappings returned by
bin,headerandattributes(name)have string keys instead ofsegyio.TraceFieldenums or integers. -
tilesegy.open(dir_path), thesegyio.open(file_path)equivalent, does not currently take any optional parameters (e.g.strictorignore_geometry). -
tilesegyexposes two classes,TileSegyfor unstructured SEG-Y andStructuredTileSegyfor structured.StructuredTileSegyextendsTileSegy, so the whole unstructured API is inherited by the structured.- All attributes and addressing modes specific to structured files (e.g.
ilinesorgather) are available only toStructuredTileSegy. In contrastsegyioreturnsNoneor raises an exception if these properties are accessed on unstructured files. - There is no
unstructuredattibute; usenot isinstance(f, StructuredTileSegy)instead.
-
There is no
tracecountattribute; uselen(trace)instead. -
There is no
ext_headersattribute; uselen(text[1:])instead.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file tilesegy-0.1.0.tar.gz.
File metadata
- Download URL: tilesegy-0.1.0.tar.gz
- Upload date:
- Size: 12.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.3.1 requests-toolbelt/0.9.1 tqdm/4.50.0 CPython/3.6.10
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
351fb7ed5d5d4a1aa03cf866fb474e70696d1cc5d5e20f5bafd5ea74ac1365ce
|
|
| MD5 |
7c6e31f9b0d79d4af14bc0330b5729ba
|
|
| BLAKE2b-256 |
0cd2735736b331f8adb4e2db87dc12cb56a54a52e05474873d8af62f99c3ed1f
|
File details
Details for the file tilesegy-0.1.0-py3-none-any.whl.
File metadata
- Download URL: tilesegy-0.1.0-py3-none-any.whl
- Upload date:
- Size: 14.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.3.1 requests-toolbelt/0.9.1 tqdm/4.50.0 CPython/3.6.10
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
50c0acc514b4fce3720ec47be5e2115f4bb0b6a167cbce994e8e5682ca68ba46
|
|
| MD5 |
af704f0974c4b16a53594c90848dcebc
|
|
| BLAKE2b-256 |
46ce9d85009f69f544d708d41763b77cf049d5b92ae3c3ef051e81472717c2d5
|