Skip to main content

Compress and decompress seismic data

Project description

seismic-zfp

Python library to convert SEG-Y files to compressed cubes and retrieve arbitrary sub-volumes from these, fast.

Motivation

Reading whole SEG-Y volumes to retrieve, for example, a single time-slice is wasteful.

Copying whole SEG-Y files uncompressed over networks is also wasteful.

This library addresses both issues by implementing the seismic-zfp (.SZ) format. This format is based on ZFP compression from Peter Lindstrom's paper, using the Python wrapper developed by Navjot Kukreja.

ZFP compression enables smoothly varying d-dimensional data in 4d subvolumes to be compressed at a fixed bitrate. The 32-bit floating point values in 4x4x4 units of a 3D post-stack SEG-Y file are well suited to this scheme.

Decomposing an appropriately padded 3D seismic volume into groups of these units which exactly fill one 4KB disk block, compressing these groups, and writing them sequentially to disk yields a file with the following properties:

  • Compression ratio of 2n:1 compression, typically a bitrate of 4 works well, implying a ratio of 8:1
  • The location of any seismic sample is known
  • Arbitrary subvolumes can be read with minimal redundant I/O, for example:
    • Padding IL/XL dimensions with 4, and the z-dimension depending on bitrate
    • Padding IL/XL dimensions with 64 and the z-dimension with 4 (16:1 compression)

Using IL/XL optimized layout

  • Groups of 4 inlines or crosslines can be read with no redundant I/O
  • A single inline can be read and with no additional I/O compared to the SEG-Y best-case scenario (provided at least 4:1 compression ratio)
  • A z-slice can be read by accessing n_traces/16 disk blocks, compared to n_traces disk blocks for SEG-Y

Using z-slice optimized layout

  • A z-slice can be read by accessing just n_traces/4096 disk blocks, compared to n_traces disk blocks for SEG-Y

The seismic-zfp format also allows for preservation of information in SEG-Y file and trace headers, with compression code identifying constant and varying trace header values and storing these appropriately.

Examples

Full example code is provided, but the following reference is useful:

Create SZ files from SEGY

from seismic_zfp.conversion import SegyConverter
with SegyConverter("in.segy") as converter:
    # Create a "standard" SZ file with 8:1 compression, using in-memory method
    converter.run("out-standard.sz", bits_per_voxel=4,
                  method="InMemory")
    # Create a "z-slice optimized" SZ file
    converter.run("out-advanced.sz", bits_per_voxel=2, 
                  blockshape=(64, 64, 4))

Read an SZ file

from seismic_zfp.read import SzReader
with SzReader("in.sz") as reader:
    inline_slice = reader.read_inline(LINE_NO)
    crossline_slice = reader.read_crossline(LINE_NO)
    z_slice = reader.read_zslice(LINE_NO)
    sub_vol = reader.read_subvolume(min_il=min_il, max_il=max_il, 
                                    min_xl=min_xl, max_xl=max_xl, 
                                    min_z=min_z, max_z=max_z)

Use segyio-like interface to read SZ files

import seismic_zfp
with seismic_zfp.open("in.sz")) as szfile:
    inline_slice = szfile.iline[szfile.ilines[LINE_ID]]
    xslice_sz = szfile.xline[szfile.xlines[LINE_ID]]
    zslice_sz = szfile.depth_slice[szfile.zslices[SLICE_ID]]
    trace = szfile.trace[TRACE_ID]
    trace_header = szfile.header[TRACE_ID]
    binary_file_header = szfile.bin
    text_file_header = szfile.text[0]

Convert an SZ file to SEGY

from seismic_zfp.conversion import SzConverter
with SzConverter("in.sz") as converter:
    converter.convert_to_segy("out.segy")

Installation Troubleshooting

  • Check your machine has these packages available: python3-devel, git, gcc, gcc-c++

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for seismic-zfp, version 0.0.7
Filename, size File type Python version Upload date Hashes
Filename, size seismic_zfp-0.0.7-py3-none-any.whl (19.5 kB) File type Wheel Python version py3 Upload date Hashes View hashes

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page