Skip to main content

A parser intended for use with VirtualiZarr to create virtual Zarr stores from TIFFs

Project description

Virtual TIFF

A Parser for creating Virtual Zarr stores from TIFF files using VirtualiZarr 2.0 and async-tiff.

Background

First, some thoughts on why we should virtualize GeoTIFFs and/or COGS:

  1. Provide faster access to non-cloud-optimized GeoTIFFS that contain some form of internal tiling without any data duplication see notebook #1.
  2. Provide fully async I/O for both GeoTIFFs and COGs using Zarr-Python
  3. Allow loading a stack of GeoTIFFS/COGS into a data cube while minimizing the number of GET requests relative to using stackstac/odc-stac, thereby decreasing cost and increasing performance
  4. Provide users access to a lazily loaded DataTree providing both the data and the overviews, allowing scientists to use the overviews not only for tile-based visualization but also quickly iterating on analytics
  5. Include etags in the virtualized datasets to support reproducibility
  6. A motivation that's less clear to me, but maybe possible, is using the virtualization layer to access COGs with disparate CRSs as a single dataset (https://github.com/zarr-developers/geozarr-spec/issues/53)

Getting started

  1. Clone the repository: git clone https://github.com/virtual-zarr/virtual-tiff.git.
  2. Pull baseline image data from dvc remote pixi run -e test download-test-images WARNING: This will download ~1.4GB of TIFFs for testing to your machine.
  3. Run the test suite using pixi run -e test run-tests WARNING: Some tests will fail due to incomplete status of the implementation.
  4. Start a shell if needed in the development environment using pixi run -e test zsh.

License

virtual-tiff is distributed under the terms of the MIT license.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

virtual_tiff-0.2.0.tar.gz (33.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

virtual_tiff-0.2.0-py3-none-any.whl (14.3 kB view details)

Uploaded Python 3

File details

Details for the file virtual_tiff-0.2.0.tar.gz.

File metadata

  • Download URL: virtual_tiff-0.2.0.tar.gz
  • Upload date:
  • Size: 33.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for virtual_tiff-0.2.0.tar.gz
Algorithm Hash digest
SHA256 b005002eec0a84080c2e22bd631c577bd5febcacf25e4e2a9f23483fb1423d31
MD5 c56a185d6e1d7ac72940dc3e44f1bc23
BLAKE2b-256 3bba8eec975ab84316e544ae3aae4d8a5373d8d6742f2c68559d30287b0a718d

See more details on using hashes here.

File details

Details for the file virtual_tiff-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: virtual_tiff-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 14.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for virtual_tiff-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 0ecd52e7ccc9885095c590547577778af841f2ca9a7ab668ed51dfc24025d311
MD5 e2247aaab14862aae47c0223875f8c3a
BLAKE2b-256 de45bfdee9db2c11fb89c288158912545ecb799f7464cfa61311ad6a4ce24033

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page