Experimental CLI tool for converting TIFF raster data to FLAC format
Project description
FLAC-Raster: Experimental Raster to FLAC Converter
An experimental CLI tool that converts TIFF raster data files into FLAC audio format while preserving all geospatial metadata, CRS, and bounds information. This proof-of-concept explores using FLAC's lossless compression for geospatial data storage.
Features
- Bidirectional conversion: TIFF → FLAC and FLAC → TIFF
- Complete metadata preservation: CRS, bounds, transform, data type, nodata values
- Intelligent audio parameters: Automatically selects sample rate and bit depth based on raster properties
- Multi-band support: Seamlessly handles multi-band rasters (RGB, multispectral) as multi-channel audio
- Lossless compression: Perfect reconstruction verified - no data loss
- FLAC chunking: Uses FLAC's frame-based compression (4096 samples/frame)
- Comprehensive logging: Verbose mode with detailed progress tracking
- Colorful CLI: Built with Typer and Rich for an intuitive experience
Installation
Prerequisites
First, install pixi:
# Install pixi (cross-platform package manager)
curl -fsSL https://pixi.sh/install.sh | bash
# or via conda: conda install -c conda-forge pixi
Clone and Setup
git clone https://github.com/Youssef-Harby/flac-raster.git
cd flac-raster
pixi install # Install all dependencies
Install the CLI tool
# For regular use:
pixi run pip install .
# For development (editable install):
pixi run pip install -e .
Alternative: Direct pip installation
pip install rasterio numpy typer rich tqdm pyflac
# Or install from PyPI (when published):
pip install flac-raster
Usage
Basic Commands
After installation, you can use the CLI directly:
-
Convert TIFF to FLAC:
flac-raster convert input.tif -o output.flac
-
Convert FLAC back to TIFF:
flac-raster convert input.flac -o output.tif
-
Get file information:
flac-raster info file.tif flac-raster info file.flac
-
Compare two TIFF files:
flac-raster compare original.tif reconstructed.tif
Alternative: Use python main.py if you haven't installed the package:
python main.py convert input.tif # Direct script usage
Options
Convert command:
--output, -o: Specify output file path (auto-generates if not provided)--compression, -c: FLAC compression level 0-8 (default: 5)--force, -f: Overwrite existing output files--verbose, -v: Enable verbose logging for detailed progress
Compare command:
--show-bands/--no-bands: Show per-band statistics (default: True)--export, -e: Export comparison results to JSON file--help: Show help message
Example Workflow
# Create sample data
python examples/create_test_data.py
# Convert DEM to FLAC
flac-raster convert test_data/sample_dem.tif -v
# Check the FLAC file info
flac-raster info test_data/sample_dem.flac
# Convert back to TIFF
flac-raster convert test_data/sample_dem.flac -o test_data/dem_reconstructed.tif
# Compare original and reconstructed
flac-raster compare test_data/sample_dem.tif test_data/dem_reconstructed.tif
# Export comparison to JSON
flac-raster compare test_data/sample_dem.tif test_data/dem_reconstructed.tif --export comparison.json
# Test with multi-band data
flac-raster convert test_data/sample_rgb.tif
flac-raster convert test_data/sample_rgb.flac -o test_data/rgb_reconstructed.tif
flac-raster compare test_data/sample_rgb.tif test_data/rgb_reconstructed.tif
# Open in QGIS to verify
# The reconstructed files should be viewable in QGIS with all metadata intact
How It Works
TIFF to FLAC Conversion
- Read raster data and extract all metadata (CRS, bounds, transform, etc.)
- Calculate audio parameters:
- Sample rate: Based on image resolution (44.1kHz to 192kHz)
- Bit depth: Matches the raster's bit depth (16 or 24-bit, minimum 16-bit due to FLAC decoder limitations)
- Normalize data to audio range (-1 to 1)
- Reshape data: Bands become audio channels, pixels become samples
- Single-band → Mono audio
- Multi-band (RGB, multispectral) → Multi-channel audio
- Encode to FLAC with configurable compression
- Save metadata as sidecar JSON file
- Use FLAC frames for natural chunking of the data
FLAC to TIFF Conversion
- Decode FLAC file and extract audio samples
- Load metadata from sidecar JSON file
- Reshape audio back to raster dimensions
- Mono → Single-band raster
- Multi-channel → Multi-band raster
- Denormalize to original data range
- Write GeoTIFF with all original metadata
Metadata Preservation
The tool preserves:
- Width and height dimensions
- Number of bands
- Data type (uint8, int16, float32, etc.)
- Coordinate Reference System (CRS)
- Geospatial transform
- Bounding box coordinates
- Original data min/max values
- NoData values
- Original driver information
Technical Details
- FLAC frames: Utilizes FLAC's frame structure for efficient chunking (4096 samples/frame)
- Multi-band support: Each raster band becomes an audio channel (up to 8 channels supported by FLAC)
- Lossless conversion: Data is normalized but the process is completely reversible
- Compression: Leverages FLAC's compression algorithms for size reduction
- Metadata storage: Sidecar JSON files ensure all geospatial information is preserved
- Data type mapping:
- uint8 → 16-bit FLAC (due to decoder limitations)
- int16/uint16 → 16-bit FLAC
- int32/uint32/float32 → 24-bit FLAC
Performance Examples
From testing:
- DEM file (1201x1201, int16): 2.75 MB → 1.60 MB FLAC (41.7% compression)
- Multi-band (256x256x3, uint8): 0.19 MB → 0.25 MB FLAC (small files may increase due to overhead)
- All conversions are perfectly lossless (verified with numpy array comparison)
Limitations
- Maximum 8 bands (FLAC channel limitation)
- Minimum 16-bit encoding (pyflac decoder limitation)
- Large rasters may take time to process
- FLAC format limitations apply (specific bit depths: 16, 24-bit)
- Metadata stored separately in JSON sidecar files
- Experimental: Not recommended for production use
Project Structure
flac-raster/
├── src/flac_raster/ # Main package
│ ├── __init__.py # Package initialization
│ ├── cli.py # Command-line interface
│ ├── converter.py # Core conversion logic
│ └── compare.py # Comparison utilities
├── examples/ # Example scripts
│ ├── create_test_data.py # Generate test datasets
│ └── example_usage.py # Usage examples
├── tests/ # Test scripts
│ ├── compare_tiffs.py # TIFF comparison script
│ └── compare_multiband.py # Multi-band comparison
├── testing_dataset/ # Test data
├── main.py # Main entry point
├── pyproject.toml # Project configuration
├── README.md # This file
└── requirements.txt # Dependencies
CI/CD & Publishing
This project uses GitHub Actions for:
- ✅ Continuous Integration: Tests on Python 3.9-3.12 across Windows, macOS, and Linux
- ✅ Automated Building: Package building and validation
- ✅ PyPI Publishing: Automatic publishing on release creation
- ✅ Quality Assurance: Integration testing via CLI commands
Publishing to PyPI
See PUBLISHING.md for detailed instructions on publishing releases.
Contributing
- Fork the repository
- Create a feature branch:
git checkout -b feature-name - Make your changes and test them
- Commit your changes:
git commit -am 'Add feature' - Push to the branch:
git push origin feature-name - Create a Pull Request
Future Improvements
- Support for more raster formats (HDF5, NetCDF, etc.)
- Streaming support for large files
- Parallel processing for faster conversion
- Better handling of special data types
- Integration with Zarr-like chunking strategies
- Enhanced unit test coverage
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file flac_raster-0.1.0.tar.gz.
File metadata
- Download URL: flac_raster-0.1.0.tar.gz
- Upload date:
- Size: 16.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.12.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
1d8cc3adbe044085a86a140d23edf0c5d978bc369bd9e20298fa85e8d7442b1a
|
|
| MD5 |
9d29d3cfa4a727f48027d3eac7c8f73f
|
|
| BLAKE2b-256 |
0861d48165783d3f743a9eaa110068560b85bee900fdfd3d33f564ab3d283d34
|
Provenance
The following attestation bundles were made for flac_raster-0.1.0.tar.gz:
Publisher:
ci.yml on Youssef-Harby/flac-raster
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
flac_raster-0.1.0.tar.gz -
Subject digest:
1d8cc3adbe044085a86a140d23edf0c5d978bc369bd9e20298fa85e8d7442b1a - Sigstore transparency entry: 295327764
- Sigstore integration time:
-
Permalink:
Youssef-Harby/flac-raster@658969eca820b2565e09244e36a8fe8e6f3d1f35 -
Branch / Tag:
refs/tags/v0.1.0 - Owner: https://github.com/Youssef-Harby
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
ci.yml@658969eca820b2565e09244e36a8fe8e6f3d1f35 -
Trigger Event:
release
-
Statement type:
File details
Details for the file flac_raster-0.1.0-py3-none-any.whl.
File metadata
- Download URL: flac_raster-0.1.0-py3-none-any.whl
- Upload date:
- Size: 14.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.12.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d696ffd6e7fd914aa5d0f09dcc739753d4bda2ab90d9f13942e33db6f2a341e6
|
|
| MD5 |
e23c439903b907a8f19f64a95346c9bf
|
|
| BLAKE2b-256 |
034ec5e28aa7a81ed1c67646fba1a3ca8e0664879a499d0c221ef3d870583400
|
Provenance
The following attestation bundles were made for flac_raster-0.1.0-py3-none-any.whl:
Publisher:
ci.yml on Youssef-Harby/flac-raster
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
flac_raster-0.1.0-py3-none-any.whl -
Subject digest:
d696ffd6e7fd914aa5d0f09dcc739753d4bda2ab90d9f13942e33db6f2a341e6 - Sigstore transparency entry: 295327773
- Sigstore integration time:
-
Permalink:
Youssef-Harby/flac-raster@658969eca820b2565e09244e36a8fe8e6f3d1f35 -
Branch / Tag:
refs/tags/v0.1.0 - Owner: https://github.com/Youssef-Harby
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
ci.yml@658969eca820b2565e09244e36a8fe8e6f3d1f35 -
Trigger Event:
release
-
Statement type: