Unified CGM data format converter for ML training and inference

These details have not been verified by PyPI

Project description

cgm_format

Python library for converting vendor-specific Continuous Glucose Monitoring (CGM) data (Dexcom, Libre) into a standardized unified format for ML training and inference.

Features

Vendor format detection: Automatic detection of Dexcom, Libre, and Unified formats
Robust parsing: Handles BOM marks, encoding artifacts, and vendor-specific CSV quirks
Unified schema: Standardized data format with service columns (metadata) and data columns
Schema validation: Frictionless Data Table Schema support for validation
Type-safe: Polars-based with strict type definitions and enum support
Extensible: Clean abstract interfaces for adding new vendor formats

Installation

# Using uv (recommended)
uv pip install -e .

# Or using pip
pip3 install -e .

# Optional dependencies
uv pip install -e ".[extra]"  # pandas, pyarrow, frictionless
uv pip install -e ".[dev]"    # pytest

Quick Start

Basic Parsing

from cgm_format import FormatParser
import polars as pl

# Parse any supported CGM file (Dexcom, Libre, or Unified)
unified_df = FormatParser.parse_file("data/example.csv")

# Or parse from base64 (useful for web APIs)
unified_df = FormatParser.parse_base64(base64_encoded_csv)

# Access the data
print(unified_df.head())

# Save to unified format
FormatParser.to_csv_file(unified_df, "output.csv")

Complete Inference Pipeline

from cgm_format import FormatParser, FormatProcessor

# Stage 1-3: Parse vendor format to unified
unified_df = FormatParser.parse_file("data/dexcom_export.csv")

# Stage 4-5: Process for inference
processor = FormatProcessor(
    expected_interval_minutes=5,
    small_gap_max_minutes=15
)

# Fill gaps and create sequences
processed_df = processor.interpolate_gaps(unified_df)

# Prepare final inference data (returns full UnifiedFormat)
unified_df, warnings = processor.prepare_for_inference(
    processed_df,
    minimum_duration_minutes=180,      # Require 3 hours minimum (default: 60)
    maximum_wanted_duration=1440       # Truncate to last 24 hours if longer (default: 480)
)

# Strip service columns for ML model
inference_df = FormatProcessor.to_data_only_df(unified_df)

# Feed to ML model
predictions = your_model.predict(inference_df)

Split Glucose and Events

from cgm_format import FormatParser, FormatProcessor

# Parse mixed data
unified_df = FormatParser.parse_file("data/cgm_with_events.csv")

# Split into glucose readings and other events (insulin, carbs, etc.)
glucose_df, events_df = FormatProcessor.split_glucose_events(unified_df)

# Process glucose data separately
processor = FormatProcessor()
glucose_df = processor.interpolate_gaps(glucose_df)
unified_df, warnings = processor.prepare_for_inference(glucose_df)

# Strip service columns if needed for ML
inference_df = FormatProcessor.to_data_only_df(unified_df)

# Analyze events separately
insulin_events = events_df.filter(pl.col('event_type').str.contains('INSULIN'))

See USAGE.md for complete inference workflows and examples/usage_example.py for runnable examples.

Unified Format Schema

The library converts all vendor formats to a standardized schema with two types of columns:

Service Columns (Metadata)

Column	Type	Description
`sequence_id`	`Int64`	Unique sequence identifier
`event_type`	`Utf8`	Event type (8-char code: EGV_READ, INS_FAST, CARBS_IN, etc.)
`quality`	`Int64`	Data quality flags (bitwise): 0=GOOD, 1=OUT_OF_RANGE, 2=SENSOR_CALIBRATION, 4=IMPUTATION, 8=TIME_DUPLICATE

Data Columns

Column	Type	Unit	Description
`datetime`	`Datetime`	-	Timestamp (ISO 8601)
`glucose`	`Float64`	mg/dL	Blood glucose reading
`carbs`	`Float64`	g	Carbohydrate intake
`insulin_slow`	`Float64`	u	Long-acting insulin dose
`insulin_fast`	`Float64`	u	Short-acting insulin dose
`exercise`	`Int64`	seconds	Exercise duration

See formats/UNIFIED_FORMAT.md for complete specification and event type enums.

Processing Pipeline

The library implements a 3-stage parsing pipeline defined in the CGMParser interface:

Stage 1: Preprocess Raw Data

Remove BOM marks, encoding artifacts, and normalize text encoding.

text_data = FormatParser.decode_raw_data(raw_bytes)

Stage 2: Format Detection

Automatically detect vendor format from CSV headers.

from cgm_format.interface.cgm_interface import SupportedCGMFormat

format_type = FormatParser.detect_format(text_data)
# Returns: SupportedCGMFormat.DEXCOM, .LIBRE, or .UNIFIED_CGM

Stage 3: Vendor-Specific Parsing

Parse vendor CSV to unified format, handling vendor-specific quirks:

Dexcom: High/Low glucose markers, variable-length rows, metadata rows
Libre: Record type filtering, timestamp format variations

unified_df = FormatParser.parse_to_unified(text_data, format_type)

All stages can be chained with convenience methods:

# Parse from file path (recommended)
unified_df = FormatParser.parse_file("data.csv")

# Parse from base64 string (web APIs)
unified_df = FormatParser.parse_base64(base64_encoded_csv)

# Parse from bytes (lower-level)
unified_df = FormatParser.parse_from_bytes(raw_data)

# Parse from string (manual control)
unified_df = FormatParser.parse_from_string(text_data)

See interface/PIPELINE.md for complete pipeline documentation.

Stage 4: Gap Interpolation and Sequence Creation

The FormatProcessor.interpolate_gaps() method handles data continuity:

from cgm_format import FormatProcessor

processor = FormatProcessor(
    expected_interval_minutes=5,    # Normal CGM reading interval
    small_gap_max_minutes=15        # Max gap size to interpolate
)

# Detect gaps, create sequences, and interpolate missing values
processed_df = processor.interpolate_gaps(unified_df)

What it does:

Gap Detection: Identifies gaps in continuous glucose monitoring data
Sequence Creation: Splits data at large gaps (>15 min default) into separate sequences
Small Gap Interpolation: Fills small gaps (≤15 min) with linearly interpolated glucose values
Calibration Marking: Marks 24-hour periods after gaps ≥2h45m with SENSOR_CALIBRATION quality flag
Warning Collection: Tracks imputation events via ProcessingWarning.IMPUTATION

Example - Analyze sequences created:

# Check sequences
sequence_count = processed_df['sequence_id'].n_unique()
print(f"Created {sequence_count} sequences")

# Analyze each sequence
import polars as pl
sequence_info = processed_df.group_by('sequence_id').agg([
    pl.col('datetime').min().alias('start_time'),
    pl.col('datetime').max().alias('end_time'),
    pl.col('datetime').count().alias('num_points'),
])

for row in sequence_info.iter_rows(named=True):
    duration_hours = (row['end_time'] - row['start_time']).total_seconds() / 3600
    print(f"Sequence {row['sequence_id']}: {duration_hours:.1f}h, {row['num_points']} points")

Stage 5: Timestamp Synchronization (Optional)

Align timestamps to fixed-frequency intervals for ML models requiring regular time steps:

# After interpolate_gaps(), synchronize to exact intervals
synchronized_df = processor.synchronize_timestamps(processed_df)

# Now all timestamps are at exact 5-minute intervals: 10:00:00, 10:05:00, 10:10:00, etc.

What it does:

Rounds timestamps to nearest minute boundary (removes seconds)
Creates fixed-frequency timestamps at expected_interval_minutes intervals
Linearly interpolates glucose values between measurements
Shifts discrete events (carbs, insulin, exercise) to nearest timestamp
Preserves sequence boundaries (processes each sequence independently)

When to use: Time-series models expecting fixed intervals (LSTM, transformers, ARIMA)
When to skip: Models handling irregular timestamps, or when original timing is critical

Stage 6: Inference Preparation

The prepare_for_inference() method performs final quality assurance and returns full UnifiedFormat:

# Prepare final inference-ready data (returns full UnifiedFormat)
unified_df, warnings = processor.prepare_for_inference(
    processed_df,
    minimum_duration_minutes=180,      # Require 3 hours minimum (default: 60)
    maximum_wanted_duration=1440       # Truncate to last 24 hours if longer (default: 480)
)

# Optionally strip service columns for ML models
inference_df = FormatProcessor.to_data_only_df(unified_df)

# Check for quality issues
from cgm_format.interface.cgm_interface import ProcessingWarning

if warnings & ProcessingWarning.TOO_SHORT:
    print("Warning: Sequence shorter than minimum duration")
if warnings & ProcessingWarning.QUALITY:
    print("Warning: Data contains quality issues (OUT_OF_RANGE or SENSOR_CALIBRATION)")
if warnings & ProcessingWarning.IMPUTATION:
    print("Warning: Data contains interpolated values")

What it does:

Validation: Raises ZeroValidInputError if no valid glucose data exists
Sequence Selection: Keeps only the latest sequence (most recent timestamps)
Duration Checks: Warns if sequence < minimum_duration_minutes
Quality Checks: Collects warnings for calibration events and quality flags
Truncation: Keeps last N minutes if exceeding maximum_wanted_duration
Returns: Full UnifiedFormat with all columns (use to_data_only_df() to strip service columns)

Output DataFrame:

# inference_df contains only data columns:
# ['datetime', 'glucose', 'carbs', 'insulin_slow', 'insulin_fast', 'exercise']

# Feed directly to ML model
predictions = your_model.predict(inference_df)

Complete Processor Configuration

from cgm_format import FormatProcessor
from cgm_format.interface.cgm_interface import MINIMUM_DURATION_MINUTES, MAXIMUM_WANTED_DURATION_MINUTES

# Initialize processor with custom intervals
processor = FormatProcessor(
    expected_interval_minutes=5,     # CGM reading interval (5 min for Dexcom, 15 min for Libre)
    small_gap_max_minutes=15         # Max gap to interpolate (larger gaps create new sequences)
)

# Stage 4: Fill gaps and create sequences
processed_df = processor.interpolate_gaps(unified_df)

# Stage 5 (Optional): Synchronize to fixed intervals
# synchronized_df = processor.synchronize_timestamps(processed_df)

# Stage 6: Prepare for inference (returns full UnifiedFormat)
unified_df, warnings = processor.prepare_for_inference(
    processed_df,  # or synchronized_df if using Stage 5
    minimum_duration_minutes=MINIMUM_DURATION_MINUTES,        # Default: 60 (1 hour)
    maximum_wanted_duration=MAXIMUM_WANTED_DURATION_MINUTES   # Default: 480 (8 hours)
)

# Optional: Strip service columns for ML models
inference_df = FormatProcessor.to_data_only_df(unified_df)

# Check warnings
if processor.has_warnings():
    all_warnings = processor.get_warnings()
    print(f"Processing collected {len(all_warnings)} warnings")

Advanced Usage

Working with Schemas

from cgm_format.formats.unified import CGM_SCHEMA, UnifiedEventType, Quality

# Get Polars schema
polars_schema = CGM_SCHEMA.get_polars_schema()
data_only_schema = CGM_SCHEMA.get_polars_schema(data_only=True)

# Get column names
all_columns = CGM_SCHEMA.get_column_names()
data_columns = CGM_SCHEMA.get_column_names(data_only=True)

# Get cast expressions for Polars
cast_exprs = CGM_SCHEMA.get_cast_expressions()
df = df.with_columns(cast_exprs)

# Use enums
event = UnifiedEventType.GLUCOSE  # "EGV_READ"
quality = 0                       # GOOD_QUALITY (no flags)

Batch Processing with Inference Preparation

from pathlib import Path
from cgm_format import FormatParser, FormatProcessor
import polars as pl

data_dir = Path("data")
output_dir = Path("data/inference_ready")
output_dir.mkdir(exist_ok=True)

processor = FormatProcessor()
results = []

for csv_file in data_dir.glob("*.csv"):
    try:
        # Parse to unified format
        unified_df = FormatParser.parse_from_file(csv_file)
        
        # Process for inference
        processed_df = processor.interpolate_gaps(unified_df)
        unified_df, warnings = processor.prepare_for_inference(processed_df)
        inference_df = FormatProcessor.to_data_only_df(unified_df)
        
        # Add patient identifier
        patient_id = csv_file.stem
        inference_df = inference_df.with_columns([
            pl.lit(patient_id).alias('patient_id')
        ])
        
        results.append(inference_df)
        
        # Save individual file
        output_file = output_dir / f"{patient_id}_inference.csv"
        FormatParser.to_csv_file(inference_df, str(output_file))
        
        warning_str = f"warnings={warnings.value}" if warnings else "OK"
        print(f"✓ {csv_file.name}: {len(inference_df)} records, {warning_str}")
        
    except Exception as e:
        print(f"✗ Failed {csv_file.name}: {e}")

# Combine all processed data
if results:
    combined_df = pl.concat(results)
    FormatParser.to_csv_file(combined_df, str(output_dir / "combined_inference.csv"))
    print(f"\n✓ Combined {len(results)} files into single dataset")

Format Detection and Validation

from examples.example_schema_usage import run_format_detection_and_validation
from pathlib import Path

# Validate all files in data directory
run_format_detection_and_validation(
    data_dir=Path("data"),
    parsed_dir=Path("data/parsed"),
    output_file=Path("validation_report.txt")
)

This generates a detailed report with:

Format detection statistics
Frictionless schema validation results (if library installed)
Known vendor quirks automatically suppressed

Supported Formats

Dexcom Clarity Export

CSV with metadata rows (rows 2-11)
Variable-length rows (non-EGV events missing trailing columns)
High/Low glucose markers for out-of-range values
Event types: EGV, Insulin, Carbs, Exercise
Multiple timestamp format variants

FreeStyle Libre

CSV with metadata row 1, header row 2
Record type filtering (0=glucose, 4=insulin, 5=food)
Multiple timestamp format variants
Separate rapid/long insulin columns

Unified Format

Standardized CSV with header row 1
ISO 8601 timestamps
Service columns + data columns
Validates existing unified format files

Project Structure

cgm_format/
├── src/
│   └── cgm_format/              # Main package
│       ├── __init__.py          # Package exports (FormatParser, FormatProcessor)
│       ├── format_parser.py  # FormatParser implementation (Stages 1-3)
│       ├── format_processor.py  # FormatProcessor implementation (Stages 4-6)
│       ├── interface/           # Abstract interfaces and schema infrastructure
│       │   ├── cgm_interface.py # CGMParser and CGMProcessor interfaces
│       │   ├── schema.py        # Base schema definition system
│       │   └── PIPELINE.md      # Pipeline documentation
│       └── formats/             # Format-specific schemas and definitions
│           ├── unified.py       # Unified format schema and enums
│           ├── unified.json     # Frictionless schema export
│           ├── dexcom.py        # Dexcom format schema and constants
│           ├── dexcom.json      # Frictionless schema for Dexcom
│           ├── libre.py         # Libre format schema and constants
│           ├── libre.json       # Frictionless schema for Libre
│           └── UNIFIED_FORMAT.md # Unified format specification
├── examples/                    # Example scripts
│   ├── usage_example.py         # Runnable usage examples
│   └── example_schema_usage.py  # Format detection & validation examples
├── tests/                       # Pytest test suite
│   ├── test_format_parser.py # Parsing and conversion tests
│   ├── test_format_processor.py # Processing tests
│   └── test_schema.py           # Schema validation tests
├── data/                        # Test data and parsed outputs
│   └── parsed/                  # Converted unified format files
├── pyproject.toml               # Package configuration (hatchling)
├── USAGE.md                     # Complete usage guide for inference
└── README.md                    # This file

Architecture

Two-Layer Interface Design

CGMParser (Stages 1-3): Vendor-specific parsing to unified format

decode_raw_data() - Encoding cleanup
detect_format() - Format detection
parse_to_unified() - Vendor CSV → UnifiedFormat

CGMProcessor (Stages 4-5): Vendor-agnostic operations on unified data

synchronize_timestamps() - Timestamp alignment to fixed intervals
interpolate_gaps() - Gap detection, sequence creation, and interpolation
prepare_for_inference() - ML preparation with quality checks and truncation

The current implementation:

FormatParser implements the CGMParser interface (Stages 1-3)
FormatProcessor implements the CGMProcessor interface (Stages 4-5)

Processing Stages Implementation

Stage 1-3 (FormatParser):

BOM removal and encoding normalization
Pattern-based format detection (first 15 lines)
Vendor-specific CSV parsing with quirk handling
Column mapping to unified schema
Service field population (sequence_id, event_type, quality)

Stage 4 (FormatProcessor.interpolate_gaps):

Time difference calculation between consecutive readings
Sequence boundary detection (gaps > small_gap_max_minutes)
Linear interpolation for small gaps (≤ small_gap_max_minutes)
Imputation row creation with Quality.IMPUTATION flag
Calibration period marking (24h after gaps ≥ 2h45m) with Quality.SENSOR_CALIBRATION flag
Warning collection for imputed data

Stage 5 (FormatProcessor.synchronize_timestamps):

Timestamp rounding to minute boundaries
Fixed-frequency grid generation at expected_interval_minutes
Asof join (backward/forward) for value alignment
Linear glucose interpolation between grid points
Discrete event shifting to nearest timestamp

Stage 6 (FormatProcessor.prepare_for_inference):

Zero-data validation (raises ZeroValidInputError)
Latest sequence selection (max timestamp)
Duration verification with TOO_SHORT warning
Quality flag detection (OUT_OF_RANGE, SENSOR_CALIBRATION)
Sequence truncation from beginning (preserves most recent data)
Service column removal (data columns only)
Warning flag aggregation and return

Processing Configuration Parameters

FormatProcessor initialization:

Parameter	Default	Description	Effect
`expected_interval_minutes`	5	Normal reading interval	Grid spacing for synchronization; gap detection baseline
`small_gap_max_minutes`	15	Max gap to interpolate	Gaps > this create new sequences; gaps ≤ this are filled

Common configurations:

# Dexcom G6/G7 (5-minute readings)
processor = FormatProcessor(expected_interval_minutes=5, small_gap_max_minutes=15)

# FreeStyle Libre (manual scans, typically 15 min)
processor = FormatProcessor(expected_interval_minutes=15, small_gap_max_minutes=45)

# Strict quality (minimal imputation)
processor = FormatProcessor(expected_interval_minutes=5, small_gap_max_minutes=10)

# Lenient (more gap filling for sparse data)
processor = FormatProcessor(expected_interval_minutes=5, small_gap_max_minutes=30)

prepare_for_inference parameters:

Parameter	Default	Description
`minimum_duration_minutes`	60	Minimum sequence duration required (warns if shorter)
`maximum_wanted_duration`	480	Maximum duration to keep (truncates from beginning)

Constants from interface:

from cgm_format.interface.cgm_interface import (
    MINIMUM_DURATION_MINUTES,           # 60 (1 hour)
    MAXIMUM_WANTED_DURATION_MINUTES,    # 480 (8 hours)
    CALIBRATION_GAP_THRESHOLD,          # 9900 seconds (2h45m)
    CALIBRATION_PERIOD_HOURS,           # 24 hours
)

Schema System

Schemas are defined using CGMSchemaDefinition from interface/schema.py:

Type-safe: Polars dtypes with constraints
Vendor-specific: Each format has its own schema with quirks documented
Frictionless export: Auto-generate validation schemas
Dialect support: CSV parsing hints (header rows, comment rows, etc.)

Error Handling

Exceptions

Exception	Base	Description
`UnknownFormatError`	`ValueError`	Format cannot be detected
`MalformedDataError`	`ValueError`	CSV parsing or conversion failed
`ZeroValidInputError`	`ValueError`	No valid data points found

Processing Warnings

The FormatProcessor collects quality warnings during processing:

Warning Flag	Description	Triggered By
`ProcessingWarning.TOO_SHORT`	Sequence duration < minimum_duration_minutes	`prepare_for_inference()`
`ProcessingWarning.QUALITY`	Data contains OUT_OF_RANGE or SENSOR_CALIBRATION quality flags	`prepare_for_inference()`
`ProcessingWarning.OUT_OF_RANGE`	Data contains OUT_OF_RANGE quality flag	`prepare_for_inference()`
`ProcessingWarning.IMPUTATION`	Data contains IMPUTATION quality flag	`interpolate_gaps()`
`ProcessingWarning.CALIBRATION`	Data contains SENSOR_CALIBRATION quality flag	`prepare_for_inference()`
`ProcessingWarning.TIME_DUPLICATES`	Data contains TIME_DUPLICATE quality flag	`prepare_for_inference()`

Usage:

processor = FormatProcessor()
processed_df = processor.interpolate_gaps(unified_df)
inference_df, warnings = processor.prepare_for_inference(processed_df)

# Check individual warnings
if warnings & ProcessingWarning.QUALITY:
    print("Quality issues detected")

# Get all warnings as list
all_warnings = processor.get_warnings()
print(f"Collected {len(all_warnings)} warnings")

# Check if any warnings exist
if processor.has_warnings():
    print("Processing completed with warnings")

Testing

# Run all tests
pytest tests/

# Run specific test
pytest tests/test_format_parser.py -v

# Generate validation report
uv run python examples/example_schema_usage.py

# Run usage examples with real data
uv run python examples/usage_example.py

Development

Regenerating Schema JSON Files

After modifying schema definitions:

# Regenerate unified.json
python3 -c "from cgm_format.formats.unified import regenerate_schema_json; regenerate_schema_json()"

# Regenerate dexcom.json
python3 -c "from cgm_format.formats.dexcom import regenerate_schema_json; regenerate_schema_json()"

# Regenerate libre.json
python3 -c "from cgm_format.formats.libre import regenerate_schema_json; regenerate_schema_json()"

Adding New Vendor Formats

Create schema in src/cgm_format/formats/your_vendor.py using CGMSchemaDefinition
Add format to SupportedCGMFormat enum in src/cgm_format/interface/cgm_interface.py
Add detection patterns and implement parsing in src/cgm_format/format_parser.py
Add tests in tests/test_format_parser.py

Requirements

Python 3.10+
polars 1.34.0+

Optional:

pandas 2.3.3+ (compatibility layer)
pyarrow 21.0.0+ (pandas conversion)
frictionless 5.18.1+ (schema validation)
pytest 8.0.0+ (testing)

Documentation

USAGE.md - Complete usage guide for inference workflows
examples/usage_example.py - Runnable examples with real data
src/cgm_format/interface/PIPELINE.md - Detailed pipeline architecture
src/cgm_format/formats/UNIFIED_FORMAT.md - Unified schema specification
examples/example_schema_usage.py - Schema validation examples

License

See LICENSE file.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.8.2

Apr 22, 2026

0.8.1

Apr 10, 2026

0.7.0

Dec 12, 2025

0.6.2

Dec 12, 2025

0.6.1

Dec 9, 2025

0.6.0

Dec 9, 2025

0.5.2

Dec 3, 2025

0.5.1

Dec 3, 2025

This version

0.4.4

Nov 30, 2025

0.4.3

Nov 30, 2025

0.4.2

Nov 30, 2025

0.4.1

Nov 30, 2025

0.4.0

Nov 30, 2025

0.3.7

Nov 29, 2025

0.3.6

Nov 29, 2025

0.3.5

Nov 29, 2025

0.3.3

Nov 26, 2025

0.3.2

Nov 26, 2025

0.2.2

Nov 26, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cgm_format-0.4.4.tar.gz (168.1 kB view details)

Uploaded Nov 30, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

cgm_format-0.4.4-py3-none-any.whl (55.5 kB view details)

Uploaded Nov 30, 2025 Python 3

File details

Details for the file cgm_format-0.4.4.tar.gz.

File metadata

Download URL: cgm_format-0.4.4.tar.gz
Upload date: Nov 30, 2025
Size: 168.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.7.3

File hashes

Hashes for cgm_format-0.4.4.tar.gz
Algorithm	Hash digest
SHA256	`eed1b2edf27c2e277a6ed949bf585b93873e864022d11dcd09788ca21a13442e`
MD5	`2f32c4bd7c4116540ac821b79cc8ef1b`
BLAKE2b-256	`3c283ff56ea514cc4974ec551520111c39bd74a8e15bd1ed8096b32bed044c6f`

See more details on using hashes here.

File details

Details for the file cgm_format-0.4.4-py3-none-any.whl.

File metadata

Download URL: cgm_format-0.4.4-py3-none-any.whl
Upload date: Nov 30, 2025
Size: 55.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.7.3

File hashes

Hashes for cgm_format-0.4.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`766b519a0bca4b847ae81dc2470ff4026ed4d397fdad41bdf01b592426d1afa6`
MD5	`a930d531296a768674f1534bc55773c4`
BLAKE2b-256	`fcd3c3db82566cc191264c7e0a8cf3fe8a27aa2d0d1068f19c14b55050a56024`

See more details on using hashes here.

cgm-format 0.4.4

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

cgm_format

Features

Installation

Quick Start

Basic Parsing

Complete Inference Pipeline

Split Glucose and Events

Unified Format Schema

Service Columns (Metadata)

Data Columns

Processing Pipeline

Stage 1: Preprocess Raw Data

Stage 2: Format Detection

Stage 3: Vendor-Specific Parsing

Stage 4: Gap Interpolation and Sequence Creation

Stage 5: Timestamp Synchronization (Optional)

Stage 6: Inference Preparation

Complete Processor Configuration

Advanced Usage

Working with Schemas

Batch Processing with Inference Preparation

Format Detection and Validation

Supported Formats

Dexcom Clarity Export

FreeStyle Libre

Unified Format

Project Structure

Architecture

Two-Layer Interface Design

Processing Stages Implementation

Processing Configuration Parameters

Schema System

Error Handling

Exceptions

Processing Warnings

Testing

Development

Regenerating Schema JSON Files

Adding New Vendor Formats

Requirements

Documentation

License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes