CSCS-DWDI reporting plugin for Waldur Site Agent
Project description
CSCS-DWDI Plugin for Waldur Site Agent
This plugin provides integration with the CSCS Data Warehouse Data Intelligence (DWDI) system to report both computational and storage usage data to Waldur. The plugin supports secure OIDC authentication and optional SOCKS proxy connectivity for accessing DWDI API endpoints from restricted networks.
Features
- Dual Backend Support: Separate backends for compute and storage resource usage reporting
- OIDC Authentication: Secure client credentials flow with automatic token refresh
- Proxy Support: SOCKS and HTTP proxy support for network-restricted environments
- Flexible Configuration: Configurable unit conversions and component mappings
- Production Ready: Comprehensive error handling and logging
Overview
The plugin implements two separate backends to handle different types of accounting data:
- Compute Backend (
cscs-dwdi-compute): Reports CPU and node hour usage from HPC clusters - Storage Backend (
cscs-dwdi-storage): Reports storage space and inode usage from filesystems
Backend Types
Compute Backend
The compute backend queries the DWDI API for computational resource usage and reports:
- Node hours consumed by accounts and users
- CPU hours consumed by accounts and users
- Account-level and user-level usage aggregation
API Endpoints Used:
/api/v1/compute/usage-month/account- Monthly usage data/api/v1/compute/usage-day/account- Daily usage data
Storage Backend
The storage backend queries the DWDI API for storage resource usage and reports:
- Storage space used (converted from bytes to configured units)
- Inode (file count) usage
- Path-based resource identification
API Endpoints Used:
/api/v1/storage/usage-month/filesystem_name/data_type- Monthly storage usage/api/v1/storage/usage-day/filesystem_name/data_type- Daily storage usage
Configuration
Compute Backend Configuration
backend_type: "cscs-dwdi-compute"
backend_settings:
cscs_dwdi_api_url: "https://dwdi.cscs.ch"
cscs_dwdi_client_id: "your_oidc_client_id"
cscs_dwdi_client_secret: "your_oidc_client_secret"
cscs_dwdi_oidc_token_url: "https://auth.cscs.ch/realms/cscs/protocol/openid-connect/token"
cscs_dwdi_oidc_scope: "openid" # Optional
backend_components:
nodeHours:
measured_unit: "node-hours"
unit_factor: 1
accounting_type: "usage"
label: "Node Hours"
cpuHours:
measured_unit: "cpu-hours"
unit_factor: 1
accounting_type: "usage"
label: "CPU Hours"
Storage Backend Configuration
backend_type: "cscs-dwdi-storage"
backend_settings:
cscs_dwdi_api_url: "https://dwdi.cscs.ch"
cscs_dwdi_client_id: "your_oidc_client_id"
cscs_dwdi_client_secret: "your_oidc_client_secret"
cscs_dwdi_oidc_token_url: "https://auth.cscs.ch/realms/cscs/protocol/openid-connect/token"
# Storage-specific settings
storage_filesystem: "lustre"
storage_data_type: "projects"
storage_tenant: "cscs" # Optional
# Map Waldur resource IDs to storage paths
storage_path_mapping:
"project_123": "/store/projects/proj123"
"project_456": "/store/projects/proj456"
backend_components:
storage_space:
measured_unit: "GB"
unit_factor: 0.000000001 # Convert bytes to GB
accounting_type: "usage"
label: "Storage Space (GB)"
storage_inodes:
measured_unit: "count"
unit_factor: 1
accounting_type: "usage"
label: "File Count"
Authentication
Both backends use OIDC client credentials flow for authentication with the DWDI API. The authentication tokens are automatically managed with refresh capabilities.
Required Settings
cscs_dwdi_client_id: OIDC client identifiercscs_dwdi_client_secret: OIDC client secretcscs_dwdi_oidc_token_url: OIDC token endpoint URL
Optional Settings
cscs_dwdi_oidc_scope: OIDC scope (defaults to "openid")
Token Management
- Tokens are automatically acquired and cached
- Automatic token refresh before expiration
- Error handling for authentication failures
SOCKS Proxy Support
Both backends support SOCKS proxy for network connectivity. This is useful when the DWDI API is only accessible through a proxy or jump host.
SOCKS Proxy Configuration
Add the SOCKS proxy setting to your backend configuration:
backend_settings:
# ... other settings ...
socks_proxy: "socks5://localhost:12345" # SOCKS5 proxy URL
Supported Proxy Types
- SOCKS5:
socks5://hostname:port - SOCKS4:
socks4://hostname:port - HTTP:
http://hostname:port
Usage Examples
SSH Tunnel with SOCKS5:
# Create SSH tunnel to jump host
ssh -D 12345 -N user@jumphost.cscs.ch
# Configure backend to use tunnel
backend_settings:
socks_proxy: "socks5://localhost:12345"
HTTP Proxy:
backend_settings:
socks_proxy: "http://proxy.cscs.ch:8080"
Resource Identification
Compute Resources
For compute resources, the system uses account names as returned by the DWDI API. The Waldur resource
backend_id should match the account name in the cluster accounting system.
Storage Resources
For storage resources, there are two options:
- Direct Path Usage: Set the Waldur resource
backend_idto the actual filesystem path - Path Mapping: Use the
storage_path_mappingsetting to map resource IDs to paths
Usage Reporting
Both backends are read-only and designed for usage reporting. They implement the _get_usage_report() method
but do not support:
- Account creation/deletion
- Resource management
- User management
- Limit setting
Historical Usage Loading
The core waldur_site_load_historical_usage command can be used to bulk-load past usage data
from DWDI into Waldur. This requires implementing get_usage_report_for_period() on the
backend, which queries the same DWDI API endpoints but for a specific historical month instead
of the current one.
Cluster Filtering
For compute backends, the normal reporting flow reads the cluster from each Waldur resource's
offering_backend_id. The historical loader does not pass Waldur resources, so you can
configure a cluster filter in backend_settings:
backend_settings:
cscs_dwdi_cluster: "alps" # optional: filter historical queries by cluster
When set, all historical compute queries will include cluster=["alps"] in the API request.
If omitted, no cluster filter is applied.
How It Works
The DWDI API already supports date-range queries (from/to parameters on
/compute/usage-month/account and exact-month on /storage/usage-month). The historical
loader calls get_usage_report_for_period(resource_backend_ids, year, month) for each month in
the requested range, then submits the returned usage to Waldur.
Running Historical Loads
# Load compute usage for all of 2024
waldur_site_load_historical_usage \
--config /etc/waldur/cscs-dwdi-config.yaml \
--offering-uuid <compute-offering-uuid> \
--user-token <staff-or-provider-token> \
--start-date 2024-01-01 \
--end-date 2024-12-31 \
--no-staff-check
# Load resource-level totals only (skip per-user breakdown)
waldur_site_load_historical_usage \
--config /etc/waldur/cscs-dwdi-config.yaml \
--offering-uuid <compute-offering-uuid> \
--user-token <provider-token> \
--start-date 2024-06-01 \
--end-date 2024-09-30 \
--no-staff-check \
--skip-user-usage
CLI Flags
| Flag | Description |
|---|---|
--config |
Path to waldur-site-agent configuration file |
--offering-uuid |
UUID of the Waldur offering to load data for |
--user-token |
Waldur API token (staff or service provider) |
--start-date |
Start date in YYYY-MM-DD format |
--end-date |
End date in YYYY-MM-DD format |
--no-staff-check |
Skip staff user validation (use with service provider tokens) |
--skip-user-usage |
Only submit resource-level totals, skip per-user breakdown |
Notes
- The
--no-staff-checkflag is useful when using a service provider token instead of a staff token — the Waldur API enforces permissions server-side regardless - The
--skip-user-usageflag skips themarketplace_offering_users_listAPI call and all per-user usage submission, which can significantly speed up large loads - Resources must already exist in Waldur with valid
backend_idvalues matching DWDI accounts - Maximum date range is 5 years
Example Configurations
See the examples/ directory for complete configuration examples:
cscs-dwdi-compute-config.yaml- Compute backend onlycscs-dwdi-storage-config.yaml- Storage backend onlycscs-dwdi-combined-config.yaml- Both backends in one configuration
Installation
The plugin is automatically discovered when the waldur-site-agent-cscs-dwdi package is installed alongside waldur-site-agent.
UV Workspace Installation
# Install all workspace packages including cscs-dwdi plugin
uv sync --all-packages
# Or install specific plugin dependencies only
uv sync --extra cscs-dwdi
Manual Installation
# Install from PyPI (when published)
pip install waldur-site-agent-cscs-dwdi
# Install from source
pip install -e plugins/cscs-dwdi/
Testing
Running Tests
# Run all cscs-dwdi tests
uv run pytest plugins/cscs-dwdi/tests/
# Run with coverage
uv run pytest plugins/cscs-dwdi/tests/ --cov=waldur_site_agent_cscs_dwdi
# Run specific test files
uv run pytest plugins/cscs-dwdi/tests/test_cscs_dwdi.py -v
Test Coverage
The test suite covers:
- Client initialization and configuration
- OIDC authentication flow
- API endpoint calls (mocked)
- Usage data processing
- Error handling scenarios
- Backend initialization and validation
API Compatibility
This plugin is compatible with DWDI API version 1 (/api/v1/). It requires the following API endpoints to be available:
Compute API:
/api/v1/compute/usage-month/account/api/v1/compute/usage-day/account
Storage API:
/api/v1/storage/usage-month/filesystem_name/data_type/api/v1/storage/usage-day/filesystem_name/data_type
Troubleshooting
Authentication Issues
Problem: Authentication failures or token errors
Solutions:
- Verify OIDC client credentials are correct
- Check that the token endpoint URL is accessible
- Ensure the client has appropriate scopes for DWDI API access
- Verify network connectivity to the OIDC provider
- Check logs for specific authentication error messages
Testing authentication:
# Test OIDC token acquisition manually
curl -X POST "https://auth.cscs.ch/realms/cscs/protocol/openid-connect/token" \
-H "Content-Type: application/x-www-form-urlencoded" \
-d "grant_type=client_credentials&client_id=YOUR_CLIENT_ID&client_secret=YOUR_SECRET&scope=openid"
Storage Backend Issues
Problem: Storage usage data not found or incorrect
Solutions:
- Verify
storage_filesystemandstorage_data_typematch available values in DWDI - Check
storage_path_mappingif using custom resource IDs - Ensure storage paths exist in the DWDI system
- Validate that the paths have usage data for the requested time period
Connection Issues
Problem: Network connectivity or API access failures
Solutions:
- Use the
ping()method to test API connectivity - Check network connectivity to the DWDI API endpoint
- Verify SSL/TLS configuration and certificates
- If behind a firewall, configure SOCKS proxy (
socks_proxysetting) - Check DNS resolution for the API hostname
Proxy Issues
Problem: SOCKS or HTTP proxy connection failures
Solutions:
- Verify proxy server is running and accessible
- Check proxy authentication if required
- Test proxy connectivity manually:
curl --proxy socks5://localhost:12345 https://dwdi.cscs.ch - Ensure proxy supports the required protocol (SOCKS4/5, HTTP)
- Verify proxy URL format is correct (e.g.,
socks5://hostname:port)
Debugging Tips
Enable debug logging:
import logging
logging.getLogger('waldur_site_agent_cscs_dwdi').setLevel(logging.DEBUG)
Test API connectivity:
# Test direct API access
curl -H "Authorization: Bearer YOUR_TOKEN" https://dwdi.cscs.ch/api/v1/
# Test through proxy
curl --proxy socks5://localhost:12345 -H "Authorization: Bearer YOUR_TOKEN" https://dwdi.cscs.ch/api/v1/
Development
Project Structure
plugins/cscs-dwdi/
├── pyproject.toml # Plugin configuration
├── README.md # This documentation
├── examples/ # Configuration examples
├── waldur_site_agent_cscs_dwdi/
│ ├── __init__.py # Package init
│ ├── backend.py # Backend implementations
│ └── client.py # CSCS-DWDI API client
└── tests/
└── test_cscs_dwdi.py # Plugin tests
Key Classes
CSCSDWDIComputeBackend: Compute usage reporting backendCSCSDWDIStorageBackend: Storage usage reporting backendCSCSDWDIClient: HTTP client for CSCS-DWDI API communication with OIDC authentication
Key Features
- Automatic Token Management: OIDC tokens are cached and refreshed automatically
- Proxy Support: Built-in SOCKS and HTTP proxy support using httpx
- Error Handling: Comprehensive error handling with detailed logging
- Flexible Configuration: Support for custom unit conversions and component mappings
Extension Points
To extend the plugin:
- Additional Endpoints: Modify
CSCSDWDIClientto support more API endpoints - Authentication Methods: Update authentication logic in
client.py - Data Processing: Enhance response processing methods for additional data formats
- Proxy Types: Extend proxy support for additional proxy protocols
Contributing
When contributing to this plugin:
- Follow the existing code style and patterns
- Add tests for new functionality
- Update documentation for new features
- Ensure backward compatibility with existing configurations
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file waldur_site_agent_cscs_dwdi-1.0.1rc11.tar.gz.
File metadata
- Download URL: waldur_site_agent_cscs_dwdi-1.0.1rc11.tar.gz
- Upload date:
- Size: 16.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.11.3 {"installer":{"name":"uv","version":"0.11.3","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Alpine Linux","version":"3.23.3","id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
aa6643f8c7410b5ae7d7e1bb36a04ad6ab23c55ff96b7eaf41156e0e219735ed
|
|
| MD5 |
211f115c83a12166c131adfc42d74db4
|
|
| BLAKE2b-256 |
d9243c5d32adfffb25eed286041ba9ef7acb7b78eb862ce0d9195b08f972549f
|
File details
Details for the file waldur_site_agent_cscs_dwdi-1.0.1rc11-py3-none-any.whl.
File metadata
- Download URL: waldur_site_agent_cscs_dwdi-1.0.1rc11-py3-none-any.whl
- Upload date:
- Size: 14.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.11.3 {"installer":{"name":"uv","version":"0.11.3","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Alpine Linux","version":"3.23.3","id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
1a5f43efef6a82bfdaa0d4f10df14895fab08e5a853d5c41ffe0252d7f5b9a1e
|
|
| MD5 |
bd04810cede9a034f7d29d1c6e0d42a0
|
|
| BLAKE2b-256 |
1699eb6d1b06f17cd53bafbc73c770c2626d387a8b35bf50a79278c3a5ce1205
|