pySigma DuckDB backend for local Sigma rule validation against JSON logs
Project description
pySigma DuckDB Backend
This is a pySigma backend that generates DuckDB SQL queries from Sigma rules. It's designed for local validation of Sigma rules against JSON log files, making it ideal for CI/CD pipelines and regression testing.
Features
- Convert Sigma rules to DuckDB SQL queries
- Validate rules against local JSON log files
- Support for JSON arrays, single objects, and NDJSON formats
- Built-in
LogIndexclass for efficient log loading and querying ValidationResultwith match counts and matched log details
Installation
pip install pySigma-backend-duckdb
Usage
Basic Query Generation
from sigma.rule import SigmaRule
from sigma.collection import SigmaCollection
from sigma.backends.duckdb import DuckDBBackend
rule = SigmaRule.from_yaml("""
title: Suspicious PowerShell Execution
logsource:
category: process_creation
product: windows
detection:
selection:
CommandLine|contains: powershell
condition: selection
""")
backend = DuckDBBackend()
queries = backend.convert(SigmaCollection([rule]))
print(queries[0])
# SELECT * FROM logs WHERE CommandLine ILIKE '%powershell%'
Validating Rules Against Local Logs
from sigma.backends.duckdb import DuckDBBackend, LogIndex
# Load logs from JSON files
index = LogIndex()
index.load_json_file("logs.json")
# Or load from a directory
index.load_directory("logs/")
# Validate a rule
backend = DuckDBBackend()
result = backend.validate_rule(rule_yaml, index)
print(f"Rule: {result.rule_title}")
print(f"Matches: {result.match_count}/{result.total_logs}")
print(f"Success: {result.success}")
for log in result.matched_logs:
print(f" - {log.get('CommandLine', 'N/A')}")
Directory Validation for CI
from sigma.backends.duckdb import validate_rules_directory
# Validate all rules against all logs
results = validate_rules_directory(
rules_dir="rules/",
logs_dir="test_logs/",
)
for result in results:
status = "PASS" if result.has_matches else "FAIL"
print(f"{status}: {result.rule_title} ({result.match_count} matches)")
Log Format
The backend expects logs in JSON format. By default, it uses Sysmon field names (e.g., CommandLine, Image, ParentImage). You can use custom pipelines for different schemas.
Splunk Sysmon format (default):
{
"CommandLine": "powershell.exe -e ...",
"Image": "C:\\Windows\\System32\\powershell.exe",
"ParentImage": "C:\\Windows\\System32\\cmd.exe"
}
Elastic ECS format (with elastic_ecs pipeline):
{
"process": {
"command_line": "powershell.exe -e ...",
"executable": "C:\\Windows\\System32\\powershell.exe"
}
}
Pipelines
The backend includes pipelines for common log formats:
from sigma.backends.duckdb import DuckDBBackend
from sigma.backends.duckdb.pipelines import splunk_sysmon, elastic_ecs
# For Splunk with Sysmon TA (default)
backend = DuckDBBackend(processing_pipeline=splunk_sysmon())
# For Elasticsearch with ECS
backend = DuckDBBackend(processing_pipeline=elastic_ecs())
License
MIT
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file pysigma_backend_duckdb-0.1.2.tar.gz.
File metadata
- Download URL: pysigma_backend_duckdb-0.1.2.tar.gz
- Upload date:
- Size: 58.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.9.9 {"installer":{"name":"uv","version":"0.9.9"},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a9859a91ffd42b9df7b5389c5b46ee5643f9c4f077b16fd3ec048e7bf367b403
|
|
| MD5 |
30a376e31965c1a8a8188c7f30b9a17f
|
|
| BLAKE2b-256 |
81b1ed3dd5915f58bf1b699ae143beea23ed563c6dc713548fad32bacfe3621f
|
File details
Details for the file pysigma_backend_duckdb-0.1.2-py3-none-any.whl.
File metadata
- Download URL: pysigma_backend_duckdb-0.1.2-py3-none-any.whl
- Upload date:
- Size: 8.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.9.9 {"installer":{"name":"uv","version":"0.9.9"},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5f7cbf467a9d4fd6d7d6ed109328875cb4559814906a4ac83c4c307f02034d96
|
|
| MD5 |
f546435cf722977226f40792230e0b64
|
|
| BLAKE2b-256 |
f87014e1fce4cd3040497a6bb4f57c4f24d63b1606c155fea402ffdbe783659b
|