Log-to-InfluxDB translator (rsyslog and journald)

These details have not been verified by PyPI

Project links

Homepage

Project description

logflux

Parse log messages and send regex-matched values to InfluxDB. Supports rsyslog (via Unix socket) and journald (via systemd journal API) as log sources.

Installation

pip install logflux

For journald support, you also need python-systemd, a C binding for libsystemd that should be installed via your system package manager:

# Fedora/RHEL/CentOS
dnf install python3-systemd

# Debian/Ubuntu
apt install python3-systemd

Usage

logflux [-s {rsyslog,journald}] [-c CONFIG] [-v] [-d] [-t]

Option	Description
`-s`, `--source`	Log source: `rsyslog` (default) or `journald`
`-c`, `--config`	Config file path (default: `logflux.yaml`)
`-v`, `--verbose`	Print InfluxDB line protocol to stdout
`-d`, `--debug`	Enable debug logging
`-t`, `--telegraf`	Telegraf mode: process new entries since last run and exit (journald only)

rsyslog

rsyslog 8.33.1 and newer supports creating log output in JSON format, which is preferred. To use, configure rsyslog (the template format is important) with:

template(name="logflux" type="list" option.jsonf="on") {
	property(outname="@timestamp" name="timereported" dateFormat="rfc3339" format="jsonf")
	property(outname="host" name="hostname" format="jsonf")
	property(outname="severity" name="syslogseverity-text" caseConversion="upper" format="jsonf")
	property(outname="facility" name="syslogfacility-text" format="jsonf")
	property(outname="syslog-tag" name="syslogtag" format="jsonf")
	property(outname="source" name="app-name" format="jsonf")
	property(outname="message" name="msg" format="jsonf")
}

module(load="omuxsock")
$OMUxSockSocket /tmp/logflux.sock
*.*	:omuxsock:;logflux

If you have older versions of rsyslog that do not support JSON output, you can use the "legacy" format with the following template:

template(name="logflux" type="list") {
	constant(value="@timestamp: ")
	property(outname="@timestamp" name="timereported" dateFormat="rfc3339")
	constant(value="\nhost: ")
	property(outname="host" name="hostname")
	constant(value="\nseverity: ")
	property(outname="severity" name="syslogseverity-text" caseConversion="upper")
	constant(value="\nfacility: ")
	property(outname="facility" name="syslogfacility-text")
	constant(value="\nsyslog-tag: ")
	property(outname="syslog-tag" name="syslogtag")
	constant(value="\nsource: ")
	property(outname="source" name="app-name")
	constant(value="\n\n")
	property(outname="message" name="msg")
}

logflux will automatically detect the format of the first message received and assume this format for all subsequent messages. If you change message formats, restart logflux.

Note: You may only want to send a subset of syslog messages to logflux, you can do so with filter conditions. Note that advanced/RainerScript configuration syntax is not supported with omuxsock as of rsyslog 8.34.0.

journald

logflux can read directly from the systemd journal. In continuous mode (default), it tails the journal and processes new entries as they appear. In telegraf mode (-t), it processes all entries since the last run and exits, making it suitable for use as a Telegraf exec input plugin.

Telegraf integration

To use logflux as a Telegraf exec input, add to your telegraf.conf:

[[inputs.exec]]
    commands = ["logflux -s journald -t -c /etc/logflux.yaml"]
    data_format = "influx"

In telegraf mode, logflux tracks its position using a timestamp file (default: .last_timestamp, configurable via last_timestamp_file in the config). It writes InfluxDB line protocol to stdout instead of connecting to InfluxDB directly.

journald configuration

journald mode uses filters to select which journal entries to process:

---

last_timestamp_file: /var/lib/telegraf/logflux_last_timestamp

filters:
  - key: _SYSTEMD_UNIT
    value: nginx.service

rules:
  - name: nginx_rate_limit
    match:
      key: MESSAGE
      regex: '^.*\[error\].* limiting requests, excess: (?P<excess>\d+\.\d+) by zone "(?P<zone>[^"]+)", client: (?P<client>[^,]+), server: [^,]+, request: "(?P<method>\S+) (?P<path>\S+) \S+", host: "[^"]+"'
    fields:
      value:
        lookup: MESSAGE.excess
        type: float
    tags:
      zone: MESSAGE.zone
      client: MESSAGE.client
      path:
        lookup: MESSAGE.path
        transform:
          - match: '(?<=/)[0-9a-fA-F]{16,}(?=(?:/|$|[/?]))'
            sub: 'ID'
          - match: '\?.*'
            sub: ''

Filters correspond to systemd journal fields (e.g. _SYSTEMD_UNIT, _HOSTNAME). Multiple filters with the same key are ORed together, as per the systemd journal API.

Note that journald message fields use different names than rsyslog. The journal message body is in the MESSAGE field (uppercase), and timestamps are handled automatically from __REALTIME_TIMESTAMP.

Configuration

logflux uses a YAML configuration file (default: logflux.yaml, override with -c).

rsyslog-specific options

socket: /tmp/logflux.sock       # Unix socket path (default: /run/logflux.sock)
socket_mode: "0660"              # Unix socket permissions (octal, default: inherited from umask)
message_format: json             # "json", "legacy", or omit for auto-detection
server_type: threading           # "forking", "threading", or omit for single-threaded

The socket_mode option sets the file permissions on the Unix socket after creation. The value is an octal string (e.g. "0660" to allow group write access, "0600" to restrict to the owner only). If omitted, the socket permissions are determined by the process umask. Quote the value in YAML to prevent it from being parsed as a decimal integer.

journald-specific options

last_timestamp_file: .last_timestamp   # Timestamp file for telegraf mode
namespace: mynamespace                  # Journal namespace (requires python-systemd >= 235)
filters:                                # Journal match filters
  - key: _SYSTEMD_UNIT
    value: nginx.service

The namespace option reads from a specific journald namespace (see systemd-journald.service(8)). With python-systemd version 235 or newer, the native namespace parameter is used. On older versions, logflux falls back to opening the namespace journal directory directly via path= (journals are stored at /var/log/journal/<machine-id>.<namespace>/). Omit the option to read from the default namespace.

Common options

influx:                          # InfluxDB connection parameters (not used in telegraf mode)
    host: localhost
    port: 8086

database: logflux                # InfluxDB database name

Rules

Rules define how log messages are matched and converted to InfluxDB points:

rules:
  - name: measurement_name       # InfluxDB measurement name
    match:
      key: message                # Message field to match against
      regex: '(?P<group>pattern)' # Regex with named capture groups
    fields:                       # InfluxDB fields (at least one required)
      field_name: message.group   # Simple: reference a capture group
      field_name:                 # Extended: with type conversion
        lookup: message.group
        type: float               # "int" or "float"
    tags:                         # InfluxDB tags (optional)
      tag_name: message.group     # Simple: reference a capture group
      tag_name: host              # Or reference a top-level message field
      tag_name:                   # Extended: with transforms
        lookup: message.group
        transform:                # Apply regex substitutions in order
          - match: 'pattern'
            sub: 'replacement'

Field and tag lookups

Values for fields and tags are specified as lookups:

message_key - Reference a top-level field from the message (e.g. host, severity)
match_key.group_name - Reference a named capture group from the regex match (e.g. message.excess). The match_key must be the same key used in the rule's match.key.

Type conversion

Fields can specify a type to convert the captured string value:

fields:
  value:
    lookup: message.excess
    type: float    # Convert to float (also supports "int")

If no type is specified, the value is stored as a string.

Transforms

Tags (and fields) support a transform list that applies regex substitutions to the value before storing it. Transforms are applied in order, each operating on the result of the previous one.

This is useful for normalizing high-cardinality values. For example, to aggregate nginx rate limit metrics by URL path while replacing unique IDs:

tags:
  path:
    lookup: MESSAGE.path
    transform:
      - match: '(?<=/)[0-9a-fA-F]{16,}(?=(?:/|$|[/?]))'
        sub: 'ID'
      - match: '\?.*'
        sub: ''

This replaces long hex IDs in URL paths with ID (e.g. /api/datasets/abcdef1234567890 becomes /api/datasets/ID) and strips query strings, reducing cardinality for InfluxDB tags while preserving meaningful path structure.

Computed fields (math)

Fields can compute a value from multiple captured groups using a math expression. Instead of lookup, specify a math expression and a vars map that defines the variables used in the expression:

fields:
  rate:
    math: "bytes / seconds"
    vars:
      bytes:
        lookup: MESSAGE.bytes
        type: float
      seconds:
        lookup: MESSAGE.seconds
        type: float

Each variable in vars is resolved using the same lookup syntax as regular fields (including type conversion). The math expression is then evaluated with those variables. The result can optionally be type-converted with type on the field itself.

Expressions support standard arithmetic operators (+, -, *, /, //, %, **) and a subset of Python math functions: ceil, floor, log, log2, log10, sqrt, abs, pow. Constants can be used directly in the expression (e.g. bytes / 1024).

If any variable resolves to None (e.g. a capture group that didn't match) or is non-numeric, the entire field is skipped.

If no fields are specified, the default field is {value: message} for rsyslog and {value: MESSAGE} for journald, which stores the full message body.

Security considerations

Regex performance (ReDoS)

Rule regex patterns and transform patterns from the configuration are applied to untrusted log message content. Python's re engine uses backtracking, which means a poorly written regex can cause catastrophic performance on crafted input — a class of denial-of-service known as ReDoS.

To avoid this, ensure your patterns do not contain nested quantifiers (e.g. (a+)+, (a*)*, (a|b*)+) or other constructs that create exponential backtracking. Prefer specific character classes (\d+, [^"]+) over greedy wildcards (.*), and anchor patterns where possible.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

1.0.0

Mar 25, 2026

0.2.2

Sep 7, 2018

0.2.1

Sep 7, 2018

0.2

Sep 7, 2018

0.1.1

Sep 6, 2018

0.1

Sep 6, 2018

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

logflux-1.0.0.tar.gz (23.4 kB view details)

Uploaded Mar 25, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

logflux-1.0.0-py3-none-any.whl (14.7 kB view details)

Uploaded Mar 25, 2026 Python 3

File details

Details for the file logflux-1.0.0.tar.gz.

File metadata

Download URL: logflux-1.0.0.tar.gz
Upload date: Mar 25, 2026
Size: 23.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for logflux-1.0.0.tar.gz
Algorithm	Hash digest
SHA256	`53c43bc22b1a7eb2307bd43de0995ed154f2651a3542268f967fbef5094b0b1c`
MD5	`ff40cc2f6dabccb3ebc6b4045b25aa2b`
BLAKE2b-256	`a134b2e2633a5dfad7590e4bda826653cbb18998cb1b132c7427d2e6b36ae966`

See more details on using hashes here.

File details

Details for the file logflux-1.0.0-py3-none-any.whl.

File metadata

Download URL: logflux-1.0.0-py3-none-any.whl
Upload date: Mar 25, 2026
Size: 14.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for logflux-1.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`293f929ad187f92e3188b65cbc330940db6fea566084b35b2b12ae7f83dbce06`
MD5	`8dae6009d306e9087e2f817e7e39967e`
BLAKE2b-256	`52108da81469fe22e37923bade265c0da881e96df33d684b85564d078d3d9960`

See more details on using hashes here.

logflux 1.0.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

logflux

Installation

Usage

rsyslog

journald

Telegraf integration

journald configuration

Configuration

rsyslog-specific options

journald-specific options

Common options

Rules

Field and tag lookups

Type conversion

Transforms

Computed fields (math)

Security considerations

Regex performance (ReDoS)

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes