netflix-spectator-py

Python library for reporting metrics to Atlas.

These details have not been verified by PyPI

Project links

Homepage

Project description

Introduction

Python port of the Spectator library for Java.

See the Spectator documentation for an overview of core concepts and details on usage.

Supports Python >= 3.5, which is the oldest system Python 3 available on our commonly used OSes.

Local Development

Install pyenv, possibly with Homebrew.
Install Python versions: 3.5, 3.6, 3.7, and 3.8. Enable all versions globally.
Make changes and add tests.
tox

Usage

Installing

The netflix-spectator-py package alone is not sufficient to report data to an Atlas backend - a configuration package must also be installed.

At Netflix, the internal configuration package is named netflix-spectator-pyconf and it declares a dependency on this client package.

pip3 install netflix-spectator-pyconf

Importing

Standard Usage

At Netflix, your initialization script should load the environment, to ensure that the standard variables are available to the Python application.

source /etc/nflx/environment

Importing the GlobalRegistry configures common tags based on environment variables, sets the Atlas Aggregator URL, and starts a background thread which reports metrics data every five seconds.

from spectator import GlobalRegistry

Once the GlobalRegistry is imported, it is used to create and manage Meters.

Concurrent Usage

There is a known issue in Python where forking a process after a thread is started can lead to deadlocks. This is commonly seen when using the multiprocessing module with default settings. The root cause of the deadlocks is that fork() copies everything in memory, including globals that have been set in imported modules, but it does not copy threads - any threads started in the parent process will not exist in the child process. The possibility of deadlocks occurs when global state sets a lock and it then depends upon a thread to remove the lock.

Under the standard usage model for spectator-py, it starts a background publishing thread when the module is imported, which is responsible for manipulating a lock. Below, there are a couple of options described for working around this issue.

Gunicorn

If you are using spectator-py while running under Gunicorn, then do not use the --preload flag, which loads application code before worker processes are forked. Preloading triggers the conditions that allow deadlocks to occur in the background publish thread.

At Netflix, you can set the following flag in /etc/default/ezconfig to achieve this configuration for Gunicorn:

WSGI_GUNICORN_PRELOAD = undef

Task Worker Forking

For other pre-fork worker processing frameworks, such as huey, you need to be careful about how and when you start the GlobalRegistry to avoid deadlocks in the background publish thread. You should set the SPECTATOR_PY_DISABLE_AUTO_START_GLOBAL environment variable to disable automatic startup of the Registry, so that you can plan to start it manually after all of the workers have been forked.

You can set the variable as a part of your initialization script:

export SPECTATOR_PY_DISABLE_AUTO_START_GLOBAL=1

You can set the variable in Python code, as long as it is at the top of the module where you plan to use spectator-py:

import os

os.environ["SPECTATOR_PY_DISABLE_AUTO_START_GLOBAL"] = "1"

After your workers have started, you can then start the GlobalRegistry as follows:

from spectator import GlobalRegistry

GlobalRegistry.start()

It is often best to have the import and start() within an initialization function for the workers, to help ensure that it is not started when the module is loaded.

Generic Multiprocessing

In Python 3, you can configure the start method to spawn for multiprocessing. This will cause the module to do a fork() followed by an execve() to start a brand new Python process.

To configure this option globally:

from multiprocessing import set_start_method
set_start_method("spawn")

To configure thid option within a context:

from multiprocessing import get_context

def your_func():
    with get_context("spawn").Pool() as pool:
        pass

Logging

This package provides three loggers:

spectator.init
spectator.HttpClient
spectator.Registry

When troubleshooting metrics collection and reporting, you should set Registry logging to the DEBUG level. For example:

import logging

# record the human-readable time, name of the logger, logging level, thread id and message
logging.basicConfig(
    level=logging.DEBUG,
    format='%(asctime)s - %(name)s - %(levelname)s - %(thread)d - %(message)s'
)

# silence the HttpClient logger output, to minimize confusion while reading logs
logging.getLogger('spectator.HttpClient').setLevel(logging.ERROR)

# set the Registry logger to INFO or ERROR when done troubleshooting
logging.getLogger('spectator.Registry').setLevel(logging.DEBUG)

Detecting Deadlocks

If you need to detect whether or not your application is affected by deadlocks, then you can use sys._current_frames to collect stack frames periodically and check them. A common pattern is to run this on a background thread every 10 seconds.

Working with IDs

The IDs used for looking up a meter in the GlobalRegistry consist of a name and a set of tags. IDs will be consumed by users many times after the data has been reported, so they should be chosen thoughtfully, while considering how they will be used. See the naming conventions page for general guidelines.

IDs are immutable, so they can be freely passed around and used in a concurrent context. Tags can be added to an ID when it is created, to track the dimensionality of the metric. All tag keys and values must be strings. For example, if you want to keep track of the number of successful requests, you must cast integers to strings.

requests_id = GlobalRegistry.counter('server.numRequests', {'statusCode': str(200)})
requests_id.increment()

Meter Types

Counters

A Counter is used to measure the rate at which an event is occurring. Considering an API endpoint, a Counter could be used to measure the rate at which it is being accessed.

Counters are reported to the backend as a rate-per-second. In Atlas, the :per-step operator can be used to convert them back into a value-per-step on a graph.

Call increment() when an event occurs:

GlobalRegistry.counter('server.numRequests').increment()

You can also pass a value to increment(). This is useful when a collection of events happens together:

GlobalRegistry.counter('queue.itemsAdded').increment(10)

Distribution Summaries

A Distribution Summary is used to track the distribution of events. It is similar to a Timer, but more general, in that the size does not have to be a period of time. For example, a Distribution Summary could be used to measure the payload sizes of requests hitting a server.

Always use base units when recording data, to ensure that the tick labels presented on Atlas graphs are readable. If you are measuring payload size, then use bytes, not kilobytes (or some other unit). This means that a 4K tick label will represent 4 kilobytes, rather than 4 kilo-kilobytes.

Call record() with a value:

GlobalRegistry.distribution_summary('server.requestSize').record(10)

Gauges

A gauge is a value that is sampled at some point in time. Typical examples for gauges would be the size of a queue or number of threads in a running state. Since gauges are not updated inline when a state change occurs, there is no information about what might have occurred between samples.

Consider monitoring the behavior of a queue of tasks. If the data is being collected once a minute, then a gauge for the size will show the size when it was sampled. The size may have been much higher or lower at some point during interval, but that is not known.

Call set() with a value:

GlobalRegistry.gauge('server.queueSize').set(10)

Gauges are designed to report the last set value for 15 minutes. This done so that updates to the values do not need to be collected on a tight 1-minute schedule to ensure that Atlas shows unbroken lines in graphs.

If you wish to no longer report a Gauge value, then set it to float('nan'). This is a separate and distinct value from 'nan' or 'NaN', which are strings.

Timers

A Timer is used to measure how long (in seconds) some event is taking.

Call record() with a value:

GlobalRegistry.timer('server.requestLatency').record(0.01)

Timers will keep track of the following statistics as they are used:

count
totalTime
totalOfSquares
max

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

1.1.2

Oct 2, 2025

1.1.1

Jun 25, 2025

1.1.0

Jun 24, 2025

1.0.6

May 19, 2025

1.0.5

May 16, 2025

1.0.4

May 15, 2025

1.0.3

Mar 30, 2025

1.0.2

Mar 20, 2025

1.0.1

Aug 30, 2024

1.0.0

Aug 5, 2024

1.0.0rc5 pre-release

Aug 3, 2024

1.0.0rc4 pre-release

Jul 30, 2024

1.0.0rc3 pre-release

Jul 19, 2024

1.0.0rc2 pre-release

Jul 19, 2024

1.0.0rc1 pre-release

Jul 12, 2024

1.0.0rc0 pre-release

Jul 11, 2024

0.2.10

Jan 19, 2023

0.2.9

Aug 29, 2022

0.2.8

Aug 29, 2022

0.2.7

Aug 23, 2022

0.2.6

Apr 29, 2022

0.2.5

Apr 27, 2022

0.2.4

Apr 27, 2022

0.2.3

Apr 27, 2022

0.2.2

Apr 26, 2022

0.2.1

Apr 26, 2022

0.2.0

Apr 19, 2022

0.1.18

Mar 30, 2022

0.1.17

Oct 21, 2021

0.1.16

Mar 24, 2021

This version

0.1.15

Aug 13, 2020

0.1.14

Apr 21, 2020

0.1.13

Feb 4, 2020

0.1.12

Dec 12, 2019

0.1.11

Nov 8, 2019

0.1.10

Jun 3, 2019

0.1.9

Feb 12, 2019

0.1.8

Feb 11, 2019

0.1.7

Oct 5, 2018

0.1.6

Oct 5, 2018

0.1.5

Apr 26, 2018

0.1.4

Apr 26, 2018

0.1.3

Mar 20, 2018

0.1.2

Mar 15, 2018

0.1.1

Mar 15, 2018

0.1

Mar 15, 2018

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

netflix-spectator-py-0.1.15.tar.gz (23.8 kB view details)

Uploaded Aug 13, 2020 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

netflix_spectator_py-0.1.15-py2.py3-none-any.whl (21.1 kB view details)

Uploaded Aug 13, 2020 Python 2Python 3

File details

Details for the file netflix-spectator-py-0.1.15.tar.gz.

File metadata

Download URL: netflix-spectator-py-0.1.15.tar.gz
Upload date: Aug 13, 2020
Size: 23.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.23.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.7.5

File hashes

Hashes for netflix-spectator-py-0.1.15.tar.gz
Algorithm	Hash digest
SHA256	`794f9e341e2c2c1f312495772ddaaaf87d25121a6271aea9fef71b1cef471738`
MD5	`f26b69a4e4302b45168340d77740fa2c`
BLAKE2b-256	`71fad0e93f832a763fa1e7d02bf42d4e1288b6cae879bf983c1bee4fa036e94f`

See more details on using hashes here.

File details

Details for the file netflix_spectator_py-0.1.15-py2.py3-none-any.whl.

File metadata

Download URL: netflix_spectator_py-0.1.15-py2.py3-none-any.whl
Upload date: Aug 13, 2020
Size: 21.1 kB
Tags: Python 2, Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.23.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.7.5

File hashes

Hashes for netflix_spectator_py-0.1.15-py2.py3-none-any.whl
Algorithm	Hash digest
SHA256	`9b220274f3bf1f06c0ae8645e9a725721b0e93668980a8d97542ada1ec3139f9`
MD5	`86d328a793ed5a7670b0af6a4fc5cb82`
BLAKE2b-256	`73d1d4a3c452de3b8f891e4e31b0f3652e4d1e708e1fb28dbe0436ef8fa14407`

See more details on using hashes here.

netflix-spectator-py 0.1.15

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Introduction

Local Development

Usage

Installing

Importing

Standard Usage

Concurrent Usage

Gunicorn

Task Worker Forking

Generic Multiprocessing

Logging

Detecting Deadlocks

Working with IDs

Meter Types

Counters

Distribution Summaries

Gauges

Timers

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes