Skip to main content

Profile Tensorboard Plugin

Project description

Tensorboard Profiler Plugin

The profiler includes a suite of tools for JAX, TensorFlow, and PyTorch/XLA. These tools help you understand, debug and optimize programs to run on CPUs, GPUs and TPUs.

The profiler plugin offers a number of tools to analyse and visualize the performance of your model across multiple devices. Some of the tools include:

  • Overview: A high-level overview of the performance of your model. This is an aggregated overview for your host and all devices. It includes:
    • Performance summary and breakdown of step times.
    • A graph of individual step times.
    • A table of the top 10 most expensive operations.
  • Trace Viewer: Displays a timeline of the execution of your model that shows:
    • The duration of each op.
    • Which part of the system (host or device) executed an op.
    • The communication between devices.
  • Memory Profile Viewer: Monitors the memory usage of your model.
  • Graph Viewer: A visualization of the graph structure of HLOs of your model.

Demo

First time user? Come and check out this Colab Demo.

Prerequisites

  • TensorFlow >= 2.18.0
  • TensorBoard >= 2.18.0
  • tensorboard-plugin-profile >= 2.18.0

Note: The Tensorboard Profiler Plugin requires access to the Internet to load the Google Chart library. Some charts and tables may be missing if you run TensorBoard entirely offline on your local machine, behind a corporate firewall, or in a datacenter.

To profile on a single GPU system, the following NVIDIA software must be installed on your system:

  1. NVIDIA GPU drivers and CUDA Toolkit:

    • CUDA 12.5 requires 525.60.13 and higher.
  2. Ensure that CUPTI 10.1 exists on the path.

    $ /sbin/ldconfig -N -v $(sed 's/:/ /g' <<< $LD_LIBRARY_PATH) | grep libcupti
    

    If you don't see libcupti.so.12.5 on the path, prepend its installation directory to the $LD_LIBRARY_PATH environmental variable:

    $ export LD_LIBRARY_PATH=/usr/local/cuda/extras/CUPTI/lib64:$LD_LIBRARY_PATH
    

    Run the ldconfig command above again to verify that the CUPTI 12.5 library is found.

    If this doesn't work, try:

    $ sudo apt-get install libcupti-dev
    

To profile a system with multiple GPUs, see this guide for details.

To profile multi-worker GPU configurations, profile individual workers independently.

To profile cloud TPUs, you must have access to Google Cloud TPUs.

Quick Start

The profiler plugin follows the TensorFlow versioning scheme. As a result, the tensorboard-plugin-profile PyPI package can be behind the tbp-nightly PyPI package. In order to get the latest version of the profiler plugin, you can install the nightly package..

To install the nightly version of profiler:

$ pip uninstall tensorboard-plugin-profile
$ pip install tbp-nightly

Run TensorBoard:

$ tensorboard --logdir=profiler/demo

If you are behind a corporate firewall, you may need to include the --bind_all tensorboard flag.

Go to localhost:6006/#profile of your browser, you should now see the demo overview page show up. Overview Page Congratulations! You're now ready to capture a profile.

Next Steps

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tbp_testing-2.19.0.tar.gz (5.9 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

tbp_testing-2.19.0-py3-none-any.whl (6.0 MB view details)

Uploaded Python 3

File details

Details for the file tbp_testing-2.19.0.tar.gz.

File metadata

  • Download URL: tbp_testing-2.19.0.tar.gz
  • Upload date:
  • Size: 5.9 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.10

File hashes

Hashes for tbp_testing-2.19.0.tar.gz
Algorithm Hash digest
SHA256 9f9b9fe0a35eff0b27e3fd876a09699a270b220db6c6e08c0593d82e7d77bea7
MD5 8c5a7a7938326e105d7c4bb93b0c4c6e
BLAKE2b-256 65f9d043d412c52b78223f9f3c0de192c47eadcca7c6e060d01e73b0c2240b73

See more details on using hashes here.

File details

Details for the file tbp_testing-2.19.0-py3-none-any.whl.

File metadata

  • Download URL: tbp_testing-2.19.0-py3-none-any.whl
  • Upload date:
  • Size: 6.0 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.10

File hashes

Hashes for tbp_testing-2.19.0-py3-none-any.whl
Algorithm Hash digest
SHA256 b982cfad0b72653979f7c687bd26d864ffebc99cafc8c45f64b98790203e4004
MD5 ea367cfabe87ccb67bff5fafe3e60e85
BLAKE2b-256 1c5d1cbec0b9a19fe165e236169d5491d582b0c468882a4bd08b42a606c6f1b6

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page