A profiling and performance analysis tool for TensorFlow
The profiler includes a suite of tools. These tools help you understand, debug and optimize TensorFlow programs to run on CPUs, GPUs and TPUs.
First time user? Come and check out this Colab Demo.
Note: The TensorFlow Profiler requires access to the Internet to load the Google Chart library. Some charts and tables may be missing if you run TensorBoard entirely offline on your local machine, behind a corporate firewall, or in a datacenter.
To profile on a single GPU system, the following NVIDIA software must be installed on your system:
$ /sbin/ldconfig -N -v $(sed 's/:/ /g' <<< $LD_LIBRARY_PATH) | grep libcupti
If you don't see
libcupti.so.10.1on the path, prepend its installation directory to the $LDLIBRARYPATH environmental variable:
$ export LD_LIBRARY_PATH=/usr/local/cuda/extras/CUPTI/lib64:$LD_LIBRARY_PATH
Run the ldconfig command above again to verify that the CUPTI 10.1 library is found.
If this doesn't work, try:
shell $ sudo apt-get install libcupti-dev
To profile a system with multiple GPUs, see this guide for details.
To profile multi-worker GPU configurations, profile individual workers independently.
To profile cloud TPUs, you must have access to Google Cloud TPUs.
Install nightly version of profiler by downloading and running the
install_and_run.pyscript from this directory.
$ git clone https://github.com/tensorflow/profiler.git profiler $ mkdir profile_env $ python3 profiler/install_and_run.py --envdir=profile_env --logdir=profiler/demoGo to
localhost:6006/#profileof your browser, you should now see the demo overview page show up.