Performance analysis on GPUs with NVIDIA tools (online)

Online ()


The link to the Zoom session will be communicated to registered participants a few days before the event

NHR@FAUPorting Code to the GPU promises large speedups, but there can be pitfalls on the way to realizing its potential. This course will introduce NVIDIA's profiler as a tool to spot common performance bugs that arise when porting code to the GPU.

A practical demonstration introduces the basics of GPU performance analysis. NVIDIA's profiling tool Nsight Systems ist used to analyze GPU utilization and to spot performance anomalies. The complementary tool Nsight Compute is then used to gain more insight about the performance of individual GPU kernels. The performance analysis will be guided by simple, resource-based performance models which will enable the developer to develop a concept of how far the performance is from the "target."

Attendees will be able to follow along the demos and conduct their own experiments on the NHR@FAU GPU cluster.


  • basic understanding of GPU programming with CUDA, OpenMP, or OpenACC
  • compile applications by invoking the compiler on the command line
  • editing files through ssh connection via vim, emacs, nano, ...


Participation is free of charge for attendees from German universities, academic computing centers, and research institutions. Please only register for the course if you are really going to attend. No-shows will be blacklisted and excluded from future NHR@FAU events.


All actual course attendees will receive a course certificate.


Dominik Ernst (NHR@FAU)

This course is organized by Erlangen National High Performance Computing Center (NHR@FAU)

    • 09:00 10:30
      Lecture and hands-on part 1 1h 30m
    • 10:30 10:40
      Break 10m
    • 10:40 11:40
      Lecture and hands-on part 2 1h
    • 11:40 11:50
      Break 10m
    • 11:50 13:00
      Lecture and hands-on part 3 1h 10m
Your browser is out of date!

Update your browser to view this website correctly. Update my browser now