Nvidia Training Course : Find the Bottleneck – Optimize AI Pipelines With Nsight Systems

sm-nftb
Durée : 0.5 jour
Tarif : 650€ HT

97%

Taux de satisfaction clients
(sur 1838 évaluations du 19/05/21 au 18/03/26)

Voir les avis

Objectifs

By participating in this workshop, you will be equipped to:

Profile AI applications using NVIDIA Nsight Systems to identify performance bottlenecks.

Annotate code with NVTX (NVIDIA Tools Extension) for better visualization and analysis.

Analyze and optimize data transfers between host CPU and GPU memory.

Leverage GPU-accelerated libraries to eliminate unnecessary data movement.

Use Nsight Systems plugins and multi-report analysis for advanced performance insights.

Apply optimization strategies to real-world AI pipelines for significant speedups.

Prérequis

Basic Python programming experience and familiarity with Jupyter notebooks.

Understanding of GPU computing concepts (CUDA knowledge helpful but not required).

Familiarity with deep learning frameworks, particularly PyTorch.

Basic understanding of video processing concepts.

Public

IT professionals

Formation(s) associée(s)

Dernière mise à jour

Programme mis à jour le 13 avril 2026

Cette formation vous intéresse ? Contactez-nous

Bon à savoir

Evaluez votre niveau

Pour vous aider à bien choisir votre formation, nous vous proposons soit un entretien avec le formateur soit un test d’évaluation. Cela vous assurera que vous disposez des connaissances nécessaires pour suivre la formation dans des conditions optimales.

Sessions garanties

La majorité de nos sessions proposées en distanciel sont garanties. Elles peuvent être enregistrées à la demande.

Travaux pratiques

Nos formations comprennent de nombreux travaux pratiques pour un meilleur apprentissage (60 % de pratique). Nous proposons également de travailler sur vos données pour une meilleure expérience.

Nos prestations

Nous réalisons certaines prestations IT pour vous : développement sur-mesure, refonte logicielle, TMA, interfaçage ERP.

Télécharger la fiche de cette formation

Les Modules
de formation

Module1

Introduction

Understand the video segmentation pipeline for background blurring.

Learn about the components: ffmpeg, OpenCV, and PyTorch.

Review the optimization workflow with Nsight Systems.

Module2

First Optimization Iteration

Annotate applications with NVTX for better profiling visibility.

Generate baseline profiles using Nsight Systems.

Navigate the Nsight Systems GUI and timeline views.

Identify initial performance bottlenecks.

Module3

Optimize Data Transfers

Analyze CPU-GPU data transfer patterns and bottlenecks.

Implement GPU-accelerated video decoding/encoding with PyNvVideoCodec.

Eliminate unnecessary host-device memory copies.

Verify optimizations through profiling.

Module4

Nsight Systems Plugins

Explore Nsight Systems plugin architecture.

Use built-in collector plugins for extended metrics.

Understand how to create custom data collectors.

Module5

Advanced Analysis

Learn about multi-report analysis for parallel applications.

Use recipe scripts for statistical analysis across reports.

Identify load balancing issues and communication patterns.

Generate automated performance reports.

Module6

Multi-Node Performance

Profile applications in distributed computing environments.

Use Nsight Systems with cluster schedulers like Slurm.

Apply data collection strategies for large-scale applications.

Analyze performance across multiple nodes and ranks.

Les prochaines
sessions de formation

Sur demande

Vous souhaitez organiser cette formation à une date spécifique ?Contactez-nous en remplissant le formulaire ci-dessous

10 août 2026

12 octobre 2026

30 novembre 2026

01 février 2027

Cette formation vous intéresse ? Contactez-nous !