By participating in this workshop, you will be equipped to:
Profile AI applications using NVIDIA Nsight Systems to identify performance bottlenecks.
Annotate code with NVTX (NVIDIA Tools Extension) for better visualization and analysis.
Analyze and optimize data transfers between host CPU and GPU memory.
Leverage GPU-accelerated libraries to eliminate unnecessary data movement.
Use Nsight Systems plugins and multi-report analysis for advanced performance insights.
Apply optimization strategies to real-world AI pipelines for significant speedups.