In this session, we will present a DPC++ code walk-through of a simple matrix multiplication example, and look at how we can optimize the code for performance using various features of DPC++. We will compare the execution times of various optimizations.

Agenda

DPC++ code walk-through of simple matrix multiplication example - 20min

Optimize the code for performance using ND-Range Kernel. - 20min

Optimize the code for performance using Shared Local Memory. -20min

Compare the execution times of various optimizations. - 20min

This module is a part of the Aurora Learning Paths Series.

About the Presenter

Rakshith Krishnappa is a developer evangelist at Intel, focused on oneAPI, DPC++, and High-Performance Computing. For the last 16 years, he has worked on various Intel products including CPUs, Graphics, GPUs, HPC products, and Software solutions.