SYCL Sub-groups can speed up your performance like magic. Basically, work-items can perform inter "shuffle" operations accelerated inside the EU. Work-items running on the same execution unit can perform "shuffle" operations between them inside the EU. Faster, efficient, and accelerated by the EU Hardware, example source code: SYCL Sub-groups - Google Drive: https://bit.ly/3DOhcBP