Profile DPC++ and GPU Workload Performance

@IntelDevTools