Developer Guide


Kernel Variables

This section describes mechanisms available to manipulate datapath for kernel variables.


Intel® oneAPI DPC++/C++ Compiler
provides FPGA extension
that you can include in your kernel code.
function directs the compiler to insert at least one register between the operand and the return value of the function call. In general, it is not necessary to include the
function in your kernel code to achieve desired performance.
Intel® strongly recommends that you use the
function only if you are experienced in using the Intel® Quartus® Prime Pro Edition software performing advanced optimization for a specific target device. You must have sufficient knowledge about the placement of portions of the datapath on an FPGA device.
T fpga_reg(T op)
may be any sized type, such as standard DPC++ device data types, or a user-defined
containing DPC++ types.
Consider the following example:
#include <CL/sycl/INTEL/fpga_extensions.hpp> ... r[k] = cl::sycl::INTEL::fpga_reg(a[k]) + b[k]; ...
Use the
function to perform the following:
  • Break critical paths between spatially distant portions of a datapath, such as between processing elements of a large systolic array.
  • Reduce the pressure on placement and routing efforts caused by spatially distinct portions of the kernel implementation.
function directs the compiler to insert at least one hardware pipelining register on the signal path that assigns the operand to the return value. This built-in function operates as an assignment in the DPC++ programming language where the operand is assigned to the return value. The assignment has no implicit semantic or functional meaning beyond a standard-C assignment. Functionally, you can consider the
function being always optimized away by the compiler.
For additional information, refer to the FPGA tutorial sample "FPGA register" listed in the Intel® oneAPI Samples Browser on Linux* or Intel® oneAPI Samples Browser on Windows*.
The compiler does not provide feedback on where you should insert the
function calls in your code. Use the Intel® Quartus® Prime Pro Edition software to determine where you should insert the calls to address specific aspects of performance.
You may introduce nested
function calls in your kernel code to increase the minimum number of registers that the compiler inserts on the assignment path. Because each function call guarantees the insertion of at least one register stage, the number of calls provides a lower limit on the number of registers.

Product and Performance Information


Intel's compilers may or may not optimize to the same degree for non-Intel microprocessors for optimizations that are not unique to Intel microprocessors. These optimizations include SSE2, SSE3, and SSSE3 instruction sets and other optimizations. Intel does not guarantee the availability, functionality, or effectiveness of any optimization on microprocessors not manufactured by Intel. Microprocessor-dependent optimizations in this product are intended for use with Intel microprocessors. Certain optimizations not specific to Intel microarchitecture are reserved for Intel microprocessors. Please refer to the applicable product User and Reference Guides for more information regarding the specific instruction sets covered by this notice.

Notice revision #20110804