These options are defined specifically for the AMD GCN port.
Set architecture type or tuning for gpu. Supported values for gpu are
Compile for GCN3 Fiji devices (gfx803).
Compile for GCN5 Vega 10 devices (gfx900).
Compile for GCN5 Vega 20 devices (gfx906).
Compile for CDNA1 Instinct MI100 series devices (gfx908).
Compile for CDNA2 Instinct MI200 series devices (gfx90a).
Compile binaries suitable for devices with the SRAM-ECC feature enabled, disabled, or either mode. This feature can be enabled per-process on some devices. The compiled code must match the device mode. The default is ‘any’, for devices that support it.
Specify how many bytes of stack space will be requested for each GPU thread (wave-front). Beware that there may be many threads and limited memory available. The size of the stack allocation may also have an impact on run-time performance. The default is 32KB when using OpenACC or OpenMP, and 1MB otherwise.
Compile binaries suitable for devices with the XNACK feature enabled. Some devices always require XNACK and some allow the user to configure XNACK. The compiled code must match the device mode. The default is ‘-mno-xnack’. At present this option is a placeholder for support that is not yet implemented.