Syntax of gfx906 Instructions

Introduction

This document describes the syntax of instructions specific to gfx906.

For a description of other gfx906 instructions see Syntax of Core GFX9 Instructions.

Notation

Notation used in this document is explained here.

Overview

An overview of generic syntax and other features of AMDGPU instructions may be found in this document.

Instructions

VOP2

INSTRUCTION                    DST       SRC0      SRC1       MODIFIERS
————————————————————————————————————————————————————————————————————————————————————————————————————
v_fmac_f32                     vdst,     src0,     vsrc1
v_fmac_f32_dpp                 vdst,     vsrc0:m,  vsrc1:m    dpp_ctrl row_mask bank_mask bound_ctrl
v_xnor_b32                     vdst,     src0,     vsrc1
v_xnor_b32_dpp                 vdst,     vsrc0,    vsrc1      dpp_ctrl row_mask bank_mask bound_ctrl
v_xnor_b32_sdwa                vdst,     src0:m,   src1:m     dst_sel dst_unused src0_sel src1_sel

VOP3

INSTRUCTION                    DST       SRC0      SRC1           MODIFIERS
————————————————————————————————————————————————————————————————————————————
v_fmac_f32_e64                 vdst,     src0:m,   src1:m         clamp omod
v_xnor_b32_e64                 vdst,     src0,     src1

VOP3P

INSTRUCTION            DST      SRC0        SRC1        SRC2           MODIFIERS
—————————————————————————————————————————————————————————————————————————————————————————————————
v_dot2_f32_f16         vdst,    src0:f16x2, src1:f16x2, src2:f32       neg_lo neg_hi clamp
v_dot2_i32_i16         vdst,    src0:i16x2, src1:i16x2, src2:i32       clamp
v_dot2_u32_u16         vdst,    src0:u16x2, src1:u16x2, src2:u32       clamp
v_dot4_i32_i8          vdst,    src0:i8x4,  src1:i8x4,  src2:i32       clamp
v_dot4_u32_u8          vdst,    src0:u8x4,  src1:u8x4,  src2:u32       clamp
v_dot8_i32_i4          vdst,    src0:i4x8,  src1:i4x8,  src2:i32       clamp
v_dot8_u32_u4          vdst,    src0:u4x8,  src1:u4x8,  src2:u32       clamp
v_fma_mix_f32          vdst,    src0:m:fx,  src1:m:fx,  src2:m:fx      m_op_sel m_op_sel_hi clamp
v_fma_mixhi_f16        vdst,    src0:m:fx,  src1:m:fx,  src2:m:fx      m_op_sel m_op_sel_hi clamp
v_fma_mixlo_f16        vdst,    src0:m:fx,  src1:m:fx,  src2:m:fx      m_op_sel m_op_sel_hi clamp