Syntax of GFX10 RDNA1 Instructions¶
Introduction¶
This document describes the syntax of GFX10 RDNA1 (gfx1010) instructions.
Notation¶
Notation used in this document is explained here.
Overview¶
An overview of generic syntax and other features of AMDGPU instructions may be found in this document.
Instructions¶
DPP16¶
INSTRUCTION DST0 DST1 SRC0 SRC1 SRC2 MODIFIERS ——————————————————————————————————————————————————————————————————————————————————————————————————————————————————— v_add_co_ci_u32_dpp vdst, vcc, vsrc0, vsrc1, vcc dpp16_ctrl row_mask bank_mask bound_ctrl fi v_add_f16_dpp vdst, vsrc0:m, vsrc1:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_add_f32_dpp vdst, vsrc0:m, vsrc1:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_add_nc_u32_dpp vdst, vsrc0, vsrc1 dpp16_ctrl row_mask bank_mask bound_ctrl fi v_and_b32_dpp vdst, vsrc0, vsrc1 dpp16_ctrl row_mask bank_mask bound_ctrl fi v_ashrrev_i32_dpp vdst, vsrc0:u32, vsrc1 dpp16_ctrl row_mask bank_mask bound_ctrl fi v_bfrev_b32_dpp vdst, vsrc dpp16_ctrl row_mask bank_mask bound_ctrl fi v_ceil_f16_dpp vdst, vsrc:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_ceil_f32_dpp vdst, vsrc:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_cndmask_b32_dpp vdst, vsrc0:m, vsrc1:m, vcc dpp16_ctrl row_mask bank_mask bound_ctrl fi v_cos_f16_dpp vdst, vsrc:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_cos_f32_dpp vdst, vsrc:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_cvt_f16_f32_dpp vdst, vsrc:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_cvt_f16_i16_dpp vdst, vsrc dpp16_ctrl row_mask bank_mask bound_ctrl fi v_cvt_f16_u16_dpp vdst, vsrc dpp16_ctrl row_mask bank_mask bound_ctrl fi v_cvt_f32_f16_dpp vdst, vsrc:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_cvt_f32_i32_dpp vdst, vsrc dpp16_ctrl row_mask bank_mask bound_ctrl fi v_cvt_f32_u32_dpp vdst, vsrc dpp16_ctrl row_mask bank_mask bound_ctrl fi v_cvt_f32_ubyte0_dpp vdst, vsrc dpp16_ctrl row_mask bank_mask bound_ctrl fi v_cvt_f32_ubyte1_dpp vdst, vsrc dpp16_ctrl row_mask bank_mask bound_ctrl fi v_cvt_f32_ubyte2_dpp vdst, vsrc dpp16_ctrl row_mask bank_mask bound_ctrl fi v_cvt_f32_ubyte3_dpp vdst, vsrc dpp16_ctrl row_mask bank_mask bound_ctrl fi v_cvt_flr_i32_f32_dpp vdst, vsrc:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_cvt_i16_f16_dpp vdst, vsrc:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_cvt_i32_f32_dpp vdst, vsrc:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_cvt_norm_i16_f16_dpp vdst, vsrc:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_cvt_norm_u16_f16_dpp vdst, vsrc:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_cvt_off_f32_i4_dpp vdst, vsrc dpp16_ctrl row_mask bank_mask bound_ctrl fi v_cvt_pkrtz_f16_f32_dpp vdst, vsrc0:m:f32, vsrc1:m:f32 dpp16_ctrl row_mask bank_mask bound_ctrl fi v_cvt_rpi_i32_f32_dpp vdst, vsrc:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_cvt_u16_f16_dpp vdst, vsrc:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_cvt_u32_f32_dpp vdst, vsrc:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_exp_f16_dpp vdst, vsrc:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_exp_f32_dpp vdst, vsrc:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_ffbh_i32_dpp vdst, vsrc dpp16_ctrl row_mask bank_mask bound_ctrl fi v_ffbh_u32_dpp vdst, vsrc dpp16_ctrl row_mask bank_mask bound_ctrl fi v_ffbl_b32_dpp vdst, vsrc dpp16_ctrl row_mask bank_mask bound_ctrl fi v_floor_f16_dpp vdst, vsrc:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_floor_f32_dpp vdst, vsrc:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_fmac_f16_dpp vdst, vsrc0:m, vsrc1:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_fmac_f32_dpp vdst, vsrc0:m, vsrc1:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_fract_f16_dpp vdst, vsrc:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_fract_f32_dpp vdst, vsrc:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_frexp_exp_i16_f16_dpp vdst, vsrc:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_frexp_exp_i32_f32_dpp vdst, vsrc:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_frexp_mant_f16_dpp vdst, vsrc:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_frexp_mant_f32_dpp vdst, vsrc:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_ldexp_f16_dpp vdst, vsrc0:m, vsrc1:i16 dpp16_ctrl row_mask bank_mask bound_ctrl fi v_log_f16_dpp vdst, vsrc:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_log_f32_dpp vdst, vsrc:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_lshlrev_b32_dpp vdst, vsrc0:u32, vsrc1 dpp16_ctrl row_mask bank_mask bound_ctrl fi v_lshrrev_b32_dpp vdst, vsrc0:u32, vsrc1 dpp16_ctrl row_mask bank_mask bound_ctrl fi v_mac_f32_dpp vdst, vsrc0:m, vsrc1:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_max_f16_dpp vdst, vsrc0:m, vsrc1:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_max_f32_dpp vdst, vsrc0:m, vsrc1:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_max_i32_dpp vdst, vsrc0, vsrc1 dpp16_ctrl row_mask bank_mask bound_ctrl fi v_max_u32_dpp vdst, vsrc0, vsrc1 dpp16_ctrl row_mask bank_mask bound_ctrl fi v_min_f16_dpp vdst, vsrc0:m, vsrc1:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_min_f32_dpp vdst, vsrc0:m, vsrc1:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_min_i32_dpp vdst, vsrc0, vsrc1 dpp16_ctrl row_mask bank_mask bound_ctrl fi v_min_u32_dpp vdst, vsrc0, vsrc1 dpp16_ctrl row_mask bank_mask bound_ctrl fi v_mov_b32_dpp vdst, vsrc dpp16_ctrl row_mask bank_mask bound_ctrl fi v_movreld_b32_dpp vdst, vsrc dpp16_ctrl row_mask bank_mask bound_ctrl fi v_movrels_b32_dpp vdst, vsrc dpp16_ctrl row_mask bank_mask bound_ctrl fi v_movrelsd_2_b32_dpp vdst, vsrc dpp16_ctrl row_mask bank_mask bound_ctrl fi v_movrelsd_b32_dpp vdst, vsrc dpp16_ctrl row_mask bank_mask bound_ctrl fi v_mul_f16_dpp vdst, vsrc0:m, vsrc1:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_mul_f32_dpp vdst, vsrc0:m, vsrc1:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_mul_hi_i32_i24_dpp vdst, vsrc0, vsrc1 dpp16_ctrl row_mask bank_mask bound_ctrl fi v_mul_hi_u32_u24_dpp vdst, vsrc0, vsrc1 dpp16_ctrl row_mask bank_mask bound_ctrl fi v_mul_i32_i24_dpp vdst, vsrc0, vsrc1 dpp16_ctrl row_mask bank_mask bound_ctrl fi v_mul_legacy_f32_dpp vdst, vsrc0:m, vsrc1:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_mul_u32_u24_dpp vdst, vsrc0, vsrc1 dpp16_ctrl row_mask bank_mask bound_ctrl fi v_not_b32_dpp vdst, vsrc dpp16_ctrl row_mask bank_mask bound_ctrl fi v_or_b32_dpp vdst, vsrc0, vsrc1 dpp16_ctrl row_mask bank_mask bound_ctrl fi v_rcp_f16_dpp vdst, vsrc:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_rcp_f32_dpp vdst, vsrc:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_rcp_iflag_f32_dpp vdst, vsrc:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_rndne_f16_dpp vdst, vsrc:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_rndne_f32_dpp vdst, vsrc:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_rsq_f16_dpp vdst, vsrc:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_rsq_f32_dpp vdst, vsrc:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_sat_pk_u8_i16_dpp vdst:u8x4, vsrc dpp16_ctrl row_mask bank_mask bound_ctrl fi v_sin_f16_dpp vdst, vsrc:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_sin_f32_dpp vdst, vsrc:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_sqrt_f16_dpp vdst, vsrc:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_sqrt_f32_dpp vdst, vsrc:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_sub_co_ci_u32_dpp vdst, vcc, vsrc0, vsrc1, vcc dpp16_ctrl row_mask bank_mask bound_ctrl fi v_sub_f16_dpp vdst, vsrc0:m, vsrc1:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_sub_f32_dpp vdst, vsrc0:m, vsrc1:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_sub_nc_u32_dpp vdst, vsrc0, vsrc1 dpp16_ctrl row_mask bank_mask bound_ctrl fi v_subrev_co_ci_u32_dpp vdst, vcc, vsrc0, vsrc1, vcc dpp16_ctrl row_mask bank_mask bound_ctrl fi v_subrev_f16_dpp vdst, vsrc0:m, vsrc1:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_subrev_f32_dpp vdst, vsrc0:m, vsrc1:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_subrev_nc_u32_dpp vdst, vsrc0, vsrc1 dpp16_ctrl row_mask bank_mask bound_ctrl fi v_trunc_f16_dpp vdst, vsrc:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_trunc_f32_dpp vdst, vsrc:m dpp16_ctrl row_mask bank_mask bound_ctrl fi v_xnor_b32_dpp vdst, vsrc0, vsrc1 dpp16_ctrl row_mask bank_mask bound_ctrl fi v_xor_b32_dpp vdst, vsrc0, vsrc1 dpp16_ctrl row_mask bank_mask bound_ctrl fi
DPP8¶
INSTRUCTION DST0 DST1 SRC0 SRC1 SRC2 MODIFIERS ——————————————————————————————————————————————————————————————————————————————————————————————————— v_add_co_ci_u32_dpp vdst, vcc, vsrc0, vsrc1, vcc dpp8_sel fi v_add_f16_dpp vdst, vsrc0, vsrc1 dpp8_sel fi v_add_f32_dpp vdst, vsrc0, vsrc1 dpp8_sel fi v_add_nc_u32_dpp vdst, vsrc0, vsrc1 dpp8_sel fi v_and_b32_dpp vdst, vsrc0, vsrc1 dpp8_sel fi v_ashrrev_i32_dpp vdst, vsrc0:u32, vsrc1 dpp8_sel fi v_bfrev_b32_dpp vdst, vsrc dpp8_sel fi v_ceil_f16_dpp vdst, vsrc dpp8_sel fi v_ceil_f32_dpp vdst, vsrc dpp8_sel fi v_cndmask_b32_dpp vdst, vsrc0, vsrc1, vcc dpp8_sel fi v_cos_f16_dpp vdst, vsrc dpp8_sel fi v_cos_f32_dpp vdst, vsrc dpp8_sel fi v_cvt_f16_f32_dpp vdst, vsrc dpp8_sel fi v_cvt_f16_i16_dpp vdst, vsrc dpp8_sel fi v_cvt_f16_u16_dpp vdst, vsrc dpp8_sel fi v_cvt_f32_f16_dpp vdst, vsrc dpp8_sel fi v_cvt_f32_i32_dpp vdst, vsrc dpp8_sel fi v_cvt_f32_u32_dpp vdst, vsrc dpp8_sel fi v_cvt_f32_ubyte0_dpp vdst, vsrc dpp8_sel fi v_cvt_f32_ubyte1_dpp vdst, vsrc dpp8_sel fi v_cvt_f32_ubyte2_dpp vdst, vsrc dpp8_sel fi v_cvt_f32_ubyte3_dpp vdst, vsrc dpp8_sel fi v_cvt_flr_i32_f32_dpp vdst, vsrc dpp8_sel fi v_cvt_i16_f16_dpp vdst, vsrc dpp8_sel fi v_cvt_i32_f32_dpp vdst, vsrc dpp8_sel fi v_cvt_norm_i16_f16_dpp vdst, vsrc dpp8_sel fi v_cvt_norm_u16_f16_dpp vdst, vsrc dpp8_sel fi v_cvt_off_f32_i4_dpp vdst, vsrc dpp8_sel fi v_cvt_pkrtz_f16_f32_dpp vdst, vsrc0:f32, vsrc1:f32 dpp8_sel fi v_cvt_rpi_i32_f32_dpp vdst, vsrc dpp8_sel fi v_cvt_u16_f16_dpp vdst, vsrc dpp8_sel fi v_cvt_u32_f32_dpp vdst, vsrc dpp8_sel fi v_exp_f16_dpp vdst, vsrc dpp8_sel fi v_exp_f32_dpp vdst, vsrc dpp8_sel fi v_ffbh_i32_dpp vdst, vsrc dpp8_sel fi v_ffbh_u32_dpp vdst, vsrc dpp8_sel fi v_ffbl_b32_dpp vdst, vsrc dpp8_sel fi v_floor_f16_dpp vdst, vsrc dpp8_sel fi v_floor_f32_dpp vdst, vsrc dpp8_sel fi v_fmac_f16_dpp vdst, vsrc0, vsrc1 dpp8_sel fi v_fmac_f32_dpp vdst, vsrc0, vsrc1 dpp8_sel fi v_fract_f16_dpp vdst, vsrc dpp8_sel fi v_fract_f32_dpp vdst, vsrc dpp8_sel fi v_frexp_exp_i16_f16_dpp vdst, vsrc dpp8_sel fi v_frexp_exp_i32_f32_dpp vdst, vsrc dpp8_sel fi v_frexp_mant_f16_dpp vdst, vsrc dpp8_sel fi v_frexp_mant_f32_dpp vdst, vsrc dpp8_sel fi v_ldexp_f16_dpp vdst, vsrc0, vsrc1:i16 dpp8_sel fi v_log_f16_dpp vdst, vsrc dpp8_sel fi v_log_f32_dpp vdst, vsrc dpp8_sel fi v_lshlrev_b32_dpp vdst, vsrc0:u32, vsrc1 dpp8_sel fi v_lshrrev_b32_dpp vdst, vsrc0:u32, vsrc1 dpp8_sel fi v_mac_f32_dpp vdst, vsrc0, vsrc1 dpp8_sel fi v_max_f16_dpp vdst, vsrc0, vsrc1 dpp8_sel fi v_max_f32_dpp vdst, vsrc0, vsrc1 dpp8_sel fi v_max_i32_dpp vdst, vsrc0, vsrc1 dpp8_sel fi v_max_u32_dpp vdst, vsrc0, vsrc1 dpp8_sel fi v_min_f16_dpp vdst, vsrc0, vsrc1 dpp8_sel fi v_min_f32_dpp vdst, vsrc0, vsrc1 dpp8_sel fi v_min_i32_dpp vdst, vsrc0, vsrc1 dpp8_sel fi v_min_u32_dpp vdst, vsrc0, vsrc1 dpp8_sel fi v_mov_b32_dpp vdst, vsrc dpp8_sel fi v_movreld_b32_dpp vdst, vsrc dpp8_sel fi v_movrels_b32_dpp vdst, vsrc dpp8_sel fi v_movrelsd_2_b32_dpp vdst, vsrc dpp8_sel fi v_movrelsd_b32_dpp vdst, vsrc dpp8_sel fi v_mul_f16_dpp vdst, vsrc0, vsrc1 dpp8_sel fi v_mul_f32_dpp vdst, vsrc0, vsrc1 dpp8_sel fi v_mul_hi_i32_i24_dpp vdst, vsrc0, vsrc1 dpp8_sel fi v_mul_hi_u32_u24_dpp vdst, vsrc0, vsrc1 dpp8_sel fi v_mul_i32_i24_dpp vdst, vsrc0, vsrc1 dpp8_sel fi v_mul_legacy_f32_dpp vdst, vsrc0, vsrc1 dpp8_sel fi v_mul_u32_u24_dpp vdst, vsrc0, vsrc1 dpp8_sel fi v_not_b32_dpp vdst, vsrc dpp8_sel fi v_or_b32_dpp vdst, vsrc0, vsrc1 dpp8_sel fi v_rcp_f16_dpp vdst, vsrc dpp8_sel fi v_rcp_f32_dpp vdst, vsrc dpp8_sel fi v_rcp_iflag_f32_dpp vdst, vsrc dpp8_sel fi v_rndne_f16_dpp vdst, vsrc dpp8_sel fi v_rndne_f32_dpp vdst, vsrc dpp8_sel fi v_rsq_f16_dpp vdst, vsrc dpp8_sel fi v_rsq_f32_dpp vdst, vsrc dpp8_sel fi v_sat_pk_u8_i16_dpp vdst:u8x4, vsrc dpp8_sel fi v_sin_f16_dpp vdst, vsrc dpp8_sel fi v_sin_f32_dpp vdst, vsrc dpp8_sel fi v_sqrt_f16_dpp vdst, vsrc dpp8_sel fi v_sqrt_f32_dpp vdst, vsrc dpp8_sel fi v_sub_co_ci_u32_dpp vdst, vcc, vsrc0, vsrc1, vcc dpp8_sel fi v_sub_f16_dpp vdst, vsrc0, vsrc1 dpp8_sel fi v_sub_f32_dpp vdst, vsrc0, vsrc1 dpp8_sel fi v_sub_nc_u32_dpp vdst, vsrc0, vsrc1 dpp8_sel fi v_subrev_co_ci_u32_dpp vdst, vcc, vsrc0, vsrc1, vcc dpp8_sel fi v_subrev_f16_dpp vdst, vsrc0, vsrc1 dpp8_sel fi v_subrev_f32_dpp vdst, vsrc0, vsrc1 dpp8_sel fi v_subrev_nc_u32_dpp vdst, vsrc0, vsrc1 dpp8_sel fi v_trunc_f16_dpp vdst, vsrc dpp8_sel fi v_trunc_f32_dpp vdst, vsrc dpp8_sel fi v_xnor_b32_dpp vdst, vsrc0, vsrc1 dpp8_sel fi v_xor_b32_dpp vdst, vsrc0, vsrc1 dpp8_sel fi
DS¶
INSTRUCTION DST SRC0 SRC1 SRC2 MODIFIERS ————————————————————————————————————————————————————————————————————————————————————————————————— ds_add_f32 vaddr, vdata offset gds ds_add_rtn_f32 vdst, vaddr, vdata offset gds ds_add_rtn_u32 vdst, vaddr, vdata offset gds ds_add_rtn_u64 vdst, vaddr, vdata offset gds ds_add_src2_f32 vaddr offset gds ds_add_src2_u32 vaddr offset gds ds_add_src2_u64 vaddr offset gds ds_add_u32 vaddr, vdata offset gds ds_add_u64 vaddr, vdata offset gds ds_and_b32 vaddr, vdata offset gds ds_and_b64 vaddr, vdata offset gds ds_and_rtn_b32 vdst, vaddr, vdata offset gds ds_and_rtn_b64 vdst, vaddr, vdata offset gds ds_and_src2_b32 vaddr offset gds ds_and_src2_b64 vaddr offset gds ds_append vdst offset gds ds_bpermute_b32 vdst, vaddr, vdata offset ds_cmpst_b32 vaddr, vdata0, vdata1 offset gds ds_cmpst_b64 vaddr, vdata0, vdata1 offset gds ds_cmpst_f32 vaddr, vdata0, vdata1 offset gds ds_cmpst_f64 vaddr, vdata0, vdata1 offset gds ds_cmpst_rtn_b32 vdst, vaddr, vdata0, vdata1 offset gds ds_cmpst_rtn_b64 vdst, vaddr, vdata0, vdata1 offset gds ds_cmpst_rtn_f32 vdst, vaddr, vdata0, vdata1 offset gds ds_cmpst_rtn_f64 vdst, vaddr, vdata0, vdata1 offset gds ds_condxchg32_rtn_b64 vdst, vaddr, vdata offset gds ds_consume vdst offset gds ds_dec_rtn_u32 vdst, vaddr, vdata offset gds ds_dec_rtn_u64 vdst, vaddr, vdata offset gds ds_dec_src2_u32 vaddr offset gds ds_dec_src2_u64 vaddr offset gds ds_dec_u32 vaddr, vdata offset gds ds_dec_u64 vaddr, vdata offset gds ds_gws_barrier vdata offset gds ds_gws_init vdata offset gds ds_gws_sema_br vdata offset gds ds_gws_sema_p offset gds ds_gws_sema_release_all offset gds ds_gws_sema_v offset gds ds_inc_rtn_u32 vdst, vaddr, vdata offset gds ds_inc_rtn_u64 vdst, vaddr, vdata offset gds ds_inc_src2_u32 vaddr offset gds ds_inc_src2_u64 vaddr offset gds ds_inc_u32 vaddr, vdata offset gds ds_inc_u64 vaddr, vdata offset gds ds_max_f32 vaddr, vdata offset gds ds_max_f64 vaddr, vdata offset gds ds_max_i32 vaddr, vdata offset gds ds_max_i64 vaddr, vdata offset gds ds_max_rtn_f32 vdst, vaddr, vdata offset gds ds_max_rtn_f64 vdst, vaddr, vdata offset gds ds_max_rtn_i32 vdst, vaddr, vdata offset gds ds_max_rtn_i64 vdst, vaddr, vdata offset gds ds_max_rtn_u32 vdst, vaddr, vdata offset gds ds_max_rtn_u64 vdst, vaddr, vdata offset gds ds_max_src2_f32 vaddr offset gds ds_max_src2_f64 vaddr offset gds ds_max_src2_i32 vaddr offset gds ds_max_src2_i64 vaddr offset gds ds_max_src2_u32 vaddr offset gds ds_max_src2_u64 vaddr offset gds ds_max_u32 vaddr, vdata offset gds ds_max_u64 vaddr, vdata offset gds ds_min_f32 vaddr, vdata offset gds ds_min_f64 vaddr, vdata offset gds ds_min_i32 vaddr, vdata offset gds ds_min_i64 vaddr, vdata offset gds ds_min_rtn_f32 vdst, vaddr, vdata offset gds ds_min_rtn_f64 vdst, vaddr, vdata offset gds ds_min_rtn_i32 vdst, vaddr, vdata offset gds ds_min_rtn_i64 vdst, vaddr, vdata offset gds ds_min_rtn_u32 vdst, vaddr, vdata offset gds ds_min_rtn_u64 vdst, vaddr, vdata offset gds ds_min_src2_f32 vaddr offset gds ds_min_src2_f64 vaddr offset gds ds_min_src2_i32 vaddr offset gds ds_min_src2_i64 vaddr offset gds ds_min_src2_u32 vaddr offset gds ds_min_src2_u64 vaddr offset gds ds_min_u32 vaddr, vdata offset gds ds_min_u64 vaddr, vdata offset gds ds_mskor_b32 vaddr, vdata0, vdata1 offset gds ds_mskor_b64 vaddr, vdata0, vdata1 offset gds ds_mskor_rtn_b32 vdst, vaddr, vdata0, vdata1 offset gds ds_mskor_rtn_b64 vdst, vaddr, vdata0, vdata1 offset gds ds_nop ds_or_b32 vaddr, vdata offset gds ds_or_b64 vaddr, vdata offset gds ds_or_rtn_b32 vdst, vaddr, vdata offset gds ds_or_rtn_b64 vdst, vaddr, vdata offset gds ds_or_src2_b32 vaddr offset gds ds_or_src2_b64 vaddr offset gds ds_ordered_count vdst, vaddr offset gds ds_permute_b32 vdst, vaddr, vdata offset ds_read2_b32 vdst:b32x2, vaddr offset0 offset1 gds ds_read2_b64 vdst:b64x2, vaddr offset0 offset1 gds ds_read2st64_b32 vdst:b32x2, vaddr offset0 offset1 gds ds_read2st64_b64 vdst:b64x2, vaddr offset0 offset1 gds ds_read_addtid_b32 vdst offset gds ds_read_b128 vdst, vaddr offset gds ds_read_b32 vdst, vaddr offset gds ds_read_b64 vdst, vaddr offset gds ds_read_b96 vdst, vaddr offset gds ds_read_i16 vdst, vaddr offset gds ds_read_i8 vdst, vaddr offset gds ds_read_i8_d16 vdst, vaddr offset gds ds_read_i8_d16_hi vdst, vaddr offset gds ds_read_u16 vdst, vaddr offset gds ds_read_u16_d16 vdst, vaddr offset gds ds_read_u16_d16_hi vdst, vaddr offset gds ds_read_u8 vdst, vaddr offset gds ds_read_u8_d16 vdst, vaddr offset gds ds_read_u8_d16_hi vdst, vaddr offset gds ds_rsub_rtn_u32 vdst, vaddr, vdata offset gds ds_rsub_rtn_u64 vdst, vaddr, vdata offset gds ds_rsub_src2_u32 vaddr offset gds ds_rsub_src2_u64 vaddr offset gds ds_rsub_u32 vaddr, vdata offset gds ds_rsub_u64 vaddr, vdata offset gds ds_sub_rtn_u32 vdst, vaddr, vdata offset gds ds_sub_rtn_u64 vdst, vaddr, vdata offset gds ds_sub_src2_u32 vaddr offset gds ds_sub_src2_u64 vaddr offset gds ds_sub_u32 vaddr, vdata offset gds ds_sub_u64 vaddr, vdata offset gds ds_swizzle_b32 vdst, vaddr pattern gds ds_wrap_rtn_b32 vdst, vaddr, vdata0, vdata1 offset gds ds_write2_b32 vaddr, vdata0, vdata1 offset0 offset1 gds ds_write2_b64 vaddr, vdata0, vdata1 offset0 offset1 gds ds_write2st64_b32 vaddr, vdata0, vdata1 offset0 offset1 gds ds_write2st64_b64 vaddr, vdata0, vdata1 offset0 offset1 gds ds_write_addtid_b32 vdata offset gds ds_write_b128 vaddr, vdata offset gds ds_write_b16 vaddr, vdata offset gds ds_write_b16_d16_hi vaddr, vdata offset gds ds_write_b32 vaddr, vdata offset gds ds_write_b64 vaddr, vdata offset gds ds_write_b8 vaddr, vdata offset gds ds_write_b8_d16_hi vaddr, vdata offset gds ds_write_b96 vaddr, vdata offset gds ds_write_src2_b32 vaddr offset gds ds_write_src2_b64 vaddr offset gds ds_wrxchg2_rtn_b32 vdst:b32x2, vaddr, vdata0, vdata1 offset0 offset1 gds ds_wrxchg2_rtn_b64 vdst:b64x2, vaddr, vdata0, vdata1 offset0 offset1 gds ds_wrxchg2st64_rtn_b32 vdst:b32x2, vaddr, vdata0, vdata1 offset0 offset1 gds ds_wrxchg2st64_rtn_b64 vdst:b64x2, vaddr, vdata0, vdata1 offset0 offset1 gds ds_wrxchg_rtn_b32 vdst, vaddr, vdata offset gds ds_wrxchg_rtn_b64 vdst, vaddr, vdata offset gds ds_xor_b32 vaddr, vdata offset gds ds_xor_b64 vaddr, vdata offset gds ds_xor_rtn_b32 vdst, vaddr, vdata offset gds ds_xor_rtn_b64 vdst, vaddr, vdata offset gds ds_xor_src2_b32 vaddr offset gds ds_xor_src2_b64 vaddr offset gds
EXP¶
INSTRUCTION DST SRC0 SRC1 SRC2 SRC3 MODIFIERS ——————————————————————————————————————————————————————————————————————————————————————————————————— exp tgt, vsrc0, vsrc1, vsrc2, vsrc3 done compr vm
FLAT¶
INSTRUCTION DST SRC0 SRC1 SRC2 MODIFIERS ——————————————————————————————————————————————————————————————————————————————————————————————————————— flat_atomic_add vdst:opt, vaddr, vdata offset11 glc slc dlc flat_atomic_add_x2 vdst:opt, vaddr, vdata offset11 glc slc dlc flat_atomic_and vdst:opt, vaddr, vdata offset11 glc slc dlc flat_atomic_and_x2 vdst:opt, vaddr, vdata offset11 glc slc dlc flat_atomic_cmpswap vdst:opt, vaddr, vdata:b32x2 offset11 glc slc dlc flat_atomic_cmpswap_x2 vdst:opt, vaddr, vdata:b64x2 offset11 glc slc dlc flat_atomic_dec vdst:opt:u32, vaddr, vdata:u32 offset11 glc slc dlc flat_atomic_dec_x2 vdst:opt:u64, vaddr, vdata:u64 offset11 glc slc dlc flat_atomic_fcmpswap vdst:opt:f32, vaddr, vdata:f32x2 offset11 glc slc dlc flat_atomic_fcmpswap_x2 vdst:opt:f64, vaddr, vdata:f64x2 offset11 glc slc dlc flat_atomic_fmax vdst:opt:f32, vaddr, vdata:f32 offset11 glc slc dlc flat_atomic_fmax_x2 vdst:opt:f64, vaddr, vdata:f64 offset11 glc slc dlc flat_atomic_fmin vdst:opt:f32, vaddr, vdata:f32 offset11 glc slc dlc flat_atomic_fmin_x2 vdst:opt:f64, vaddr, vdata:f64 offset11 glc slc dlc flat_atomic_inc vdst:opt:u32, vaddr, vdata:u32 offset11 glc slc dlc flat_atomic_inc_x2 vdst:opt:u64, vaddr, vdata:u64 offset11 glc slc dlc flat_atomic_or vdst:opt, vaddr, vdata offset11 glc slc dlc flat_atomic_or_x2 vdst:opt, vaddr, vdata offset11 glc slc dlc flat_atomic_smax vdst:opt:i32, vaddr, vdata:i32 offset11 glc slc dlc flat_atomic_smax_x2 vdst:opt:i64, vaddr, vdata:i64 offset11 glc slc dlc flat_atomic_smin vdst:opt:i32, vaddr, vdata:i32 offset11 glc slc dlc flat_atomic_smin_x2 vdst:opt:i64, vaddr, vdata:i64 offset11 glc slc dlc flat_atomic_sub vdst:opt, vaddr, vdata offset11 glc slc dlc flat_atomic_sub_x2 vdst:opt, vaddr, vdata offset11 glc slc dlc flat_atomic_swap vdst:opt, vaddr, vdata offset11 glc slc dlc flat_atomic_swap_x2 vdst:opt, vaddr, vdata offset11 glc slc dlc flat_atomic_umax vdst:opt:u32, vaddr, vdata:u32 offset11 glc slc dlc flat_atomic_umax_x2 vdst:opt:u64, vaddr, vdata:u64 offset11 glc slc dlc flat_atomic_umin vdst:opt:u32, vaddr, vdata:u32 offset11 glc slc dlc flat_atomic_umin_x2 vdst:opt:u64, vaddr, vdata:u64 offset11 glc slc dlc flat_atomic_xor vdst:opt, vaddr, vdata offset11 glc slc dlc flat_atomic_xor_x2 vdst:opt, vaddr, vdata offset11 glc slc dlc flat_load_dword vdst, vaddr offset11 glc slc dlc flat_load_dwordx2 vdst, vaddr offset11 glc slc dlc flat_load_dwordx3 vdst, vaddr offset11 glc slc dlc flat_load_dwordx4 vdst, vaddr offset11 glc slc dlc flat_load_sbyte vdst, vaddr offset11 glc slc dlc flat_load_sbyte_d16 vdst, vaddr offset11 glc slc dlc flat_load_sbyte_d16_hi vdst, vaddr offset11 glc slc dlc flat_load_short_d16 vdst, vaddr offset11 glc slc dlc flat_load_short_d16_hi vdst, vaddr offset11 glc slc dlc flat_load_sshort vdst, vaddr offset11 glc slc dlc flat_load_ubyte vdst, vaddr offset11 glc slc dlc flat_load_ubyte_d16 vdst, vaddr offset11 glc slc dlc flat_load_ubyte_d16_hi vdst, vaddr offset11 glc slc dlc flat_load_ushort vdst, vaddr offset11 glc slc dlc flat_store_byte vaddr, vdata offset11 glc slc dlc flat_store_byte_d16_hi vaddr, vdata offset11 glc slc dlc flat_store_dword vaddr, vdata offset11 glc slc dlc flat_store_dwordx2 vaddr, vdata offset11 glc slc dlc flat_store_dwordx3 vaddr, vdata offset11 glc slc dlc flat_store_dwordx4 vaddr, vdata offset11 glc slc dlc flat_store_short vaddr, vdata offset11 glc slc dlc flat_store_short_d16_hi vaddr, vdata offset11 glc slc dlc global_atomic_add vdst:opt, vaddr, vdata, saddr offset12s glc slc dlc global_atomic_add_x2 vdst:opt, vaddr, vdata, saddr offset12s glc slc dlc global_atomic_and vdst:opt, vaddr, vdata, saddr offset12s glc slc dlc global_atomic_and_x2 vdst:opt, vaddr, vdata, saddr offset12s glc slc dlc global_atomic_cmpswap vdst:opt, vaddr, vdata:b32x2, saddr offset12s glc slc dlc global_atomic_cmpswap_x2 vdst:opt, vaddr, vdata:b64x2, saddr offset12s glc slc dlc global_atomic_dec vdst:opt:u32, vaddr, vdata:u32, saddr offset12s glc slc dlc global_atomic_dec_x2 vdst:opt:u64, vaddr, vdata:u64, saddr offset12s glc slc dlc global_atomic_fcmpswap vdst:opt:f32, vaddr, vdata:f32x2, saddr offset12s glc slc dlc global_atomic_fcmpswap_x2 vdst:opt:f64, vaddr, vdata:f64x2, saddr offset12s glc slc dlc global_atomic_fmax vdst:opt:f32, vaddr, vdata:f32, saddr offset12s glc slc dlc global_atomic_fmax_x2 vdst:opt:f64, vaddr, vdata:f64, saddr offset12s glc slc dlc global_atomic_fmin vdst:opt:f32, vaddr, vdata:f32, saddr offset12s glc slc dlc global_atomic_fmin_x2 vdst:opt:f64, vaddr, vdata:f64, saddr offset12s glc slc dlc global_atomic_inc vdst:opt:u32, vaddr, vdata:u32, saddr offset12s glc slc dlc global_atomic_inc_x2 vdst:opt:u64, vaddr, vdata:u64, saddr offset12s glc slc dlc global_atomic_or vdst:opt, vaddr, vdata, saddr offset12s glc slc dlc global_atomic_or_x2 vdst:opt, vaddr, vdata, saddr offset12s glc slc dlc global_atomic_smax vdst:opt:i32, vaddr, vdata:i32, saddr offset12s glc slc dlc global_atomic_smax_x2 vdst:opt:i64, vaddr, vdata:i64, saddr offset12s glc slc dlc global_atomic_smin vdst:opt:i32, vaddr, vdata:i32, saddr offset12s glc slc dlc global_atomic_smin_x2 vdst:opt:i64, vaddr, vdata:i64, saddr offset12s glc slc dlc global_atomic_sub vdst:opt, vaddr, vdata, saddr offset12s glc slc dlc global_atomic_sub_x2 vdst:opt, vaddr, vdata, saddr offset12s glc slc dlc global_atomic_swap vdst:opt, vaddr, vdata, saddr offset12s glc slc dlc global_atomic_swap_x2 vdst:opt, vaddr, vdata, saddr offset12s glc slc dlc global_atomic_umax vdst:opt:u32, vaddr, vdata:u32, saddr offset12s glc slc dlc global_atomic_umax_x2 vdst:opt:u64, vaddr, vdata:u64, saddr offset12s glc slc dlc global_atomic_umin vdst:opt:u32, vaddr, vdata:u32, saddr offset12s glc slc dlc global_atomic_umin_x2 vdst:opt:u64, vaddr, vdata:u64, saddr offset12s glc slc dlc global_atomic_xor vdst:opt, vaddr, vdata, saddr offset12s glc slc dlc global_atomic_xor_x2 vdst:opt, vaddr, vdata, saddr offset12s glc slc dlc global_load_dword vdst:opt, vaddr, saddr offset12s glc slc dlc lds global_load_dwordx2 vdst, vaddr, saddr offset12s glc slc dlc global_load_dwordx3 vdst, vaddr, saddr offset12s glc slc dlc global_load_dwordx4 vdst, vaddr, saddr offset12s glc slc dlc global_load_sbyte vdst:opt, vaddr, saddr offset12s glc slc dlc lds global_load_sbyte_d16 vdst, vaddr, saddr offset12s glc slc dlc global_load_sbyte_d16_hi vdst, vaddr, saddr offset12s glc slc dlc global_load_short_d16 vdst, vaddr, saddr offset12s glc slc dlc global_load_short_d16_hi vdst, vaddr, saddr offset12s glc slc dlc global_load_sshort vdst:opt, vaddr, saddr offset12s glc slc dlc lds global_load_ubyte vdst:opt, vaddr, saddr offset12s glc slc dlc lds global_load_ubyte_d16 vdst, vaddr, saddr offset12s glc slc dlc global_load_ubyte_d16_hi vdst, vaddr, saddr offset12s glc slc dlc global_load_ushort vdst:opt, vaddr, saddr offset12s glc slc dlc lds global_store_byte vaddr, vdata, saddr offset12s glc slc dlc global_store_byte_d16_hi vaddr, vdata, saddr offset12s glc slc dlc global_store_dword vaddr, vdata, saddr offset12s glc slc dlc global_store_dwordx2 vaddr, vdata, saddr offset12s glc slc dlc global_store_dwordx3 vaddr, vdata, saddr offset12s glc slc dlc global_store_dwordx4 vaddr, vdata, saddr offset12s glc slc dlc global_store_short vaddr, vdata, saddr offset12s glc slc dlc global_store_short_d16_hi vaddr, vdata, saddr offset12s glc slc dlc scratch_load_dword vdst:opt, vaddr, saddr offset12s glc slc dlc lds scratch_load_dwordx2 vdst, vaddr, saddr offset12s glc slc dlc scratch_load_dwordx3 vdst, vaddr, saddr offset12s glc slc dlc scratch_load_dwordx4 vdst, vaddr, saddr offset12s glc slc dlc scratch_load_sbyte vdst:opt, vaddr, saddr offset12s glc slc dlc lds scratch_load_sbyte_d16 vdst, vaddr, saddr offset12s glc slc dlc scratch_load_sbyte_d16_hi vdst, vaddr, saddr offset12s glc slc dlc scratch_load_short_d16 vdst, vaddr, saddr offset12s glc slc dlc scratch_load_short_d16_hi vdst, vaddr, saddr offset12s glc slc dlc scratch_load_sshort vdst:opt, vaddr, saddr offset12s glc slc dlc lds scratch_load_ubyte vdst:opt, vaddr, saddr offset12s glc slc dlc lds scratch_load_ubyte_d16 vdst, vaddr, saddr offset12s glc slc dlc scratch_load_ubyte_d16_hi vdst, vaddr, saddr offset12s glc slc dlc scratch_load_ushort vdst:opt, vaddr, saddr offset12s glc slc dlc lds scratch_store_byte vaddr, vdata, saddr offset12s glc slc dlc scratch_store_byte_d16_hi vaddr, vdata, saddr offset12s glc slc dlc scratch_store_dword vaddr, vdata, saddr offset12s glc slc dlc scratch_store_dwordx2 vaddr, vdata, saddr offset12s glc slc dlc scratch_store_dwordx3 vaddr, vdata, saddr offset12s glc slc dlc scratch_store_dwordx4 vaddr, vdata, saddr offset12s glc slc dlc scratch_store_short vaddr, vdata, saddr offset12s glc slc dlc scratch_store_short_d16_hi vaddr, vdata, saddr offset12s glc slc dlc
MIMG¶
INSTRUCTION DST SRC0 SRC1 SRC2 MODIFIERS —————————————————————————————————————————————————————————————————————————————————————————————————————— image_atomic_add vdata:dst, vaddr, srsrc dmask dim unorm glc slc dlc a16 lwe image_atomic_and vdata:dst, vaddr, srsrc dmask dim unorm glc slc dlc a16 lwe image_atomic_cmpswap vdata:dst, vaddr, srsrc dmask dim unorm glc slc dlc a16 lwe image_atomic_dec vdata:dst, vaddr, srsrc dmask dim unorm glc slc dlc a16 lwe image_atomic_fcmpswap vdata:dst, vaddr, srsrc dmask dim unorm glc slc dlc a16 lwe image_atomic_fmax vdata:dst, vaddr, srsrc dmask dim unorm glc slc dlc a16 lwe image_atomic_fmin vdata:dst, vaddr, srsrc dmask dim unorm glc slc dlc a16 lwe image_atomic_inc vdata:dst, vaddr, srsrc dmask dim unorm glc slc dlc a16 lwe image_atomic_or vdata:dst, vaddr, srsrc dmask dim unorm glc slc dlc a16 lwe image_atomic_smax vdata:dst, vaddr, srsrc dmask dim unorm glc slc dlc a16 lwe image_atomic_smin vdata:dst, vaddr, srsrc dmask dim unorm glc slc dlc a16 lwe image_atomic_sub vdata:dst, vaddr, srsrc dmask dim unorm glc slc dlc a16 lwe image_atomic_swap vdata:dst, vaddr, srsrc dmask dim unorm glc slc dlc a16 lwe image_atomic_umax vdata:dst, vaddr, srsrc dmask dim unorm glc slc dlc a16 lwe image_atomic_umin vdata:dst, vaddr, srsrc dmask dim unorm glc slc dlc a16 lwe image_atomic_xor vdata:dst, vaddr, srsrc dmask dim unorm glc slc dlc a16 lwe image_gather4 vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 lwe d16 image_gather4_b vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc lwe d16 image_gather4_b_cl vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc lwe d16 image_gather4_b_cl_o vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc lwe d16 image_gather4_b_o vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc lwe d16 image_gather4_c vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 lwe d16 image_gather4_c_b vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 lwe d16 image_gather4_c_b_cl vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 lwe d16 image_gather4_c_b_cl_o vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 lwe d16 image_gather4_c_b_o vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 lwe d16 image_gather4_c_cl vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 lwe d16 image_gather4_c_cl_o vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 lwe d16 image_gather4_c_l vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 lwe d16 image_gather4_c_l_o vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 lwe d16 image_gather4_c_lz vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 lwe d16 image_gather4_c_lz_o vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 lwe d16 image_gather4_c_o vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 lwe d16 image_gather4_cl vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 lwe d16 image_gather4_cl_o vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 lwe d16 image_gather4_l vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 lwe d16 image_gather4_l_o vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 lwe d16 image_gather4_lz vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 lwe d16 image_gather4_lz_o vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 lwe d16 image_gather4_o vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 lwe d16 image_gather4h vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe image_get_lod vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe image_get_resinfo vdst, vaddr, srsrc dmask dim unorm glc slc dlc a16 tfe lwe image_load vdst, vaddr, srsrc dmask dim unorm glc slc dlc a16 tfe lwe d16 image_load_mip vdst, vaddr, srsrc dmask dim unorm glc slc dlc a16 tfe lwe d16 image_load_mip_pck vdst, vaddr, srsrc dmask dim unorm glc slc dlc a16 tfe lwe image_load_mip_pck_sgn vdst, vaddr, srsrc dmask dim unorm glc slc dlc a16 tfe lwe image_load_pck vdst, vaddr, srsrc dmask dim unorm glc slc dlc a16 tfe lwe image_load_pck_sgn vdst, vaddr, srsrc dmask dim unorm glc slc dlc a16 tfe lwe image_sample vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_b vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc tfe lwe d16 image_sample_b_cl vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc tfe lwe d16 image_sample_b_cl_o vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc tfe lwe d16 image_sample_b_o vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc tfe lwe d16 image_sample_c vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_c_b vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_c_b_cl vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_c_b_cl_o vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_c_b_o vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_c_cd vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_c_cd_cl vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_c_cd_cl_g16 vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_c_cd_cl_o vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_c_cd_cl_o_g16 vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_c_cd_g16 vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_c_cd_o vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_c_cd_o_g16 vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_c_cl vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_c_cl_o vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_c_d vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_c_d_cl vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_c_d_cl_g16 vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_c_d_cl_o vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_c_d_cl_o_g16 vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_c_d_g16 vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_c_d_o vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_c_d_o_g16 vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_c_l vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_c_l_o vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_c_lz vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_c_lz_o vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_c_o vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_cd vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_cd_cl vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_cd_cl_g16 vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_cd_cl_o vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_cd_cl_o_g16 vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_cd_g16 vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_cd_o vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_cd_o_g16 vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_cl vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_cl_o vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_d vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_d_cl vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_d_cl_g16 vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_d_cl_o vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_d_cl_o_g16 vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_d_g16 vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_d_o vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_d_o_g16 vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_l vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_l_o vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_lz vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_lz_o vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_sample_o vdst, vaddr, srsrc, ssamp dmask dim unorm glc slc dlc a16 tfe lwe d16 image_store vdata, vaddr, srsrc dmask dim unorm glc slc dlc a16 lwe d16 image_store_mip vdata, vaddr, srsrc dmask dim unorm glc slc dlc a16 lwe d16 image_store_mip_pck vdata, vaddr, srsrc dmask dim unorm glc slc dlc a16 lwe image_store_pck vdata, vaddr, srsrc dmask dim unorm glc slc dlc a16 lwe
MTBUF¶
INSTRUCTION DST SRC0 SRC1 SRC2 SRC3 MODIFIERS ————————————————————————————————————————————————————————————————————————————————————————————————————————— tbuffer_load_format_d16_x vdst, vaddr, srsrc, soffset ufmt idxen offen offset12 glc slc dlc tbuffer_load_format_d16_xy vdst, vaddr, srsrc, soffset ufmt idxen offen offset12 glc slc dlc tbuffer_load_format_d16_xyz vdst, vaddr, srsrc, soffset ufmt idxen offen offset12 glc slc dlc tbuffer_load_format_d16_xyzw vdst, vaddr, srsrc, soffset ufmt idxen offen offset12 glc slc dlc tbuffer_load_format_x vdst, vaddr, srsrc, soffset ufmt idxen offen offset12 glc slc dlc tbuffer_load_format_xy vdst, vaddr, srsrc, soffset ufmt idxen offen offset12 glc slc dlc tbuffer_load_format_xyz vdst, vaddr, srsrc, soffset ufmt idxen offen offset12 glc slc dlc tbuffer_load_format_xyzw vdst, vaddr, srsrc, soffset ufmt idxen offen offset12 glc slc dlc tbuffer_store_format_d16_x vdata, vaddr, srsrc, soffset ufmt idxen offen offset12 glc slc tbuffer_store_format_d16_xy vdata, vaddr, srsrc, soffset ufmt idxen offen offset12 glc slc tbuffer_store_format_d16_xyz vdata, vaddr, srsrc, soffset ufmt idxen offen offset12 glc slc tbuffer_store_format_d16_xyzw vdata, vaddr, srsrc, soffset ufmt idxen offen offset12 glc slc tbuffer_store_format_x vdata, vaddr, srsrc, soffset ufmt idxen offen offset12 glc slc tbuffer_store_format_xy vdata, vaddr, srsrc, soffset ufmt idxen offen offset12 glc slc tbuffer_store_format_xyz vdata, vaddr, srsrc, soffset ufmt idxen offen offset12 glc slc tbuffer_store_format_xyzw vdata, vaddr, srsrc, soffset ufmt idxen offen offset12 glc slc
MUBUF¶
INSTRUCTION DST SRC0 SRC1 SRC2 SRC3 MODIFIERS ————————————————————————————————————————————————————————————————————————————————————————————————————————————————————————— buffer_atomic_add vdata:dst, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_atomic_add_x2 vdata:dst, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_atomic_and vdata:dst, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_atomic_and_x2 vdata:dst, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_atomic_cmpswap vdata:dst:b32x2, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_atomic_cmpswap_x2 vdata:dst:b64x2, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_atomic_dec vdata:dst:u32, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_atomic_dec_x2 vdata:dst:u64, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_atomic_fcmpswap vdata:dst:f32x2, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_atomic_fcmpswap_x2 vdata:dst:f64x2, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_atomic_fmax vdata:dst:f32, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_atomic_fmax_x2 vdata:dst:f64, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_atomic_fmin vdata:dst:f32, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_atomic_fmin_x2 vdata:dst:f64, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_atomic_inc vdata:dst:u32, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_atomic_inc_x2 vdata:dst:u64, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_atomic_or vdata:dst, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_atomic_or_x2 vdata:dst, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_atomic_smax vdata:dst:i32, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_atomic_smax_x2 vdata:dst:i64, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_atomic_smin vdata:dst:i32, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_atomic_smin_x2 vdata:dst:i64, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_atomic_sub vdata:dst, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_atomic_sub_x2 vdata:dst, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_atomic_swap vdata:dst, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_atomic_swap_x2 vdata:dst, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_atomic_umax vdata:dst:u32, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_atomic_umax_x2 vdata:dst:u64, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_atomic_umin vdata:dst:u32, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_atomic_umin_x2 vdata:dst:u64, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_atomic_xor vdata:dst, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_atomic_xor_x2 vdata:dst, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_gl0_inv buffer_gl1_inv buffer_load_dword vdst:opt, vaddr, srsrc, soffset idxen offen offset12 glc slc dlc lds tfe buffer_load_dwordx2 vdst, vaddr, srsrc, soffset idxen offen offset12 glc slc dlc tfe buffer_load_dwordx3 vdst, vaddr, srsrc, soffset idxen offen offset12 glc slc dlc tfe buffer_load_dwordx4 vdst, vaddr, srsrc, soffset idxen offen offset12 glc slc dlc tfe buffer_load_format_d16_x vdst, vaddr, srsrc, soffset idxen offen offset12 glc slc dlc tfe buffer_load_format_d16_xy vdst, vaddr, srsrc, soffset idxen offen offset12 glc slc dlc tfe buffer_load_format_d16_xyz vdst, vaddr, srsrc, soffset idxen offen offset12 glc slc dlc tfe buffer_load_format_d16_xyzw vdst, vaddr, srsrc, soffset idxen offen offset12 glc slc dlc tfe buffer_load_format_x vdst:opt, vaddr, srsrc, soffset idxen offen offset12 glc slc dlc lds tfe buffer_load_format_xy vdst, vaddr, srsrc, soffset idxen offen offset12 glc slc dlc tfe buffer_load_format_xyz vdst, vaddr, srsrc, soffset idxen offen offset12 glc slc dlc tfe buffer_load_format_xyzw vdst, vaddr, srsrc, soffset idxen offen offset12 glc slc dlc tfe buffer_load_sbyte vdst:opt, vaddr, srsrc, soffset idxen offen offset12 glc slc dlc lds tfe buffer_load_sbyte_d16 vdst, vaddr, srsrc, soffset idxen offen offset12 glc slc dlc tfe buffer_load_sbyte_d16_hi vdst, vaddr, srsrc, soffset idxen offen offset12 glc slc dlc tfe buffer_load_short_d16 vdst, vaddr, srsrc, soffset idxen offen offset12 glc slc dlc tfe buffer_load_short_d16_hi vdst, vaddr, srsrc, soffset idxen offen offset12 glc slc dlc tfe buffer_load_sshort vdst:opt, vaddr, srsrc, soffset idxen offen offset12 glc slc dlc lds tfe buffer_load_ubyte vdst:opt, vaddr, srsrc, soffset idxen offen offset12 glc slc dlc lds tfe buffer_load_ubyte_d16 vdst, vaddr, srsrc, soffset idxen offen offset12 glc slc dlc tfe buffer_load_ubyte_d16_hi vdst, vaddr, srsrc, soffset idxen offen offset12 glc slc dlc tfe buffer_load_ushort vdst:opt, vaddr, srsrc, soffset idxen offen offset12 glc slc dlc lds tfe buffer_store_byte vdata, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_store_byte_d16_hi vdata, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_store_dword vdata, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_store_dwordx2 vdata, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_store_dwordx3 vdata, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_store_dwordx4 vdata, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_store_format_d16_x vdata, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_store_format_d16_xy vdata, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_store_format_d16_xyz vdata, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_store_format_d16_xyzw vdata, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_store_format_x vdata, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_store_format_xy vdata, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_store_format_xyz vdata, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_store_format_xyzw vdata, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_store_short vdata, vaddr, srsrc, soffset idxen offen offset12 glc slc buffer_store_short_d16_hi vdata, vaddr, srsrc, soffset idxen offen offset12 glc slc
SDWA¶
INSTRUCTION DST0 DST1 SRC0 SRC1 SRC2 MODIFIERS —————————————————————————————————————————————————————————————————————————————————————————————————————————————————————— v_add_co_ci_u32_sdwa vdst, vcc, src0:m, src1:m, vcc clamp dst_sel dst_unused src0_sel src1_sel v_add_f16_sdwa vdst, src0:m, src1:m clamp omod dst_sel dst_unused src0_sel src1_sel v_add_f32_sdwa vdst, src0:m, src1:m clamp omod dst_sel dst_unused src0_sel src1_sel v_add_nc_u32_sdwa vdst, src0:m, src1:m clamp dst_sel dst_unused src0_sel src1_sel v_and_b32_sdwa vdst, src0:m, src1:m dst_sel dst_unused src0_sel src1_sel v_ashrrev_i32_sdwa vdst, src0:m:u32, src1:m dst_sel dst_unused src0_sel src1_sel v_bfrev_b32_sdwa vdst, src:m dst_sel dst_unused src0_sel v_ceil_f16_sdwa vdst, src:m clamp omod dst_sel dst_unused src0_sel v_ceil_f32_sdwa vdst, src:m clamp omod dst_sel dst_unused src0_sel v_cmp_class_f16_sdwa sdst, src0:m, src1:m:b32 src0_sel src1_sel v_cmp_class_f32_sdwa sdst, src0:m, src1:m:b32 src0_sel src1_sel v_cmp_eq_f16_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_eq_f32_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_eq_i16_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_eq_i32_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_eq_u16_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_eq_u32_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_f_f16_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_f_f32_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_f_i32_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_f_u32_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_ge_f16_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_ge_f32_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_ge_i16_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_ge_i32_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_ge_u16_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_ge_u32_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_gt_f16_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_gt_f32_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_gt_i16_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_gt_i32_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_gt_u16_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_gt_u32_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_le_f16_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_le_f32_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_le_i16_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_le_i32_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_le_u16_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_le_u32_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_lg_f16_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_lg_f32_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_lt_f16_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_lt_f32_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_lt_i16_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_lt_i32_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_lt_u16_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_lt_u32_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_ne_i16_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_ne_i32_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_ne_u16_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_ne_u32_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_neq_f16_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_neq_f32_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_nge_f16_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_nge_f32_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_ngt_f16_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_ngt_f32_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_nle_f16_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_nle_f32_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_nlg_f16_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_nlg_f32_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_nlt_f16_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_nlt_f32_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_o_f16_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_o_f32_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_t_i32_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_t_u32_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_tru_f16_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_tru_f32_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_u_f16_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmp_u_f32_sdwa sdst, src0:m, src1:m src0_sel src1_sel v_cmpx_class_f16_sdwa src0:m, src1:m:b32 src0_sel src1_sel v_cmpx_class_f32_sdwa src0:m, src1:m:b32 src0_sel src1_sel v_cmpx_eq_f16_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_eq_f32_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_eq_i16_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_eq_i32_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_eq_u16_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_eq_u32_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_f_f16_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_f_f32_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_f_i32_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_f_u32_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_ge_f16_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_ge_f32_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_ge_i16_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_ge_i32_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_ge_u16_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_ge_u32_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_gt_f16_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_gt_f32_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_gt_i16_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_gt_i32_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_gt_u16_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_gt_u32_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_le_f16_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_le_f32_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_le_i16_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_le_i32_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_le_u16_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_le_u32_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_lg_f16_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_lg_f32_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_lt_f16_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_lt_f32_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_lt_i16_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_lt_i32_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_lt_u16_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_lt_u32_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_ne_i16_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_ne_i32_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_ne_u16_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_ne_u32_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_neq_f16_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_neq_f32_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_nge_f16_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_nge_f32_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_ngt_f16_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_ngt_f32_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_nle_f16_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_nle_f32_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_nlg_f16_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_nlg_f32_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_nlt_f16_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_nlt_f32_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_o_f16_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_o_f32_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_t_i32_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_t_u32_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_tru_f16_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_tru_f32_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_u_f16_sdwa src0:m, src1:m src0_sel src1_sel v_cmpx_u_f32_sdwa src0:m, src1:m src0_sel src1_sel v_cndmask_b32_sdwa vdst, src0:m, src1:m, vcc clamp dst_sel dst_unused src0_sel src1_sel v_cos_f16_sdwa vdst, src:m clamp omod dst_sel dst_unused src0_sel v_cos_f32_sdwa vdst, src:m clamp omod dst_sel dst_unused src0_sel v_cvt_f16_f32_sdwa vdst, src:m clamp omod dst_sel dst_unused src0_sel v_cvt_f16_i16_sdwa vdst, src:m clamp omod dst_sel dst_unused src0_sel v_cvt_f16_u16_sdwa vdst, src:m clamp omod dst_sel dst_unused src0_sel v_cvt_f32_f16_sdwa vdst, src:m clamp omod dst_sel dst_unused src0_sel v_cvt_f32_i32_sdwa vdst, src:m clamp omod dst_sel dst_unused src0_sel v_cvt_f32_u32_sdwa vdst, src:m clamp omod dst_sel dst_unused src0_sel v_cvt_f32_ubyte0_sdwa vdst, src:m clamp omod dst_sel dst_unused src0_sel v_cvt_f32_ubyte1_sdwa vdst, src:m clamp omod dst_sel dst_unused src0_sel v_cvt_f32_ubyte2_sdwa vdst, src:m clamp omod dst_sel dst_unused src0_sel v_cvt_f32_ubyte3_sdwa vdst, src:m clamp omod dst_sel dst_unused src0_sel v_cvt_flr_i32_f32_sdwa vdst, src:m dst_sel dst_unused src0_sel v_cvt_i16_f16_sdwa vdst, src:m clamp dst_sel dst_unused src0_sel v_cvt_i32_f32_sdwa vdst, src:m clamp dst_sel dst_unused src0_sel v_cvt_norm_i16_f16_sdwa vdst, src:m clamp dst_sel dst_unused src0_sel v_cvt_norm_u16_f16_sdwa vdst, src:m clamp dst_sel dst_unused src0_sel v_cvt_off_f32_i4_sdwa vdst, src clamp omod dst_sel dst_unused src0_sel v_cvt_pkrtz_f16_f32_sdwa vdst, src0:m:f32, src1:m:f32 clamp dst_sel dst_unused src0_sel src1_sel v_cvt_rpi_i32_f32_sdwa vdst, src:m dst_sel dst_unused src0_sel v_cvt_u16_f16_sdwa vdst, src:m clamp dst_sel dst_unused src0_sel v_cvt_u32_f32_sdwa vdst, src:m clamp dst_sel dst_unused src0_sel v_exp_f16_sdwa vdst, src:m clamp omod dst_sel dst_unused src0_sel v_exp_f32_sdwa vdst, src:m clamp omod dst_sel dst_unused src0_sel v_ffbh_i32_sdwa vdst, src:m dst_sel dst_unused src0_sel v_ffbh_u32_sdwa vdst, src:m dst_sel dst_unused src0_sel v_ffbl_b32_sdwa vdst, src:m dst_sel dst_unused src0_sel v_floor_f16_sdwa vdst, src:m clamp omod dst_sel dst_unused src0_sel v_floor_f32_sdwa vdst, src:m clamp omod dst_sel dst_unused src0_sel v_fract_f16_sdwa vdst, src:m clamp omod dst_sel dst_unused src0_sel v_fract_f32_sdwa vdst, src:m clamp omod dst_sel dst_unused src0_sel v_frexp_exp_i16_f16_sdwa vdst, src:m dst_sel dst_unused src0_sel v_frexp_exp_i32_f32_sdwa vdst, src:m dst_sel dst_unused src0_sel v_frexp_mant_f16_sdwa vdst, src:m clamp omod dst_sel dst_unused src0_sel v_frexp_mant_f32_sdwa vdst, src:m clamp omod dst_sel dst_unused src0_sel v_ldexp_f16_sdwa vdst, src0:m, src1:m:i16 clamp omod dst_sel dst_unused src0_sel src1_sel v_log_f16_sdwa vdst, src:m clamp omod dst_sel dst_unused src0_sel v_log_f32_sdwa vdst, src:m clamp omod dst_sel dst_unused src0_sel v_lshlrev_b32_sdwa vdst, src0:m:u32, src1:m dst_sel dst_unused src0_sel src1_sel v_lshrrev_b32_sdwa vdst, src0:m:u32, src1:m dst_sel dst_unused src0_sel src1_sel v_max_f16_sdwa vdst, src0:m, src1:m clamp omod dst_sel dst_unused src0_sel src1_sel v_max_f32_sdwa vdst, src0:m, src1:m clamp omod dst_sel dst_unused src0_sel src1_sel v_max_i32_sdwa vdst, src0:m, src1:m dst_sel dst_unused src0_sel src1_sel v_max_u32_sdwa vdst, src0:m, src1:m dst_sel dst_unused src0_sel src1_sel v_min_f16_sdwa vdst, src0:m, src1:m clamp omod dst_sel dst_unused src0_sel src1_sel v_min_f32_sdwa vdst, src0:m, src1:m clamp omod dst_sel dst_unused src0_sel src1_sel v_min_i32_sdwa vdst, src0:m, src1:m dst_sel dst_unused src0_sel src1_sel v_min_u32_sdwa vdst, src0:m, src1:m dst_sel dst_unused src0_sel src1_sel v_mov_b32_sdwa vdst, src clamp dst_sel dst_unused src0_sel v_movreld_b32_sdwa vdst, src clamp dst_sel dst_unused src0_sel v_movrels_b32_sdwa vdst, vsrc clamp dst_sel dst_unused src0_sel v_movrelsd_2_b32_sdwa vdst, vsrc clamp dst_sel dst_unused src0_sel v_movrelsd_b32_sdwa vdst, vsrc clamp dst_sel dst_unused src0_sel v_mul_f16_sdwa vdst, src0:m, src1:m clamp omod dst_sel dst_unused src0_sel src1_sel v_mul_f32_sdwa vdst, src0:m, src1:m clamp omod dst_sel dst_unused src0_sel src1_sel v_mul_hi_i32_i24_sdwa vdst, src0:m, src1:m dst_sel dst_unused src0_sel src1_sel v_mul_hi_u32_u24_sdwa vdst, src0:m, src1:m dst_sel dst_unused src0_sel src1_sel v_mul_i32_i24_sdwa vdst, src0:m, src1:m clamp dst_sel dst_unused src0_sel src1_sel v_mul_legacy_f32_sdwa vdst, src0:m, src1:m clamp omod dst_sel dst_unused src0_sel src1_sel v_mul_u32_u24_sdwa vdst, src0:m, src1:m clamp dst_sel dst_unused src0_sel src1_sel v_not_b32_sdwa vdst, src:m dst_sel dst_unused src0_sel v_or_b32_sdwa vdst, src0:m, src1:m dst_sel dst_unused src0_sel src1_sel v_rcp_f16_sdwa vdst, src:m clamp omod dst_sel dst_unused src0_sel v_rcp_f32_sdwa vdst, src:m clamp omod dst_sel dst_unused src0_sel v_rcp_iflag_f32_sdwa vdst, src:m clamp omod dst_sel dst_unused src0_sel v_rndne_f16_sdwa vdst, src:m clamp omod dst_sel dst_unused src0_sel v_rndne_f32_sdwa vdst, src:m clamp omod dst_sel dst_unused src0_sel v_rsq_f16_sdwa vdst, src:m clamp omod dst_sel dst_unused src0_sel v_rsq_f32_sdwa vdst, src:m clamp omod dst_sel dst_unused src0_sel v_sat_pk_u8_i16_sdwa vdst:u8x4, src:m dst_sel dst_unused src0_sel v_sin_f16_sdwa vdst, src:m clamp omod dst_sel dst_unused src0_sel v_sin_f32_sdwa vdst, src:m clamp omod dst_sel dst_unused src0_sel v_sqrt_f16_sdwa vdst, src:m clamp omod dst_sel dst_unused src0_sel v_sqrt_f32_sdwa vdst, src:m clamp omod dst_sel dst_unused src0_sel v_sub_co_ci_u32_sdwa vdst, vcc, src0:m, src1:m, vcc clamp dst_sel dst_unused src0_sel src1_sel v_sub_f16_sdwa vdst, src0:m, src1:m clamp omod dst_sel dst_unused src0_sel src1_sel v_sub_f32_sdwa vdst, src0:m, src1:m clamp omod dst_sel dst_unused src0_sel src1_sel v_sub_nc_u32_sdwa vdst, src0:m, src1:m clamp dst_sel dst_unused src0_sel src1_sel v_subrev_co_ci_u32_sdwa vdst, vcc, src0:m, src1:m, vcc clamp dst_sel dst_unused src0_sel src1_sel v_subrev_f16_sdwa vdst, src0:m, src1:m clamp omod dst_sel dst_unused src0_sel src1_sel v_subrev_f32_sdwa vdst, src0:m, src1:m clamp omod dst_sel dst_unused src0_sel src1_sel v_subrev_nc_u32_sdwa vdst, src0:m, src1:m clamp dst_sel dst_unused src0_sel src1_sel v_trunc_f16_sdwa vdst, src:m clamp omod dst_sel dst_unused src0_sel v_trunc_f32_sdwa vdst, src:m clamp omod dst_sel dst_unused src0_sel v_xnor_b32_sdwa vdst, src0:m, src1:m dst_sel dst_unused src0_sel src1_sel v_xor_b32_sdwa vdst, src0:m, src1:m dst_sel dst_unused src0_sel src1_sel
SMEM¶
INSTRUCTION DST SRC0 SRC1 SRC2 MODIFIERS ———————————————————————————————————————————————————————————————————————————————————————————————————— s_atc_probe probe, sbase, soffset offset21s s_atc_probe_buffer probe, sbase, soffset offset20u s_atomic_add sdata:dst, sbase, soffset offset21s glc s_atomic_add_x2 sdata:dst, sbase, soffset offset21s glc s_atomic_and sdata:dst, sbase, soffset offset21s glc s_atomic_and_x2 sdata:dst, sbase, soffset offset21s glc s_atomic_cmpswap sdata:dst:b32x2, sbase, soffset offset21s glc s_atomic_cmpswap_x2 sdata:dst:b64x2, sbase, soffset offset21s glc s_atomic_dec sdata:dst:u32, sbase, soffset offset21s glc s_atomic_dec_x2 sdata:dst:u64, sbase, soffset offset21s glc s_atomic_inc sdata:dst:u32, sbase, soffset offset21s glc s_atomic_inc_x2 sdata:dst:u64, sbase, soffset offset21s glc s_atomic_or sdata:dst, sbase, soffset offset21s glc s_atomic_or_x2 sdata:dst, sbase, soffset offset21s glc s_atomic_smax sdata:dst:i32, sbase, soffset offset21s glc s_atomic_smax_x2 sdata:dst:i64, sbase, soffset offset21s glc s_atomic_smin sdata:dst:i32, sbase, soffset offset21s glc s_atomic_smin_x2 sdata:dst:i64, sbase, soffset offset21s glc s_atomic_sub sdata:dst, sbase, soffset offset21s glc s_atomic_sub_x2 sdata:dst, sbase, soffset offset21s glc s_atomic_swap sdata:dst, sbase, soffset offset21s glc s_atomic_swap_x2 sdata:dst, sbase, soffset offset21s glc s_atomic_umax sdata:dst:u32, sbase, soffset offset21s glc s_atomic_umax_x2 sdata:dst:u64, sbase, soffset offset21s glc s_atomic_umin sdata:dst:u32, sbase, soffset offset21s glc s_atomic_umin_x2 sdata:dst:u64, sbase, soffset offset21s glc s_atomic_xor sdata:dst, sbase, soffset offset21s glc s_atomic_xor_x2 sdata:dst, sbase, soffset offset21s glc s_buffer_atomic_add sdata:dst, sbase, soffset offset20u glc s_buffer_atomic_add_x2 sdata:dst, sbase, soffset offset20u glc s_buffer_atomic_and sdata:dst, sbase, soffset offset20u glc s_buffer_atomic_and_x2 sdata:dst, sbase, soffset offset20u glc s_buffer_atomic_cmpswap sdata:dst:b32x2, sbase, soffset offset20u glc s_buffer_atomic_cmpswap_x2 sdata:dst:b64x2, sbase, soffset offset20u glc s_buffer_atomic_dec sdata:dst:u32, sbase, soffset offset20u glc s_buffer_atomic_dec_x2 sdata:dst:u64, sbase, soffset offset20u glc s_buffer_atomic_inc sdata:dst:u32, sbase, soffset offset20u glc s_buffer_atomic_inc_x2 sdata:dst:u64, sbase, soffset offset20u glc s_buffer_atomic_or sdata:dst, sbase, soffset offset20u glc s_buffer_atomic_or_x2 sdata:dst, sbase, soffset offset20u glc s_buffer_atomic_smax sdata:dst:i32, sbase, soffset offset20u glc s_buffer_atomic_smax_x2 sdata:dst:i64, sbase, soffset offset20u glc s_buffer_atomic_smin sdata:dst:i32, sbase, soffset offset20u glc s_buffer_atomic_smin_x2 sdata:dst:i64, sbase, soffset offset20u glc s_buffer_atomic_sub sdata:dst, sbase, soffset offset20u glc s_buffer_atomic_sub_x2 sdata:dst, sbase, soffset offset20u glc s_buffer_atomic_swap sdata:dst, sbase, soffset offset20u glc s_buffer_atomic_swap_x2 sdata:dst, sbase, soffset offset20u glc s_buffer_atomic_umax sdata:dst:u32, sbase, soffset offset20u glc s_buffer_atomic_umax_x2 sdata:dst:u64, sbase, soffset offset20u glc s_buffer_atomic_umin sdata:dst:u32, sbase, soffset offset20u glc s_buffer_atomic_umin_x2 sdata:dst:u64, sbase, soffset offset20u glc s_buffer_atomic_xor sdata:dst, sbase, soffset offset20u glc s_buffer_atomic_xor_x2 sdata:dst, sbase, soffset offset20u glc s_buffer_load_dword sdst, sbase, soffset offset20u glc dlc s_buffer_load_dwordx16 sdst, sbase, soffset offset20u glc dlc s_buffer_load_dwordx2 sdst, sbase, soffset offset20u glc dlc s_buffer_load_dwordx4 sdst, sbase, soffset offset20u glc dlc s_buffer_load_dwordx8 sdst, sbase, soffset offset20u glc dlc s_buffer_store_dword sdata, sbase, soffset offset20u glc s_buffer_store_dwordx2 sdata, sbase, soffset offset20u glc s_buffer_store_dwordx4 sdata, sbase, soffset offset20u glc s_dcache_discard sbase, soffset offset21s s_dcache_discard_x2 sbase, soffset offset21s s_dcache_inv s_dcache_wb s_get_waveid_in_workgroup sdst s_gl1_inv s_load_dword sdst, sbase, soffset offset21s glc dlc s_load_dwordx16 sdst, sbase, soffset offset21s glc dlc s_load_dwordx2 sdst, sbase, soffset offset21s glc dlc s_load_dwordx4 sdst, sbase, soffset offset21s glc dlc s_load_dwordx8 sdst, sbase, soffset offset21s glc dlc s_memrealtime sdst:b64 s_memtime sdst:b64 s_scratch_load_dword sdst, sbase, soffset offset21s glc dlc s_scratch_load_dwordx2 sdst, sbase, soffset offset21s glc dlc s_scratch_load_dwordx4 sdst, sbase, soffset offset21s glc dlc s_scratch_store_dword sdata, sbase, soffset offset21s glc s_scratch_store_dwordx2 sdata, sbase, soffset offset21s glc s_scratch_store_dwordx4 sdata, sbase, soffset offset21s glc s_store_dword sdata, sbase, soffset offset21s glc s_store_dwordx2 sdata, sbase, soffset offset21s glc s_store_dwordx4 sdata, sbase, soffset offset21s glc
SOP1¶
INSTRUCTION DST SRC ——————————————————————————————————————————————————— s_abs_i32 sdst, ssrc s_and_saveexec_b32 sdst, ssrc s_and_saveexec_b64 sdst, ssrc s_andn1_saveexec_b32