|
LLVM 23.0.0git
|
#include "Target/AMDGPU/GCNSubtarget.h"
Public Types | |
| enum class | TrapHandlerAbi { NONE = 0x00 , AMDHSA = 0x01 } |
| enum class | TrapID { LLVMAMDHSATrap = 0x02 , LLVMAMDHSADebugTrap = 0x03 } |
| Public Types inherited from llvm::AMDGPUSubtarget | |
| enum | Generation { INVALID = 0 , R600 = 1 , R700 = 2 , EVERGREEN = 3 , NORTHERN_ISLANDS = 4 , SOUTHERN_ISLANDS = 5 , SEA_ISLANDS = 6 , VOLCANIC_ISLANDS = 7 , GFX9 = 8 , GFX10 = 9 , GFX11 = 10 , GFX12 = 11 , GFX13 = 12 } |
Static Public Member Functions | |
| static bool | hasHalfRate64Ops (const TargetSubtargetInfo &STI) |
| Static Public Member Functions inherited from llvm::AMDGPUSubtarget | |
| static const AMDGPUSubtarget & | get (const MachineFunction &MF) |
| static const AMDGPUSubtarget & | get (const TargetMachine &TM, const Function &F) |
Protected Attributes | |
| AMDGPU::TargetID | TargetID |
| unsigned | Gen = INVALID |
| InstrItineraryData | InstrItins |
| int | LDSBankCount = 0 |
| unsigned | MaxPrivateElementSize = 0 |
| unsigned | InstCacheLineSize = 0 |
| bool | DynamicVGPR = false |
| bool | DynamicVGPRBlockSize32 = false |
| bool | ScalarizeGlobal = false |
| const bool | BufferOOBRelaxed |
| const bool | TBufferOOBRelaxed |
| unsigned | MaxHardClauseLength = 0 |
| The maximum number of instructions that may be placed within an S_CLAUSE, which is one greater than the maximum argument to S_CLAUSE. | |
| Protected Attributes inherited from llvm::AMDGPUSubtarget | |
| bool | HasMulI24 = true |
| bool | HasMulU24 = true |
| bool | HasSMulHi = false |
| bool | HasFminFmaxLegacy = true |
| unsigned | EUsPerCU = 4 |
| unsigned | MaxWavesPerEU = 10 |
| unsigned | LocalMemorySize = 0 |
| unsigned | AddressableLocalMemorySize = 0 |
| unsigned | LDSAllocationGranularity = 0 |
| char | WavefrontSizeLog2 = 0 |
| unsigned | FlatOffsetBitWidth = 0 |
Definition at line 44 of file GCNSubtarget.h.
|
strong |
| Enumerator | |
|---|---|
| NONE | |
| AMDHSA | |
Definition at line 51 of file GCNSubtarget.h.
|
strong |
| Enumerator | |
|---|---|
| LLVMAMDHSATrap | |
| LLVMAMDHSADebugTrap | |
Definition at line 56 of file GCNSubtarget.h.
| GCNSubtarget::GCNSubtarget | ( | const Triple & | TT, |
| StringRef | GPU, | ||
| StringRef | FS, | ||
| const GCNTargetMachine & | TM, | ||
| bool | BufferOOBRelaxed = false, | ||
| bool | TBufferOOBRelaxed = false ) |
Definition at line 180 of file GCNSubtarget.cpp.
References llvm::AMDGPUSubtarget::AMDGPUSubtarget(), BufferOOBRelaxed, llvm::AMDGPUSubtarget::EUsPerCU, llvm::AMDGPU::IsaInfo::getEUsPerCU(), llvm::AMDGPU::IsaInfo::getMaxWavesPerEU(), getStackAlignment(), getTargetLowering(), initializeSubtargetDependencies(), InstrItins, llvm::AMDGPUSubtarget::MaxWavesPerEU, TargetID, and TBufferOOBRelaxed.
Referenced by initializeSubtargetDependencies(), and ~GCNSubtarget().
|
overridedefault |
References F, and GCNSubtarget().
|
override |
Definition at line 762 of file GCNSubtarget.cpp.
References llvm::SDep::Artificial, llvm::SDep::Data, E(), getInstrItineraryData(), llvm::ilist_node_impl< OptionsT >::getIterator(), llvm::SDep::getKind(), llvm::SDep::getLatency(), llvm::MachineInstr::getOpcode(), llvm::MachineInstr::getParent(), llvm::SDep::getReg(), getRegisterInfo(), I, llvm::MachineBasicBlock::instr_end(), llvm::MachineInstr::isBundle(), llvm::SDep::setLatency(), llvm::SDep::setReg(), and TRI.
Diagnose inconsistent subtarget features before attempting to codegen function F.
Definition at line 171 of file GCNSubtarget.cpp.
References F.
Referenced by llvm::AMDGPUDAGToDAGISel::runOnMachineFunction(), and llvm::AMDGPUInstructionSelector::setupMF().
| std::pair< unsigned, unsigned > GCNSubtarget::computeOccupancy | ( | const Function & | F, |
| unsigned | LDSSize = 0, | ||
| unsigned | NumSGPRs = 0, | ||
| unsigned | NumVGPRs = 0 ) const |
Subtarget's minimum/maximum occupancy, in number of waves per EU, that can be achieved when the only function running on a CU is F, each workgroup uses LDSSize bytes of LDS, and each wave uses NumSGPRs SGPRs and NumVGPRs VGPRs.
The flat workgroup sizes associated to the function are a range, so this returns a range as well.
Note that occupancy can be affected by the scratch allocation as well, but we do not have enough information to compute it.
Definition at line 464 of file GCNSubtarget.cpp.
References F, llvm::AMDGPU::getDynamicVGPRBlockSize(), getDynamicVGPRBlockSize(), getOccupancyWithNumSGPRs(), getOccupancyWithNumVGPRs(), llvm::AMDGPUSubtarget::getOccupancyWithWorkGroupSizes(), and isDynamicVGPREnabled().
|
inline |
Definition at line 398 of file GCNSubtarget.h.
References hasD16LoadStore(), and TargetID.
|
inlineoverride |
Definition at line 486 of file GCNSubtarget.h.
|
inlineoverride |
Definition at line 473 of file GCNSubtarget.h.
|
inlineoverride |
Definition at line 477 of file GCNSubtarget.h.
|
inline |
Definition at line 710 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX9.
|
inline |
Definition at line 850 of file GCNSubtarget.h.
References llvm::AMDGPU::IsaInfo::getAddressableNumArchVGPRs().
Referenced by getMaxNumVectorRegs().
|
inline |
Definition at line 776 of file GCNSubtarget.h.
References llvm::AMDGPU::IsaInfo::getAddressableNumSGPRs().
Definition at line 855 of file GCNSubtarget.h.
References llvm::AMDGPU::IsaInfo::getAddressableNumVGPRs().
| unsigned GCNSubtarget::getBaseMaxNumSGPRs | ( | const Function & | F, |
| std::pair< unsigned, unsigned > | WavesPerEU, | ||
| unsigned | PreloadedSGPRs, | ||
| unsigned | ReservedNumSGPRs ) const |
Definition at line 481 of file GCNSubtarget.cpp.
References F, llvm::AMDGPU::IsaInfo::FIXED_NUM_SGPRS_FOR_INIT_BUG, getMaxNumSGPRs(), and getMinNumSGPRs().
Referenced by getMaxNumSGPRs(), and getMaxNumSGPRs().
| unsigned GCNSubtarget::getBaseMaxNumVGPRs | ( | const Function & | F, |
| std::pair< unsigned, unsigned > | NumVGPRBounds ) const |
Definition at line 565 of file GCNSubtarget.cpp.
References F.
Referenced by getMaxNumVGPRs().
Definition at line 434 of file GCNSubtarget.cpp.
References getGeneration(), llvm::AMDGPUSubtarget::GFX10, isXNACKEnabled(), llvm::AMDGPUSubtarget::SEA_ISLANDS, and llvm::AMDGPUSubtarget::VOLCANIC_ISLANDS.
Referenced by getReservedNumSGPRs(), and getReservedNumSGPRs().
|
inline |
Definition at line 923 of file GCNSubtarget.h.
References llvm::SIRegisterInfo::getBoolRC(), and getRegisterInfo().
|
inlineoverride |
Definition at line 140 of file GCNSubtarget.h.
Definition at line 214 of file GCNSubtarget.cpp.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX10.
|
inline |
Definition at line 977 of file GCNSubtarget.h.
References DynamicVGPRBlockSize32.
Referenced by computeOccupancy(), and getMaxNumVGPRs().
|
inlineoverride |
Definition at line 128 of file GCNSubtarget.h.
|
inline |
Definition at line 168 of file GCNSubtarget.h.
References Gen.
Referenced by flatScratchIsPointer(), getBaseReservedNumSGPRs(), getConstantBusLimit(), getInstPrefSizeArgs(), getMaxWaveScratchSize(), getNSAThreshold(), getSetRegWaitStates(), getSNopBits(), has12DWordStoreHazard(), hasAddr64(), hasD16Images(), hasD16LoadStore(), hasDenormModeInst(), hasDPPRowShare(), hasDsAtomicAsyncBarrierArriveB64PipeBug(), hasDsSwizzleRotateMode(), hasExpertSchedulingMode(), hasExtendedWaitCounts(), hasFlat(), hasFlatLgkmVMemCountInOrder(), hasFlatScratchHiInB64InstHazard(), hasFlatScratchSVSSwizzleBug(), hasFmaakFmamkF32Insts(), hasFPAtomicToDenormModeHazard(), hasFractBug(), hasGWSAutoReplay(), hasInstPrefetch(), hasLdsDirect(), hasLdsWaitVMSRC(), hasLegacyGeometry(), hasMad64_32(), hasMed3_16(), hasMergedShaders(), hasMin3Max3_16(), hasMultiDwordFlatScratchAddressing(), hasNonNSAEncoding(), hasOnlyRevVALUShifts(), hasPermLane64(), hasPKF32InstsReplicatingLower32BitsOfScalarInput(), hasReadM0LdsDirectHazard(), hasReadM0LdsDmaHazard(), hasReadM0MovRelInterpHazard(), hasReadM0SendMsgHazard(), hasReadVCCZBug(), hasRFEHazards(), hasRrWGMode(), hasScalarAddSub64(), hasScalarCompareEq64(), hasScalarSMulU64(), hasScalarSubwordLoads(), hasSCmpK(), hasScratchBaseForwardingHazard(), hasSignedScratchOffsets(), hasSMRDReadVALUDefHazard(), hasSplitBarriers(), hasUsableDivScaleConditionOutput(), hasUsableDSOffset(), hasVALUMaskWriteHazard(), hasVALUPartialForwardingHazard(), hasVMEMReadSGPRVALUDefHazard(), hasVOP3DPP(), initializeSubtargetDependencies(), isGFX11Plus(), ldsRequiresM0Init(), partialVCCWritesUpdateVCCZ(), privateMemoryResourceIsRangeChecked(), requiresWaitOnWorkgroupReleaseFence(), llvm::AMDGPUAsmPrinter::runOnMachineFunction(), setRegModeNeedsVNOPs(), shouldClusterStores(), supportsBPermute(), supportsGetDoorbellID(), supportsMinMaxDenormModes(), supportsWave32(), supportsWaveWideBPermute(), supportsWGP(), vmemWriteNeedsExpWaitcnt(), and zeroesHigh16BitsOfDest().
|
inlineoverride |
Definition at line 144 of file GCNSubtarget.h.
|
inline |
Instruction cache line size in bytes (64 for pre-GFX11, 128 for GFX11+).
Definition at line 198 of file GCNSubtarget.h.
References InstCacheLineSize.
Referenced by getInstPrefSizeArgs(), and llvm::SITargetLowering::SITargetLowering().
|
inline |
Definition at line 445 of file GCNSubtarget.h.
References assert(), CacheLineSize, getGeneration(), getInstCacheLineSize(), llvm::AMDGPUSubtarget::GFX11, and isGFX11Plus().
Referenced by llvm::AMDGPUAsmPrinter::endFunction().
|
inlineoverride |
Definition at line 126 of file GCNSubtarget.h.
Referenced by llvm::SITargetLowering::AddMemOpInit(), llvm::SITargetLowering::AdjustInstrPostInstrSelection(), llvm::SITargetLowering::bundleInstWithWaitcnt(), llvm::SITargetLowering::emitGWSMemViolTestLoop(), llvm::AMDGPUAsmPrinter::emitInstruction(), llvm::SITargetLowering::EmitInstrWithCustomInserter(), llvm::SIProgramInfo::getFunctionCodeSize(), llvm::SITargetLowering::getPrefLoopAlignment(), llvm::SITargetLowering::PerformDAGCombine(), llvm::SITargetLowering::PostISelFolding(), llvm::SITargetLowering::splitKillBlock(), and llvm::SITargetLowering::wrapAddr64Rsrc().
|
inlineoverride |
Definition at line 162 of file GCNSubtarget.h.
References InstrItins.
Referenced by adjustSchedDependency().
|
inlineoverride |
Definition at line 148 of file GCNSubtarget.h.
|
inline |
Return the number of high bits known to be zero for a frame index.
Definition at line 191 of file GCNSubtarget.h.
References llvm::countl_zero(), getMaxWaveScratchSize(), and llvm::AMDGPUSubtarget::getWavefrontSizeLog2().
Referenced by llvm::SITargetLowering::LowerFormalArguments().
|
inline |
Definition at line 195 of file GCNSubtarget.h.
References LDSBankCount.
|
inlineoverride |
Definition at line 152 of file GCNSubtarget.h.
|
inlineoverridevirtual |
Implements llvm::AMDGPUSubtarget.
Definition at line 939 of file GCNSubtarget.h.
References llvm::AMDGPU::IsaInfo::getMaxFlatWorkGroupSize().
| unsigned llvm::GCNSubtarget::getMaxLocalMemSizeWithWaveCount | ( | unsigned | WaveCount, |
| const Function & | ) const |
Return the amount of LDS that can be used that will not restrict the occupancy lower than WaveCount.
Definition at line 891 of file GCNSubtarget.h.
References F, and getMaxNumVGPRs().
| unsigned GCNSubtarget::getMaxNumPreloadedSGPRs | ( | ) | const |
Definition at line 535 of file GCNSubtarget.cpp.
Referenced by getMaxNumSGPRs().
F, or number of SGPRs explicitly requested using "amdgpu-num-sgpr" attribute attached to function F.Definition at line 560 of file GCNSubtarget.cpp.
References F, getBaseMaxNumSGPRs(), getMaxNumPreloadedSGPRs(), getReservedNumSGPRs(), and llvm::AMDGPUSubtarget::getWavesPerEU().
| unsigned GCNSubtarget::getMaxNumSGPRs | ( | const MachineFunction & | MF | ) | const |
MF, or number of SGPRs explicitly requested using "amdgpu-num-sgpr" attribute attached to function MF.Definition at line 528 of file GCNSubtarget.cpp.
References F, getBaseMaxNumSGPRs(), llvm::MachineFunction::getFunction(), llvm::MachineFunction::getInfo(), llvm::SIMachineFunctionInfo::getNumPreloadedSGPRs(), getReservedNumSGPRs(), and llvm::SIMachineFunctionInfo::getWavesPerEU().
Definition at line 788 of file GCNSubtarget.h.
References llvm::AMDGPU::IsaInfo::getMaxNumSGPRs().
Referenced by getBaseMaxNumSGPRs().
|
inline |
Definition at line 496 of file GCNSubtarget.h.
References llvm::AMDGPU::getMaxNumUserSGPRs().
Return a pair of maximum numbers of VGPRs and AGPRs that meet the number of waves per execution unit required for the function MF.
Definition at line 598 of file GCNSubtarget.cpp.
References llvm::alignTo(), assert(), F, getAddressableNumArchVGPRs(), llvm::AMDGPU::getIntegerPairAttribute(), and getMaxNumVGPRs().
F, or number of VGPRs explicitly requested using "amdgpu-num-vgpr" attribute attached to function F.Definition at line 580 of file GCNSubtarget.cpp.
References F, getBaseMaxNumVGPRs(), llvm::AMDGPU::getDynamicVGPRBlockSize(), getDynamicVGPRBlockSize(), getMaxNumVGPRs(), getMinNumVGPRs(), llvm::AMDGPUSubtarget::getWavesPerEU(), and isDynamicVGPREnabled().
| unsigned GCNSubtarget::getMaxNumVGPRs | ( | const MachineFunction & | MF | ) | const |
MF, or number of VGPRs explicitly requested using "amdgpu-num-vgpr" attribute attached to function MF.Definition at line 593 of file GCNSubtarget.cpp.
References llvm::MachineFunction::getFunction(), and getMaxNumVGPRs().
|
inline |
WavesPerEU. Definition at line 869 of file GCNSubtarget.h.
References llvm::AMDGPU::IsaInfo::getMaxNumVGPRs().
Referenced by getMaxNumAGPRs(), getMaxNumVectorRegs(), getMaxNumVGPRs(), and getMaxNumVGPRs().
Definition at line 200 of file GCNSubtarget.h.
References hasFlatScratchEnabled(), and MaxPrivateElementSize.
Referenced by llvm::SITargetLowering::canMergeStoresTo().
|
inline |
Definition at line 176 of file GCNSubtarget.h.
References getGeneration(), llvm::AMDGPUSubtarget::GFX11, and llvm::AMDGPUSubtarget::GFX12.
Referenced by getKnownHighZeroBitsForFrameIndex().
|
inline |
Definition at line 292 of file AMDGPUSubtarget.h.
|
inlineoverridevirtual |
FlatWorkGroupSize. Implements llvm::AMDGPUSubtarget.
Definition at line 929 of file GCNSubtarget.h.
References llvm::AMDGPU::IsaInfo::getMaxWorkGroupsPerCU().
|
inlineoverridevirtual |
Implements llvm::AMDGPUSubtarget.
Definition at line 934 of file GCNSubtarget.h.
References llvm::AMDGPU::IsaInfo::getMinFlatWorkGroupSize().
Definition at line 782 of file GCNSubtarget.h.
References llvm::AMDGPU::IsaInfo::getMinNumSGPRs().
Referenced by getBaseMaxNumSGPRs().
|
inline |
WavesPerEU. Definition at line 861 of file GCNSubtarget.h.
References llvm::AMDGPU::IsaInfo::getMinNumVGPRs().
Referenced by getMaxNumVGPRs().
|
inlineoverridevirtual |
Implements llvm::AMDGPUSubtarget.
Definition at line 952 of file GCNSubtarget.h.
References llvm::AMDGPU::IsaInfo::getMinWavesPerEU().
Definition at line 533 of file GCNSubtarget.h.
References llvm::AMDGPU::getNSAMaxSize().
| unsigned GCNSubtarget::getNSAThreshold | ( | const MachineFunction & | MF | ) | const |
Definition at line 831 of file GCNSubtarget.cpp.
References llvm::Function::getFnAttributeAsParsedInteger(), llvm::MachineFunction::getFunction(), getGeneration(), llvm::AMDGPUSubtarget::GFX12, and NSAThreshold.
Return the maximum number of waves per SIMD for kernels using SGPRs SGPRs.
Definition at line 422 of file GCNSubtarget.cpp.
References llvm::AMDGPU::IsaInfo::getOccupancyWithNumSGPRs().
Referenced by computeOccupancy().
| unsigned GCNSubtarget::getOccupancyWithNumVGPRs | ( | unsigned | VGPRs, |
| unsigned | DynamicVGPRBlockSize ) const |
Return the maximum number of waves per SIMD for kernels using VGPRs VGPRs.
Definition at line 427 of file GCNSubtarget.cpp.
References llvm::AMDGPU::IsaInfo::getNumWavesPerEUWithNumVGPRs().
Referenced by computeOccupancy().
|
inlineoverride |
Definition at line 156 of file GCNSubtarget.h.
|
inlineoverride |
Definition at line 134 of file GCNSubtarget.h.
Referenced by adjustSchedDependency(), llvm::GCNTargetMachine::convertFuncInfoToYAML(), llvm::AMDGPUAsmPrinter::emitInstruction(), getBoolRC(), llvm::SITargetLowering::insertCopiesSplitCSR(), llvm::SITargetLowering::isEligibleForTailCallOptimization(), llvm::SITargetLowering::LowerFormalArguments(), llvm::AMDGPUCallLowering::lowerFormalArgumentsKernel(), llvm::SITargetLowering::LowerReturn(), and llvm::SITargetLowering::SITargetLowering().
F. Definition at line 455 of file GCNSubtarget.cpp.
References F, and getBaseReservedNumSGPRs().
| unsigned GCNSubtarget::getReservedNumSGPRs | ( | const MachineFunction & | MF | ) | const |
MF. Definition at line 450 of file GCNSubtarget.cpp.
References getBaseReservedNumSGPRs(), llvm::MachineFunction::getInfo(), llvm::SIMachineFunctionInfo::getUserSGPRInfo(), and llvm::GCNUserSGPRUsageInfo::hasFlatScratchInit().
Referenced by getMaxNumSGPRs(), and getMaxNumSGPRs().
|
inline |
Definition at line 480 of file GCNSubtarget.h.
References ScalarizeGlobal.
|
override |
Definition at line 210 of file GCNSubtarget.cpp.
|
inline |
Number of hazard wait states for s_setreg_b32/s_setreg_imm32_b32.
Definition at line 296 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::SEA_ISLANDS.
|
inline |
Definition at line 761 of file GCNSubtarget.h.
References llvm::AMDGPU::IsaInfo::getSGPRAllocGranule().
|
inline |
Definition at line 766 of file GCNSubtarget.h.
References llvm::AMDGPU::IsaInfo::getSGPREncodingGranule().
|
inline |
Definition at line 1014 of file GCNSubtarget.h.
References getGeneration(), llvm::AMDGPUSubtarget::GFX12, and llvm::AMDGPUSubtarget::VOLCANIC_ISLANDS.
|
inline |
Definition at line 471 of file GCNSubtarget.h.
Referenced by GCNSubtarget().
|
inline |
Definition at line 160 of file GCNSubtarget.h.
References TargetID.
Referenced by llvm::AMDGPUAsmPrinter::runOnMachineFunction().
|
inlineoverride |
Definition at line 132 of file GCNSubtarget.h.
Referenced by GCNSubtarget(), and getTM().
|
inline |
Definition at line 771 of file GCNSubtarget.h.
References llvm::AMDGPU::IsaInfo::getTotalNumSGPRs().
|
inline |
Definition at line 844 of file GCNSubtarget.h.
References llvm::AMDGPU::IsaInfo::getTotalNumVGPRs().
|
inline |
Definition at line 252 of file GCNSubtarget.h.
References AMDHSA, llvm::AMDGPUSubtarget::isAmdHsaOS(), and NONE.
Definition at line 834 of file GCNSubtarget.h.
References llvm::AMDGPU::IsaInfo::getVGPRAllocGranule().
|
inline |
Definition at line 839 of file GCNSubtarget.h.
References llvm::AMDGPU::IsaInfo::getVGPREncodingGranule().
|
inlineoverridevirtual |
FlatWorkGroupSize. Implements llvm::AMDGPUSubtarget.
Definition at line 946 of file GCNSubtarget.h.
References llvm::AMDGPU::IsaInfo::getWavesPerEUForWorkGroup().
|
inline |
Definition at line 551 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::SOUTHERN_ISLANDS.
|
inline |
Definition at line 674 of file GCNSubtarget.h.
|
inline |
Definition at line 220 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::VOLCANIC_ISLANDS.
Referenced by initializeSubtargetDependencies().
|
inline |
Definition at line 250 of file GCNSubtarget.h.
|
inline |
Definition at line 375 of file GCNSubtarget.h.
|
inline |
Definition at line 429 of file GCNSubtarget.h.
|
inline |
Return true if the target's EXP instruction has the COMPR flag, which affects the meaning of the EN (enable) bits.
Definition at line 647 of file GCNSubtarget.h.
|
inline |
Definition at line 754 of file GCNSubtarget.h.
|
inline |
Definition at line 613 of file GCNSubtarget.h.
|
inline |
Definition at line 402 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::VOLCANIC_ISLANDS.
|
inline |
Definition at line 396 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX9.
Referenced by d16PreservesUnusedBits().
|
inline |
Return true if the target has the S_DELAY_ALU instruction.
Definition at line 656 of file GCNSubtarget.h.
|
inline |
Definition at line 310 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX10.
|
inline |
Definition at line 592 of file GCNSubtarget.h.
|
inline |
Definition at line 518 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX10.
|
inline |
Definition at line 319 of file GCNSubtarget.h.
|
inline |
Definition at line 989 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX12.
|
inline |
Definition at line 516 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX9.
|
inline |
Definition at line 589 of file GCNSubtarget.h.
|
inline |
Definition at line 556 of file GCNSubtarget.h.
|
inline |
Definition at line 679 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX12.
|
inline |
Definition at line 377 of file GCNSubtarget.h.
|
inline |
Definition at line 665 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX12.
|
inline |
Definition at line 224 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::SOUTHERN_ISLANDS.
Referenced by initializeSubtargetDependencies().
|
inline |
Definition at line 394 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX9.
|
inline |
Definition at line 368 of file GCNSubtarget.h.
Referenced by getMaxPrivateElementSize().
|
inline |
Definition at line 1001 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX12.
|
inline |
Definition at line 362 of file GCNSubtarget.h.
|
inline |
Definition at line 366 of file GCNSubtarget.h.
|
inline |
Definition at line 653 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX11.
|
inline |
Definition at line 357 of file GCNSubtarget.h.
|
inline |
Definition at line 525 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX10.
|
inline |
Definition at line 529 of file GCNSubtarget.h.
|
inline |
Definition at line 758 of file GCNSubtarget.h.
|
inline |
Definition at line 599 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX10.
|
inline |
Definition at line 234 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::SOUTHERN_ISLANDS.
|
inline |
Definition at line 744 of file GCNSubtarget.h.
|
inline |
Definition at line 373 of file GCNSubtarget.h.
|
inline |
Definition at line 413 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX9.
|
inline |
Definition at line 416 of file GCNSubtarget.h.
|
static |
|
inline |
Definition at line 597 of file GCNSubtarget.h.
References MaxHardClauseLength.
|
inline |
Definition at line 218 of file GCNSubtarget.h.
|
inline |
Definition at line 437 of file GCNSubtarget.h.
References getGeneration(), llvm::AMDGPUSubtarget::GFX10, and llvm::AMDGPUSubtarget::GFX11.
|
inline |
Definition at line 443 of file GCNSubtarget.h.
References isGFX11Plus().
Referenced by llvm::AMDGPUAsmPrinter::endFunction().
|
inline |
Definition at line 731 of file GCNSubtarget.h.
|
inline |
Definition at line 386 of file GCNSubtarget.h.
|
inline |
Definition at line 605 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX11.
|
inline |
Definition at line 506 of file GCNSubtarget.h.
|
inline |
Definition at line 507 of file GCNSubtarget.h.
|
inline |
Returns true if the target supports global_load_lds_dwordx3/global_load_lds_dwordx4 or buffer_load_dwordx3/buffer_load_dwordx4 with the lds bit.
Definition at line 661 of file GCNSubtarget.h.
|
inline |
Definition at line 575 of file GCNSubtarget.h.
|
inline |
Definition at line 607 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX12.
|
inline |
Definition at line 719 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX11.
|
inline |
Definition at line 619 of file GCNSubtarget.h.
|
inline |
Definition at line 427 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::SEA_ISLANDS.
| bool GCNSubtarget::hasMadF16 | ( | ) | const |
Definition at line 412 of file GCNSubtarget.cpp.
Referenced by llvm::SITargetLowering::SITargetLowering().
|
inline |
Definition at line 236 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX9.
|
inline |
Definition at line 716 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX9.
|
inline |
Definition at line 238 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX9.
Referenced by supportsMin3Max3().
|
inline |
Definition at line 390 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX9.
|
inline |
Definition at line 531 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX12.
|
inline |
Return true if the target's EXP instruction supports the NULL export target.
Definition at line 651 of file GCNSubtarget.h.
|
inline |
Definition at line 230 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::VOLCANIC_ISLANDS.
|
inline |
Definition at line 512 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX11.
|
inline |
Definition at line 670 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX12.
|
inline |
Definition at line 736 of file GCNSubtarget.h.
|
inline |
Definition at line 523 of file GCNSubtarget.h.
|
inline |
Definition at line 441 of file GCNSubtarget.h.
|
inline |
Definition at line 571 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX9.
|
inline |
Definition at line 567 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX9.
|
inline |
Definition at line 558 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX9.
|
inline |
Definition at line 562 of file GCNSubtarget.h.
References getGeneration(), llvm::AMDGPUSubtarget::GFX9, and llvm::AMDGPUSubtarget::VOLCANIC_ISLANDS.
|
inline |
Extra wait hazard is needed in some cases before s_cbranch_vccnz/s_cbranch_vccz.
Definition at line 276 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::SEA_ISLANDS.
|
inline |
Definition at line 350 of file GCNSubtarget.h.
References BufferOOBRelaxed.
|
inline |
Definition at line 351 of file GCNSubtarget.h.
References TBufferOOBRelaxed.
|
inline |
Definition at line 293 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::VOLCANIC_ISLANDS.
|
inline |
Definition at line 725 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX12.
|
inline |
Definition at line 418 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX12.
|
inline |
Definition at line 502 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::VOLCANIC_ISLANDS.
|
inline |
Definition at line 246 of file GCNSubtarget.h.
|
inline |
Definition at line 244 of file GCNSubtarget.h.
|
inline |
Definition at line 420 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX12.
|
inline |
Definition at line 248 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX12.
|
inline |
Definition at line 540 of file GCNSubtarget.h.
|
inline |
Definition at line 461 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX12.
|
inline |
Definition at line 995 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX12.
|
inline |
Definition at line 739 of file GCNSubtarget.h.
|
inline |
Definition at line 581 of file GCNSubtarget.h.
|
inline |
Definition at line 543 of file GCNSubtarget.h.
|
inline |
Definition at line 729 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX12.
|
inline |
A read of an SGPR by SMRD instruction requires 4 wait states when the SGPR was written by a VALU instruction.
Definition at line 283 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::SOUTHERN_ISLANDS.
|
inline |
Return true if the target has the S_PACK_HL_B32_B16 instruction.
Definition at line 639 of file GCNSubtarget.h.
|
inline |
Definition at line 722 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX12.
|
inline |
Definition at line 756 of file GCNSubtarget.h.
|
inline |
Definition at line 242 of file GCNSubtarget.h.
|
inline |
Definition at line 585 of file GCNSubtarget.h.
|
inline |
Definition at line 334 of file GCNSubtarget.h.
|
inline |
Definition at line 338 of file GCNSubtarget.h.
|
inline |
Definition at line 342 of file GCNSubtarget.h.
|
inline |
Condition output from div_scale is usable.
Definition at line 270 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::SOUTHERN_ISLANDS.
|
inline |
True if the offset field of DS instructions works as expected.
On SI, the offset uses a 16-bit adder and does not always wrap properly.
Definition at line 263 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::SEA_ISLANDS.
|
inline |
Definition at line 547 of file GCNSubtarget.h.
References isWave32().
|
inline |
Definition at line 625 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX11.
|
inline |
Definition at line 609 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX11.
|
inline |
Definition at line 627 of file GCNSubtarget.h.
|
inline |
Return true if the target has the V_CVT_PK_I16_F32/V_CVT_PK_U16_F32 instructions.
Definition at line 643 of file GCNSubtarget.h.
|
inline |
Definition at line 595 of file GCNSubtarget.h.
|
inline |
Definition at line 381 of file GCNSubtarget.h.
|
inline |
A read of an SGPR by a VMEM instruction requires 5 wait states when the SGPR was written by a VALU Instruction.
Definition at line 289 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::VOLCANIC_ISLANDS.
|
inline |
Definition at line 603 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX11.
|
inline |
Definition at line 733 of file GCNSubtarget.h.
|
inline |
Have v_trunc_f64, v_ceil_f64, v_rndne_f64.
Definition at line 322 of file GCNSubtarget.h.
| GCNSubtarget & GCNSubtarget::initializeSubtargetDependencies | ( | const Triple & | TT, |
| StringRef | GPU, | ||
| StringRef | FS ) |
Definition at line 57 of file GCNSubtarget.cpp.
References llvm::AMDGPUSubtarget::AddressableLocalMemorySize, llvm::Triple::AMDHSA, assert(), llvm::dbgs(), llvm::AMDGPUSubtarget::FlatOffsetBitWidth, GCNSubtarget(), Gen, getGeneration(), llvm::AMDGPU::getLdsDwGranularity(), llvm::AMDGPU::IsaInfo::getLocalMemorySize(), llvm::AMDGPUSubtarget::getWavefrontSizeLog2(), llvm::AMDGPUSubtarget::GFX10, llvm::AMDGPUSubtarget::GFX9, hasAddr64(), hasFlat(), llvm::AMDGPUSubtarget::HasFminFmaxLegacy, llvm::AMDGPUSubtarget::HasSMulHi, InstCacheLineSize, llvm::AMDGPUSubtarget::INVALID, llvm::AMDGPUSubtarget::isAmdHsaOS(), llvm::isPowerOf2_32(), llvm::AMDGPUSubtarget::LDSAllocationGranularity, LDSBankCount, LLVM_DEBUG, llvm::AMDGPUSubtarget::LocalMemorySize, MaxPrivateElementSize, ParseSubtargetFeatures(), llvm::AMDGPUSubtarget::SEA_ISLANDS, llvm::AMDGPUSubtarget::SOUTHERN_ISLANDS, TargetID, llvm::AMDGPUSubtarget::VOLCANIC_ISLANDS, and llvm::AMDGPUSubtarget::WavefrontSizeLog2.
Referenced by GCNSubtarget().
|
inline |
Definition at line 353 of file GCNSubtarget.h.
|
inline |
Definition at line 976 of file GCNSubtarget.h.
References DynamicVGPR.
Referenced by computeOccupancy(), and getMaxNumVGPRs().
|
inline |
Definition at line 170 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX11.
Referenced by getInstPrefSizeArgs(), and hasInstPrefSize().
Definition at line 423 of file GCNSubtarget.h.
References F, llvm::AMDGPUSubtarget::isMesa3DOS(), and llvm::AMDGPU::isShader().
|
inline |
Definition at line 355 of file GCNSubtarget.h.
|
inline |
Definition at line 348 of file GCNSubtarget.h.
Referenced by requiresWaitOnWorkgroupReleaseFence().
|
inline |
Definition at line 911 of file GCNSubtarget.h.
References llvm::AMDGPUSubtarget::getWavefrontSize().
Referenced by hasUserSGPRInit16BugInWave32(), mirFileLoaded(), and supportsWaveWideBPermute().
|
inline |
Definition at line 913 of file GCNSubtarget.h.
References llvm::AMDGPUSubtarget::getWavefrontSize().
Referenced by lowerWaveShuffle().
|
inline |
Returns if the wavesize of this subtarget is known reliable.
This is false only for the a default target-cpu that does not have an explicit +wavefrontsize target feature.
Definition at line 918 of file GCNSubtarget.h.
|
inline |
Definition at line 346 of file GCNSubtarget.h.
References TargetID.
Referenced by getBaseReservedNumSGPRs(), and needsConstrainedOpcode().
|
inline |
Return if most LDS instructions have an m0 use that require m0 to be initialized.
Definition at line 406 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX9.
|
inline |
Definition at line 545 of file GCNSubtarget.h.
References EnableLoadStoreOpt.
|
inline |
Definition at line 684 of file GCNSubtarget.h.
References MaxHardClauseLength.
|
override |
Definition at line 401 of file GCNSubtarget.cpp.
References isWave32(), MBB, and MI.
|
inline |
Return if operations acting on VGPR tuples require even alignment.
Definition at line 636 of file GCNSubtarget.h.
|
inline |
Definition at line 750 of file GCNSubtarget.h.
|
override |
Definition at line 364 of file GCNSubtarget.cpp.
References llvm::dbgs(), F, llvm::Attribute::getValueAsString(), llvm::Attribute::isValid(), LLVM_DEBUG, llvm::MachineSchedPolicy::OnlyBottomUp, and llvm::MachineSchedPolicy::OnlyTopDown.
|
override |
Definition at line 340 of file GCNSubtarget.cpp.
References F, llvm::AMDGPU::getSchedStrategy(), llvm::MachineSchedPolicy::OnlyBottomUp, llvm::MachineSchedPolicy::OnlyTopDown, llvm::MachineSchedPolicy::ShouldTrackLaneMasks, and llvm::MachineSchedPolicy::ShouldTrackPressure.
Referenced by initializeSubtargetDependencies().
|
inline |
Writes to VCC_LO/VCC_HI update the VCCZ flag.
Definition at line 279 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX10.
|
inline |
Definition at line 326 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX9.
|
inline |
Definition at line 621 of file GCNSubtarget.h.
|
inlineoverride |
Definition at line 981 of file GCNSubtarget.h.
|
inline |
Definition at line 970 of file GCNSubtarget.h.
|
inline |
Definition at line 974 of file GCNSubtarget.h.
|
inline |
Definition at line 1042 of file GCNSubtarget.h.
References getGeneration(), llvm::AMDGPUSubtarget::GFX10, and isTgSplitEnabled().
|
inline |
Definition at line 1008 of file GCNSubtarget.h.
|
inline |
Definition at line 631 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX12.
|
inline |
Definition at line 479 of file GCNSubtarget.h.
References ScalarizeGlobal.
|
inline |
Definition at line 962 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX11.
|
inline |
Definition at line 1022 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::VOLCANIC_ISLANDS.
|
inline |
Definition at line 256 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX9.
|
inline |
Definition at line 305 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX9.
|
inline |
Definition at line 907 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX10.
|
inline |
Definition at line 909 of file GCNSubtarget.h.
|
inline |
Definition at line 1026 of file GCNSubtarget.h.
References getGeneration(), llvm::AMDGPUSubtarget::GFX12, llvm::AMDGPUSubtarget::GFX9, and isWave32().
Referenced by lowerWaveShuffle().
|
inline |
Definition at line 212 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::GFX10.
Referenced by llvm::AMDGPU::HSAMD::MetadataStreamerMsgPackV4::getHSAKernelProps().
|
inline |
Definition at line 265 of file GCNSubtarget.h.
|
override |
Definition at line 420 of file GCNSubtarget.cpp.
References UseAA.
|
inline |
Definition at line 316 of file GCNSubtarget.h.
|
inline |
Definition at line 332 of file GCNSubtarget.h.
|
inline |
Return true if real (non-fake) variants of True16 instructions using 16-bit registers should be code-generated.
Fake True16 instructions are identical to non-fake ones except that they take 32-bit registers as operands and always use their low halves.
Definition at line 1038 of file GCNSubtarget.h.
|
inline |
Definition at line 623 of file GCNSubtarget.h.
| bool GCNSubtarget::useVGPRIndexMode | ( | ) | const |
Definition at line 416 of file GCNSubtarget.cpp.
References EnableVGPRIndexMode.
|
inline |
Definition at line 433 of file GCNSubtarget.h.
References getGeneration(), and llvm::AMDGPUSubtarget::SEA_ISLANDS.
Returns if the result of this instruction with a 16-bit result returned in a 32-bit register implicitly zeroes the high 16-bits, rather than preserve the original value.
This list was mostly derived from experimentation.
Definition at line 242 of file GCNSubtarget.cpp.
References getGeneration(), llvm::AMDGPUSubtarget::GFX9, and llvm::AMDGPUSubtarget::VOLCANIC_ISLANDS.
Definition at line 87 of file GCNSubtarget.h.
Referenced by GCNSubtarget(), and hasRelaxedBufferOOBMode().
|
protected |
Definition at line 84 of file GCNSubtarget.h.
Referenced by isDynamicVGPREnabled().
|
protected |
Definition at line 85 of file GCNSubtarget.h.
Referenced by getDynamicVGPRBlockSize().
Definition at line 75 of file GCNSubtarget.h.
Referenced by getGeneration(), and initializeSubtargetDependencies().
|
protected |
Definition at line 81 of file GCNSubtarget.h.
Referenced by getInstCacheLineSize(), and initializeSubtargetDependencies().
|
protected |
Definition at line 76 of file GCNSubtarget.h.
Referenced by GCNSubtarget(), and getInstrItineraryData().
|
protected |
Definition at line 77 of file GCNSubtarget.h.
Referenced by getLDSBankCount(), and initializeSubtargetDependencies().
|
protected |
The maximum number of instructions that may be placed within an S_CLAUSE, which is one greater than the maximum argument to S_CLAUSE.
A value of 0 indicates a lack of S_CLAUSE support.
Definition at line 93 of file GCNSubtarget.h.
Referenced by hasHardClauses(), and maxHardClauseLength().
|
protected |
Definition at line 78 of file GCNSubtarget.h.
Referenced by getMaxPrivateElementSize(), and initializeSubtargetDependencies().
|
protected |
Definition at line 86 of file GCNSubtarget.h.
Referenced by getScalarizeGlobalBehavior(), and setScalarizeGlobalBehavior().
|
protected |
Definition at line 74 of file GCNSubtarget.h.
Referenced by d16PreservesUnusedBits(), GCNSubtarget(), getTargetID(), initializeSubtargetDependencies(), and isXNACKEnabled().
Definition at line 88 of file GCNSubtarget.h.
Referenced by GCNSubtarget(), and hasRelaxedTBufferOOBMode().