More registers are used when multiple target regions are compiled together #45795

ye-luo · 2020-06-25T05:21:55Z


Bugzilla Link	46450
Resolution	FIXED
Resolved on	Jul 14, 2020 20:39
Version	unspecified
OS	Linux
CC	@hfinkel,@jdoerfert

Extended Description

I initially spotted this issue with AOMP but it seems from upstream clang.
ROCm/aomp#24

reproducer:
git clone https://github.com/ye-luo/miniqmc
cd miniqmc/build
cmake -DCMAKE_CXX_COMPILER=clang++ -DENABLE_OFFLOAD=ON
-DUSE_OBJECT_TARGET=ON -DCMAKE_EXE_LINKER_FLAGS="-v" ..
make -j32 check_spo_batched

all the 6 kernels use 254 registers.

Then I comment out "target teams" at 159, 311, 405.
make -j32 check_spo_batched
now all the 3 kernels left use 243 registers.

If I add
-DCMAKE_CXX_FLAGS="-Xcuda-ptxas -v" to cmake and print out register usage reported by ptxas. The three kernels take 146, 30, 30 registers when compiled.

I think the register usage is fine when kernels are compiled individually.
Somehow at linking, all the assembled kernels get the worst register usage among all the individual kernels.

It destroys performance completely.

ye-luo · 2020-06-25T14:02:49Z

Forgot to say, these line numbers 159, 311, 405 refers to source file src/QMCWaveFunctions/einspline_spo_omp.cpp

jdoerfert · 2020-07-15T03:39:34Z

Partially resolved with 5b0581a and D83832. For better automatic results we need LTO. I'll mark this as fixed for now as the original issue is gone (I think).

llvmbot transferred this issue from llvm/llvm-bugzilla-archive Dec 10, 2021

This issue was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

More registers are used when multiple target regions are compiled together #45795

More registers are used when multiple target regions are compiled together #45795

ye-luo commented Jun 25, 2020

ye-luo commented Jun 25, 2020

jdoerfert commented Jul 15, 2020

More registers are used when multiple target regions are compiled together #45795

More registers are used when multiple target regions are compiled together #45795

Comments

ye-luo commented Jun 25, 2020

Extended Description

ye-luo commented Jun 25, 2020

jdoerfert commented Jul 15, 2020