verifyFunction spends too much time validating !dbg metadata in +asserts builds #47056

rnk · 2020-10-02T18:58:06Z


Bugzilla Link	47712
Version	trunk
OS	Windows NT
CC	@adrian-prantl,@fhahn,@zmodem,@nico,@rotateright,@vedantk

Extended Description

I was profiling LLVM for other reasons and accidentally used a build with assertions enabled. What I discovered is that, if you compile PassBuilder.cpp with assertions, most of the time is spent in the verifier, mostly due to asserts in SLPVectorizer and LoopVectorizer. There are 187,486 samples overall, and 131,698 of them (70.2%) are inside calls to verifyFunction.

If you drill into what verifyFunction is doing, all the time is spent in recursive calls to visitMDNode. These calls are rooted in visitInstruction calls to visitMDNode:
https://github.com/llvm/llvm-project/blob/master/llvm/lib/IR/Verifier.cpp#L4431

visitMDNode in turn validates each of its operands. This is undesirable when validating a function: location metadata points upwards, so validating one location will ultimately validate the entire metadata graph rooted in the compile unit. See the tree of calls in this screenshot:
https://reviews.llvm.org/file/data/u5ms43qqwl3aahxopjql/PHID-FILE-ysoomivlg3fixz52f4ny/visitmdnode-profile.png

The code I linked was added in http://github.com/llvm/llvm-project/commit/8dfe819bcd23, but I'm not sure when this regressed. We used to ship clang with assertions enabled, so we saw these non-linear assert compile time issues in the field as they arose. Now that we are no longer watching carefully, it seems like these issues can slip in.

Should we try to establish the invariant that llvm::verifyFunction runs in O(#instructions), or should we move all calls to verifyFunction to EXPENSIVE_CHECKS?

fhahn · 2021-01-09T16:52:51Z

Should we try to establish the invariant that llvm::verifyFunction runs in
O(#instructions), or should we move all calls to verifyFunction to
EXPENSIVE_CHECKS?

I think verifyFunction is too valuable for builds with assertions to put it behind EXPENSIVE_CHECKS.

Limiting assertions outside EXPENSIVE_CHECKS to be O(#instructions) seems very reasonable and a natural cut-off for non-expensive checks. (I think plenty of the current uses of EXPENSIVE_CHECKS already follow a similar reasoning). Might be good to spell it out somewhere. We probably should limit it to #instruction the pass operates on, e.g. a loop pass should not verify the whole function for each loop in a function.

As for the issue at hand in verifyFunction, I think we could limit the expensive MD verification to EXPENSIVE_CHECKS to start with.

fhahn · 2021-01-13T10:16:46Z

A potentially related issue motivated another patch: https://reviews.llvm.org/D94576

Reid, do you happen to still have the file & flags to reproduce this? I tried with a recent Clang build & current main, but couldn't reproduce it by building PassBuilder.cpp

fhahn · 2021-01-13T10:36:36Z

Reid, do you happen to still have the file & flags to reproduce this? I
tried with a recent Clang build & current main, but couldn't reproduce it by
building PassBuilder.cpp

Nevermind, this can be reproduced by building PassBuilder.cpp with -O3 and -g. Time spent in verifyFunction goes from >0.5% of total compile-time to ~11% total compile-time. Here's the trace, with total time spent & percentage of total compile-time.

8.91 s 12.7% 3.00 ms (anonymous namespace)::LoopVectorize::runOnFunction(llvm::Function&)
8.90 s 12.7% 0 s llvm::LoopVectorizePass::runImpl(llvm::Function&, llvm::ScalarEvolution&, llvm::LoopInfo&, llvm::TargetTransformInfo&, llvm::DominatorTree&, llvm::BlockFrequencyInfo&, llvm::TargetLibraryInfo*, llvm::DemandedBits&, llvm::AAResults&, llvm::AssumptionCache&, std::__1::function<llvm::LoopAccessInfo const& (llvm::Loop&)>&, llvm::OptimizationRemarkEmitter&, llvm::ProfileSummaryInfo*)
8.89 s 12.7% 2.00 ms llvm::LoopVectorizePass::processLoop(llvm::Loop*)
8.61 s 12.3% 0 s llvm::verifyFunction(llvm::Function const&, llvm::raw_ostream*)
8.54 s 12.2% 66.00 ms (anonymous namespace)::Verifier::verify(llvm::Function const&)
7.97 s 11.4% 0 s (anonymous namespace)::Verifier::visitMDNode(llvm::MDNode const&)
7.97 s 11.4% 12.00 ms (anonymous namespace)::Verifier::visitMDNode(llvm::MDNode const&)
7.96 s 11.4% 30.00 ms (anonymous namespace)::Verifier::visitMDNode(llvm::MDNode const&)
7.93 s 11.3% 71.00 ms (anonymous namespace)::Verifier::visitMDNode(llvm::MDNode const&)
1.00 ms 0.0% 0 s ml_set_interrupts_enabled
1.00 ms 0.0% 0 s (anonymous namespace)::Verifier::verifySourceDebugInfo(llvm::DICompileUnit const&, llvm::DIFile const&)
347.00 ms 0.4% 2.00 ms (anonymous namespace)::Verifier::visitCallInst(llvm::CallInst&)

I also get similar results with http://github.com/llvm/llvm-project/commit/8dfe819bcd23 reverted.

rnk · 2021-11-27T03:56:43Z

mentioned in issue llvm/llvm-bugzilla-archive#48689

The Verifier is called anyway at the beginning and end of the Pass Pipeline, and some complex transformations like LoopVectorize call the Verifier anyway in order to report errors early. It can be argued that verifying metadata after vectorization is inessential, and not doing this can significantly improve compile-times on debug builds. With this patch, although the Verifier itself verifies metadata, change the calls to the Verifier in LoopVectorize to skip this check. Fixes llvm#47056.

llvmbot transferred this issue from llvm/llvm-bugzilla-archive Dec 10, 2021

artagnon mentioned this issue May 8, 2024

LoopVectorize: don't verify metadata #91484

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

verifyFunction spends too much time validating !dbg metadata in +asserts builds #47056

verifyFunction spends too much time validating !dbg metadata in +asserts builds #47056

rnk commented Oct 2, 2020

fhahn commented Jan 9, 2021

fhahn commented Jan 13, 2021

fhahn commented Jan 13, 2021

rnk commented Nov 27, 2021

verifyFunction spends too much time validating !dbg metadata in +asserts builds #47056

verifyFunction spends too much time validating !dbg metadata in +asserts builds #47056

Comments

rnk commented Oct 2, 2020

Extended Description

fhahn commented Jan 9, 2021

fhahn commented Jan 13, 2021

fhahn commented Jan 13, 2021

rnk commented Nov 27, 2021