Doing certain kinds of IPO over comdat functions is unsound #27148

sanjoy · 2016-02-29T19:41:35Z


Bugzilla Link	26774
Version	trunk
OS	All
Attachments	Reproducer
CC	@dexonsmith,@hfinkel,@joker-eph,@rnk,@wjristow

Extended Description

This was discussed on llvm-dev: http://lists.llvm.org/pipermail/llvm-dev/2016-February/095833.html

Description copy-pasted from the thread:

"""
Let's start with an example that shows that we have a problem (direct
copy/paste from the guard intrinsics thread). Say we have:

void foo() available_externally {
  %t0 = load atomic %ptr
  %t1 = load atomic %ptr
  if (%t0 != %t1) print("X");
}
void main() {
  foo();
  print("Y");
}

The possible behaviors of the above program are {print("X"),
print("Y")} or {print("Y")}. But if we run opt then we have

void foo() available_externally readnone nounwind {
  ;; After CSE'ing the two loads and folding the condition
}
void main() {
  foo();
  print("Y");
}

and some generic reordering

void foo() available_externally readnone nounwind {
  ;; After CSE'ing the two loads and folding the condition
}
void main() {
  print("Y");
  foo();  // legal since we're moving a readnone nounwind function that
          // was guaranteed to execute (hence can't have UB)
}

If we do not inline @foo(), and instead re-link the call site in @main
to some non-optimized copy (or differently optimized copy) of @foo,
then it is possible for the program to have the behavior {print("Y");
print ("X")}, which was disallowed in the earlier program.

In other words, opt refined the semantics of @foo() (i.e. reduced the
set of behaviors it may have) in ways that would make later
optimizations invalid if we de-refine the implementation of @foo().

The above example is clearly fabricated, but such cases can come up
even if everything is optimized to the same level. E.g. one of the
atomic loads in the unrefined implementation of @foo() could have been
hidden behind a function call, whose body existed in only one module.
That module would then be able to refine @foo() to ret void but
other modules won't.
"""

The text was updated successfully, but these errors were encountered:

dexonsmith · 2016-02-29T20:51:21Z

Adding a link to a complete reproducer (also from Sanjoy) that has nothing to do with atomics:
https://github.com/sanjoy/comdat-ipo

sanjoy · 2016-02-29T20:54:49Z

Adding a link to a complete reproducer (also from Sanjoy) that has nothing
to do with atomics:
https://github.com/sanjoy/comdat-ipo

FYI: that's the zip file I've attached to this bug.

rnk · 2016-02-29T22:29:31Z

I vote that we stop inferring readnone/readonly/etc on functions with isWeakForLinker linkage. I expect that in most places where we can infer such attributes, inlining is likely to happen anyway, and be just as profitable.

Are we aware of any other IPO soundness issues with other passes that aren't functionattrs? I think IPSCP is fine for ODR linkage functions, they all have to return the same constant regardless of optimization.

joker-eph · 2016-02-29T22:37:25Z

Just a note: be careful with isWeakForLinker(), it is not the right one (as I wrote in the email thread). I think the right condition is !isStrongDefinitionForLinker().
(isWeakForLinker() does not catch available_externally)

sanjoy · 2016-02-29T23:01:29Z

Are we aware of any other IPO soundness issues with other passes that aren't
functionattrs? I think IPSCP is fine for ODR linkage functions, they all
have to return the same constant regardless of optimization.

I think both ipsccp and ipconstprop are bad. Consider a variant of
the original example:

void f(i32* %ptr) available_externally {
%v0 = load atomic i32* %ptr
%v1 = load atomic i32* %ptr
%sub = sub i32 %v0, %v1
call @print(i32 %sub)
ret i32 %sub
}

void caller() {
%t = call @f(%ptr)
print(%t)
}

function local opts ==>

void f(i32* %ptr) available_externally {
call @print(i32 0)
ret i32 0
}

void caller() {
%t = call @f(%ptr)
print(%t)
}

ipsccp or ipconstprop ==>

void f(i32* %ptr) available_externally {
call @print(i32 0)
ret i32 0
}

void caller() {
%t = call @f(%ptr)
print(i32 0)
}

Now if we link to the unoptimized @f, we can print {,
<0>}, when in the original program the two values printed would have
had to be the same.

I think (haven't tried) we can construct similar cases with undef
(no atomics) as well.

hfinkel · 2016-02-29T23:19:45Z

I vote that we stop inferring readnone/readonly/etc on functions with
isWeakForLinker linkage. I expect that in most places where we can infer
such attributes, inlining is likely to happen anyway, and be just as
profitable.

I think we need to be careful here; you could be right, but there seem to be some attributes that we do commonly prove for inline functions, even those too large to inline. We can prove 'nocapture' on the implicit this pointer, for example. readonly seems more common than readnone.

I really think we need to benchmark this against privatizing the functions to make an informed decision.

Are we aware of any other IPO soundness issues with other passes that aren't
functionattrs? I think IPSCP is fine for ODR linkage functions, they all
have to return the same constant regardless of optimization.

I suspect that GlobalsAA has the same problem as attribute propagation (in one TU, a function's access to a global could be optimized out, but the linker could pick a version of the function which still accesses the global). [And privatizing just got harder?...]

wjristow · 2016-05-18T00:57:20Z

I see a change was made at r265762 to address this issue. That's causing a
lost optimization in variadic cases, that I believe can still safely be
optimized. I've made bug 27796 for it.

wjristow · 2021-11-26T21:49:28Z

mentioned in issue llvm/llvm-bugzilla-archive#27796

llvmbot transferred this issue from llvm/llvm-bugzilla-archive Dec 10, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Doing certain kinds of IPO over comdat functions is unsound #27148

Doing certain kinds of IPO over comdat functions is unsound #27148

sanjoy commented Feb 29, 2016

dexonsmith mannequin commented Feb 29, 2016

sanjoy commented Feb 29, 2016

rnk commented Feb 29, 2016

joker-eph commented Feb 29, 2016

sanjoy commented Feb 29, 2016

hfinkel commented Feb 29, 2016

wjristow commented May 18, 2016

wjristow commented Nov 26, 2021

Doing certain kinds of IPO over comdat functions is unsound #27148

Doing certain kinds of IPO over comdat functions is unsound #27148

Comments

sanjoy commented Feb 29, 2016

Extended Description

dexonsmith mannequin commented Feb 29, 2016

sanjoy commented Feb 29, 2016

rnk commented Feb 29, 2016

joker-eph commented Feb 29, 2016

sanjoy commented Feb 29, 2016

hfinkel commented Feb 29, 2016

wjristow commented May 18, 2016

wjristow commented Nov 26, 2021