1261 – [apint] instcombine miscompilation

LLVM Bugzilla is read-only and represents the historical archive of all LLVM issues filled before November 26, 2021. Use github to submit LLVM bugs

Bug 1261 - [apint] instcombine miscompilation

Summary: [apint] instcombine miscompilation

Status:	RESOLVED FIXED

Alias:	None

Product:	libraries
Classification:	Unclassified
Component:	Scalar Optimizations (show other bugs)
Version:	trunk
Hardware:	All All

Importance:	P normal
Assignee:	Reid Spencer

URL:
Keywords:	miscompilation

Depends on:
Blocks:

Reported:	2007-03-19 13:46 PDT by Leo
Modified:	2014-07-22 09:30 PDT (History)
CC List:	1 user (show)

See Also:	20377
Fixed By Commit(s):

Attachments
the ll file (99.80 KB, text/plain) 2007-03-19 13:49 PDT, Leo	Details
The output file (99.50 KB, text/plain) 2007-03-19 13:50 PDT, Leo	Details
Add an attachment (proposed patch, testcase, etc.)

Note You need to log in before you can comment on or make changes to this bug.

Description Leo 2007-03-19 13:46:43 PDT

I found an error in instcombine pass. Please look at the b.ll file attached. At
line 956, the extension operation is sext. After instcombine, it has been
changed to zext which causes semantic errors.

Comment 1 Leo 2007-03-19 13:49:24 PDT

Created attachment 707 [details]
the ll file

The input file for opt -instcombine

Comment 2 Leo 2007-03-19 13:50:07 PDT

Created attachment 708 [details]
The output file

The output file after instcombine.

Comment 3 Nick Lewycky 2007-03-19 17:57:17 PDT

Testcase:

  define i16 @test(i31 %zzz) {
  entry:
    %A = sext i31 %zzz to i32
    %B = add i32 %A, 16384
    %C = lshr i32 %B, 15
    %D = trunc i32 %C to i16
    ret i16 %D
  }

Comment 4 Reid Spencer 2007-03-19 17:58:44 PDT

The differences are these (main ones highlighted with ***

360c360
< %newPhi117 = phi i8 [ 0, %bb586.loopexit.i ], [ %cast_tmp137, %bb95.i ]; <i8>
[#uses=1]
---
>%newPhi117 = phi i8 [ 0, %bb586.loopexit.i ], [ %cast_tmp137, %bb95.i ]; <i8>
[#uses=2]
363,364c363,364
< %cast_tmp138 = zext i8 %newPhi117 to i32      ; <i32> [#uses=42]
< %indvar39.i74.0 = trunc i32 %cast_tmp138 to i16 ; <i16> [#uses=1]
---
> %cast_tmp138 = zext i8 %newPhi117 to i32      ; <i32> [#uses=41]
> %indvar39.i74.0 = zext i8 %newPhi117 to i16   ; <i16> [#uses=1]
520c520
< %tmp470.i = sub i32 -9, %cast_tmp138          ; <i32> [#uses=1]
---
> %tmp470.i = xor i32 %cast_tmp138, -1          ; <i32> [#uses=1]
770,771c770
< %tmp639640.i = sext i16 %tmp639.i to i32      ; <i32> [#uses=1]
< %cast_tmp135 = trunc i32 %tmp639640.i to i26  ; <i26> [#uses=2]
---
> %cast_tmp135 = sext i16 %tmp639.i to i26      ; <i26> [#uses=2]
885,886c884
< %tmp910.i = sext i16 %tmp9.i to i32           ; <i32> [#uses=1]
< %cast_tmp132 = trunc i32 %tmp910.i to i28     ; <i28> [#uses=1]
---
> %cast_tmp132 = sext i16 %tmp9.i to i28        ; <i28> [#uses=1]
924,925c922
< %tmp5556.i = sext i16 %tmp55.i to i32         ; <i32> [#uses=1]
< %cast_tmp130 = trunc i32 %tmp5556.i to i30    ; <i30> [#uses=1]
---
> %cast_tmp130 = sext i16 %tmp55.i to i30       ; <i30> [#uses=1]
963,964c960
< %tmp106107.i = sext i16 %tmp106.i12 to i32    ; <i32> [#uses=1]
< %cast_tmp128 = trunc i32 %tmp106107.i to i31  ; <i31> [#uses=1]
---
> %cast_tmp128 = sext i16 %tmp106.i12 to i31    ; <i31> [#uses=1]
966,967c962,963 *******
< %cast_tmp127 = sext i31 %newbin121 to i32     ; <i32> [#uses=1]
< %tmp109.i = add i32 %cast_tmp127, 16384       ; <i32> [#uses=1]
---
> %cast_tmp1272 = zext i31 %newbin121 to i32    ; <i32> [#uses=1]
> %tmp109.i = add i32 %cast_tmp1272, 16384      ; <i32> [#uses=1]
1003,1004c999
< %tmp157158.i = sext i16 %tmp157.i to i32      ; <i32> [#uses=1]
< %cast_tmp125 = trunc i32 %tmp157158.i to i31  ; <i31> [#uses=1]
---
> %cast_tmp125 = sext i16 %tmp157.i to i31      ; <i31> [#uses=1]
1006,1007c1001,1002 ********
< %cast_tmp124 = sext i31 %newbin123 to i32     ; <i32> [#uses=1]
< %tmp160.i19 = add i32 %cast_tmp124, 16384     ; <i32> [#uses=1]
---
> %cast_tmp1241 = zext i31 %newbin123 to i32    ; <i32> [#uses=1]
> %tmp160.i19 = add i32 %cast_tmp1241, 16384    ; <i32> [#uses=1]

Comment 5 Reid Spencer 2007-03-19 18:00:21 PDT

Mine.

Comment 6 Reid Spencer 2007-03-19 18:19:04 PDT

Leo, 

Please don't file any more of these InstCombine bugs. We know that
InstructionCombining is broken for bitwidths like i31 and i26. Sheng and I are
working to resolve this. When you file a bug like this it causes me to instantly
stop working on things and check this bug out. I could have just spent the last
hour actually solving your problem by working on InstCombine. Instead I had to
investigate this bug which is a duplicate of one you have filed yourself.

This bug is merely a reflection of SimplifyDemandedBits not supporting i31.

Comment 7 Reid Spencer 2007-03-19 18:19:21 PDT


*** This bug has been marked as a duplicate of 1205 ***

Comment 8 Reid Spencer 2007-04-07 16:06:35 PDT

It seems this is a bug after all. After bug 1205 was closed (Instcombine changes
to support APInt), I reran the test case. It produced:

define i16 @test(i31 %zzz) {
entry:
        %A1 = zext i31 %zzz to i32              ; <i32> [#uses=1]
        %B = add i32 %A1, 16384         ; <i32> [#uses=1]
        %C = lshr i32 %B, 15            ; <i32> [#uses=1]
        %D = trunc i32 %C to i16                ; <i16> [#uses=1]
        ret i16 %D
}

i.e. the "sext" changed to zext without any other compensating changes.

This needs to be investigated freshly to determine if the zext is okay or if
this is truly a bug.

Comment 9 Reid Spencer 2007-04-07 16:36:58 PDT

This is clearly a bug. The sign bit in the 31st bit (extended bit) is still live
even after the lshr by 15 bits and the truncation.

Comment 10 Reid Spencer 2007-04-07 16:51:35 PDT

SimplifyDemandedBits is computing a DemandedMask that doesn't include the high
bit after the sext so it thinks this bit is not needed in the result.
Consequently it opts to replace the sext with a zext because (perhaps) that's
more efficient?  I'm investigating why the DemandedMask doesn't have the high
bit set.

Comment 11 Reid Spencer 2007-04-07 17:05:47 PDT

The Demanded Mask makes these transitions:

ret i16 %D                    0xFFFF (all bits demanded in %D)
%D = trunc i32 %C to i16      0x0000FFFF (widen to include trunc'd bits)
%C = lshr i32 %B, 15          0x7FFF8000 (get shifted bits)
%B = add i32 %A, 16384        0x7FFFFFFF (16 bit value added affects low bits)
%A = sext i31 %zzz to i32     0x7FFFFFFF 

Despite my previous comment, this looks correct to me. The high bit is not
included in the shift because the truncation occurs at 16 bits while the shift
was 15 bits. This is reflected in the DemandedMask shown above.  In this
sitaution, InstCombine opts to replace a sext with an equivalent zext because
the high bit is not preserved downstream. The code looks like:

    // If the input sign bit is known zero, or if the NewBits are not demanded
    // convert this into a zero extension.
    if (RHSKnownZero[SrcBitWidth-1] || (NewBits & ~DemandedMask) == NewBits)
    {
      // Convert to ZExt cast
      CastInst *NewCast = new ZExtInst(I->getOperand(0), VTy, I->getName(), I);
      return UpdateValueUsesWith(I, NewCast);

Presumably this is because zext is cheaper than sext on some hardware.

So, I'm claiming that the sext -> zext is correct and appropriate.

Comment 12 Chris Lattner 2007-04-08 01:50:58 PDT

This xform is definitely correct.