[x86 Disassembler] Displacement following SIB byte parsed with wrong instruction length with REX.B #19234

llvmbot · 2014-02-16T13:46:39Z


Bugzilla Link	18860
Resolution	FIXED
Resolved on	Nov 07, 2018 00:22
Version	trunk
OS	Windows NT
Blocks	#11360
Reporter	LLVM Bugzilla Contributor
CC	@topperc

Extended Description

I've been trying to get my head around the x86 instruction format lately, and have been looking at various disassemblers to understand how it fits together (as well as writing my own as an exercise).

I've noticed that the implementation in LLVM seems to disagree with other assemblers/docs in this detail.

Consider the following instruction bytes (compiled for x86-64)
43 80 04 05 78 56 34 12 00

One online disassembler (http://onlinedisassembler.com/odaweb/) gives the following:
add BYTE PTR [r8*1+0x12345678],0x0

which after much consideration I agree with.

The crux here is that the sib byte (05) corresponds to the special case where the base register is ignored and a 4 byte displacement follows the SIB byte. The docs (both Intel and AMD) are rather unclear on this, but my understanding is that this special case should apply regardless of whether the REX prefix is present; as does the special case for the MODR/M byte that indicates the SIB byte is required. My understanding is that the REX prefix shouldn't affect the length of the instruction.

However, the code in LLVM (from x86DissasemblerDecoder.c, reproduced here) takes the REX.B prefix into account before performing the test, and so does not read the displacement:

base = baseFromSIB(insn->sib) | (bFromREX(insn->rexPrefix) << 3);

switch (base) {
case 0x5:
...

I think, much like the case in the readModRM function the switch should also allow 0xd as so:

base = baseFromSIB(insn->sib) | (bFromREX(insn->rexPrefix) << 3);

switch (base) {
case 0x5:
case 0xd: /* in case REXW.b is set */
...

llvmbot · 2014-02-16T13:46:39Z

assigned to @topperc

topperc · 2014-02-17T11:11:09Z

I believe I agree with you for mod=00.

I think for mod=01 and mod=10 with rex.b we need to select r13 as the base and have a displacement.

topperc · 2014-02-17T12:04:09Z

Fixed in r201507.

llvmbot · 2014-02-17T13:09:08Z

That was quick!

I agree with your fix, so am marking the bug CLOSED.

llvmbot transferred this issue from llvm/llvm-bugzilla-archive Dec 9, 2021

This issue was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[x86 Disassembler] Displacement following SIB byte parsed with wrong instruction length with REX.B #19234

[x86 Disassembler] Displacement following SIB byte parsed with wrong instruction length with REX.B #19234

llvmbot commented Feb 16, 2014

llvmbot commented Feb 16, 2014

topperc commented Feb 17, 2014

topperc commented Feb 17, 2014

llvmbot commented Feb 17, 2014

[x86 Disassembler] Displacement following SIB byte parsed with wrong instruction length with REX.B #19234

[x86 Disassembler] Displacement following SIB byte parsed with wrong instruction length with REX.B #19234

Comments

llvmbot commented Feb 16, 2014

Extended Description

llvmbot commented Feb 16, 2014

topperc commented Feb 17, 2014

topperc commented Feb 17, 2014

llvmbot commented Feb 17, 2014