ARM integrated assembler generates incorrect nop opcode when switching from arm to thumb mode #18393

llvmbot · 2013-11-22T03:24:25Z


Bugzilla Link	18019
Resolution	FIXED
Resolved on	Sep 25, 2017 11:50
Version	trunk
OS	Windows NT
Blocks	#19300
Reporter	LLVM Bugzilla Contributor
CC	@fhahn,@rengolin

Extended Description

I am seeing a problem with the way nops are emitted in the integrated assembler for ARM. When switching from arm to thumb mode in an assembly file we still emit the arm nop opcode. Look at this small example:

$ cat align.s
.syntax unified
.code 16
foo:
add r0, r0
.align 3
add r0, r0

$ llvm-mc -triple armv7-none-linux align.s -filetype=obj -o t.o && llvm-objdump -triple thumbv7 -d t.o
t.o: file format ELF32-arm

Disassembly of section .text:
foo:
0: 00 44 add r0,
r0
2: 00 f0 20 e3 blx
#4195904
6: 00 00 movs r0,
r0
8: 00 44 add r0,
r0

This shows that we have actually emitted an arm nop (e320f000) instead of a thumb nop. Unfortunately, this encodes to a thumb branch which causes bad things to happen when compiling assembly code with align directives.

The ARMAsmBackend class is responsible for emitting these nops. It keeps track of whether it should emit arm or thumb nop. The first problem is that MCElfStreamer does not pass on the .code 16 directive to the ARMAsmBackend class (using handleAssemblerFlag). In the example above we start assembling in arm mode (because of the -triple) and so the ARMAsmBackend always thinks we are in arm mode and it emits the wrong opcode.

We actually can assemble this example correctly for darwin because the MCMachOStreamer does pass on the directives. It looks like we need to modify the MCElfStreamer to pass the assembler directives down to the ARMAsmBackend to match the behavior of the MCMachOStreamer.

Unfortunately, this change will not solve the full problem, even though the integrated assembler works correctly for MachO in this example:

$ llvm-mc -triple armv7-apple-darwin align.s -filetype=obj -o t.o && llvm-objdump -triple thumbv7 -d t.o
t.o: file format Mach-O arm

Disassembly of section __TEXT,__text:
foo:
0: 00 44 add r0,
r0
2: 00 bf nop
4: 00 bf nop
6: 00 bf nop
8: 00 44 add r0,
r0

The problem is that the nops are written after the assembly is complete when it writes the MCAlignFragment to the output. The ARMAsmBackend writes nops using the last mode it knew about. So it can write bad nop data if the nops are in a location that is a mode that does not match the bit stored in the backend. We can see the problem by simply adding a .code 32 directive to the end of the example:

$ echo ".code 32" >> align.s
$ llvm-mc -triple armv7-apple-darwin align.s -filetype=obj -o t.o && llvm-objdump -triple thumbv7 -d t.o
t.o: file format Mach-O arm

Disassembly of section __TEXT,__text:
foo:
0: 00 44 add r0,
r0
2: 00 f0 20 e3 blx
#4195904
6: 00 00 movs r0,
r0
8: 00 44 add r0,
r0

It seems that the MCAlignFragment needs to know if it is aligning in thumb mode or in arm mode. How should we solve this problem? Should we store the current mode in the fragment when assembling the file and use that mode when writing nop data?

The text was updated successfully, but these errors were encountered:

llvmbot · 2013-11-23T03:05:51Z

https://review-hexagon.quicinc.com/#change,14999

llvmbot · 2013-11-26T02:58:21Z

Partial fix committed in r195677: http://llvm.org/viewvc/llvm-project?view=revision&revision=195677

Still need a fix for code that switches modes multiple times in the same file because we just emit noops using the last mode.

llvmbot · 2014-02-21T14:09:33Z

Still need a fix for code that switches modes multiple times in the same
file because we just emit noops using the last mode.

The fixes for llvm/llvm-bugzilla-archive#18303 show how to fix this, right? We want to keep the current MCSubtargetInfo in the MCAlignFragment so that the back end can emit NOPs appropriately?

llvmbot · 2014-02-21T19:58:24Z

Still need a fix for code that switches modes multiple times in the same
file because we just emit noops using the last mode.

The fixes for llvm/llvm-bugzilla-archive#18303 show how to fix this, right? We want to keep the
current MCSubtargetInfo in the MCAlignFragment so that the back end can emit
NOPs appropriately?

Yes, that is correct. We now have a clear path on how to fix this bug, which is to keep the MCSubtargetInfo in the fragment and pass it to the MCAsmInfo when generating noop data.

I briefly attempted the fix, but I got stuck because the noops are generated not just for the MCAlignFragment, but for potentially any MCFragment because of how bundling is handled. The bundle handling caused the change to not be as simple as I'd hoped and I never got around to investigating it further for the best implementation.

fhahn · 2017-09-24T17:27:21Z

llvm-mc from the current master produces thumb nops for the example:

t.o: file format ELF32-arm-little

Disassembly of section .text:
foo:
0: 00 44 add r0, r0
2: 00 bf nop
4: 00 bf nop
6: 00 bf nop
8: 00 44 add r0, r0

rengolin · 2021-11-26T18:57:57Z

mentioned in issue #19300

rengolin mentioned this issue Feb 21, 2014

[Meta] ARM Integrated assembler support #19300

Open

llvmbot transferred this issue from llvm/llvm-bugzilla-archive Dec 9, 2021

This issue was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ARM integrated assembler generates incorrect nop opcode when switching from arm to thumb mode #18393

ARM integrated assembler generates incorrect nop opcode when switching from arm to thumb mode #18393

llvmbot commented Nov 22, 2013

llvmbot commented Nov 23, 2013

llvmbot commented Nov 26, 2013

llvmbot commented Feb 21, 2014

llvmbot commented Feb 21, 2014

fhahn commented Sep 24, 2017

rengolin commented Nov 26, 2021

ARM integrated assembler generates incorrect nop opcode when switching from arm to thumb mode #18393

ARM integrated assembler generates incorrect nop opcode when switching from arm to thumb mode #18393

Comments

llvmbot commented Nov 22, 2013

Extended Description

llvmbot commented Nov 23, 2013

llvmbot commented Nov 26, 2013

llvmbot commented Feb 21, 2014

llvmbot commented Feb 21, 2014

fhahn commented Sep 24, 2017

rengolin commented Nov 26, 2021