X86: stack realignment, dynamic allocas, and inline assembly cause conflict over ebx / esi #17204

rnk · 2013-08-08T00:39:23Z


Bugzilla Link	16830
Version	trunk
OS	Windows NT
Blocks	#21794 #24719 #13712
CC	@asl,@efriedma-quic,@duck-37

Extended Description

So far as I can tell, the only register you can't touch in MS inline asm is ebp, but LLVM's x86 backend requires a BasePtr register which is separate from the frame pointer in ebp. It happens to hard code the choice of esi in X86RegisterInfo.cpp:
// Use a callee-saved register as the base pointer. These registers must
// not conflict with any ABI requirements. For example, in 32-bit mode PIC
// requires GOT in the EBX register before function calls via PLT GOT pointer.
BasePtr = Is64Bit ? X86::RBX : X86::ESI;

This will blow up if the inline asm clobbers esi.

Test case straight from LLVM's own lib/Support/Host.cpp:

bool GetX86CpuIDAndInfo(unsigned value, unsigned *rEAX, unsigned *rEBX,
unsigned *rECX, unsigned *rEDX) {
__asm {
mov eax,value
cpuid
mov esi,rEAX
mov dword ptr [esi],eax
mov esi,rEBX
mov dword ptr [esi],ebx
mov esi,rECX
mov dword ptr [esi],ecx
mov esi,rEDX
mov dword ptr [esi],edx
}
return false;
}

This generates x86 asm like:

$ clang -cc1 -fms-compatibility t.cpp -o - -cxx-abi microsoft -S
...
"?GetX86CpuIDAndInfo@@YA_NIPAI000@Z":
pushl %ebp
movl %esp, %ebp
pushl %ebx
pushl %edi
pushl %esi
subl $28, %esp
movl %esp, %esi
...
#APP
.intel_syntax
mov eax,dword ptr [esi + 24]
cpuid
mov esi,dword ptr [esi + 20]
mov dword ptr [esi],eax
mov esi,dword ptr [esi + 16]
mov dword ptr [esi],ebx
mov esi,dword ptr [esi + 12]
mov dword ptr [esi],ecx
mov esi,dword ptr [esi + 8]
mov dword ptr [esi],edx
.att_syntax
#NO_APP

rnk · 2013-12-07T00:03:18Z

MSVC uses ebx instead of esi, for roughly the same reasons, although the roles are swapped. ebx points to the incoming arguments, and ebp points to the local variables. ebx is callee saved in most CCs, so it can be used across the body of the function.

MSVC emits a warning when modifying ebp, and ebx if it had to use it due to stack realignment:

int main() {
__declspec(align(8)) int a;
__asm {
push ebp
mov ebx, esp
and esp, -16
mov a, edi
mov esp, ebx
pop ebp
}
return a;
}

t.cpp(5) : warning C4731: 'main' : frame pointer register 'ebx' modified by inline assembly code
t.cpp(9) : warning C4731: 'main' : frame pointer register 'ebp' modified by inline assembly code

If you remove the need for stack realignment, only the ebp warning remains. You can modify ebp without warning, but only if you enable optimizations and the FP can be eliminated.

We could teach clang to emit a diagnostic like this.

rnk · 2014-12-31T02:33:36Z

*** Bug llvm/llvm-bugzilla-archive#22068 has been marked as a duplicate of this bug. ***

asl · 2015-01-01T16:56:42Z

I think we can safely use ebx on win32 - PIC is implemented w/o GOT there, so we can easily use ebx.

rnk · 2015-01-13T02:56:23Z

I think we can safely use ebx on win32 - PIC is implemented w/o GOT there,
so we can easily use ebx.

That's just a workaround, though. We will still conflict with inline asm using ebx on Windows (rare, due to MSVC's use of ebx) and esi on Linux (not uncommon due to string instructions).

I think we need to reformulate the problem, rather than trying to pick an arbitrary hardcoded register that works for all situations. Stack objects (allocas, spill slots) with low alignment can be accessed via ebp, and stack objects with large alignment requirements can be accessed via a virtual register, which can be spilled via ebp. Then we can let the register allocator solve the problem.

rnk · 2015-01-13T23:04:07Z

Re-titling since this is generic across OSs and asm style.

rnk · 2016-04-02T00:27:33Z

*** Bug llvm/llvm-bugzilla-archive#27183 has been marked as a duplicate of this bug. ***

rnk · 2016-09-14T23:11:28Z

*** Bug llvm/llvm-bugzilla-archive#30389 has been marked as a duplicate of this bug. ***

efriedma-quic · 2021-07-15T07:33:42Z

*** Bug llvm/llvm-bugzilla-archive#51100 has been marked as a duplicate of this bug. ***

efriedma-quic · 2021-07-15T08:00:42Z

I looked at this briefly since a duplicate was filed, bug 51100.

Apparently, gcc doesn't use a base pointer at all. It uses different techniques on different targets; on x86, it emits an exotic prologue that actually stores the frame pointer below the realignment gap. On ARM, gcc apparently just never realigns the stack at all.

rnk · 2021-07-19T18:33:10Z

To recap, there are three stack areas that the compiler may need to reference:

incoming parameters and fixed stack objects (return addr)
local variable stack objects (potentially highly aligned)
outgoing arguments

Overaligned variables make the offset from 1 to 2 dynamic, and allocas make the offset from 2 to 3 dynamic. LLVM's strategy today is to dedicate a physical register to all areas when needed.

Fixing this issue fully requires adding indirection to accesses to one of these areas. IMO, area #1, arguments, is the best option, especially given other LLVM design choices. To implement that, we should:

mark argument stack objects as mutable (prevent remat of argument loads)
assign virtual registers to each byval / inalloca argument
treat access to byval/inalloca arguments like accessing a dynamic alloca

efriedma-quic · 2021-07-19T19:18:59Z

If we're going to move the alignment gap so it isn't between the frame pointer and the stack pointer, we might as well just construct a single virtual register that pointers to the argument area, instead of using a separate virtual register for each argument.

I suspect adding an indirection to access overaligned locals is actually the easiest approach for most targets, at least ones that don't need overaligned spill slots. We already have LocalStackSlotAllocation, which does a similar transform.

rnk · 2021-07-19T21:45:45Z

Sure, it's easy to penalize access to overaligned locals, but I think they tend to be performance critical, so IMO it's better to penalize accesses to arguments. I assume the base pointer work was specifically done to address this issue.

rnk · 2021-11-26T18:37:24Z

mentioned in issue llvm/llvm-bugzilla-archive#17907

stephenhines · 2021-11-26T19:45:51Z

mentioned in issue #21794

rnk · 2021-11-26T19:58:39Z

mentioned in issue llvm/llvm-bugzilla-archive#22068

llvmbot · 2021-11-26T20:42:21Z

mentioned in issue #24719

rnk · 2021-11-26T21:38:15Z

mentioned in issue llvm/llvm-bugzilla-archive#27183

rnk · 2021-11-26T22:23:34Z

mentioned in issue llvm/llvm-bugzilla-archive#30389

efriedma-quic · 2021-11-27T04:34:45Z

mentioned in issue llvm/llvm-bugzilla-archive#51100

This patch fixes llvm#17204. If a base pointer is used in a function, and it is clobbered by an instruction (typically an inline asm), current register allocator can't handle this situation, so BP becomes garbage after those instructions. It can also occur to FP in theory. We can spill and reload FP/BP registers around those instructions. But normal spill/reload instructions also use FP/BP, so we can't spill them into normal spill slots, instead we spill them into the top of stack by using SP register.

stephenhines mentioned this issue Oct 31, 2014

[Meta] Android+Clang platform support #21794

Closed

llvmbot mentioned this issue Aug 4, 2015

[Meta] ChromeOs+Clang platform support #24719

Open

llvmbot transferred this issue from llvm/llvm-bugzilla-archive Dec 9, 2021

weiguozhi linked a pull request Feb 7, 2024 that will close this issue

Spill/restore FP/BP around instructions in which they are clobbered #81048

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

X86: stack realignment, dynamic allocas, and inline assembly cause conflict over ebx / esi #17204

X86: stack realignment, dynamic allocas, and inline assembly cause conflict over ebx / esi #17204

rnk commented Aug 8, 2013

rnk commented Dec 7, 2013

rnk commented Dec 31, 2014

asl commented Jan 1, 2015

rnk commented Jan 13, 2015

rnk commented Jan 13, 2015

rnk commented Apr 2, 2016

rnk commented Sep 14, 2016

efriedma-quic commented Jul 15, 2021

efriedma-quic commented Jul 15, 2021

rnk commented Jul 19, 2021

efriedma-quic commented Jul 19, 2021

rnk commented Jul 19, 2021

rnk commented Nov 26, 2021

stephenhines commented Nov 26, 2021

rnk commented Nov 26, 2021

llvmbot commented Nov 26, 2021

rnk commented Nov 26, 2021

rnk commented Nov 26, 2021

efriedma-quic commented Nov 27, 2021

X86: stack realignment, dynamic allocas, and inline assembly cause conflict over ebx / esi #17204

X86: stack realignment, dynamic allocas, and inline assembly cause conflict over ebx / esi #17204

Comments

rnk commented Aug 8, 2013

Extended Description

rnk commented Dec 7, 2013

rnk commented Dec 31, 2014

asl commented Jan 1, 2015

rnk commented Jan 13, 2015

rnk commented Jan 13, 2015

rnk commented Apr 2, 2016

rnk commented Sep 14, 2016

efriedma-quic commented Jul 15, 2021

efriedma-quic commented Jul 15, 2021

rnk commented Jul 19, 2021

efriedma-quic commented Jul 19, 2021

rnk commented Jul 19, 2021

rnk commented Nov 26, 2021

stephenhines commented Nov 26, 2021

rnk commented Nov 26, 2021

llvmbot commented Nov 26, 2021

rnk commented Nov 26, 2021

rnk commented Nov 26, 2021

efriedma-quic commented Nov 27, 2021