__builtin_ia32_readeflags_u64() unnecessarily forces a frame pointer #46875

llvmbot · 2020-09-14T22:03:34Z


Bugzilla Link	47531
Version	trunk
OS	Linux
Blocks	#4440
Reporter	LLVM Bugzilla Contributor
CC	@topperc,@majnemer,@RKSimon,@nickdesaulniers,@zygoloid,@rnk,@rotateright,@tstellar

Extended Description

This code:

unsigned long read_eflags(void)
{
   return __builtin_ia32_readeflags_u64();
}

compiles to:

read_eflags:
        pushq   %rbp
        movq    %rsp, %rbp
        pushfq
        popq    %rax
        popq    %rbp
        retq

which has an unnecessary frame pointer. (I'm somewhat puzzled as to why this happened. Sure, the redzone slot clobbered by the pushfq can't be used, but I don't see what this has to do with a frame pointer.)

gcc generates:

read_eflags:
        pushfq
        popq    %rax
        ret

llvmbot · 2020-09-14T22:03:34Z

assigned to @nickdesaulniers

nickdesaulniers · 2020-09-18T18:54:45Z

Looks like:
llvm-project/llvm/lib/Target/X86/X86ISelLowering.cpp:25858

case llvm::Intrinsic::x86_flags_read_u32:
case llvm::Intrinsic::x86_flags_read_u64:
case llvm::Intrinsic::x86_flags_write_u32:
case llvm::Intrinsic::x86_flags_write_u64: {
  // We need a frame pointer because this will get lowered to a PUSH/POP
  // sequence.
  MachineFrameInfo &MFI = DAG.getMachineFunction().getFrameInfo();
  MFI.setHasCopyImplyingStackAdjustment(true);
  // Don't do anything here, we will expand these intrinsics out later
  // during FinalizeISel in EmitInstrWithCustomInserter.
  return Op;
}

added by 861a0ae. Commenting out that MFI operation fixes the test case in question. Doing so will cause test failures in:

LLVM :: CodeGen/X86/win64_frame.ll
LLVM :: CodeGen/X86/x86-64-flags-intrinsics.ll
LLVM :: CodeGen/X86/x86-flags-intrinsics.ll

I don't exactly follow the comment in the source. Why does push/pop require a frame pointer? Also, the tests seem to all use windows target triple; is there some requirement of the Windows ABI there?

topperc · 2020-09-18T19:07:16Z

Here are some older comments from before some other EFLAGS copying code was removed in 19618fc

-/// This function checks if any of the users of EFLAGS copies the EFLAGS. We
-/// know that the code that lowers COPY of EFLAGS has to use the stack, and if
-/// we don't adjust the stack we clobber the first frame index.
-/// See X86InstrInfo::copyPhysReg.
-static bool hasCopyImplyingStackAdjustment(const MachineFunction &MF) {
-  const MachineRegisterInfo &MRI = MF.getRegInfo();
-  return any_of(MRI.reg_instructions(X86::EFLAGS),
-                [](const MachineInstr &RI) { return RI.isCopy(); });
-}
-
-void X86TargetLowering::finalizeLowering(MachineFunction &MF) const {
-  if (hasCopyImplyingStackAdjustment(MF)) {
-    MachineFrameInfo &MFI = MF.getFrameInfo();
-    MFI.setHasCopyImplyingStackAdjustment(true);
-  }
-
-  TargetLoweringBase::finalizeLowering(MF);
-}
-

rnk · 2020-09-21T18:48:20Z

At the time, Jan 2016, David was working on Windows EH stuff, and I think the main concern was that if you have PUSHF or POPF instructions in a function, the stack unwind info will be incorrect after that instruction. This is true regardless of whether DWARF or Windows unwind info is in use. We had just finished going to great lengths to making the Windows unwind info accurate at every instruction boundary, and this may have represented a bug to David. Writing flags is always super slow, so we weren't really concerned about introducing extra overhead for it. I can see the argument that users may not care about perfect unwind info and may prefer to read flags more quickly.

nickdesaulniers · 2020-12-04T22:41:24Z

https://reviews.llvm.org/D92695

nickdesaulniers · 2020-12-04T22:57:31Z

Noting: see also: #46874

llvmbot · 2020-12-06T00:52:50Z

https://reviews.llvm.org/D92695

I don't have a Phabricator account, but maybe a comment here will still be useful.

As you all discovered, on x86_64, a kernel essentially can't use a red-zone. [0] That being said, I would hope that LLVM would be sensible enough to generate correct code regardless of command line options. For example, the eflags builtins could disable the redzone in the function in which they're used, or they could be even more clever and reserve the top redzone slot for themselves so that the rest of the redzone could be used.

But I bet that they will be much more common in kernel code than user code. Kernel code can and does manipulate various interesting EFLAGS bits that can't or mostly wouldn't be written from user mode, in particular IF and AC. The only reason I can think of for using this builtins from 64-bit user mode is to manipulate the EFLAGS.TF bit for single-stepping. The RF bit is also interesting but IIRC can't be written using POPF in any useful way, and the arithmetic flags bits don't make any sense to manipulate like this in C code. Authors of test cases that abuse the kernel or of DRM schemes might play with EFLAGS.TS, and maybe someone wants to fiddle with EFLAGS.AC, but those would be quite rare. (32-bit user mode intended to work on very old CPUs uses the EFLAGS.ID to determine whether the CPUID instruction exists. I doubt anyone cares about the performance of such code.)

[0] It's techincally possible but it's horrible for all kinds of reasons. A user program that cooperates with the kernel to get direct-to-usermode delivery of interrupts also can't use a red-zone, but no one except exploit authors would do this. There are hardware features^Wbugs that have historically been interesting attacks against hypervisors that involve programming the CPU like this, but no one does it for legitimate reasons.

nickdesaulniers · 2021-12-13T22:23:41Z

see also #20571.

nickdesaulniers · 2022-01-28T00:29:14Z

cc @gwelymernans

nickdesaulniers · 2022-02-10T01:20:56Z

@rnk and I both have patches for this:

nickdesaulniers · 2022-02-10T18:10:46Z

@tstellar can we backport f3481f4 into the 14.x release?

tstellar · 2022-02-11T01:14:43Z

/cherry-pick f3481f4

llvmbot · 2022-02-11T01:20:21Z

/branch llvmbot/llvm-project/issue46875

This ensures that the Windows unwinder will work at every instruction boundary, and allows other targets to read and write flags without setting up a frame pointer. Fixes llvmGH-46875 Differential Revision: https://reviews.llvm.org/D119391 (cherry picked from commit f3481f4)

llvmbot · 2022-02-11T01:21:31Z

/pull-request llvmbot#42

This ensures that the Windows unwinder will work at every instruction boundary, and allows other targets to read and write flags without setting up a frame pointer. Fixes llvmGH-46875 Differential Revision: https://reviews.llvm.org/D119391 (cherry picked from commit f3481f4)

tstellar · 2022-02-14T22:17:12Z

Merged: 66c59c0

llvmbot transferred this issue from llvm/llvm-bugzilla-archive Dec 10, 2021

llvmbot added the confirmed Verified by a second party label Jan 26, 2022

nickdesaulniers removed their assignment Feb 8, 2022

nickdesaulniers assigned nickdesaulniers and rnk Feb 10, 2022

rnk closed this as completed in f3481f4 Feb 10, 2022

nickdesaulniers reopened this Feb 10, 2022

nickdesaulniers added the release:backport label Feb 10, 2022

nickdesaulniers assigned tstellar and unassigned nickdesaulniers Feb 10, 2022

nathanchance added this to the LLVM 14.0.0 Release milestone Feb 10, 2022

llvmbot mentioned this issue Feb 11, 2022

PR for llvm/llvm-project#46875 llvmbot/llvm-project#42

Merged

tstellar added the release:reviewed label Feb 14, 2022

tstellar closed this as completed Feb 14, 2022

tstellar added the release:merged label Feb 21, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

__builtin_ia32_readeflags_u64() unnecessarily forces a frame pointer #46875

__builtin_ia32_readeflags_u64() unnecessarily forces a frame pointer #46875

llvmbot commented Sep 14, 2020 •

edited by nickdesaulniers

llvmbot commented Sep 14, 2020

nickdesaulniers commented Sep 18, 2020

topperc commented Sep 18, 2020 •

edited by nickdesaulniers

rnk commented Sep 21, 2020

nickdesaulniers commented Dec 4, 2020

nickdesaulniers commented Dec 4, 2020

llvmbot commented Dec 6, 2020

nickdesaulniers commented Dec 13, 2021

nickdesaulniers commented Jan 28, 2022

nickdesaulniers commented Feb 10, 2022

nickdesaulniers commented Feb 10, 2022

tstellar commented Feb 11, 2022

llvmbot commented Feb 11, 2022

llvmbot commented Feb 11, 2022

tstellar commented Feb 14, 2022

__builtin_ia32_readeflags_u64() unnecessarily forces a frame pointer #46875

__builtin_ia32_readeflags_u64() unnecessarily forces a frame pointer #46875

Comments

llvmbot commented Sep 14, 2020 • edited by nickdesaulniers

Extended Description

llvmbot commented Sep 14, 2020

nickdesaulniers commented Sep 18, 2020

topperc commented Sep 18, 2020 • edited by nickdesaulniers

rnk commented Sep 21, 2020

nickdesaulniers commented Dec 4, 2020

nickdesaulniers commented Dec 4, 2020

llvmbot commented Dec 6, 2020

nickdesaulniers commented Dec 13, 2021

nickdesaulniers commented Jan 28, 2022

nickdesaulniers commented Feb 10, 2022

nickdesaulniers commented Feb 10, 2022

tstellar commented Feb 11, 2022

llvmbot commented Feb 11, 2022

llvmbot commented Feb 11, 2022

tstellar commented Feb 14, 2022

llvmbot commented Sep 14, 2020 •

edited by nickdesaulniers

topperc commented Sep 18, 2020 •

edited by nickdesaulniers