LLVM Bugzilla is read-only and represents the historical archive of all LLVM issues filled before November 26, 2021. Use github to submit LLVM bugs

Bug 51420 - Implicit instantiation of TypeID is incompatible with building with -BSymbolic
Summary: Implicit instantiation of TypeID is incompatible with building with -BSymbolic
Status: REOPENED
Alias: None
Product: MLIR
Classification: Unclassified
Component: Core (show other bugs)
Version: unspecified
Hardware: PC Linux
: P enhancement
Assignee: Mehdi Amini
URL:
Keywords:
Depends on:
Blocks: release-13.0.1
  Show dependency tree
 
Reported: 2021-08-09 13:22 PDT by Bernhard Rosenkraenzer
Modified: 2021-11-08 11:39 PST (History)
8 users (show)

See Also:
Fixed By Commit(s): c3aed0d395950c23d66c3ec0af4b1a2ec2534795 d5159b99105d


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Bernhard Rosenkraenzer 2021-08-09 13:22:25 PDT
Building LLVM 13.0-rc1 with MLIR enabled fails:

cd /home/bero/abf/llvm/BUILD/llvm-project-release-13.x/build/tools/mlir/include/mlir/Dialect/Linalg/IR && /home/bero/abf/llvm/BUILD/llvm-project-release-13.x/build/bin/mlir-linalg-ods-yaml-gen /home/bero/abf/llvm/BUILD/llvm-project-release-13.x/mlir/include/mlir/Dialect/Linalg/IR/LinalgNamedStructuredOps.yaml -o-ods-decl=/home/bero/abf/llvm/BUILD/llvm-project-release-13.x/build/tools/mlir/include/mlir/Dialect/Linalg/IR/LinalgNamedStructuredOps.yamlgen.td -o-impl=/home/bero/abf/llvm/BUILD/llvm-project-release-13.x/build/tools/mlir/include/mlir/Dialect/Linalg/IR/LinalgNamedStructuredOps.yamlgen.cpp.inc
YAML:19:16: error: could not parse as an affine map attribute
    shape_map: affine_map<()[s0, s1, s2] -> (s0, s1)>
               ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Comment 1 Bernhard Rosenkraenzer 2021-09-14 10:24:08 PDT
still happens in 13.0-rc3
Comment 2 Mehdi Amini 2021-09-14 17:54:11 PDT
Nicolas: can you look into this?
Comment 3 Tobias Gysi 2021-09-17 04:33:37 PDT
Hi, I tried to reproduce the problem using the following commands:

git clone https://github.com/llvm/llvm-project.git
cd llvm-project
git checkout tags/llvmorg-13.0.0-rc3 -b llvmorg-13.0.0-rc3
mkdir build
cmake -GNinja ../llvm \
  -DCMAKE_C_COMPILER=/usr/bin/clang \
  -DCMAKE_CXX_COMPILER=/usr/bin/clang++ \
  -DLLVM_ENABLE_PROJECTS=mlir \
  -DLLVM_TARGETS_TO_BUILD=X86 \
  -DMLIR_INCLUDE_INTEGRATION_TESTS=ON \
  -DLLVM_ENABLE_ASSERTIONS=ON \
  -DBUILD_SHARED_LIBS=ON \
  -DLLVM_INCLUDE_UTILS=ON \
  -DLLVM_BUILD_EXAMPLES=ON \
  -DLLVM_ENABLE_LLD=ON \
  -DLLVM_CCACHE_BUILD=ON \
  -DLLVM_OPTIMIZED_TABLEGEN=ON \
  -DCMAKE_BUILD_TYPE=Release
ninja all
ninja check-mlir

Building and running the mlir tests works fine for me. I hope I did catch the right commit? Also can you share your cmake configuration in case this is an artifact of a particular setup?

Best
Tobias
Comment 4 Bernhard Rosenkraenzer 2021-09-17 05:46:47 PDT
My failing build is part of a much larger build (all components, all targets). Will try to reproduce with fewer stuff and isolate what's triggering the breakage.
Comment 5 Bernhard Rosenkraenzer 2021-09-17 06:56:12 PDT
Confirmed that it builds for me in the minimal config as well
Comment 6 Mehdi Amini 2021-09-17 09:20:12 PDT
There is something fishy somewhere, the tool `build/bin/mlir-linalg-ods-yaml-gen`  fails to parse a YAML file that is committed in the repo. 
This tool does not have any dependency on other projects (I expect it to only depend on LLVM libSupport actually). 
Do you have a cmake invocation that reliably fail for you Bernhard?
If you can also provide more info about the environment (OS, arch).
Thanks!
Comment 7 Bernhard Rosenkraenzer 2021-09-18 03:57:41 PDT
I've narrowed it down -
Comment 8 Bernhard Rosenkraenzer 2021-09-18 03:58:59 PDT
it's not a serious problem after all, it happens if the tools are built with "-Wl,-Bsymbolic".

Simply optimizing a little less and not making the assumption that this is safe fixes the problem.

Still might be worth documenting somewhere. I don't see an obvious reason why -Wl,-Bsymbolic would break this.
Comment 9 Tobias Gysi 2021-09-20 08:22:11 PDT
Thanks for analyzing.

I can confirm that adding -Wl,-Bsymbolic seems problematic (to CMAKE_MODULE_LINKER_FLAGS and CMAKE_SHARED_LINKER_FLAGS). It still builds for me but almost all mlir tests fail. 

I lack the linker / cmake expertise to understand the root cause. Looking at the error messages it could be that type uniquing does not work anymore.

A solution could be documentation or even better adding a check in the cmake file to emit a warning if the flag is set. Let me know if you know a reliable way of doing that?

Tobias
Comment 10 Mehdi Amini 2021-09-20 09:37:57 PDT
Ah right, `-BSymbolic` will likely break our TypeID mechanism I believe: we still rely in a few places on template instantiated function having the same address for the entire program.
Comment 11 Fangrui Song 2021-09-20 11:30:21 PDT
cmake -GNinja -Hllvm -Bout/release -DCMAKE_C_COMPILER=~/Stable/bin/clang -DCMAKE_CXX_COMPILER=~/Stable/bin/clang++ -DLLVM_ENABLE_PROJECTS=mlir -DLLVM_TARGETS_TO_BUILD=X86 -DLLVM_ENABLE_ASSERTIONS=ON -DBUILD_SHARED_LIBS=ON -DLLVM_ENABLE_LLD=ON -DCMAKE_BUILD_TYPE=Release -DCMAKE_SHARED_LINKER_FLAGS=-Wl,-Bsymbolic  # check-mlir-unit fails
With -Wl,-Bsymbolic-functions check-mlir-unit passes.

This is a pointer equality for variables problem, likely related to TypeID
See https://maskray.me/blog/2021-05-16-elf-interposition-and-bsymbolic for details.

(I ruled out the copy relocations because with
-DCMAKE_CXX_FLAGS=-fno-direct-access-external-data -DCMAKE_C_FLAGS=-fno-direct-access-external-data -DCMAKE_SHARED_LINKER_FLAGS=-Wl,-Bsymbolic
there is copy relocation (readelf -r bin/mlir-translate | grep COPY) but check-mlir-unit still fails.
)

Just stick with -Wl,-Bsymbolic-functions.
Comment 12 Fangrui Song 2021-09-20 11:30:54 PDT
> there is copy relocation

typo. there is no copy relocation
Comment 13 Mehdi Amini 2021-09-20 11:55:10 PDT
> This is a pointer equality for variables problem

Right: the templated function returns the address of a local static variable.
The address of the function itself isn't important, only the one it returns (so the static variable)
Comment 14 Mehdi Amini 2021-09-25 10:07:21 PDT
We can't fix this in time for LLVM 13 release. It is also not a regression from LLVM 12 or 11 I believe.
Comment 15 Mehdi Amini 2021-09-25 10:08:04 PDT
However we could error at the CMake level if Bsymbolic is set.
Comment 16 Mehdi Amini 2021-09-25 10:29:35 PDT
Sent https://reviews.llvm.org/D110483 to do just that
Comment 17 Tom Stellard 2021-10-22 01:34:52 PDT
(In reply to Mehdi Amini from comment #16)
> Sent https://reviews.llvm.org/D110483 to do just that

Should we backport this to the release/13.x branch?
Comment 18 Mehdi Amini 2021-10-22 10:58:43 PDT
It would be "nice to have", feel free to cherry-pick this!
Comment 19 Tom Stellard 2021-11-08 09:08:43 PST
Merged: d5159b99105d
Comment 20 Mehdi Amini 2021-11-08 11:39:43 PST
The bug isn't fixed in the sense that we don't support it. The commit back ported is just error detection at CMake time.