-
Notifications
You must be signed in to change notification settings - Fork 12.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[x86] Failure to optimize vector shuffle in conversion #48425
Comments
Current Codegen: https://godbolt.org/z/oYn1df Probably an SLP issue? [Bug #35732] looks very similar |
This issue is to be fixed by http://reviews.llvm.org/D57059 after committing (in process of review now). We have conversion for f[1], f[2] and f[3] here, for three fps, so I have checked patch "Initial support for the vectorization of the non-power-of-2 vectors" fits here:
|
Commited llvm/test/Transforms/SLPVectorizer/X86/pr49081.ll to track fixing. |
For reference, that sequence was not optimizing in IR or backend, so added an instcombine transform to make it easier for SDAG: |
resolving - this was fixed by #65476 |
Extended Description
With -O3, GCC outputs this:
LLVM outputs this:
The text was updated successfully, but these errors were encountered: