Loop Vectorizer: Only vectorize on ARM if precision requirements allow the use of NEON #16649

tobiasgrosser · 2013-06-07T23:29:24Z


Bugzilla Link	16275
Resolution	FIXED
Resolved on	Apr 14, 2016 15:45
Version	trunk
OS	Linux
Attachments	Test case where we vectorize without considering the precision requirements
CC	@rengolin

Extended Description

The attached simple loop is vectorized under the triple 'thumbv7-linux-gnueabi'.

Due to NEON not providing IEEE 745 compatibility we should not introduce it's use under linux, if the user did not specifically allowed imprecise floating point computations. #16648 is about fixing the ARM target to only issue NEON instructions if the user (or the default compiler flags) set the precision requirements such that it is legal to do so.

This bug is about the vectorizer and its cost model to only introduce LLVM-IR vector instructions in case we know the ARM target can actually translate them into NEON instructions.

GCC had a similar issue and fixed it in this bug report:

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=43703

tobiasgrosser · 2013-06-07T23:29:24Z

assigned to @rengolin

rengolin · 2016-01-18T12:34:50Z

Still pertinent, as we still vectorize using NEON without fast-math flags.

rengolin · 2016-02-12T16:22:32Z

For the loop vectorizer, here's the review: http://reviews.llvm.org/D17141

The SLP vectorizer seems to get it right, already. I need to look into it a bit better.

rengolin · 2016-02-23T19:41:42Z

Bug #21778 is an example of the SLP vectorizer getting it wrong.

rengolin · 2016-04-01T19:37:24Z

After discussion with James Greenhalgh, GCC seems to be doing what the original patch expected, so I just simplified it and rebased:

http://reviews.llvm.org/D18701

For now, fast-math is required (exactly like GCC), but we don't have an -fsubnormal-maths flag, so we can't expand on that further.

If there is enough interest in getting that flag (GCC seems to have ignored that for many years), we can create a new bug and work with them to find a common flag syntax.

rengolin · 2016-04-14T22:45:02Z

Fixed in r266363

llvmbot transferred this issue from llvm/llvm-bugzilla-archive Dec 9, 2021

This issue was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Loop Vectorizer: Only vectorize on ARM if precision requirements allow the use of NEON #16649

Loop Vectorizer: Only vectorize on ARM if precision requirements allow the use of NEON #16649

tobiasgrosser commented Jun 7, 2013

tobiasgrosser commented Jun 7, 2013

rengolin commented Jan 18, 2016

rengolin commented Feb 12, 2016

rengolin commented Feb 23, 2016

rengolin commented Apr 1, 2016

rengolin commented Apr 14, 2016

Navigation Menu

Loop Vectorizer: Only vectorize on ARM if precision requirements allow the use of NEON #16649

Loop Vectorizer: Only vectorize on ARM if precision requirements allow the use of NEON #16649

Comments

tobiasgrosser commented Jun 7, 2013

Extended Description

tobiasgrosser commented Jun 7, 2013

rengolin commented Jan 18, 2016

rengolin commented Feb 12, 2016

rengolin commented Feb 23, 2016

rengolin commented Apr 1, 2016

rengolin commented Apr 14, 2016