Bug 53947
depends on
543
bugs:
view as bug list
-
53947:
[meta-bug] vectorizer missed-optimizations
[NEW; assigned to unassigned; target: ---]
-
18437:
vectorizer failed for matrix multiplication
[NEW; assigned to unassigned; target: ---]
-
18438:
vectorizer failed for vector matrix multiplication
[NEW; assigned to unassigned; target: ---]
-
18439:
vectorizer failed for vector normalization
[NEW; assigned to unassigned; target: ---]
-
19347:
Invariant load not moved out of loop
[NEW; assigned to unassigned; target: ---]
-
21998:
(cond ? result1 : result2) is vectorized, where equivalent if-syntax isn't (store)
[NEW; assigned to unassigned; target: ---]
-
22226:
vectorization library
[NEW; assigned to unassigned; target: ---]
-
28921:
vector of a typedef applies the vector to the inner most type instead of erroring/warning out that vector does not apply
[NEW; assigned to unassigned; target: ---]
-
29268:
need to generalize realignment support in the vectorizer
[NEW; assigned to unassigned; target: ---]
-
31021:
gfortran 20% slower than ifort on CP2K computational kernel
[WAITING; assigned to unassigned; target: ---]
-
31485:
C complex numbers, amd64 SSE, missed optimization opportunity
[NEW; assigned to unassigned; target: ---]
-
31738:
Fortran dot product vectorization is restricted
[NEW; assigned to unassigned; target: ---]
-
31946:
missed vectorization due to too strict peeling-for-alignment policy
[UNCONFIRMED; assigned to unassigned; target: ---]
-
32375:
vectorized with alias check: can't determine dependence (array sections)
[NEW; assigned to unassigned; target: ---]
-
32503:
__builtin_pow[i] - vectorize for other exponents besides 2 and 0.5
[NEW; assigned to unassigned; target: ---]
-
32806:
Missing optimization to remove backward dependencies
[NEW; assigned to unassigned; target: ---]
-
33243:
Missed opportunities for vectorization due to unhandled real_type
[UNCONFIRMED; assigned to unassigned; target: ---]
-
33244:
Missed opportunities for vectorization
[NEW; assigned to unassigned; target: ---]
-
33245:
Missed opportunities for vectorization due to invariant condition
[UNCONFIRMED; assigned to unassigned; target: ---]
-
35226:
Induction with multiplication are not vectorized
[NEW; assigned to unassigned; target: ---]
-
35229:
Vectorizer doesn't support dependence created by predictive commoning or PRE
[NEW; assigned to unassigned; target: ---]
-
35272:
Loop distribution fails to distribute
[NEW; assigned to unassigned; target: ---]
-
36033:
Non-optimal vectorization of interleaved data of different types
[NEW; assigned to unassigned; target: ---]
-
36054:
bad code generation with -ftree-vectorize
[UNCONFIRMED; assigned to unassigned; target: ---]
-
36281:
vector code is not parallelized
[NEW; assigned to unassigned; target: ---]
-
36844:
Vectorizer doesn't support INT<->FP conversions with different size
[UNCONFIRMED; assigned to unassigned; target: ---]
-
37150:
basic-block vectorization misses some unrolled loops
[ASSIGNED; assigned to rguenth; target: ---]
(*)
-
40073:
Vector short/char shifts generate sub-optimal code
[NEW; assigned to unassigned; target: ---]
-
40770:
Vectorization of complex types, vectorization of sincos missing
[NEW; assigned to unassigned; target: ---]
-
41115:
Tree-vectorizer: VecCost tuning for X2: Without vectorization 30% faster
[UNCONFIRMED; assigned to unassigned; target: ---]
-
43436:
Missed vectorization: "unhandled data-ref"
[UNCONFIRMED; assigned to unassigned; target: ---]
-
43543:
Reorder the statements in the loop can vectorize it
[NEW; assigned to unassigned; target: ---]
-
44794:
pre- and post-loops should not be unrolled.
[NEW; assigned to unassigned; target: ---]
-
46006:
vectorization outside of loops starting from loads
[NEW; assigned to unassigned; target: ---]
-
46012:
256bit vectorizer failed on int->double
[NEW; assigned to unassigned; target: ---]
-
47341:
unnecessary versioning in the vectorizer, not implemented affine-affine test
[NEW; assigned to unassigned; target: ---]
-
47860:
is vectorization of "condition in nested loop" supported
[NEW; assigned to unassigned; target: ---]
-
48510:
Does not vectorize loops involving casts from floating point to unsigned integer types
[NEW; assigned to unassigned; target: ---]
-
49513:
DOM inhibits if-conversion and vectorization
[NEW; assigned to unassigned; target: ---]
-
49760:
vectorization inhibited if indices are references
[NEW; assigned to unassigned; target: ---]
-
49795:
vectorization of conditional code happens only on local variables
[NEW; assigned to unassigned; target: ---]
-
49849:
loop optimization prevents vectorization
[NEW; assigned to unassigned; target: ---]
-
49969:
not vectorized: data ref analysis failed
[NEW; assigned to unassigned; target: ---]
-
50374:
Support vectorization of min/max location pattern
[NEW; assigned to unassigned; target: ---]
-
50713:
SLP vs loop: code generated differs (SLP less efficient)
[NEW; assigned to unassigned; target: ---]
-
50789:
Gather vectorization
[ASSIGNED; assigned to jakub; target: ---]
-
51062:
SLP vectorization of dot (inner) product
[NEW; assigned to unassigned; target: ---]
-
51492:
vectorizer does not support saturated arithmetic patterns
[NEW; assigned to unassigned; target: ---]
-
51499:
-Ofast does not vectorize while -O3 does.
[NEW; assigned to unassigned; target: ---]
-
51848:
GCC is not able to vectorize when a constant value is also added to the sum of array expression inside a loop.
[NEW; assigned to unassigned; target: ---]
-
52056:
Vectorizer cost model is imprecise
[NEW; assigned to unassigned; target: ---]
-
52252:
An opportunity for x86 gcc vectorizer (gain up to 3 times)
[NEW; assigned to unassigned; target: ---]
-
53355:
Autovectorization of a simple loop could be improved.
[ASSIGNED; assigned to rguenth; target: ---]
-
53533:
[11/12/13/14 regression] vectorization causes loop unrolling test slowdown as measured by Adobe's C++Benchmark
[NEW; assigned to unassigned; target: 11.5]
-
53957:
Polyhedron 11 benchmark: MP_PROP_DESIGN twice as long as other compiler
[NEW; assigned to unassigned; target: ---]
-
54013:
Loop with control flow not vectorized
[NEW; assigned to unassigned; target: ---]
-
55213:
vectorizer ignores __restrict__
[UNCONFIRMED; assigned to unassigned; target: ---]
-
55723:
loop vectorization inefficient in presence of multiple identical conditions
[UNCONFIRMED; assigned to unassigned; target: ---]
-
55968:
Bytemark HUFFMAN 11% slower with -ftree-vectorize
[UNCONFIRMED; assigned to unassigned; target: ---]
-
56286:
vectorizer does not keep loop-closed SSA up-to-date
[ASSIGNED; assigned to rguenth; target: ---]
-
56531:
SLP load permutations cannot share the load between and inside SLP instances
[ASSIGNED; assigned to rguenth; target: ---]
-
56612:
basic-block vectorization does not replace all scalar uses
[ASSIGNED; assigned to rguenth; target: ---]
-
56624:
Vectorizer gives up on a group-access if it contains stores to the same location
[NEW; assigned to unassigned; target: ---]
-
56688:
static/saved variables prevent loop vectorization.
[NEW; assigned to unassigned; target: ---]
-
56717:
Enhance Dot-product pattern recognition to avoid mult widening.
[UNCONFIRMED; assigned to unassigned; target: ---]
-
56741:
Epilogue loop not partly vectorized
[NEW; assigned to unassigned; target: ---]
-
57162:
Ofast does not make use of avx while O3 does
[NEW; assigned to unassigned; target: ---]
-
57204:
Auto-vectorization in nested loops with non-varying indexed array access results in very poor performance (worse than no auto-vectorization)
[UNCONFIRMED; assigned to unassigned; target: ---]
-
57634:
Missed vectorization for a "fixed point multiplication" reduction
[NEW; assigned to unassigned; target: ---]
-
57705:
Non-constant step induction vars not vectorized
[UNCONFIRMED; assigned to unassigned; target: ---]
-
58280:
Missed Opportunity for Aligned Vectorized Load
[UNCONFIRMED; assigned to unassigned; target: ---]
-
58359:
__builtin_unreachable prevents vectorization
[NEW; assigned to unassigned; target: ---]
-
58497:
SLP vectorizes identical operations
[ASSIGNED; assigned to pinskia; target: ---]
-
58686:
vect_get_loop_niters() fails for some loops
[NEW; assigned to unassigned; target: ---]
-
58902:
small matrix multiplication non vectorized
[UNCONFIRMED; assigned to unassigned; target: ---]
-
58927:
Despite loop->safelen=INT_MAX / GCC ivdep: loop versioned for vectorization because of possible aliasing
[UNCONFIRMED; assigned to unassigned; target: ---]
-
59464:
Unnecessary vector register spill
[NEW; assigned to unassigned; target: ---]
-
60042:
vectorizer still does too many dependence tests for himeno:jacobi
[ASSIGNED; assigned to rguenth; target: ---]
-
60117:
simd reduction clause suppresses simd auto-vectorization when -fopenmp is set
[UNCONFIRMED; assigned to unassigned; target: ---]
-
61175:
LIM not pulling out non-aliased non-depednent load and stores (outside of loop bounds)
[NEW; assigned to unassigned; target: ---]
-
61247:
vectorization fails for unsigned is used for IV but casted to int before using as the index (and then casted for internal type)
[NEW; assigned to unassigned; target: ---]
-
61304:
Missed vectorization: control flow in loop
[NEW; assigned to unassigned; target: ---]
-
61338:
too many permutation in a vectorized "reverse loop"
[NEW; assigned to unassigned; target: ---]
-
63271:
Should commute arithmetic with vector load
[NEW; assigned to unassigned; target: ---]
-
63644:
Kahan Summation with fast-math, pattern not always recognized
[UNCONFIRMED; assigned to unassigned; target: ---]
-
63945:
Missing vectorization optimization
[UNCONFIRMED; assigned to unassigned; target: ---]
-
64716:
Missed vectorization in a hot code of SPEC2000 ammp
[NEW; assigned to unassigned; target: ---]
-
64746:
Loop with nested load/stores is not vectorized using aggressive if-conversion.
[UNCONFIRMED; assigned to unassigned; target: ---]
-
65084:
Lack of type narrowing/widening inhibits good vectorization
[NEW; assigned to unassigned; target: ---]
-
66142:
Loop is not vectorized because not sufficient support for GOMP_SIMD_LANE
[REOPENED; assigned to rguenth; target: ---]
-
66285:
failure to vectorize parallelized loop
[UNCONFIRMED; assigned to unassigned; target: ---]
-
66741:
loops not fused nor vectorized
[NEW; assigned to unassigned; target: ---]
-
67323:
Use non-unit stride loads by preference when applicable
[ASSIGNED; assigned to rguenth; target: ---]
-
67612:
Unable to vectorize DOT_PROD_EXPR (PMADDWD?)
[ASSIGNED; assigned to rguenth; target: ---]
-
67683:
Missed vectorization: shifts of an induction variable
[NEW; assigned to unassigned; target: ---]
-
68050:
SLP vectorization should negate constants to match up + vs -
[ASSIGNED; assigned to rguenth; target: ---]
-
68109:
GCC fails to vectorize popcount on x86_64
[NEW; assigned to unassigned; target: ---]
-
68365:
gfortran test case showing performance loss with vectorization
[NEW; assigned to unassigned; target: ---]
-
68494:
[ARM] Use vector multiply by lane
[NEW; assigned to unassigned; target: ---]
-
68928:
AVX loops on unaligned arrays could generate more efficient startup/cleanup code when peeling
[UNCONFIRMED; assigned to unassigned; target: ---]
-
69489:
missed vectorization for boolean loop, missed if-conversion
[NEW; assigned to unassigned; target: ---]
-
69908:
recognizing idioms that check for a buffer of all-zeros could make *much* better code
[UNCONFIRMED; assigned to unassigned; target: ---]
-
70102:
Tree re-association prevents SLP vectorization at -Ofast.
[ASSIGNED; assigned to rguenth; target: ---]
-
70193:
missed loop splitting support based on iteration space
[NEW; assigned to unassigned; target: ---]
-
70482:
Opimization opportunity to vectorize basic block for -mavx target.
[NEW; assigned to unassigned; target: ---]
-
70546:
ifconvert if(cond) ++count; to count += cond; fails because of mergephi and failed loop header copying
[NEW; assigned to unassigned; target: ---]
-
70666:
SLP vectorization opportunity to use load element + splat
[ASSIGNED; assigned to rguenth; target: ---]
-
70849:
Loop can be vectorized through gathers on AVX2 platforms.
[NEW; assigned to unassigned; target: ---]
-
71414:
2x slower than clang summing small float array, GCC should consider larger vectorization factor for "unrolling" reductions
[NEW; assigned to unassigned; target: ---]
-
71992:
Missed BB SLP vectorization in GCC
[NEW; assigned to unassigned; target: ---]
-
74881:
re-align optimization blocks vectorization on powerpc
[UNCONFIRMED; assigned to unassigned; target: ---]
-
77689:
Missing vectorization lead to huge performance loss
[NEW; assigned to unassigned; target: ---]
-
78164:
SLP vectorizer: prologue cost biased by redundancies
[ASSIGNED; assigned to rguenth; target: ---]
-
79014:
Absent vectorization with memory loads
[NEW; assigned to unassigned; target: ---]
-
79102:
gcc fails to auto-vectorise the multiplicative reduction of an array of complex floats
[NEW; assigned to unassigned; target: ---]
-
79151:
Missed BB vectorization with strided/scalar stores
[NEW; assigned to unassigned; target: ---]
-
79336:
Poor vectorisation of additive reduction of complex array, final SLP reduction step inefficient
[NEW; assigned to unassigned; target: ---]
-
79357:
Doubling a single complex float gives inefficient code
[NEW; assigned to unassigned; target: ---]
-
79726:
Missing optimisation: Type conversion not vectorised in simple additive reduction
[NEW; assigned to unassigned; target: ---]
-
79934:
Vectorization of descending-index loops can produce unnecessary permutes
[NEW; assigned to unassigned; target: ---]
-
79946:
Suboptimal code with AVX2 copying all arguments to stack
[NEW; assigned to unassigned; target: ---]
-
80015:
auto vectorization leaves scalar epilogue even if it is unreachable due to dominating n % 4 check
[NEW; assigned to unassigned; target: ---]
-
80232:
Ofast pessimizes Sparse matmult in scimark2 benchmark on avx platforms
[NEW; assigned to unassigned; target: ---]
-
80248:
sparse access to Array of structures does not vectorize using gathers
[NEW; assigned to unassigned; target: ---]
-
80430:
Vectorizer undervalues cost of alias checking for versioning
[NEW; assigned to unassigned; target: ---]
-
80570:
auto-vectorizing int->double conversion should use half-width memory operands to avoid shuffles, instead of load+extract
[NEW; assigned to unassigned; target: ---]
-
81127:
Complex division misses BB vectorisation opportunity
[NEW; assigned to unassigned; target: ---]
-
81366:
pragma omp simd reduce(max:m) not vectorizing
[ASSIGNED; assigned to rguenth; target: ---]
-
81558:
Loop not vectorized
[NEW; assigned to unassigned; target: ---]
-
81948:
vectorize exp2 using exp
[NEW; assigned to unassigned; target: ---]
-
82137:
auto-vectorizing shuffles way to much to avoid duplicate work
[NEW; assigned to unassigned; target: ---]
-
82189:
Two stage SLP needed
[NEW; assigned to unassigned; target: ---]
-
84261:
gcc fails to vectorize a function call
[UNCONFIRMED; assigned to unassigned; target: ---]
-
85050:
Vectorized function - suboptimal gather
[NEW; assigned to unassigned; target: ---]
-
85057:
GCC fails to vectorize code unless dummy loop is added
[NEW; assigned to unassigned; target: ---]
-
85186:
jump threading can rotate loops affecting loop form, and causing vectorization not to happen
[NEW; assigned to unassigned; target: ---]
-
85406:
Unnecessary blend when vectorizing short-cutted calculations
[NEW; assigned to unassigned; target: ---]
-
85466:
Performance is slow when doing 'branchless' conditional style math operations
[NEW; assigned to unassigned; target: ---]
-
85501:
missed if-conversion / phiopt trick
[UNCONFIRMED; assigned to unassigned; target: ---]
(*)
-
85919:
Incomplete transition to IFNs for scatter/gather support, drop vectorize.builtin_{gather,scatter} target hooks
[UNCONFIRMED; assigned to unassigned; target: ---]
-
85999:
416.gamess slowed down by BB vectorization
[UNCONFIRMED; assigned to unassigned; target: ---]
-
86174:
Poor vectorization/register allocation with omp simd, FMA
[NEW; assigned to unassigned; target: ---]
-
86530:
Vectorization failure for a simple loop
[ASSIGNED; assigned to tnfchris; target: ---]
-
86625:
funroll-loops doesn't unroll, producing >3x assembly and running 10x slower than manual complete unrolling
[UNCONFIRMED; assigned to unassigned; target: ---]
-
87077:
missed optimization for horizontal add for x86 SSE
[NEW; assigned to unassigned; target: ---]
-
87743:
Vectorizer doesn't support conversion of different sizes
[NEW; assigned to unassigned; target: ---]
-
88013:
can't vectorize rgb to grayscale conversion code
[NEW; assigned to unassigned; target: ---]
-
88153:
sqrt() is not vectorized
[UNCONFIRMED; assigned to unassigned; target: ---]
-
88259:
vectorization failure for a typical loop for getting max value and index
[ASSIGNED; assigned to tnfchris; target: ---]
-
88281:
SLP permutation check fails to fall back to strided loads
[ASSIGNED; assigned to rguenth; target: ---]
-
88398:
vectorization failure for a small loop to do byte comparison
[ASSIGNED; assigned to sudi; target: ---]
-
88487:
union prevents autovectorization
[ASSIGNED; assigned to rguenth; target: ---]
-
88490:
Missed autovectorization when indices are different
[NEW; assigned to unassigned; target: ---]
-
88492:
SLP optimization generates ugly code
[ASSIGNED; assigned to tnfchris; target: ---]
-
88570:
Missing or ineffective vectorization of scatter load
[NEW; assigned to unassigned; target: ---]
-
88713:
Vectorized code slow vs. flang
[REOPENED; assigned to unassigned; target: ---]
-
88873:
missing vectorization for decomposed operations on a vector type
[NEW; assigned to unassigned; target: ---]
-
89176:
Vectorizer fails to consider narrower vector width for res[i] = v1[i] < v2[i] ? v2[i] : v1[i]
[NEW; assigned to unassigned; target: ---]
-
89371:
missed vectorisation with "#pragma omp simd collapse(2)"
[UNCONFIRMED; assigned to unassigned; target: ---]
-
89582:
Suboptimal code generated for floating point struct in -O2 compare to -O1
[ASSIGNED; assigned to rguenth; target: ---]
-
89754:
Vectorizer cost model check should look at evolution of niter in outer loop(s)
[NEW; assigned to unassigned; target: ---]
-
89755:
Inefficient runtime alias check
[UNCONFIRMED; assigned to unassigned; target: ---]
-
89908:
Unnecessary rejection of dependence for outer loop vectorisation
[NEW; assigned to unassigned; target: ---]
-
89992:
Vectorizer is sensitive to guessed profile
[NEW; assigned to unassigned; target: ---]
-
90304:
-O3 vectorization gets worse when code is inlined with respect to restrict
[NEW; assigned to unassigned; target: ---]
-
90491:
simple operation with unsigned long integer and conversion to float not vectorized
[NEW; assigned to unassigned; target: ---]
-
90993:
simd integer division not optimized
[NEW; assigned to unassigned; target: ---]
-
91198:
GCC not generating AVX-512 compress/expand instructions
[NEW; assigned to unassigned; target: ---]
-
91201:
[11/12/13/14 Regression] SIMD not generated for horizontal sum of bytes in array
[NEW; assigned to unassigned; target: 11.5]
-
91246:
vectorization failure for a small loop to search array element
[NEW; assigned to unassigned; target: ---]
-
91435:
Better induction variable for vectorization
[NEW; assigned to unassigned; target: ---]
-
91594:
Missing horizontal addition auto-vectorization
[NEW; assigned to unassigned; target: ---]
-
91732:
Adding omp simd pragma prevents vectorization
[NEW; assigned to unassigned; target: ---]
-
91735:
[11/12/13/14 Regression] Runtime regression for SPEC2000 177.mesa on Haswell around the end of August 2018
[NEW; assigned to unassigned; target: 11.5]
-
91811:
256-bit vector store isn't used
[NEW; assigned to unassigned; target: ---]
-
92080:
Missed CSE of _mm512_set1_epi8(c) with _mm256_set1_epi8(c)
[NEW; assigned to unassigned; target: ---]
-
92130:
Missed vectorization for iteration dependent loads and simple multiplicative accumulators
[NEW; assigned to unassigned; target: ---]
-
92175:
x86 backend claims V4SI multiplication support, preventing more optimal pattern
[NEW; assigned to unassigned; target: ---]
-
92243:
Missing "auto-vectorization" of char array reversal using x86 scalar bswap when SIMD pshufb isn't available
[NEW; assigned to unassigned; target: ---]
-
92246:
Byte or short array reverse loop auto-vectorized with 3-uop vpermt2w instead of 1 or 2-uop vpermw (AVX512)
[NEW; assigned to unassigned; target: ---]
-
92335:
[11/12/13 Regression] sinking of loads happen too early which causes vectorization not to be done
[ASSIGNED; assigned to rguenth; target: 11.5]
-
92492:
AVX512: Missed vectorization opportunity
[NEW; assigned to unassigned; target: ---]
-
92655:
Suboptimal vectorization of variable shift
[NEW; assigned to unassigned; target: ---]
-
92772:
wrong code vectorizing masked max
[UNCONFIRMED; assigned to unassigned; target: ---]
-
92834:
misssed SLP vectorization in LightPixel
[NEW; assigned to unassigned; target: ---]
-
93440:
scalar unrolled loop makes vectorized code unreachable
[UNCONFIRMED; assigned to unassigned; target: ---]
-
93616:
Missed chance to use alias checks to vectorise invariant indirection
[NEW; assigned to unassigned; target: ---]
-
94792:
Missed SLP optimization in pr65930-2.c variation
[NEW; assigned to unassigned; target: ---]
-
95960:
GCC should re-vectorize vector code with larger VF
[UNCONFIRMED; assigned to unassigned; target: ---]
-
96053:
Miss optimization:Finding SLP sequences from reductions sometimes is better than finding from reduction chains
[NEW; assigned to unassigned; target: ---]
-
96135:
[11/12/13/14 regression] bswap not detected by bswap pass, unexpected results between optimization levels
[NEW; assigned to unassigned; target: 11.5]
-
96275:
Vectorizer doesn't take into account bitmask condition from branch conditions.
[NEW; assigned to unassigned; target: ---]
-
96373:
[11 Regression] SVE miscompilation on vectorized division loop, leading to FP exception
[ASSIGNED; assigned to rsandifo; target: 11.5]
-
96461:
[SVE] Use the HISTCNT instruction for simple histogram loops
[UNCONFIRMED; assigned to unassigned; target: ---]
-
96481:
SLP fail to vectorize VEC_COND_EXPR pattern.
[ASSIGNED; assigned to rguenth; target: ---]
-
96654:
Failure to optimize vectorized conversion to `int` with AVX
[NEW; assigned to unassigned; target: ---]
-
96738:
GCC generates worse assembly than clang and It fails to vectorized code compared to clang
[NEW; assigned to unassigned; target: ---]
-
96888:
Missing vectorization opportunity depending on integer type
[NEW; assigned to unassigned; target: ---]
-
97064:
BB vectorization behaves sub-optimal
[NEW; assigned to unassigned; target: ---]
-
97343:
AVX2 vectorizer generates extremely strange and slow code for AoSoA complex dot product
[NEW; assigned to unassigned; target: ---]
-
97707:
avx512 math function invoked even if -mprefer-vector-width=256 specified
[NEW; assigned to unassigned; target: ---]
-
97984:
[11 Regression] Worse code for -O3 than -O2 on aarch64 vector multiply-add
[NEW; assigned to unassigned; target: 11.5]
-
98138:
BB vect fail to SLP one case
[NEW; assigned to unassigned; target: ---]
-
98176:
Loop invariant memory could not be hoisted when nonpure_call in loop body
[NEW; assigned to unassigned; target: ---]
-
98339:
GCC could not vectorize loop with conditional reduced add and store
[NEW; assigned to unassigned; target: ---]
-
98542:
Redundant loads in vectorised loop
[UNCONFIRMED; assigned to unassigned; target: ---]
-
98563:
[11/12/13/14 Regression] vectorization fails while it worked on gcc 9 and earlier since since r10-2271-gd81ab49d0586fca0
[NEW; assigned to unassigned; target: 11.5]
-
98774:
gcc -O3 does not vectorize some operations
[NEW; assigned to unassigned; target: ---]
-
98792:
Fail to use SHRN instructions for narrowing shift on aarch64
[NEW; assigned to unassigned; target: ---]
-
98813:
loop is sub-optimized if index is unsigned int with offset
[NEW; assigned to unassigned; target: ---]
-
98837:
SLP discovery does not consider all lane permutes
[UNCONFIRMED; assigned to unassigned; target: ---]
-
98908:
[11 Regression] arithmetic involving struct members into operating on the entire struct fails at -O3
[NEW; assigned to unassigned; target: 11.5]
-
99100:
Inconsistent vector length used in autovectorizer for AVX-512
[ASSIGNED; assigned to jakub; target: ---]
-
99395:
s116 benchmark of TSVC is vectorized by clang and not by gcc
[NEW; assigned to unassigned; target: ---]
-
99397:
s152 benchmark of TSVC is vectorized by clang and not by gcc
[NEW; assigned to unassigned; target: ---]
-
99408:
s3251 benchmark of TSVC vectorized by clang runs about 7 times faster compared to gcc
[UNCONFIRMED; assigned to unassigned; target: ---]
-
99411:
s311, s312, s31111, s31111, s3110, vsumr benchmark of TSVC is vectorized by clang better than by gcc
[UNCONFIRMED; assigned to unassigned; target: ---]
-
99414:
s235, s2233, s275, s2275 and s233 benchmarks of TSVC is vectorized better by icc than gcc (loop interchange)
[NEW; assigned to unassigned; target: ---]
-
99415:
s115 benchmark of TSVC is vectorized by icc and not by gcc
[NEW; assigned to unassigned; target: ---]
-
99416:
s211 benchmark of TSVC is vectorized by icc and not by gcc
[ASSIGNED; assigned to rguenth; target: ---]
-
99646:
s111 benchmark of TSVC preffers -mprefer-avx128 on zen3
[NEW; assigned to unassigned; target: ---]
-
100171:
autovectorizer
[UNCONFIRMED; assigned to unassigned; target: ---]
-
100756:
[12 Regression] vect: Superfluous epilog created on s390x
[ASSIGNED; assigned to rguenth; target: 12.4]
-
101097:
Vectorizer is too eager to use vec_unpack
[NEW; assigned to unassigned; target: ---]
-
101340:
SLP discovery via vect_slp_linearize_chain is imperfect
[UNCONFIRMED; assigned to unassigned; target: ---]
-
101390:
Expand vector mod as vector div + multiply-subtract
[NEW; assigned to unassigned; target: ---]
-
101434:
vector-by-vector left shift expansion for char/short is not optimal
[NEW; assigned to unassigned; target: ---]
-
101450:
GCC doesn't vectorize loop due to evolution of base is not affine.
[NEW; assigned to unassigned; target: ---]
-
101639:
vectorization with bool reduction
[NEW; assigned to unassigned; target: ---]
-
101705:
Missed optimization opportunity when copying lots of bitfields
[ASSIGNED; assigned to pinskia; target: ---]
-
101802:
Vectorization can end up creating vector bool CTORs
[UNCONFIRMED; assigned to unassigned; target: ---]
-
101842:
Vectorizer doesn't vectorize when loop bound depends on two independent variables that are unknown
[ASSIGNED; assigned to rguenth; target: ---]
-
101895:
[11 Regression] SLP Vectorizer change pushes VEC_PERM_EXPR into bad location spoiling further optimization opportunities
[NEW; assigned to unassigned; target: 11.5]
-
101909:
[12/13/14 Regression] 73% regression on tfft benchmark for -O2 -ftree-loop-vectorize compared to -O2 on zen hardware
[UNCONFIRMED; assigned to unassigned; target: 12.4]
-
101927:
There is no vector mode popcount for aarch64
[UNCONFIRMED; assigned to unassigned; target: ---]
-
101932:
Vectorizer failed due to argument types differ for ldexp
[NEW; assigned to unassigned; target: ---]
-
101944:
suboptimal SLP for reduced case from namd_r
[UNCONFIRMED; assigned to unassigned; target: ---]
-
101956:
Miss vectorization from v4hi to v4df
[NEW; assigned to unassigned; target: ---]
-
102040:
vectorizer costing is off because of if-conversion
[UNCONFIRMED; assigned to unassigned; target: ---]
-
102160:
Too many runtime alias checks when vectorizing
[UNCONFIRMED; assigned to unassigned; target: ---]
-
102188:
Over widening detection doesn't work when no range information when doing bitwise operations (|, ^, and &)
[NEW; assigned to unassigned; target: ---]
-
102404:
Loop vectorized with 32 byte vectors actually uses 16 byte vectors
[NEW; assigned to unassigned; target: ---]
-
102435:
gcc 9: aarch64 -ftree-loop-vectorize results in wrong code
[UNCONFIRMED; assigned to unassigned; target: ---]
-
102483:
Reduction of 4 chars can be improved
[NEW; assigned to unassigned; target: ---]
-
102564:
Missed loop vectorization with reduction and ptr load/store inside loop
[NEW; assigned to unassigned; target: ---]
-
102591:
Failure to optimize search for value in vector-sized area to use SIMD
[NEW; assigned to unassigned; target: ---]
-
103495:
vectorizer bool pattern recog does not handle cycles correctly
[UNCONFIRMED; assigned to unassigned; target: ---]
-
103554:
-mavx generates worse code on scalar code
[NEW; assigned to unassigned; target: ---]
-
103592:
fatigue2 benchmarks on zen runs 43% faster with -fno-tree-vectorize -fno-tree-slp-vectorize
[NEW; assigned to unassigned; target: ---]
-
103999:
Vectorizer failed to reduce sum with conversion.
[NEW; assigned to unassigned; target: ---]
-
104116:
Optimize {FLOOR,CEIL,ROUND}_{DIV,MOD}_EXPR in tree-vect-patterns.cc
[NEW; assigned to unassigned; target: ---]
-
104265:
Missed vectorization in 526.blender_r
[NEW; assigned to unassigned; target: ---]
-
104368:
[12/13/14 Regression] Failure to vectorise conditional grouped accesses after PR102659
[NEW; assigned to unassigned; target: 12.4]
-
104408:
SLP discovery fails due to -Ofast rewriting
[ASSIGNED; assigned to tnfchris; target: ---]
-
104579:
vectorizer failed to reduce max & index search together
[NEW; assigned to unassigned; target: ---]
-
104917:
No runtime alias test required for dependent reductions
[UNCONFIRMED; assigned to unassigned; target: ---]
-
105062:
Suboptimal vectorization for reduction with several elements
[NEW; assigned to unassigned; target: ---]
-
105490:
unvectorized loop due to bool condition loaded from memory and different size data
[NEW; assigned to unassigned; target: ---]
-
105558:
simple 8-bit integer calculation fails with -O3 / march=core-avx2 on some gfortran 8/9/10 versions
[NEW; assigned to unassigned; target: ---]
-
105617:
[12/13/14 Regression] Slp is maybe too aggressive in some/many cases
[NEW; assigned to unassigned; target: 12.4]
-
105793:
Missed vectorisation with conditional-select inside loop
[NEW; assigned to unassigned; target: ---]
-
106038:
x86_64 vectorization of ALU ops using xmm registers prematurely
[NEW; assigned to unassigned; target: ---]
-
106343:
SLP does not support no-op case
[NEW; assigned to unassigned; target: ---]
-
106902:
[11/12/13/14 Regression] Program compiled with -O3 -mfma produces different result
[NEW; assigned to unassigned; target: 11.5]
-
106910:
roundss not vectorized
[NEW; assigned to unassigned; target: ---]
-
107259:
Vectorize VxHF _Float16 modes using partial vectors without -mavx512vl
[UNCONFIRMED; assigned to unassigned; target: ---]
-
107283:
conversions u/int64_t to float64/32_t are not vectorized
[NEW; assigned to unassigned; target: ---]
-
107690:
[12/13/14 Regression] vectorization fails for std::ranges::transform due to IR changes since r12-3903-g0288527f47cec669
[NEW; assigned to unassigned; target: 12.4]
-
107891:
Redudant "double" permutation from SLP vectorization (PR97832)
[NEW; assigned to unassigned; target: ---]
-
108322:
Using __restrict parameter with -ftree-vectorize (default with -O2) results in massive code bloat
[NEW; assigned to unassigned; target: ---]
-
108346:
gather/scatter loops optimized too often for znver4 (and other zens)
[UNCONFIRMED; assigned to unassigned; target: ---]
-
108410:
x264 averaging loop not optimized well for avx512
[ASSIGNED; assigned to rguenth; target: ---]
-
108785:
nested cycle vectorization too restrictive
[UNCONFIRMED; assigned to unassigned; target: ---]
-
108804:
missed vectorization in presence of conversion from uint64_t to float
[NEW; assigned to unassigned; target: ---]
-
108839:
Option for rerolling loops
[NEW; assigned to unassigned; target: ---]
-
108902:
Conversions std::float16_t<->float with FP16C are not vectorized
[UNCONFIRMED; assigned to unassigned; target: ---]
-
108916:
Miss vectorization for masked gather w/o restrict qualifier.
[NEW; assigned to unassigned; target: ---]
-
109011:
missed optimization in presence of __builtin_ctz
[UNCONFIRMED; assigned to jakub; target: ---]
-
109029:
__builtin_signbit for 64bit fp does not vectorize
[NEW; assigned to unassigned; target: ---]
-
109088:
GCC does not always vectorize conditional reduction
[NEW; assigned to unassigned; target: ---]
-
109271:
Unnecessary vectorization alias versioning check
[UNCONFIRMED; assigned to unassigned; target: ---]
-
109499:
Unnecessary zeroing in SVE loops
[UNCONFIRMED; assigned to unassigned; target: ---]
-
109603:
Vectorization failure for a small loop containing a simple branch
[UNCONFIRMED; assigned to unassigned; target: ---]
-
109605:
-fno-tree-vectorize does not disable vectorizer
[NEW; assigned to unassigned; target: ---]
-
109764:
V2SI multiply high is not vectorized on x86_64
[NEW; assigned to unassigned; target: ---]
-
109892:
SLP failure with explicit fma
[NEW; assigned to unassigned; target: ---]
-
110023:
10% performance drop on important benchmark after r247544.
[UNCONFIRMED; assigned to unassigned; target: ---]
-
110062:
missed vectorization in graphicsmagick
[ASSIGNED; assigned to rguenth; target: ---]
-
110223:
Missed optimization vectorizing booleans comparisons
[NEW; assigned to unassigned; target: ---]
-
110449:
Vect: use a small step to calculate the loop induction if the loop is unrolled during loop vectorization
[UNCONFIRMED; assigned to unassigned; target: ---]
-
110485:
vectorizing simd clone calls without loop masking applied
[UNCONFIRMED; assigned to avieira; target: ---]
-
110692:
epilogues for loop which can be also vectorized with half size can be improved.
[UNCONFIRMED; assigned to unassigned; target: ---]
-
110760:
slp introduces new overflow arithmetic
[ASSIGNED; assigned to rguenth; target: ---]
-
110838:
[14 Regression] wrong code on x365-3.5, -O3, sign extraction
[ASSIGNED; assigned to rguenth; target: 14.0]
-
110979:
Miss-optimization for O2 fully masked loop on floating point reduction.
[ASSIGNED; assigned to rguenth; target: ---]
-
111114:
RISC-V: Failed combine extend + vcond_mask when modify by vect_recog_over_widening_pattern
[NEW; assigned to unassigned; target: ---]
-
111131:
SLP of gathers incomplete
[UNCONFIRMED; assigned to unassigned; target: ---]
-
111133:
SLP of scatters not implemented
[UNCONFIRMED; assigned to unassigned; target: ---]
-
111381:
RISC-V: missed autovec MULH for signed * unsigned
[UNCONFIRMED; assigned to unassigned; target: ---]
-
111494:
Signed overflow introduced by vectorizer
[ASSIGNED; assigned to rguenth; target: ---]
-
18557:
Inefficient code generated by -ftree-vectorize on Alpha
[RESOLVED DUPLICATE; assigned to unassigned; target: ---]
-
21465:
autovectorisation conflicts with -ffast-math
[RESOLVED FIXED; assigned to unassigned; target: 5.0]
-
21997:
vectorization inhibited by gcc's choice to view an integer as a boolean
[RESOLVED FIXED; assigned to unassigned; target: 5.0]
-
22184:
tree vectorizer depends on context
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
25621:
Missed optimization when unrolling the loop (splitting up the sum) (only with -ffast-math)
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
26128:
Trivial operation not vectorized on char/short
[RESOLVED FIXED; assigned to unassigned; target: 4.5.0]
-
29533:
Ada fails to vectorize even trivial loops
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
31079:
20% difference between ifort/gfortran, missed vectorization
[RESOLVED FIXED; assigned to unassigned; target: 4.8.0]
-
33711:
Missed optimization: reduction by subtraction (vectorizer)
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
34195:
missed optimization with store motion (vectorizer)
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
34378:
[autovectorize]: missed optimization
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
37021:
Fortran Complex reduction / multiplication not vectorized
[RESOLVED FIXED; assigned to rguenth; target: 6.0]
-
38011:
vectorizer ignores alignment, useless versioning
[RESOLVED INVALID; assigned to unassigned; target: ---]
-
39300:
vectorizer confused by predictive commoning and PRE
[RESOLVED FIXED; assigned to unassigned; target: 4.5.0]
-
39821:
120% slowdown with vectorizer
[RESOLVED FIXED; assigned to unassigned; target: 12.0]
-
41464:
vector loads are unnecessarily split into high and low loads
[RESOLVED FIXED; assigned to unassigned; target: 4.9.0]
-
41499:
Scalar evolution analysis prevents vectorization
[RESOLVED WONTFIX; assigned to unassigned; target: ---]
-
42652:
vectorizer created unaligned vector insns
[RESOLVED FIXED; assigned to unassigned; target: 4.5.4]
-
43422:
reversed loop is not vectorized
[RESOLVED FIXED; assigned to unassigned; target: 5.0]
-
43423:
gcc should vectorize this loop through if-conversion
[RESOLVED FIXED; assigned to unassigned; target: 7.0]
-
43425:
gcc should vectorize this loop by substitution
[RESOLVED DUPLICATE; assigned to unassigned; target: ---]
-
43428:
vectorizer should invoke loop distribution to partially vectorize this loop
[RESOLVED DUPLICATE; assigned to unassigned; target: ---]
-
43434:
Missed vectorization: "not vectorized: data ref analysis": pointer incremented by a parameter
[RESOLVED FIXED; assigned to rguenth; target: ---]
(*)
-
44976:
reductions with short variables do not get vectorized
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
46008:
Floating point condexpr not vectorized
[RESOLVED FIXED; assigned to unassigned; target: 5.0]
-
46011:
256bit vectorizer failed on double->int
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
46391:
false dependencies are computed after vectorization (#2)
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
47298:
-O3 destroys beautifully vectorized code obtained at -O2
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
49730:
loop not vectorized if inside another loop
[RESOLVED FIXED; assigned to unassigned; target: 8.0]
-
49773:
use of class data members prevent vectorization
[RESOLVED FIXED; assigned to unassigned; target: 6.0]
-
49869:
Excessive loop versioning done by vectorization + predictive commoning
[RESOLVED WORKSFORME; assigned to unassigned; target: ---]
-
49955:
Fails to do partial basic-block SLP
[RESOLVED FIXED; assigned to rguenth; target: 14.0]
-
51179:
poor vectorization on interlagos.
[RESOLVED FIXED; assigned to unassigned; target: 4.7.2]
-
52865:
GCC can't vectorize fortran loop but able to vectorize similar c-loop
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
54400:
recognize vector reductions
[RESOLVED FIXED; assigned to rguenth; target: 12.0]
-
54803:
Rotates are not vectorized
[RESOLVED FIXED; assigned to unassigned; target: 4.9.0]
-
54939:
Very poor vectorization of loops with complex arithmetic
[RESOLVED FIXED; assigned to rguenth; target: ---]
(*)
-
55266:
vector expansion: 24 movs for 4 adds
[RESOLVED FIXED; assigned to rguenth; target: ---]
-
56213:
strided load vectorization is unnecessarily restricted
[RESOLVED FIXED; assigned to rguenth; target: 4.9.0]
-
56541:
vectorizaton fails in conditional assignment of a constant
[RESOLVED FIXED; assigned to amker; target: 7.0]
-
56595:
Tree-ssa-pre can create loop carried dependencies which prevent loop vectorization.
[RESOLVED DUPLICATE; assigned to unassigned; target: ---]
-
56625:
After if-conversion vectorizer doesn't recognize similar loads
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
56764:
vect_prune_runtime_alias_test_list not smart enough
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
56902:
Fails to SLP with mismatched +/- and negatable constants
[RESOLVED FIXED; assigned to unassigned; target: ---]
(*)
-
57169:
fully unrolled matrix multiplication not vectorized
[RESOLVED FIXED; assigned to unassigned; target: 10.0]
-
57206:
Auto-vectorization fails when array index is an unsigned int expression
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
57223:
Auto-vectorization fails for nested multiple loops depending on type of array
[RESOLVED DUPLICATE; assigned to unassigned; target: ---]
-
57328:
Missed optimization: Unable to vectorize Fortran min and max intrinsics
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
57512:
Vectorizer: cannot handle accumulation loop of signed char type
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
57558:
Loop not vectorized if iteration count could be infinite
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
57600:
Turn 2 comparisons into 1 with the min
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
57858:
AVX2: ymm used for div, not for sqrt
[RESOLVED FIXED; assigned to unassigned; target: 8.0]
-
58508:
[Missed-Optimization] Redundant vector load of "actual" loop invariant in loop body.
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
58821:
conditional reduction does not vectorize
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
59544:
Vectorizing store with negative step
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
60012:
Vectorizer generates unnecessary loop versioning for alias
[RESOLVED FIXED; assigned to rguenth; target: ---]
-
60510:
SLP blocks loop vectorization (with reduction)
[RESOLVED FIXED; assigned to rguenth; target: 8.0]
-
61171:
vectorization fails for a reduction in presence of subtraction
[RESOLVED FIXED; assigned to rguenth; target: ---]
-
61194:
[4.9/5/6/7 Regression] vectorization failed with "bit-precision arithmetic not supported" even if conversion to int is requested
[RESOLVED FIXED; assigned to rguenth; target: 7.0]
-
61403:
An opportunity for x86 gcc vectorizer (~40% gain)
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
62012:
Loop is not vectorized after function inlining (SCEV)
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
62283:
basic-block vectorization fails
[RESOLVED FIXED; assigned to rguenth; target: 6.0]
-
63168:
loop header copying fails - not vectorized: latch block not empty
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
63844:
[4.8 Regression] open mp parallelization prevents vectorization
[RESOLVED FIXED; assigned to rguenth; target: 4.8.5]
-
64031:
(un-)conditional execution state is not preserved by PRE/sink
[RESOLVED FIXED; assigned to unassigned; target: 14.0]
-
64410:
gcc 25% slower than clang 3.5 for adding complex numbers
[RESOLVED FIXED; assigned to rguenth; target: 5.0]
-
64745:
Generic vectorization missed opportunities
[RESOLVED FIXED; assigned to rguenth; target: 10.0]
-
64909:
[4.8 Regression] Missed vectorization with bdver1
[RESOLVED FIXED; assigned to rguenth; target: 4.9.3]
-
65206:
vectorized version of loop is removed, dependence analysis fails for *&a[i] vs a[j]
[RESOLVED FIXED; assigned to rguenth; target: 12.0]
(*)
-
65335:
Potential optimization issue with 'tree-loop-vectorize'
[RESOLVED FIXED; assigned to unassigned; target: 7.0]
-
65930:
Reduction with sign-change not handled
[RESOLVED FIXED; assigned to rguenth; target: ---]
-
65946:
Simple loop with if-statement not vectorized
[RESOLVED FIXED; assigned to alan.lawrence.arm; target: ---]
-
65947:
Vectorizer misses conditional assignment of constant
[RESOLVED FIXED; assigned to alan.hayward; target: ---]
-
65951:
[AArch64] Will not vectorize 64bit integer multiplication
[RESOLVED FIXED; assigned to unassigned; target: 7.0]
-
65952:
[AArch64] Will not vectorize storing induction of pointer addresses for LP64
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
65962:
Missed vectorization of strided stores
[RESOLVED FIXED; assigned to rguenth; target: ---]
-
65965:
Straight-line memcpy/memset not vectorized when equivalent loop is
[RESOLVED FIXED; assigned to rguenth; target: 6.0]
-
66002:
paq8p benchmark 50% slower than clang on sandybridge
[RESOLVED FIXED; assigned to rguenth; target: ---]
-
66036:
strided group loads are not vectorized
[RESOLVED FIXED; assigned to rguenth; target: ---]
-
66051:
can't vectorize reductions inside an SLP group
[RESOLVED FIXED; assigned to rguenth; target: ---]
-
66558:
Missed vectorization of loop with control flow
[RESOLVED FIXED; assigned to alan.hayward; target: ---]
-
67326:
[6 Regression] -ftree-loop-if-convert-stores does not vectorize conditional assignment (anymore)
[RESOLVED FIXED; assigned to vekumar; target: 7.0]
-
67681:
Missed vectorization: induction variable used after loop
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
67800:
[6 Regression] Missed vectorization opportunity on x86 (DOT_PROD_EXPR in non-reduction)
[CLOSED FIXED; assigned to rguenth; target: 6.0]
-
68483:
[5/6 Regression] gcc 5.2: suboptimal code compared to 4.9
[RESOLVED FIXED; assigned to jakub; target: 5.3]
-
68558:
Fails to SLP loop
[RESOLVED FIXED; assigned to rguenth; target: ---]
-
68559:
Excessive peeling for gaps
[RESOLVED FIXED; assigned to rguenth; target: ---]
-
68694:
SLP loads should be permuted until supported if possible
[RESOLVED FIXED; assigned to rguenth; target: ---]
-
69848:
poor vectorization of a loop from SPEC2006 464.h264ref
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
71271:
SLP loop vectorized twice
[RESOLVED FIXED; assigned to unassigned; target: 9.0]
-
71488:
[6 Regression] Wrong code for vector comparisons with ivybridge and westmere targets
[RESOLVED FIXED; assigned to unassigned; target: 7.0]
-
78007:
Important loop from 482.sphinx3 is not vectorized
[RESOLVED FIXED; assigned to rguenth; target: ---]
-
78151:
Fail to vectorize *min_element
[RESOLVED FIXED; assigned to unassigned; target: 8.0]
-
78205:
BB vectorization confused by too large load groups
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
79262:
[8/9/10 Regression] load gap with store gap causing performance regression in 462.libquantum
[RESOLVED FIXED; assigned to unassigned; target: 8.4]
-
80198:
[10/11/12/13 Regression] does not vectorize generic inplace integer operation
[RESOLVED FIXED; assigned to unassigned; target: 10.0]
-
80561:
Missed optimization: std::array data is aligned if array is aligned
[RESOLVED FIXED; assigned to rguenth; target: ---]
-
80634:
strangely missed vectorization optimizations
[RESOLVED FIXED; assigned to unassigned; target: 8.0]
-
80846:
auto-vectorized AVX2 horizontal sum should narrow to 128b right away, to be more efficient for Ryzen and Intel
[RESOLVED FIXED; assigned to rguenth; target: ---]
-
80928:
SLP vectorization does not handle induction in outer loop vectorization
[RESOLVED FIXED; assigned to rguenth; target: ---]
-
82255:
Vectorizer cost model overcounts cost of some vectorized loads
[RESOLVED FIXED; assigned to linkw; target: ---]
-
82426:
Missed tree-slp-vectorization on -O2 and -O3
[RESOLVED FIXED; assigned to rguenth; target: 12.0]
-
83202:
Try joining operations on consecutive array elements during tree vectorization
[RESOLVED FIXED; assigned to rguenth; target: ---]
-
83403:
Missed register promotion opportunities in loop
[RESOLVED FIXED; assigned to rguenth; target: 8.5]
-
84101:
[8 Regression] -O3 and -ftree-vectorize trying too hard for function returning trivial pair-of-uint64_t-structure
[RESOLVED FIXED; assigned to rguenth; target: 9.0]
(*)
-
84361:
Fails to use vfmaddsub* for complex multiplication
[RESOLVED DUPLICATE; assigned to unassigned; target: ---]
(*)
-
84362:
[8 Regression] Auto-vectorization regression when accessing member variable through getter/accessor
[RESOLVED FIXED; assigned to rguenth; target: 9.0]
-
84512:
[8 Regression] Missed optimization: should be precalculated in compile-time
[RESOLVED FIXED; assigned to rguenth; target: 8.0]
-
85283:
Generates 20 lines of assembly while only one assembly instruction is enough.
[RESOLVED FIXED; assigned to unassigned; target: 11.0]
-
85491:
[8 Regression] nbench LU Decomposition test 15% slower than GCC 7, 30% slower than peak
[RESOLVED FIXED; assigned to rguenth; target: 8.0]
-
85694:
Generation of vectorized AVG (Average) instruction
[RESOLVED FIXED; assigned to unassigned; target: 9.0]
-
85918:
Conversions to/from [unsigned] long long are not vectorized for AVX512DQ target
[RESOLVED FIXED; assigned to jakub; target: ---]
-
85920:
Incomplete transition to IFNs for scatter/gather support, drop vectorize.builtin_{gather,scatter} target hooks
[RESOLVED DUPLICATE; assigned to unassigned; target: ---]
-
86504:
vectorization failure for a nest loop
[RESOLVED FIXED; assigned to tnfchris; target: 10.0]
-
86557:
missed vectorization with std::vector compared to icc 18
[RESOLVED FIXED; assigned to unassigned; target: 12.0]
-
86877:
ICE in vectorizable_load, at tree-vect-stmts.c:8038
[RESOLVED FIXED; assigned to rsandifo; target: 8.5]
-
87062:
mis-optimized code with -O3 and std::pair
[RESOLVED DUPLICATE; assigned to unassigned; target: ---]
-
87105:
Autovectorization [X86, SSE2, AVX2, DoublePrecision]
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
87621:
outer loop auto-vectorization fails for exponentiation code
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
87914:
gcc fails to vectorize bitreverse code
[RESOLVED FIXED; assigned to rguenth; target: ---]
-
88459:
vectorization failure for a simple sum reduction loop
[RESOLVED DUPLICATE; assigned to unassigned; target: ---]
-
88464:
AVX-512 vectorization of masked scatter failing with "not suitable for scatter store"
[RESOLVED FIXED; assigned to jakub; target: ---]
-
88531:
Index data types when targeting AVX-512 vectorization with gather/scatter
[RESOLVED FIXED; assigned to unassigned; target: 12.0]
-
88540:
Issues with vectorization of min/max operations
[RESOLVED FIXED; assigned to rguenth; target: 14.0]
-
88915:
Try smaller vectorisation factors in scalar fallback
[RESOLVED FIXED; assigned to avieira; target: 10.0]
-
89028:
8-byte loop isn't vectorized
[RESOLVED FIXED; assigned to unassigned; target: 10.0]
-
89386:
Generation of vectorized MULHRS (Multiply High with Round and Scale) instruction
[RESOLVED FIXED; assigned to unassigned; target: 10.0]
-
91094:
BB vectorization is too quick to disable itself because of possible unrolling needed
[RESOLVED FIXED; assigned to rguenth; target: ---]
-
91460:
gcc -mpreferred-vector-width=256 is slower than -mpreferred-vector-width=128 for some loops
[RESOLVED DUPLICATE; assigned to unassigned; target: ---]
-
91514:
optimization needs fictive memory allocation
[RESOLVED INVALID; assigned to unassigned; target: ---]
-
91573:
Vectorization failure for a loop to do multiply-add because SLP loads unnecessarily require permutation
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
91790:
ICE: verify_ssa failed (error: definition in block 2 follows the use)
[RESOLVED FIXED; assigned to linkw; target: ---]
-
92098:
[9 Regression] After r262333, the following code cannot be vectorized on powerpc64le.
[RESOLVED FIXED; assigned to unassigned; target: 9.3]
-
92344:
Missing considering fre optimization of vector load in auto-vectorization
[RESOLVED DUPLICATE; assigned to unassigned; target: ---]
-
92611:
auto vectorization failed for type promotation
[RESOLVED DUPLICATE; assigned to unassigned; target: ---]
-
92645:
Hand written vector code is 450 times slower when compiled with GCC compared to Clang
[RESOLVED FIXED; assigned to rguenth; target: 11.0]
-
93183:
SVE does not use neg as conditional
[RESOLVED FIXED; assigned to prathamesh3492; target: ---]
-
93734:
[9 Regression] Invalid code generated with -O2 -march=haswell -ftree-vectorize
[RESOLVED FIXED; assigned to unassigned; target: 10.0]
-
93771:
SLP produces VEC_PERM when should have used vector generation
[RESOLVED FIXED; assigned to unassigned; target: 11.0]
-
93868:
[10 Regression] wrong-code with permuted SLP reduction
[RESOLVED FIXED; assigned to rguenth; target: 10.0]
-
94364:
505.mcf_r is 8% faster when compiled with -mprefer-vector-width=128
[RESOLVED WONTFIX; assigned to unassigned; target: ---]
-
94375:
548.exchange2_r run time is 8-18% worse than GCC 9 at -Ofast -march=native
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
95201:
Some x86 vector-extend patterns are not exercised.
[RESOLVED FIXED; assigned to unassigned; target: ---]
(*)
-
95839:
Failure to optimize addition of vector elements to vector addition
[RESOLVED FIXED; assigned to rguenth; target: ---]
-
95866:
vectorized shift with scalar argument not correctly costed
[RESOLVED FIXED; assigned to rguenth; target: ---]
-
96166:
[10 Regression] -O3/-ftree-slp-vectorize turns ROL into a mess
[RESOLVED FIXED; assigned to jakub; target: 11.0]
-
96208:
non-grouped load can be SLP vectorized for 2-element vectors case
[RESOLVED FIXED; assigned to rguenth; target: ---]
-
96239:
Failure to recognize __builtin_bswap16 pattern
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
96757:
aarch64:ICE during GIMPLE pass: vect
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
96789:
x264: sub4x4_dct() improves when vectorization is disabled
[RESOLVED FIXED; assigned to linkw; target: ---]
-
96834:
[9/10/11 Regression] Segmentation fault signal terminated program cc1
[RESOLVED DUPLICATE; assigned to rguenth; target: 9.4]
-
96864:
loop not vectorized due AVX512 condition handling
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
96974:
[10/11 Regression] ICE in vect_get_vector_types_for_stmt compiling for SVE
[RESOLVED FIXED; assigned to stammark; target: 10.3]
-
97138:
cross-BB vectorization opportunity
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
97334:
inefficient vectorization of gcc.dg/vect/bb-slp-pr65935.c
[RESOLVED FIXED; assigned to rguenth; target: ---]
-
97351:
gcc.dg/vect/bb-slp-subgroups-3.c bad vectorization with AVX
[RESOLVED FIXED; assigned to unassigned; target: 12.0]
-
97352:
gcc.dg/vect/bb-slp-pr78205.c fails to vectorize all opportunities with AVX
[RESOLVED FIXED; assigned to rguenth; target: 12.0]
-
97428:
-O3 is great for basic AoSoA packing of complex arrays, but horrible one step above the basic
[RESOLVED FIXED; assigned to rguenth; target: ---]
-
97770:
[ICELAKE]suboptimal vectorization for vpopcntw/b/q
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
97832:
AoSoA complex caxpy-like loops: AVX2+FMA -Ofast 7 times slower than -O3
[RESOLVED FIXED; assigned to rguenth; target: 12.0]
-
98117:
[8 Regression] wrong code with "-O3 -fno-tree-scev-cprop" since r8-1163-g7078979b291419f3
[RESOLVED FIXED; assigned to rguenth; target: 8.5]
-
98291:
multiple scalar FP accumulators auto-vectorize worse than scalar, including vector load + merge instead of scalar + high-half insert
[RESOLVED FIXED; assigned to rguenth; target: ---]
-
98365:
Miss vectoization for signed char ifcvt
[RESOLVED FIXED; assigned to unassigned; target: 12.0]
-
98674:
[10 Regression] vectorizer failed for compilation time alias
[RESOLVED FIXED; assigned to rguenth; target: 11.0]
-
98772:
Widening patterns causing missed vectorization
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
98848:
[10 regression] vectorizer failed to reduce max pattern since r9-1590
[RESOLVED FIXED; assigned to jakub; target: 11.0]
-
99394:
s254 benchmark of TSVC is vectorized by clang and not by gcc
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
99407:
s243 benchmark of TSVC is vectorized by clang and not by gcc, missed DSE
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
99409:
s252 benchmark of TSVC is vectorized by clang and not by gcc
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
99412:
s352 benchmark of TSVC is vectorized by clang and not by gcc
[RESOLVED FIXED; assigned to rguenth; target: 13.0]
-
100696:
mult_higpart is not vectorized
[RESOLVED FIXED; assigned to linkw; target: 12.0]
-
101668:
BB vectorizer doesn't handle lowpart of existing vector
[RESOLVED FIXED; assigned to rguenth; target: ---]
-
101801:
vect_worthwhile_without_simd_p is broken
[RESOLVED FIXED; assigned to unassigned; target: 12.0]
-
101908:
[12 regression] cray regression with -O2 -ftree-slp-vectorize compared to -O2
[RESOLVED FIXED; assigned to unassigned; target: 12.0]
-
101910:
[12 Regression] tsvc regressions for -O2 -ftree-loop-vectorize at zen hardware
[RESOLVED INVALID; assigned to unassigned; target: 12.0]
-
102467:
Missed SLP discovery for gathers
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
102575:
Failure to optimize double _Complex stores to use largest loads/stores possible
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
103144:
vectorizer failed to recognize shift>>=1 in loop as shift>>i
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
103771:
[12/13/14 Regression] Missed vectorization under -mavx512f -mavx512vl after r12-5489
[RESOLVED FIXED; assigned to pinskia; target: 14.0]
-
103903:
Loops handling r,g,b values are not vectorized to use power of 2 vectors even if they can
[RESOLVED INVALID; assigned to unassigned; target: ---]
-
103941:
uavgv2qi3_ceil is not used (SLP costing and patterns vs live stmts)
[RESOLVED FIXED; assigned to rguenth; target: ---]
(*)
-
103948:
Vectorizer does not use vec_cmpMN without vcondMN pattern
[RESOLVED FIXED; assigned to unassigned; target: 12.0]
-
103995:
[11/12 Regression] conj() ignored with tree loop vectorizer
[RESOLVED FIXED; assigned to rguenth; target: 11.3]
-
104058:
[12 Regression] 6-7% x264_r regression with -march=native -Ofast -funroll-loops -flto on x86 since r12-6420-gd3ff7420e941931d32ce2e332e7968fe67ba20af
[RESOLVED FIXED; assigned to unassigned; target: 12.0]
-
104240:
SLP discovery does not consider swapping comparisons
[RESOLVED FIXED; assigned to rguenth; target: ---]
(*)
-
104595:
unvectorized loop due to bool condition loaded from memory
[RESOLVED FIXED; assigned to rguenth; target: ---]
-
106010:
Miss vectorization for complex type copy.
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
106019:
Surprising SLP failure on trivial code
[RESOLVED FIXED; assigned to rsandifo; target: ---]
-
106081:
missed vectorization
[RESOLVED FIXED; assigned to rguenth; target: 14.0]
-
106346:
[11/12/13/14 Regression] Potential regression on vectorization of left shift with constants since r11-5160-g9fc9573f9a5e94
[RESOLVED FIXED; assigned to tnfchris; target: 14.0]
-
106475:
Loop vectorizer prevents vectorization
[RESOLVED INVALID; assigned to unassigned; target: ---]
-
106989:
GCC fail to vectorize and clang succeed
[RESOLVED DUPLICATE; assigned to unassigned; target: ---]
-
107093:
AVX512 mask operations not simplified in fully masked loop
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
107247:
SLP reduction results fail to reduce to a single accumulator
[RESOLVED FIXED; assigned to rguenth; target: ---]
-
107647:
[12 Regression] GCC 12.2.0 may produce FMAs even with -ffp-contract=off
[RESOLVED FIXED; assigned to rguenth; target: 12.3]
-
108429:
[13 Regression] FAIL: gcc.target/i386/pr89618.c scan-tree-dump vect "LOOP VECTORIZED"
[RESOLVED FIXED; assigned to hubicka; target: 13.0]
-
109406:
Missing use of aarch64 SVE2 unpredicated integer multiply
[RESOLVED FIXED; assigned to unassigned; target: 14.0]
-
110018:
Missing vectorizable_conversion(unsigned char -> double) for BB vectorizer
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
110148:
[14 Regression] TSVC s242 regression between g:c0df96b3cda5738afbba3a65bb054183c5cd5530 and g:e4c986fde56a6248f8fbe6cf0704e1da34b055d8
[RESOLVED FIXED; assigned to unassigned; target: 14.0]
-
110310:
vector epilogue handling is inefficient
[RESOLVED FIXED; assigned to rguenth; target: ---]
-
110630:
Missed optimization: bb-slp-pr95839.c not vectorised with V2SF targets
[RESOLVED FIXED; assigned to rguenth; target: ---]
-
110897:
RISC-V: Fail to vectorize shift
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
111023:
missing extendv4siv4hi (and friends)
[RESOLVED FIXED; assigned to unassigned; target: ---]
-
111115:
Failure to vectorize conditional grouped store
[RESOLVED FIXED; assigned to rguenth; target: 14.0]
Bug 53947
does not block any bugs.