Split off from PR21676, which is no longer a regression but where "-O2 -fno-ivopts" is still much faster than "-O2". For the testcase, see that PR.
This is pretty old. I tried latest trunk with revision r205025. gcc -O2 -march=pentium4 [-fomit-frame-pointer] .L7: movl (%esi,%eax,4), %edx fldl (%edi,%edx,8) fmull (%ebx,%eax,8) faddp %st, %st(1) addl $1, %eax cmpl %ecx, %eax jne .L7 gcc -O2 -march=pentium4 [-fomit-frame-pointer] -fno-ivopts .L7: movl (%esi,%eax,4), %edx fldl (%edi,%edx,8) fmull (%ebx,%eax,8) faddp %st, %st(1) addl $1, %eax cmpl %eax, %ecx jg .L7 Also works for default arch in my configuration. Should this be considered fixed?
Fixed per comment 1.