This is the mail archive of the
gcc-patches@gcc.gnu.org
mailing list for the GCC project.
[PATCH ARM iWMMXt 0/5] Improve iWMMXt support
- From: Matt Turner <mattst88 at gmail dot com>
- To: gcc-patches at gcc dot gnu dot org
- Cc: Ramana Radhakrishnan <ramana dot radhakrishnan at arm dot com>, Richard Earnshaw <richard dot earnshaw at arm dot com>, Nick Clifton <nickc at redhat dot com>, Paul Brook <paul at codesourcery dot com>, Xinyu Qi <xyqi at marvell dot com>
- Date: Tue, 29 May 2012 00:13:14 -0400
- Subject: [PATCH ARM iWMMXt 0/5] Improve iWMMXt support
This series was written by Marvell and sent by Xinyu Qi <xyqi@marvell.com>
a number of times in the last year.
We (One Laptop per Child) need these patches for reasonable iWMMXt support
and performance. Without them, logical and shift intrinsics cause ICEs,
see PR 35294 and its duplicates 36798 and 36966.
The software compositing library pixman uses MMX intrinsics to optimize
various compositing routines. The following are the minimum execution times
of cairo-perf-trace graphics work loads without and with iWMMXt-optimized
pixman for the image and image16 backends (32-bpp and 16-bpp respectively).
image image16
evolution 33.492 -> 29.590 30.334 -> 24.751
firefox-planet-gnome 191.465 -> 173.835 211.297 -> 187.570
gnome-system-monitor 51.956 -> 44.549 52.272 -> 40.525
gnome-terminal-vim 53.625 -> 54.554 47.593 -> 47.341
grads-heat-map 4.439 -> 4.165 4.548 -> 4.624
midori-zoomed 38.033 -> 28.500 38.576 -> 26.937
poppler 41.096 -> 31.949 41.230 -> 31.749
swfdec-giant-steps 20.062 -> 16.912 28.294 -> 17.286
swfdec-youtube 42.281 -> 37.335 52.848 -> 47.053
xfce4-terminal-a1 64.311 -> 51.011 62.592 -> 51.191
We have cleaned up some white-space issues with the patches and fixed a
small bug in patch 4/5 since the last time they were posted in December
(added tandc,textrc,torc,torvsc to the "wtype" attribute)
Please commit them for 4.8.
For 4.7 and 4.6 please consider committing my patch
"[PATCH] arm: Fix iwmmxt shift and logical intrinsics (PR 35294)."
which only fixes the logical and shift intrinsics.
Thanks,
Matt Turner