[Bug optimization/11873] New: inefficient use of registers induces size and time overhead
willy at w dot ods dot org
gcc-bugzilla@gcc.gnu.org
Sun Aug 10 08:39:00 GMT 2003
PLEASE REPLY TO gcc-bugzilla@gcc.gnu.org ONLY, *NOT* gcc-bugs@gcc.gnu.org.
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=11873
Summary: inefficient use of registers induces size and time
overhead
Product: gcc
Version: 3.3.1
Status: UNCONFIRMED
Severity: enhancement
Priority: P2
Component: optimization
AssignedTo: unassigned at gcc dot gnu dot org
ReportedBy: willy at w dot ods dot org
CC: gcc-bugs at gcc dot gnu dot org
GCC build triplet: i586-linux-gnu
GCC host triplet: i586-linux-gnu
GCC target triplet: i586-linux-gnu
I wrote a simple test function which uses an unsigned long long arg and returns
its integer part + 1 if not 0, otherwise 0. GCC 3.3.1 passes some intermediate
values through several registers while it's unneeded. Here comes all my analysis.
I hope this can help improving the optimizer.
The following code :
int test(unsigned long long x) {
if (x) {
return (int)x + 1;
}
else {
return (int)x;
}
}
processed this way :
# 1 "bool.c"
# 1 "<built-in>"
# 1 "<command line>"
# 1 "bool.c"
int test(unsigned long long x) {
if (x) {
return (int)x + 1;
}
else {
return (int)x;
}
}
produces the following code when compiled with gcc 2.95.3 :
(gcc -c -O2 -fomit-frame-pointer bool.c)
00000000 <test>:
0: 8b 54 24 04 mov 0x4(%esp,1),%edx
4: 8b 4c 24 08 mov 0x8(%esp,1),%ecx
8: 89 d0 mov %edx,%eax
a: 09 c8 or %ecx,%eax
c: 75 03 jne 11 <test+0x11>
e: 89 d0 mov %edx,%eax
10: c3 ret
11: 8d 42 01 lea 0x1(%edx),%eax
14: c3 ret
15: 8d 76 00 lea 0x0(%esi),%esi
18:
and this one with gcc-3.3.1 :
(gcc-3.3.1 -c -O2 -fomit-frame-pointer bool.c)
00000000 <test>:
0: 53 push %ebx
1: 8b 4c 24 0c mov 0xc(%esp,1),%ecx
5: 8b 54 24 08 mov 0x8(%esp,1),%edx
9: 89 cb mov %ecx,%ebx
b: 89 d0 mov %edx,%eax
d: 09 d3 or %edx,%ebx
f: 74 03 je 14 <test+0x14>
11: 8d 42 01 lea 0x1(%edx),%eax
14: 5b pop %ebx
15: c3 ret
16: 8d 76 00 lea 0x0(%esi),%esi
19: 8d bc 27 00 00 00 00 lea 0x0(%edi,1),%edi
20:
the EBX register is used and clobbered for nothing here. The
same code could be written this way, which is fully equivalent
and saves some cycles and bytes :
00000000 <test>:
0: 8b 4c 24 0c mov 0xc(%esp,1),%ecx
4: 8b 54 24 08 mov 0x8(%esp,1),%edx
8: 89 d0 mov %edx,%eax
a: 09 d1 or %edx,%ecx
c: 74 03 je 11 <test+0x11>
e: 8d 42 01 lea 0x1(%edx),%eax
11: c3 ret
12:
Now we can also save EDX and some more bytes :
00000000 <test>:
0: 8b 4c 24 0c mov 0xc(%esp,1),%ecx
4: 8b 54 24 08 mov 0x8(%esp,1),%eax
a: 09 c1 or %eax,%ecx
c: 74 01 je f <test+0xf>
e: 41 inc %eax
f: c3 ret
10:
Here are the compilers versions :
wtap:~/dev$ gcc -v
Reading specs from /usr/lib/gcc-lib/i586-pc-linux-gnu/2.95.3/specs
gcc version 2.95.3 20010315 (release)
wtap:~/dev$ gcc-3.3.1 -v
Reading specs from /usr/lib/gcc-lib/i586-pc-linux-gnu/3.3.1/specs
Configured with: ./configure --prefix=/usr --with-cpu=i386 --host=i586-pc-linux-gnu --enable-languages=c,c++ --disable-nls --disable-locale --enable-shared --enable-target-optspace --enable-version-specific-runtime-libs --program-suffix=-3.3.1 --enable-threads
Thread model: posix
gcc version 3.3.1
More information about the Gcc-bugs
mailing list