[Bug rtl-optimization/61605] Potential optimization: Keep unclobbered argument registers live across function calls

vries at gcc dot gnu.org gcc-bugzilla@gcc.gnu.org
Mon Sep 29 10:24:00 GMT 2014


https://gcc.gnu.org/bugzilla/show_bug.cgi?id=61605

vries at gcc dot gnu.org changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|UNCONFIRMED                 |NEW
   Last reconfirmed|                            |2014-09-29
                 CC|                            |vries at gcc dot gnu.org
     Ever confirmed|0                           |1

--- Comment #4 from vries at gcc dot gnu.org ---
> If a function is known to not clobber an argument register then the caller
> shouldn't have to save/reload that register across the function call.

If a function is known to not clobber an call_used_reg then the caller
can use it as a non-call_used_reg across the function call.

This diff shows the example with -fno-use-caller-save vs -fuse-caller-save:
...
 foo:
 .LFB1:
     .cfi_startproc
-    pushq    %rbx
-    .cfi_def_cfa_offset 16
-    .cfi_offset 3, -16
-    movl    %edi, %ebx
+    movl    %edi, %edx
     call    bar
-    addl    %ebx, %eax
-    popq    %rbx
-    .cfi_def_cfa_offset 8
+    addl    %edx, %eax
     ret
     .cfi_endproc
 .LFE1:
...
-fuse-caller-save removes the entry/exit save/restore pair
'pushq %rbx'/'popq %rbx'.

The 'movl %edi, %edx' is indeed non-optimal, but it's not a 'save' in the sense
of save/restore pair generated at function entry/exit or around function calls.
It's a copy at function entry of a hard reg argumument to a pseudo reg,
generated at expand, which is followed by a copy of the pseudo reg to the same
register to set the argument for the function call:
...
(insn 2 4 3 2 (set (reg/v:SI 86 [ yD.1755 ])
        (reg:SI 5 di [ yD.1755 ])) test.c:9 -1
     (nil))
(note 3 2 6 2 NOTE_INSN_FUNCTION_BEG)
(insn 6 3 7 2 (set (reg:SI 5 di)
        (reg/v:SI 86 [ yD.1755 ])) test.c:10 -1
     (nil))
...
The second insn is removed in pass_fast_rtl_dce. The reg-alloc choiche for
pseudo 86 in the first insn is dx, and the insn remains.

I think there could be two ways to address this:
1. Teach a pass after ira, like pass_cprop_hardreg or pass_gcse2 to use the
   information collected by fuse-calller-save.
2. Teach ira to prefer the dx to di in this case.

My guess would be pass_cprop_hardreg.



More information about the Gcc-bugs mailing list