This is the mail archive of the
gcc-patches@gcc.gnu.org
mailing list for the GCC project.
Re: [PATCH] PR target/66819: Allow indirect sibcall with register arguments
- From: Uros Bizjak <ubizjak at gmail dot com>
- To: "H.J. Lu" <hjl dot tools at gmail dot com>
- Cc: "gcc-patches at gcc dot gnu dot org" <gcc-patches at gcc dot gnu dot org>
- Date: Fri, 10 Jul 2015 19:21:37 +0200
- Subject: Re: [PATCH] PR target/66819: Allow indirect sibcall with register arguments
- Authentication-results: sourceware.org; auth=none
- References: <20150709105439 dot GA19247 at gmail dot com> <CAFULd4Zg3-L23+UAQHsP3j7a0iPboFyxu+ETR3cFx7eh4r0aow at mail dot gmail dot com> <CAMe9rOrbAe+AO1=GbjG6nC1xYa4mweKxN=_zTy09OC=efcbK8g at mail dot gmail dot com>
On Fri, Jul 10, 2015 at 7:10 PM, H.J. Lu <hjl.tools@gmail.com> wrote:
> On Fri, Jul 10, 2015 at 9:30 AM, Uros Bizjak <ubizjak@gmail.com> wrote:
>> On Thu, Jul 9, 2015 at 12:54 PM, H.J. Lu <hjl.tools@gmail.com> wrote:
>>> Indirect sibcall with register arguments is OK when there is register
>>> available for argument passing.
>>>
>>> OK for trunk if there is no regression?
>>>
>>>
>>> H.J.
>>> ---
>>> gcc/
>>>
>>> PR target/66819
>>> * config/i386/i386.c (ix86_function_ok_for_sibcall): Allow
>>> indirect sibcall with register arguments if register available
>>> for argument passing.
>>> (init_cumulative_args): Set cfun->machine->arg_reg_available_p
>>> to cum->nregs != 0.
Please update the above entry for nregs > 0.
>>> (function_arg_advance_32): Set cfun->machine->arg_reg_available_p
>>> to 0 when setting cum->nregs = 0.
>>
>> Do we also need similar functionality for 64bit ABIs? What happens if
>> we are out of argument regs there?
>
> 64-bit is OK since we have rax, r10 and r11 as scratch registers which
> aren't used to pass arguments.
Maybe this fact should be added as a comment in some appropriate place.
>>> * config/i386/i386.h (machine_function): Add arg_reg_available_p.
>>>
>>> gcc/testsuite/
>>>
>>> PR target/66819
>>> * gcc.target/i386/pr66819-1.c: New test.
>>> * gcc.target/i386/pr66819-2.c: Likewise.
>>> * gcc.target/i386/pr66819-3.c: Likewise.
>>> * gcc.target/i386/pr66819-4.c: Likewise.
>>> * gcc.target/i386/pr66819-5.c: Likewise.
>>> ---
>>> gcc/config/i386/i386.c | 15 +++++++++------
>>> gcc/config/i386/i386.h | 3 +++
>>> gcc/testsuite/gcc.target/i386/pr66819-1.c | 8 ++++++++
>>> gcc/testsuite/gcc.target/i386/pr66819-2.c | 8 ++++++++
>>> gcc/testsuite/gcc.target/i386/pr66819-3.c | 10 ++++++++++
>>> gcc/testsuite/gcc.target/i386/pr66819-4.c | 12 ++++++++++++
>>> gcc/testsuite/gcc.target/i386/pr66819-5.c | 10 ++++++++++
>>> 7 files changed, 60 insertions(+), 6 deletions(-)
>>> create mode 100644 gcc/testsuite/gcc.target/i386/pr66819-1.c
>>> create mode 100644 gcc/testsuite/gcc.target/i386/pr66819-2.c
>>> create mode 100644 gcc/testsuite/gcc.target/i386/pr66819-3.c
>>> create mode 100644 gcc/testsuite/gcc.target/i386/pr66819-4.c
>>> create mode 100644 gcc/testsuite/gcc.target/i386/pr66819-5.c
>>>
>>> diff --git a/gcc/config/i386/i386.c b/gcc/config/i386/i386.c
>>> index 54ee6f3..85e59a8 100644
>>> --- a/gcc/config/i386/i386.c
>>> +++ b/gcc/config/i386/i386.c
>>> @@ -5628,12 +5628,12 @@ ix86_function_ok_for_sibcall (tree decl, tree exp)
>>> if (!decl
>>> || (TARGET_DLLIMPORT_DECL_ATTRIBUTES && DECL_DLLIMPORT_P (decl)))
>>> {
>>> - if (ix86_function_regparm (type, NULL) >= 3)
>>> - {
>>> - /* ??? Need to count the actual number of registers to be used,
>>> - not the possible number of registers. Fix later. */
>>> - return false;
>>> - }
>>> + /* FIXME: The symbol indirect call doesn't need a
>>> + call-clobbered register. But we don't know if
>>> + this is a symbol indirect call or not here. */
>>> + if (ix86_function_regparm (type, NULL) >= 3
>>> + && !cfun->machine->arg_reg_available_p)
>>
>> Isn't enough to look at arg_reg_available here?
>
> We need to check ix86_function_regparm since nregs is 0 if
> -mregparm=N isn't used and pr65753.c will fail.
OK. Please add this comment, is not that obvious.
>
>>> + return false;
>>> }
>>> }
>>>
>>> @@ -6567,6 +6567,7 @@ init_cumulative_args (CUMULATIVE_ARGS *cum, /* Argument info to initialize */
>>> ? X86_64_REGPARM_MAX
>>> : X86_64_MS_REGPARM_MAX);
>>> }
>>> + cfun->machine->arg_reg_available_p = cum->nregs != 0;
>>
>> false instead of 0. This is a boolean.
>
> Updated.
>
>>> if (TARGET_SSE)
>>> {
>>> cum->sse_nregs = SSE_REGPARM_MAX;
>>> @@ -6636,6 +6637,7 @@ init_cumulative_args (CUMULATIVE_ARGS *cum, /* Argument info to initialize */
>>> else
>>> cum->nregs = ix86_function_regparm (fntype, fndecl);
>>> }
>>> + cfun->machine->arg_reg_available_p = cum->nregs != 0;
>>
>> IMO, cum->nregs > 0 would be more descriptive.
>
> Updated.
>
>>> /* Set up the number of SSE registers used for passing SFmode
>>> and DFmode arguments. Warn for mismatching ABI. */
>>> @@ -7584,6 +7586,7 @@ pass_in_reg:
>>> {
>>> cum->nregs = 0;
>>> cum->regno = 0;
>>> + cfun->machine->arg_reg_available_p = 0;
>>> }
>>> break;
>>>
>>> diff --git a/gcc/config/i386/i386.h b/gcc/config/i386/i386.h
>>> index 74334ff..0b6e304 100644
>>> --- a/gcc/config/i386/i386.h
>>> +++ b/gcc/config/i386/i386.h
>>> @@ -2479,6 +2479,9 @@ struct GTY(()) machine_function {
>>> /* If true, it is safe to not save/restore DRAP register. */
>>> BOOL_BITFIELD no_drap_save_restore : 1;
>>>
>>> + /* If true, there is register available for argument passing. */
>>> + BOOL_BITFIELD arg_reg_available_p : 1;
>>
>> This is not a predicate, but a boolean flag. Please remove _p from the name.
>
> Updated.
>
> Here is the updated patch. OK for trunk?
OK with a small comment additions.
+ /* If true, there is register available for argument passing. */
+ BOOL_BITFIELD arg_reg_available : 1;
+
Please mention here that this is for 32bit targets only.
Thanks,
Uros.