avx512 mask register spilling

Vladimir Makarov vmakarov@redhat.com
Wed Aug 7 15:55:00 GMT 2013


On 13-08-06 3:11 PM, Richard Henderson wrote:
> On 08/06/2013 03:57 AM, Kirill Yukhin wrote:
>> On 05 Aug 09:55, Richard Henderson wrote:
>>> On 08/05/2013 08:07 AM, Kirill Yukhin wrote:
>>>> Hello Richard, Vlad,
>>>>
>>>> On 31 Jul 06:26, Richard Henderson wrote:
>>>>> On 07/31/2013 05:02 AM, Kirill Yukhin wrote:
>>>>>> There's ICE (max. number of generated reload insns per insn is achieved (90)),
>>>>>> when LRA tried to save mask register before call. This was caused by fact that  split_reg function
>>>>>> in lra-constraints.c allocates memory for saved reg in SECONDARY_MEMORY_NEEDED_MODE which
>>>>> I've told you before that it's not SECONDARY_MEMORY that you want, but
>>>>> a secondary reload.  You should be patching ix86_secondary_reload, right
>>>>> below where we handle QImode spills from non-Q registers for x86-32.
>>>> Trying to do that with no success so far.
>>>> Could you pls correct me if I am wrong.
>>>> What I am trying to do is to introduce 2 new `define_expand' for load and store.
>>> Huh?  You shouldn't need this.
>>>
>>> Give me a test case and I can have a look at it.
>> Hello,
>>
>> I've squashed part 1 and 2 + rebased on recent trunk.
>> Testcase is attached.
>> To reproduce: build-x86_64-linux/gcc/xgcc -Bbuild-x86_64-linux/gcc -Ofast -mavx512f -march=core-avx2 repro.c  -S -o-  -ffixed-rsi  -ffixed-rdi  -ffixed-rbx -ffixed-rbp -m32
>>
>> Thanks a lot for help!
> You've found what I believe to be a bug in LRA.
>
> Specifically, lra-constraints.c split_reg uses SECONDARY_MEMORY_NEEDED_MODE to
> choose what mode to spill a caller-save register.  Given the existing
> definition in i386.h, this tries to spill a MASK_CLASS register in SImode.  But
> MASK_CLASS does not support SImode, only QI/HImode.  Which leads to substantial
> confusion in the allocator trying to satisfy the move.
>
> I believe the use of SECONDARY_MEMORY_NEEDED_MODE in split_reg is wrong.
> What's the history behind that, Vlad?  Surely we can spill the value in
> its current mode?

As I remember I tried to decrease number of macros used for LRA.

Just using mode of reg might not work for general case.

Reload (caller-saves.c) uses HARD_REGNO_CALLER_SAVE_MODE.   I guess we 
should use it.

I'll try to implement this and after some testing and checking on a few 
platform I'll commit it.

I guess we will have a solution at the end of this week.
> Certainly this patch fixes the crash from Kirill's reproducer...
>
Thanks for working on this, Richard.
>



More information about the Gcc-patches mailing list