[PATCH PR95254] aarch64: gcc generate inefficient code with fixed sve vector length
Yangfei (Felix)
felix.yang@huawei.com
Tue Jun 2 02:44:49 GMT 2020
Hi,
> -----Original Message-----
> From: Richard Sandiford [mailto:richard.sandiford@arm.com]
> Sent: Monday, June 1, 2020 4:47 PM
> To: Yangfei (Felix) <felix.yang@huawei.com>
> Cc: gcc-patches@gcc.gnu.org; Uros Bizjak <ubizjak@gmail.com>; Jakub
> Jelinek <jakub@redhat.com>; Hongtao Liu <crazylht@gmail.com>; H.J. Lu
> <hjl.tools@gmail.com>
> Subject: Re: [PATCH PR95254] aarch64: gcc generate inefficient code with
> fixed sve vector length
Snip...
> Sounds good. Maybe at this point the x_inner and y_inner code is getting
> complicated enough to put into a lambda too:
>
> x_inner = ... (x);
> y_inner = ... (y);
>
> Just a suggestion though.
Yes, that's a good suggestion. I see the code becomes more cleaner with another lambda.
> Yeah, looks good.
>
> Formatting nit though: multi-line conditions should be wrapped in (...),
> i.e.:
>
> return (...
> && ...
> && ...);
>
Done. v6 patch is based on trunk 20200601.
Bootstrapped and tested on aarch64-linux-gnu.
Also bootstrapped on x86-64-linux-gnu with --enable-multilib (for building -m32 x86 libgcc).
Regresssion test on x86-64-linux-gnu looks good except for the following failures which has been confirmed by x86 devs:
> FAIL: gcc.target/i386/avx512f-vcvtps2ph-2.c (test for excess errors)
> UNRESOLVED: gcc.target/i386/avx512f-vcvtps2ph-2.c compilation failed to produce executable
154803c154803
Thanks,
Felix
-------------- next part --------------
A non-text attachment was scrubbed...
Name: pr95254-v6.diff
Type: application/octet-stream
Size: 5570 bytes
Desc: pr95254-v6.diff
URL: <https://gcc.gnu.org/pipermail/gcc-patches/attachments/20200602/520f7706/attachment-0001.obj>
More information about the Gcc-patches
mailing list