This is the mail archive of the gcc-patches@gcc.gnu.org mailing list for the GCC project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: [PATCH] Prefer mempcpy to memcpy on x86_64 target (PR middle-end/81657).

From: Richard Biener <rguenther at suse dot de>
To: Jakub Jelinek <jakub at redhat dot com>
Cc: Martin Liška <mliska at suse dot cz>,Uros Bizjak <ubizjak at gmail dot com>,gcc-patches at gcc dot gnu dot org,Marc Glisse <marc dot glisse at inria dot fr>,"H.J. Lu" <hjl dot tools at gmail dot com>,Jan Hubicka <hubicka at ucw dot cz>
Date: Thu, 12 Apr 2018 17:17:29 +0200
Subject: Re: [PATCH] Prefer mempcpy to memcpy on x86_64 target (PR middle-end/81657).
References: <772b1171-2321-67d9-85e7-358a5cad0efa@suse.cz> <20180329122532.GP8577@tucnak> <17bbc039-e511-4fbe-d534-3d6d21aadc00@suse.cz> <2d812eaf-8ea0-68e8-089b-0c3d89a203d8@suse.cz> <20180410091915.GA8577@tucnak> <fbd9f1ef-34c6-45e1-b5ae-5acb3b828788@suse.cz> <5b750aa0-c5f6-0e64-9a14-5667926bcf3f@suse.cz> <alpine.LSU.2.20.1804121536260.18265@zhemvz.fhfr.qr> <20180412140549.GJ8577@tucnak> <alpine.LSU.2.20.1804121614440.18265@zhemvz.fhfr.qr> <20180412143112.GK8577@tucnak>

On April 12, 2018 4:31:12 PM GMT+02:00, Jakub Jelinek <jakub@redhat.com> wrote:
>On Thu, Apr 12, 2018 at 04:19:38PM +0200, Richard Biener wrote:
>> Well, but that wouldn't be a fix for a regression and IMHO there's
>> no reason for a really lame mempcpy.  If targets disgree well,
>
>It is a regression as well, in the past we've emitted mempcpy when user
>wrote mempcpy, now we don't.
>
>E.g.
>extern void *mempcpy (void *, const void *, __SIZE_TYPE__);
>void bar (void *, void *, void *);
>
>void
>foo (void *x, void *y, void *z, void *w, __SIZE_TYPE__ n)
>{
>  bar (mempcpy (x, w, n), mempcpy (y, w, n), mempcpy (z, w, n));
>}
>
>is on x86_64-linux -O2 in 7.x using the 3 mempcpy calls and 90 bytes in
>foo, while
>on the trunk uses 3 memcpy calls and 96 bytes in foo.
>
>For -Os that is easily measurable regression, for -O2 it depends on the
>relative speed of memcpy vs. mempcpy and whether one or both of them
>are in
>I-cache or not.

Well, then simply unconditionally not generate a libcall from the move expander? 

>
>> then they get what they deserve.
>> 
>> I don't see any aarch64 specific mempcpy in glibc btw so hopefully
>> the default non-stupid one kicks in (it exactly looks like my C
>> version)
>
>	Jakub

Follow-Ups:
- Re: [PATCH] Prefer mempcpy to memcpy on x86_64 target (PR middle-end/81657).
  - From: Jakub Jelinek

References:
- Re: [PATCH] Prefer mempcpy to memcpy on x86_64 target (PR middle-end/81657).
  - From: Martin Liška
- Re: [PATCH] Prefer mempcpy to memcpy on x86_64 target (PR middle-end/81657).
  - From: Jakub Jelinek
- Re: [PATCH] Prefer mempcpy to memcpy on x86_64 target (PR middle-end/81657).
  - From: Martin Liška
- Re: [PATCH] Prefer mempcpy to memcpy on x86_64 target (PR middle-end/81657).
  - From: Martin Liška
- Re: [PATCH] Prefer mempcpy to memcpy on x86_64 target (PR middle-end/81657).
  - From: Richard Biener
- Re: [PATCH] Prefer mempcpy to memcpy on x86_64 target (PR middle-end/81657).
  - From: Jakub Jelinek
- Re: [PATCH] Prefer mempcpy to memcpy on x86_64 target (PR middle-end/81657).
  - From: Richard Biener
- Re: [PATCH] Prefer mempcpy to memcpy on x86_64 target (PR middle-end/81657).
  - From: Jakub Jelinek

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]