This is the mail archive of the gcc-patches@gcc.gnu.org mailing list for the GCC project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: [PATCH] Reenable CSE of non-volatile inline asm (PR rtl-optimization/63637)

From: Richard Henderson <rth at redhat dot com>
To: Jakub Jelinek <jakub at redhat dot com>, Richard Biener <richard dot guenther at gmail dot com>
Cc: Jeff Law <law at redhat dot com>, Segher Boessenkool <segher at kernel dot crashing dot org>, Richard Biener <rguenther at suse dot de>, Eric Botcazou <ebotcazou at adacore dot com>, gcc-patches at gcc dot gnu dot org
Date: Fri, 23 Jan 2015 12:52:37 -0800
Subject: Re: [PATCH] Reenable CSE of non-volatile inline asm (PR rtl-optimization/63637)
Authentication-results: sourceware.org; auth=none
References: <20150113161819 dot GD1405 at tucnak dot redhat dot com> <20150113163840 dot GA4183 at gate dot crashing dot org> <54B575D7 dot 8030107 at redhat dot com> <20150113201322 dot GJ1405 at tucnak dot redhat dot com> <54B59964 dot 7070707 at redhat dot com> <20150114000315 dot GA32710 at gate dot crashing dot org> <54B609A9 dot 9090800 at redhat dot com> <20150114151906 dot GA21784 at gate dot crashing dot org> <54B74AD9 dot 4010905 at redhat dot com> <DE331E6C-39D7-4F70-9E25-A3A4F7F49640 at gmail dot com> <20150115081330 dot GB1405 at tucnak dot redhat dot com>

On 01/15/2015 12:13 AM, Jakub Jelinek wrote:
> On Thu, Jan 15, 2015 at 07:46:18AM +0100, Richard Biener wrote:
>>>> That last line means the compiler is free to delete a non-volatile
>>>> asm with a memory clobber if that asm is not needed for dataflow.  Or
>>>> that is how I read it; it is trying to indicate that if you want to
>>>> prevent the memory clobber from being deleted (together with the rest
>>>> of the asm), you need to make the asm volatile.
>>>>
>>>> So as far as I can see the compiler can CSE two identical
>>> non-volatile
>>>> asms with memory clobber just fine.  Older GCC (I tried 4.7.2) does
>>> do
>>>> this; current mainline doesn't.  I think it should.
>>> No, it should not CSE those two cases.  That's simply wrong and if an 
>>> older version did that optimization, that's a bug.
>>
>> I think segher has a point here.  If the asm with memory clobber would store to random memory and the point would be to preserve that then the whole distinction with volatile doesn't make much sense (after all without volatile we happily DCE such asm if the regular outputs are not needed).
>>
>> This doesn't mean 'memory' is a well-designed thing, of course. Just its
>> effects are effectively limited to reads without volatile(?)
> 
> Segher's mails talk about "memory" being a write but not read.
> If we even can't agree on what non-volatile "memory" means, I think
> we should treat it more conservatively, because every user (and there are
> lots of people using non-volatile asm with "memory" in the wild) treats it
> differently.  Just trying to grep for a few:
> glibc:
> ./sysdeps/alpha/bits/atomic.h:# define atomic_full_barrier()	__asm ("mb" : : : "memory");
> ./sysdeps/alpha/bits/atomic.h:# define atomic_read_barrier()	__asm ("mb" : : : "memory");
> ./sysdeps/alpha/bits/atomic.h:# define atomic_write_barrier()	__asm ("wmb" : : : "memory");
> ./sysdeps/sparc/sparc32/bits/atomic.h:# define atomic_full_barrier() __asm ("" ::: "memory")
> ./sysdeps/powerpc/powerpc32/bits/atomic.h:# define atomic_read_barrier()	__asm ("lwsync" ::: "memory")
> ./sysdeps/powerpc/powerpc32/bits/atomic.h:# define atomic_read_barrier()	__asm ("sync" ::: "memory")
> ./sysdeps/powerpc/powerpc64/bits/atomic.h:#define atomic_read_barrier()	__asm ("lwsync" ::: "memory")
> ./sysdeps/powerpc/bits/atomic.h:#define atomic_full_barrier()	__asm ("sync" ::: "memory")
> ./sysdeps/powerpc/bits/atomic.h:#define atomic_write_barrier()	__asm ("eieio" ::: "memory")
> ./sysdeps/generic/malloc-machine.h:# define atomic_full_barrier() __asm ("" ::: "memory")

I think that it's uses like these -- which may well have been written
by folks that also work on gcc -- that are proof that we have at least
intended to support a memory clobber to be a full read+write barrier,
and thus we must consider a memory clobber to be both a read and write.

(The fact that all of these are automatically volatile and would never be CSEd
is beside the point.  If the semantics of a memory clobber differ based on the
volatile flag on the asm, I think that would be too ill-defined to actually
support.)

In the interest of progressing wrt the current regression, I think that
Jakub's patch should go in as-is for now, and then we iterate on how we
think the memory cse ought (or ought not) to occur.

As for my own thoughts on whether two non-volatile asms with memory clobbers
should be CSE'd, in absence of other stores to memory in between, are
complicated and probably not well-formed.  I'll think about it some more.


r~

Follow-Ups:
- Re: [PATCH] Reenable CSE of non-volatile inline asm (PR rtl-optimization/63637)
  - From: Segher Boessenkool

References:
- [PATCH] Reenable CSE of non-volatile inline asm (PR rtl-optimization/63637)
  - From: Jakub Jelinek
- Re: [PATCH] Reenable CSE of non-volatile inline asm (PR rtl-optimization/63637)
  - From: Segher Boessenkool
- Re: [PATCH] Reenable CSE of non-volatile inline asm (PR rtl-optimization/63637)
  - From: Jeff Law
- Re: [PATCH] Reenable CSE of non-volatile inline asm (PR rtl-optimization/63637)
  - From: Jakub Jelinek
- Re: [PATCH] Reenable CSE of non-volatile inline asm (PR rtl-optimization/63637)
  - From: Jeff Law
- Re: [PATCH] Reenable CSE of non-volatile inline asm (PR rtl-optimization/63637)
  - From: Segher Boessenkool
- Re: [PATCH] Reenable CSE of non-volatile inline asm (PR rtl-optimization/63637)
  - From: Jeff Law
- Re: [PATCH] Reenable CSE of non-volatile inline asm (PR rtl-optimization/63637)
  - From: Segher Boessenkool
- Re: [PATCH] Reenable CSE of non-volatile inline asm (PR rtl-optimization/63637)
  - From: Jeff Law
- Re: [PATCH] Reenable CSE of non-volatile inline asm (PR rtl-optimization/63637)
  - From: Richard Biener
- Re: [PATCH] Reenable CSE of non-volatile inline asm (PR rtl-optimization/63637)
  - From: Jakub Jelinek

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]