This is the mail archive of the gcc-patches@gcc.gnu.org mailing list for the GCC project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: wide-int more performance fixes for wide multiplication.

From: Kenneth Zadeck <zadeck at naturalbridge dot com>
To: Richard Biener <rguenther at suse dot de>, Mike Stump <mikestump at comcast dot net>, gcc-patches <gcc-patches at gcc dot gnu dot org>, rdsandiford at googlemail dot com
Date: Sun, 15 Dec 2013 13:48:42 -0500
Subject: Re: wide-int more performance fixes for wide multiplication.
Authentication-results: sourceware.org; auth=none
References: <52A61506 dot 5000407 at naturalbridge dot com> <87sitv3bkq dot fsf at talisman dot default> <52AC4EC9 dot 3020806 at naturalbridge dot com> <87ob4j33pd dot fsf at talisman dot default> <52AC7C44 dot 5080805 at naturalbridge dot com> <87txea7auv dot fsf at talisman dot default> <52ADCC04 dot 90505 at naturalbridge dot com> <87iouq6p9u dot fsf at talisman dot default>


On 12/15/2013 11:40 AM, Richard Sandiford wrote:

Kenneth Zadeck <zadeck@naturalbridge.com> writes:

it is certainly true that in order to do an unbounded set of operations,
you would have to check on every operation.   so my suggestion that we
should remove the checking from the infinite precision would not support
this.     but the reality is that there are currently no places in the
compiler that do this.

Currently all of the uses of widest-int are one or two operations, and
the style of code writing is that you do these and then you deal with
the overflow at the time that you convert the widest-int to a tree.   I
think that it is important to maintain the style of programming where
for a small finite number of computations do not need to check until
they convert back.

The problem with making the buffer size so tight is that we do not have
an adequate reserves to allow this style for any supportable type.
I personally think that 2x + some small n is what we need to have.


i am not as familiar with how this is used (or to be used when all of
the offset math is converted to use wide-int), but there appear to be
two uses of multiply.    one is the "harmless" mult by 3" and the other
is where people are trying to compute the size of arrays.    These last
operations do need to be checked for overflow.    The question here is
do you want to force those operations to overflow individually or do you
want to check when you convert out.    Again, i think 2x + some small
number is what we might want to consider.

It's a fair question, but personally I think checking for overflow
on the operation is much more robust.  Checking on conversion doesn't
allow you to stop thinking about overflow, it just changes the way you
think about it: rather than handling explicit overflow flags, you have
to remember to ask "is the range of the unconverted result within the
range of widest_int", which I bet it is something that would be easily
forgotten once widest_int & co. are part of the furniture.

E.g. the SPARC operation (picked only because I remember it):

	  for (i = 0; i < VECTOR_CST_NELTS (arg0); ++i)
	    {
	      tree e0 = VECTOR_CST_ELT (arg0, i);
	      tree e1 = VECTOR_CST_ELT (arg1, i);

	      bool neg1_ovf, neg2_ovf, add1_ovf, add2_ovf;

	      tmp = wi::neg (e1, &neg1_ovf);
	      tmp = wi::add (e0, tmp, SIGNED, &add1_ovf);
	      if (wi::neg_p (tmp))
		tmp = wi::neg (tmp, &neg2_ovf);
	      else
		neg2_ovf = false;
	      result = wi::add (result, tmp, SIGNED, &add2_ovf);
	      overflow |= neg1_ovf | neg2_ovf | add1_ovf | add2_ovf;
	    }

	  gcc_assert (!overflow);

	  return wide_int_to_tree (rtype, result);

seems pretty natural.  If instead it was modelled as a widest_int
chain without overflow then it would be less obviously correct.

Thanks,
Richard

Let us for the sake of argument assume that this was common code ratherthan code in a particular port, because code in a particular port canknow more about the environment than common code is allowed to.

My main point is that this code is in wide-int not widest-int because atthis level the writer of this code actually wants to model what thetarget wants to do. So doing the adds in precision and testingoverflow is perfectly fine at every step. But this loop CANNOT bewritten in a style where you tested the overflow at the end because ifthis is common code you cannot make any assumptions about the largestmode on the machine. If the buffer was 2x + n in size, then it wouldbe reasonably safe to assume that the number of elements in the vectorcould be represented in an integer and so you could wait till the end.

I think that my point was that (and i feel a little uncomfortableputting words in richi's mouth but i believe that this was his pointearly on) was that he thinks of the widest int as an infinite precisionrepresentation. he was the one who was pushing for the entire rep tobe done with a large internal (or perhaps unbounded) rep because he feltthat this was more natural to not have to think about overflow. Hewanted you to be able to chain a mult and a divide and not see theproduct get truncated before the divide was done. The rep that wehave now really sucks with respect to this because widest int truncatesif you are close to the largest precision on the machine and does not ifyou are small with respect to that.

My other point is that while you think that the example above is nice,the experience with double-int is contrary to this. people will say(and test) the normal modes and anyone trying to use large modes willdie a terrible death of a thousand cuts.

Follow-Ups:
- Re: wide-int more performance fixes for wide multiplication.
  - From: Richard Biener

References:
- wide-int more performance fixes for wide multiplication.
  - From: Kenneth Zadeck
- Re: wide-int more performance fixes for wide multiplication.
  - From: Richard Sandiford
- Re: wide-int more performance fixes for wide multiplication.
  - From: Kenneth Zadeck
- Re: wide-int more performance fixes for wide multiplication.
  - From: Richard Sandiford
- Re: wide-int more performance fixes for wide multiplication.
  - From: Kenneth Zadeck
- Re: wide-int more performance fixes for wide multiplication.
  - From: Richard Sandiford
- Re: wide-int more performance fixes for wide multiplication.
  - From: Kenneth Zadeck
- Re: wide-int more performance fixes for wide multiplication.
  - From: Richard Sandiford

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]