This is the mail archive of the gcc-patches@gcc.gnu.org mailing list for the GCC project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: [patch] new exec-charset testcase

From: Mark Mitchell <mark at codesourcery dot com>
To: Eric Christopher <echristo at redhat dot com>
Cc: Zack Weinberg <zack at codesourcery dot com>, gcc-patches at gcc dot gnu dot org, Mark Mitchell <mitchell at codesourcery dot com>
Date: Mon, 19 Apr 2004 20:02:13 -0700
Subject: Re: [patch] new exec-charset testcase
Organization: CodeSourcery, LLC
References: <1082411714.3349.16.camel@dzur.sfbay.redhat.com> <878ygr4lr8.fsf@egil.codesourcery.com> <1082415875.3349.26.camel@dzur.sfbay.redhat.com> <874qrf4jh1.fsf@egil.codesourcery.com> <1082418791.3349.29.camel@dzur.sfbay.redhat.com> <87zn9734ik.fsf@egil.codesourcery.com> <1082419568.3349.35.camel@dzur.sfbay.redhat.com> <87vfjv30q5.fsf@egil.codesourcery.com> <1082425892.4329.4.camel@dzur.sfbay.redhat.com> <87ekqj2wry.fsf@egil.codesourcery.com> <1082429249.4329.6.camel@dzur.sfbay.redhat.com>

Eric Christopher wrote:

Any other thoughts?


Well, pushing cpp_tokens during a tentative parse would be a good
thing, I think, it would reduce backtracking overhead.  I don't have
any idea how hard it would be though.  Mark, can you comment?

This is my preference too :)

I'm not sure what you're asking. I think you're asking "can we save cpp_tokens instead of cp_tokens?" The answer is definitely negative. There are situations where we must save the trees that we have generated for both correctness and efficiency reasons. A good example is template-ids:

S<int>

We in fact collapse that set of tokens into a single pseudo-token after seeing it the first time, remembering the associate *_TYPE node. That's important because if the type was bogus we only want to complain once. And, it's a big speed optimization; it avoids doing lookup again. I measureed this and it was a big win.

In the particular case of string constants you could probably have cp_lexer_rollback_tokens mark any STRING_CSTs rolled back as dirty, and recompute their values when you encounter them again. But, you don't actually want to do that. It will be expensive. If you do that, do it only in the case where the execution character set is not the same as the host character set; we shouldn't penalize the typical case. A better approach would probably be to give the lexer enough smarts to know when it had to do the translation.

--
Mark Mitchell
CodeSourcery, LLC
mark@codesourcery.com

Follow-Ups:
- Re: [patch] new exec-charset testcase
  - From: Eric Christopher

References:
- [patch] new exec-charset testcase
  - From: Eric Christopher
- Re: [patch] new exec-charset testcase
  - From: Zack Weinberg
- Re: [patch] new exec-charset testcase
  - From: Eric Christopher
- Re: [patch] new exec-charset testcase
  - From: Zack Weinberg
- Re: [patch] new exec-charset testcase
  - From: Eric Christopher
- Re: [patch] new exec-charset testcase
  - From: Zack Weinberg
- Re: [patch] new exec-charset testcase
  - From: Eric Christopher
- Re: [patch] new exec-charset testcase
  - From: Zack Weinberg
- Re: [patch] new exec-charset testcase
  - From: Eric Christopher
- Re: [patch] new exec-charset testcase
  - From: Zack Weinberg
- Re: [patch] new exec-charset testcase
  - From: Eric Christopher

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]