This is the mail archive of the gcc@gcc.gnu.org mailing list for the GCC project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]

Re: thoughts on martin's proposed patch for GCC and UTF-8

To: eggert at twinsun dot com
Subject: Re: thoughts on martin's proposed patch for GCC and UTF-8
From: Martin von Loewis <martin at mira dot isdn dot cs dot tu-berlin dot de>
Date: Fri, 18 Dec 1998 10:14:14 +0100
CC: rms at gnu dot org, gcc2 at gnu dot org, egcs at cygnus dot com
References: <199812100702.XAA26400@cygnus.com> <199812120323.TAA10442@shade.twinsun.com> <199812121018.LAA02558@mira.isdn.cs.tu-berlin.de> <199812131423.HAA23320@wijiji.santafe.edu> <199812131924.UAA00216@mira.isdn.cs.tu-berlin.de> <199812141022.DAA26126@wijiji.santafe.edu> <199812151846.KAA13116@shade.twinsun.com> <199812180210.TAA19043@wijiji.santafe.edu> <199812180540.VAA02653@shade.twinsun.com>

> Here is another possibility.  For identifier chars that can be
> expressed as multibyte chars in the locale's encoding, use those
> chars; otherwise, use `.uxxxx' or `.Uxxxxxxxx' where xxxx (or
> xxxxxxxx) are the Unicode position.

[...]

> I don't know how this would affect C++ mangling, though.

This won't work for C++. Consider

class Foo{
        static int u1234;
};

This currently compiles into _3Foo.u1234. With your proposal,
_3Foo.u1234.u1234 could either be Foo\u1234::u1234, or
Foo::u1234\u1234.

If people don't like converting Unicode identifiers to UTF-8 always, I
drop that proposal with regrets. It would work on assemblers that
support 8bit in identifiers, it would work for C and C++, and it would
work independently from compile-time or runtime settings (identifiers
are *not* effected by the users locale whatsoever).

Anyway, I drop that proposal. There is a proposed mangling for \u
escapes in C++ in gxxint.texi. It works for all cases and for all
assemblers, giving plain text in identifiers. It doesn't work for C,
but after this discussion, I guess I don't care about that anymore.
Somebody just tell me how it should work for C.

Kind regrets,
Martin

References:
- Re: thoughts on martin's proposed patch for GCC and UTF-8
  - From: Per Bothner
- Re: thoughts on martin's proposed patch for GCC and UTF-8
  - From: Paul Eggert
- Re: thoughts on martin's proposed patch for GCC and UTF-8
  - From: Martin von Loewis
- Re: thoughts on martin's proposed patch for GCC and UTF-8
  - From: Richard Stallman
- Re: thoughts on martin's proposed patch for GCC and UTF-8
  - From: Martin von Loewis
- Re: thoughts on martin's proposed patch for GCC and UTF-8
  - From: Richard Stallman
- Re: thoughts on martin's proposed patch for GCC and UTF-8
  - From: Paul Eggert
- Re: thoughts on martin's proposed patch for GCC and UTF-8
  - From: Richard Stallman
- Re: thoughts on martin's proposed patch for GCC and UTF-8
  - From: Paul Eggert

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]