This is the mail archive of the gcc@gcc.gnu.org mailing list for the GCC project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: Query on UTF-32 encodings for letters

From: Robert Dewar <dewar at adacore dot com>
To: Florian Weimer <fw at deneb dot enyo dot de>
Cc: Paul Koning <pkoning at equallogic dot com>, joseph at codesourcery dot com,gcc at gcc dot gnu dot org
Date: Mon, 17 Jan 2005 17:10:46 -0500
Subject: Re: Query on UTF-32 encodings for letters
References: <41E3E28D.6050506@adacore.com> <Pine.LNX.4.61.0501161942070.29730@digraph.polyomino.org.uk> <41EACFCA.7070506@adacore.com> <16875.56569.286000.776285@gargle.gargle.HOWL> <41EC0798.5020303@adacore.com> <16876.2932.32855.8813@gargle.gargle.HOWL> <41EC0D78.50201@adacore.com> <16876.4226.859818.910262@gargle.gargle.HOWL> <41EC1378.40408@adacore.com> <16876.5514.532258.388428@gargle.gargle.HOWL> <41EC320E.1010407@adacore.com> <87wtubg1ch.fsf@deneb.enyo.de>

Florian Weimer wrote:

Yes, and that's fine, both lower case i with dot and lower case i
without dot fold upper case to capital I (without dot), and so all three
are equivalent in identifiers.
No, this is not the way Turkish case conversion works. Turkish has a rule LATIN SMALL LETTER I -> LATIN CAPITAL LETTER I WITH DOT ABOVE (U+0130).


Maybe not, but I am implementing Ada, and not Turkish :-)
And the Ada rules map as I quoted. Ours not to reason why ....

I guess the point is that since we know that latin small letter i
must map to latin capital letter i (with no dot) in Ada (because
obviously that's reasonable and we cannot have case conversion in
identifiers be locale dependent. When it comes to the dotless I,
it would indeed be bizarre to map it to a dotted capital I, so they
end up being mapped the same. Makes sense, given the requirement
that case conversion (or more basically program legality) be
locale independent.

References:
- Query on UTF-32 encodings for letters
  - From: Robert Dewar
- Re: Query on UTF-32 encodings for letters
  - From: Joseph S. Myers
- Re: Query on UTF-32 encodings for letters
  - From: Robert Dewar
- Re: Query on UTF-32 encodings for letters
  - From: Paul Koning
- Re: Query on UTF-32 encodings for letters
  - From: Robert Dewar
- Re: Query on UTF-32 encodings for letters
  - From: Paul Koning
- Re: Query on UTF-32 encodings for letters
  - From: Robert Dewar
- Re: Query on UTF-32 encodings for letters
  - From: Paul Koning
- Re: Query on UTF-32 encodings for letters
  - From: Robert Dewar
- Re: Query on UTF-32 encodings for letters
  - From: Paul Koning
- Re: Query on UTF-32 encodings for letters
  - From: Robert Dewar
- Re: Query on UTF-32 encodings for letters
  - From: Florian Weimer

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]