This is the mail archive of the gcc@gcc.gnu.org mailing list for the GCC project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

Re: Query on UTF-32 encodings for letters


Tom Tromey wrote:
"Robert" == Robert Dewar <dewar@adacore.com> writes:


Robert> though the case conversion one is pretty huge -- that's
Robert> not one that C and Java have to worry about

FWIW, libgcj does need this, since String and Character both have case
conversion methods.  This is mostly table-driven, using tables derived
from the Unicode tables via a converter program, but there are some
special cases in String for weird things like esset and dotless "i" in
the Turkish locale.  See libjava/scripts and
libjava/java/lang/{*String*,*Character*}.

Tom

Well it is possible that libgcj does exactly the right thing for Ada over all planes, but on the other hand, it is not that difficult to do exactly what is required for Ada, and we definitely do not need the fold to lower case (which is where problems occur), and Ada does NOT allow special casing of esset.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]