This is the mail archive of the java-patches@gcc.gnu.org mailing list for the Java project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

Re: GCJ manual changed


Joseph S. Myers wrote:
>>>You say "UTF", which UTF (there are several, e.g., UTF-8, UTF-16,
>>>UTF-32)? 
>>>
>>It should be obvious to a Java programmer. This is the same text as
>>the old CNI manual.
>>
> 
> You're using a term from another field.  When doing so, you should do it
> in a way that is correct in the context of that other field rather than
> making an incorrect usage that will be understood by those Java
> programmers who aren't also too experienced in Unicode for the incorrect
> usage to grate on them.  Elsewhere you've explicitly named UTF-8.

Java does not use UTF-8, but something similar.  The differences are
that '\u0' is encoded using two bytes, and that code points about 2^16
are encoding by individually encoding each surrogate pairs.  (Java
does not handle code points above 2^16 well in any case.)
-- 
	--Per Bothner
per@bothner.com   http://www.bothner.com/per/


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]