GCJ manual changed

Joseph S. Myers jsm28@cam.ac.uk
Wed Jan 30 17:05:00 GMT 2002


On 30 Jan 2002, Tom Tromey wrote:

> I don't recall seeing text to that effect in anything I've read.  And
> I'd be willing to bet that at least some versions of the JDK from Sun
> don't reject such sequences.  For that matter, we don't reject such
> sequences.  It's unclear whether we should change our implementation
> here; this is yet another under-specified aspect of Java.

We ought to reject them (unless it is specifically specified otherwise).  
Both the Unicode and ISO 10646 standards were changed to disallow
interpretation (not just generation) of such sequences as representing the
characters they would appear to represent when a naive UTF-8 decoder is
used, because of the security issues associated with multiple
representations.

If there is some way of influencing Java standards, it would be worthwhile
to represent that the standards should be changed to make it clear such
over-long sequences must be rejected.

-- 
Joseph S. Myers
jsm28@cam.ac.uk



More information about the Java-patches mailing list