java/2319: invalid UTF-8 sequences should be rejected

Tom Tromey tromey@redhat.com
Mon Mar 19 09:16:00 GMT 2001


The following reply was made to PR java/2319; it has been noted by GNATS.

From: Tom Tromey <tromey@redhat.com>
To: "Joseph S. Myers" <jsm28@cam.ac.uk>
Cc: <gcc-gnats@gcc.gnu.org>, <gcc-bugs@gcc.gnu.org>
Subject: Re: java/2319: invalid UTF-8 sequences should be rejected
Date: 19 Mar 2001 10:19:18 -0700

 >>>>> "Joseph" == Joseph S Myers <jsm28@cam.ac.uk> writes:
 
 Joseph> Also note that the invalid sequences that should be rejected
 Joseph> include over-long sequences and UTF-8 encodings that would map
 Joseph> to values in the UTF-16 surrogate range.
 
 I agree, with the sole exception that I think we should accept the
 Java form of \0.  Java represents this as a two-byte sequence and it
 seems reasonable that a Java compiler would accept this form.
 
 Tom



More information about the Gcc-prs mailing list