java/2319: invalid UTF-8 sequences should be rejected

Joseph S. Myers jsm28@cam.ac.uk
Mon Mar 19 09:06:00 GMT 2001


The following reply was made to PR java/2319; it has been noted by GNATS.

From: "Joseph S. Myers" <jsm28@cam.ac.uk>
To: <tromey@redhat.com>
Cc: <gcc-gnats@gcc.gnu.org>,  <gcc-bugs@gcc.gnu.org>
Subject: Re: java/2319: invalid UTF-8 sequences should be rejected
Date: Mon, 19 Mar 2001 17:00:47 +0000 (GMT)

 On 19 Mar 2001 tromey@redhat.com wrote:
 
 > Currently the compiler accepts invalid UTF-8 sequences
 > when reading a file.  Instead we ought to diagnose
 > such sequences as errors.
 
 Also note that the invalid sequences that should be rejected include
 over-long sequences and UTF-8 encodings that would map to values in the
 UTF-16 surrogate range.
 
 http://www.cl.cam.ac.uk/~mgk25/unicode.html
 http://www.unicode.org/unicode/uni2errata/UTF-8_Corrigendum.html
 
 -- 
 Joseph S. Myers
 jsm28@cam.ac.uk
 



More information about the Gcc-prs mailing list