This is the mail archive of the
gcc-bugs@gcc.gnu.org
mailing list for the GCC project.
java/2319: invalid UTF-8 sequences should be rejected
- To: gcc-gnats at gcc dot gnu dot org
- Subject: java/2319: invalid UTF-8 sequences should be rejected
- From: tromey at redhat dot com
- Date: 19 Mar 2001 16:32:15 -0000
- Reply-To: tromey at redhat dot com
>Number: 2319
>Category: java
>Synopsis: invalid UTF-8 sequences should be rejected
>Confidential: no
>Severity: serious
>Priority: medium
>Responsible: unassigned
>State: open
>Class: sw-bug
>Submitter-Id: net
>Arrival-Date: Mon Mar 19 08:36:00 PST 2001
>Closed-Date:
>Last-Modified:
>Originator: Tom Tromey
>Release: unknown-1.0
>Organization:
>Environment:
>Description:
Currently the compiler accepts invalid UTF-8 sequences
when reading a file. Instead we ought to diagnose
such sequences as errors.
Try compiling this Latin-1 encoded program with
--encoding=UTF-8 to see the problem:
public class Hello
{
public static void main ( String []arguments)
{
System.out.println ("Liberté, égalité, fraternité !");
}
}
>How-To-Repeat:
>Fix:
>Release-Note:
>Audit-Trail:
>Unformatted: