This is the mail archive of the
gcc-bugs@gcc.gnu.org
mailing list for the GCC project.
ada/6726: -gnaty miscounts characters in UTF-8 source text
- From: starner at okstate dot edu
- To: gcc-gnats at gcc dot gnu dot org
- Date: 19 May 2002 22:02:28 -0000
- Subject: ada/6726: -gnaty miscounts characters in UTF-8 source text
- Reply-to: starner at okstate dot edu
>Number: 6726
>Category: ada
>Synopsis: -gnaty miscounts characters in UTF-8 source text
>Confidential: no
>Severity: serious
>Priority: low
>Responsible: unassigned
>State: open
>Class: rejects-legal
>Submitter-Id: net
>Arrival-Date: Sun May 19 15:06:01 PDT 2002
>Closed-Date:
>Last-Modified:
>Originator: starner@okstate.edu
>Release: gcc-3.1
>Organization:
>Environment:
Debian woody; ix86-linux
>Description:
-gnaty only permits 80 characters per line. However, if those characters are encoded in UTF-8, they are miscounted, and shorter lines are rejected. The Cherokee_String line in the attached file is 71 characters long, but 99 bytes long, and is rejected.
>How-To-Repeat:
Run gnatmake -gnaty test_strings.ads. It will return a "(style) This line is too long." error, when in fact this line is not too long.
>Fix:
>Release-Note:
>Audit-Trail:
>Unformatted:
----gnatsweb-attachment----
Content-Type: text/plain; name="test_strings.ads"
Content-Disposition: inline; filename="test_strings.ads"
package Test_Strings is
Cherokee_String : constant Wide_String := "ᎠᏆᏖᏁᏙᎽ ᎠᎽᏍᏉᏟ ᏦᏰᎾ ";
end Test_Strings;