ada/6726: -gnaty miscounts characters in UTF-8 source text

starner@okstate.edu starner@okstate.edu
Sun May 19 15:06:00 GMT 2002


>Number:         6726
>Category:       ada
>Synopsis:       -gnaty miscounts characters in UTF-8 source text
>Confidential:   no
>Severity:       serious
>Priority:       low
>Responsible:    unassigned
>State:          open
>Class:          rejects-legal
>Submitter-Id:   net
>Arrival-Date:   Sun May 19 15:06:01 PDT 2002
>Closed-Date:
>Last-Modified:
>Originator:     starner@okstate.edu
>Release:        gcc-3.1
>Organization:
>Environment:
Debian woody; ix86-linux
>Description:
-gnaty only permits 80 characters per line. However, if those characters are encoded in UTF-8, they are miscounted, and shorter lines are rejected. The Cherokee_String line in the attached file is 71 characters long, but 99 bytes long, and is rejected.
>How-To-Repeat:
Run gnatmake -gnaty test_strings.ads. It will return a "(style) This line is too long." error, when in fact this line is not too long.
>Fix:

>Release-Note:
>Audit-Trail:
>Unformatted:
----gnatsweb-attachment----
Content-Type: text/plain; name="test_strings.ads"
Content-Disposition: inline; filename="test_strings.ads"

package Test_Strings is

   Cherokee_String : constant Wide_String := "ᎠᏆᏖᏁᏙᎽ ᎠᎽᏍᏉᏟ ᏦᏰᎾ       ";

end Test_Strings;



More information about the Gcc-bugs mailing list