Bug 79183 - Hard coded plurals in gimple-ssa-sprintf.c:2050
Summary: Hard coded plurals in gimple-ssa-sprintf.c:2050
Status: NEW
Alias: None
Product: gcc
Classification: Unclassified
Component: translation (show other bugs)
Version: 7.0
: P3 trivial
Target Milestone: ---
Assignee: Not yet assigned to anyone
URL:
Keywords: diagnostic, easyhack
Depends on:
Blocks: trivial_translation_nits
  Show dependency treegraph
 
Reported: 2017-01-22 16:45 UTC by Frederic Marchal
Modified: 2021-10-28 02:28 UTC (History)
4 users (show)

See Also:
Host:
Target:
Build:
Known to work:
Known to fail:
Last reconfirmed: 2017-02-08 00:00:00


Attachments
gcc9-pr79183.patch (560 bytes, patch)
2019-04-17 19:45 UTC, Jakub Jelinek
Details | Diff

Note You need to log in before you can comment on or make changes to this bug.
Description Frederic Marchal 2017-01-22 16:45:37 UTC
gimple-ssa-sprintf.c contains two hard coded plurals starting from line 2050:

      const char* fmtstr
        = (info.bounded
           ? (1 < fmtres.range.min
          ? G_("%<%.*s%> directive output truncated while writing "
               "%wu bytes into a region of size %wu")
          : G_("%<%.*s%> directive output truncated while writing "
               "%wu byte into a region of size %wu"))
           : (1 < fmtres.range.min
          ? G_("%<%.*s%> directive writing %wu bytes "
               "into a region of size %wu")
          : G_("%<%.*s%> directive writing %wu byte "
               "into a region of size %wu")));

Beside, the plural rules are wrong for English. Plural should be used if fmtres.range.min is zero.
Comment 1 Frederic Marchal 2017-01-22 17:05:07 UTC
Yet another hard coded plural in the same file at line 2286:

    inform (callloc,
        (nbytes + exact == 1
         ? G_("format output %wu byte into a destination of size %wu")
         : G_("format output %wu bytes into a destination of size %wu")),
        nbytes + exact, info.objsize);
Comment 2 Martin Sebor 2017-02-08 21:37:44 UTC
I don't see anything wrong with the messages.  The plural form is used  when the byte count is greater than 1.  None of the messages should be issued when the counter is zero because writing no bytes cannot lead to truncation.  If you have a test case to the contrary please reopen the bug and provide the test case.
Comment 3 Joseph S. Myers 2017-02-08 22:38:29 UTC
Messages involving plurals need to go through ngettext (via warning_n etc. functions) because some languages have more than just singular and plural forms (e.g. separate forms to use when n = 2 from n > 2).
Comment 4 Joseph S. Myers 2017-02-08 22:44:41 UTC
See hr.po, ru.po, sr.po and uk.po for GCC translations involving more than just singular and plural (look at the Plural-Forms line in the .po file).
Comment 5 Martin Sebor 2017-02-08 23:01:19 UTC
I see, thanks for the pointer.  Looks like warning_n() is used in gcc/ipa-devirt.c and error_n() in gcc/cp/pt.c.  gimple-ssa-sprintf.c contains between 7 and 23 instances of these "%wu byte" vs "%wu bytes" forms so it will be a fun exercise to fix.
Comment 6 Roland Illig 2019-04-17 19:24:35 UTC
Also in gimple-ssa-sprintf.c:

    inform (callloc,
	    (min == 1
	     ? G_("%qE output %wu byte")
	     : G_("%qE output %wu bytes")),
	    info.func, min);

    inform (callloc,
	    (min == 1
	     ? G_("%qE output %wu byte into a destination of size %wu")
	     : G_("%qE output %wu bytes into a destination of size "
		  "%wu")),
	    info.func, min, info.objsize);
Comment 7 Jakub Jelinek 2019-04-17 19:45:57 UTC
Created attachment 46188 [details]
gcc9-pr79183.patch

Untested fix for those two cases.
Comment 8 Jakub Jelinek 2019-04-18 10:28:53 UTC
Author: jakub
Date: Thu Apr 18 10:28:21 2019
New Revision: 270438

URL: https://gcc.gnu.org/viewcvs?rev=270438&root=gcc&view=rev
Log:
	PR translation/79183
	* gimple-ssa-sprintf.c (format_directive): Use inform_n instead of
	inform where appropriate.

Modified:
    trunk/gcc/ChangeLog
    trunk/gcc/c/c-decl.c
    trunk/gcc/cp/decl.c
    trunk/gcc/gimple-ssa-sprintf.c
Comment 9 Roland Illig 2019-04-23 21:59:14 UTC
Is there already someone who wants to fix the remaining messages?

Jakub, you fixed some of them already in https://gcc.gnu.org/viewcvs?rev=258154&root=gcc&view=rev in March 2018.

There are still some messages that use fmtwarn instead of fmtwarn_n and still contain "%wu bytes" or its close relative "%wu or more bytes".

Another even trickier message is "between %wu and %wu bytes". This one uses a range of numbers, and in Arabic the rules for the correct word form are quite numerous: https://www.unicode.org/cldr/charts/33/supplemental/language_plural_rules.html#ar

If GCC wants to be the project demonstrating best practices in translation, it should even handle this case correctly. I'm not sure though whether gettext supports this at all.

Current state:
#c0 is fixed
#c1 is fixed in 2 out of 3 cases
#c6 is fixed