[Bug fortran/78534] Use a larger integer type for character lengths on 64-bit targets

jb at gcc dot gnu.org gcc-bugzilla@gcc.gnu.org
Fri Jan 13 17:06:00 GMT 2017


https://gcc.gnu.org/bugzilla/show_bug.cgi?id=78534

--- Comment #18 from Janne Blomqvist <jb at gcc dot gnu.org> ---
Author: jb
Date: Fri Jan 13 17:05:48 2017
New Revision: 244448

URL: https://gcc.gnu.org/viewcvs?rev=244448&root=gcc&view=rev
Log:
PR 78534 Change character length from int to size_t

In order to handle large character lengths on (L)LP64 targets, switch
the GFortran character length from an int to a size_t.

This is an ABI change, as procedures with character arguments take
hidden arguments with the character length.

I also changed the _size member in vtables from int to size_t, as
there were some cases where character lengths and sizes were
apparently mixed up and caused regressions otherwise. Although I
haven't tested, this might enable very large derived types as well.

Also, as there are some places in the frontend were negative character
lengths are used as special flag values, in the frontend the character
length is handled as a signed variable of the same size as a size_t,
although in the runtime library it really is size_t.

I haven't changed the character length variables for the co-array
intrinsics, as this is something that may need to be synchronized with
OpenCoarrays.

This is v4 of the patch. v3 was applied but had to reverted due to
breaking bootstrap. The fix is in resolve.c:resolve_charlen, where
it's necessary to check that an expression is constant before using
mpz_sgn.

Overview of v3 of the patch: All the issues pointed out by FX's review
of v2 have been fixed. In particular, there are now new functions
gfc_mpz_get_hwi and gfc_mpz_set_hwi, similar to the GMP functions
mpz_get_si and mpz_set_si, except that they get/set a HOST_WIDE_INT
instead of a long value. Similarly, gfc_get_int_expr now takes a
HOST_WIDE_INT instead of a long, gfc_extract_long is replaced by
gfc_extract_hwi. Also, the preliminary work to handle
gfc_charlen_type_node being unsigned has been removed.

Regtested on x86_64-pc-linux-gnu and i686-pc-linux-gnu.

frontend:

2017-01-13  Janne Blomqvist  <jb@gcc.gnu.org>

        PR fortran/78534
        PR fortran/66310
        * class.c (gfc_find_derived_vtab): Use gfc_size_kind instead of
        hardcoded kind.
        (find_intrinsic_vtab): Likewise.
        * expr.c (gfc_get_character_expr): Length parameter of type
        gfc_charlen_t.
        (gfc_get_int_expr): Value argument of type HOST_WIDE_INT.
        (gfc_extract_hwi): New function.
        (simplify_const_ref): Make string_len of type gfc_charlen_t.
        (gfc_simplify_expr): Use HOST_WIDE_INT for substring refs.
        * gfortran.h (gfc_mpz_get_hwi): New prototype.
        (gfc_mpz_set_hwi): Likewise.
        (gfc_charlen_t): New typedef.
        (gfc_expr): Use gfc_charlen_t for character lengths.
        (gfc_size_kind): New extern variable.
        (gfc_extract_hwi): New prototype.
        (gfc_get_character_expr): Use gfc_charlen_t for character length.
        (gfc_get_int_expr): Use HOST_WIDE_INT type for value argument.
        * iresolve.c (gfc_resolve_repeat): Pass string length directly without
        temporary, use gfc_charlen_int_kind.
        * match.c (select_intrinsic_set_tmp): Use HOST_WIDE_INT for charlen.
        * misc.c (gfc_mpz_get_hwi): New function.
        (gfc_mpz_set_hwi): New function.
        * module.c (atom_int): Change type from int to HOST_WIDE_INT.
        (parse_integer): Don't complain about large integers.
        (write_atom): Use HOST_WIDE_INT for integers.
        (mio_integer): Handle integer type mismatch.
        (mio_hwi): New function.
        (mio_intrinsic_op): Use HOST_WIDE_INT.
        (mio_array_ref): Likewise.
        (mio_expr): Likewise.
        * resolve.c (resolve_select_type): Use HOST_WIDE_INT for charlen,
        use snprintf.
        (resolve_substring_charlen): Use gfc_charlen_int_kind.
        (resolve_charlen): Use mpz_sgn to determine sign.
        * simplify.c (gfc_simplify_repeat): Use HOST_WIDE_INT/gfc_charlen_t
        instead of long.
        * target-memory.c (size_character): Length argument of type
        gfc_charlen_t.
        (gfc_encode_character): Likewise.
        (gfc_interpret_character): Use gfc_charlen_t.
        * target-memory.h (gfc_encode_character): Modify prototype.
        * trans-array.c (get_array_ctor_var_strlen): Use
        gfc_conv_mpz_to_tree_type.
        * trans-const.c (gfc_conv_mpz_to_tree_type): New function.
        * trans-const.h (gfc_conv_mpz_to_tree_type): New prototype.
        * trans-expr.c (gfc_class_len_or_zero_get): Build const of type
        gfc_charlen_type_node.
        (gfc_conv_intrinsic_to_class): Use gfc_charlen_int_kind instead of
        4, fold_convert to correct type.
        (gfc_conv_class_to_class): Build const of type size_type_node for
        size.
        (gfc_copy_class_to_class): Likewise.
        (gfc_conv_string_length): Use same type in expression.
        (gfc_conv_substring): Likewise, use HOST_WIDE_INT for charlen.
        (gfc_conv_string_tmp): Make sure len is of the right type.
        (gfc_conv_concat_op): Use same type in expression.
        (gfc_conv_procedure_call): Likewise.
        (alloc_scalar_allocatable_for_subcomponent_assignment):
        fold_convert to right type.
        (gfc_trans_subcomponent_assign): Likewise.
        (trans_class_vptr_len_assignment): Build const of correct type.
        (gfc_trans_pointer_assignment): Likewise.
        (alloc_scalar_allocatable_for_assignment): fold_convert to right
        type in expr.
        (trans_class_assignment): Build const of correct type.
        * trans-intrinsic.c (gfc_conv_associated): Likewise.
        (gfc_conv_intrinsic_repeat): Do calculation in sizetype.
        * trans-io.c (gfc_build_io_library_fndecls): Use
        gfc_charlen_type_node for character lengths.
        * trans-stmt.c (gfc_trans_label_assign): Build const of
        gfc_charlen_type_node.
        (gfc_trans_character_select): Likewise.
        (gfc_trans_allocate): Likewise, don't typecast strlen result.
        (gfc_trans_deallocate): Don't typecast strlen result.
        * trans-types.c (gfc_size_kind): New variable.
        (gfc_init_types): Determine gfc_charlen_int_kind and gfc_size_kind
        from size_type_node.

testsuite:

2017-01-13  Janne Blomqvist  <jb@gcc.gnu.org>

        PR fortran/78534
        PR fortran/66310
        * gfortran.dg/repeat_4.f90: Use integers of kind C_SIZE_T.
        * gfortran.dg/repeat_7.f90: New test for PR 66310.
        * gfortran.dg/scan_2.f90: Handle potential cast in assignment.
        * gfortran.dg/string_1.f90: Limit to ilp32 targets.
        * gfortran.dg/string_1_lp64.f90: New test.
        * gfortran.dg/string_3.f90: Limit to ilp32 targets.
        * gfortran.dg/string_3_lp64.f90: New test.

libgfortran:

2017-01-13  Janne Blomqvist  <jb@gcc.gnu.org>

        PR fortran/78534
        * intrinsics/args.c (getarg_i4): Use gfc_charlen_type.
        (get_command_argument_i4): Likewise.
        (get_command_i4): Likewise.
        * intrinsics/chmod.c (chmod_internal): Likewise.
        * intrinsics/env.c (get_environment_variable_i4): Likewise.
        * intrinsics/extends_type_of.c (struct vtype): Use size_t for size
        member.
        * intrinsics/gerror.c (gerror): Use gfc_charlen_type.
        * intrinsics/getlog.c (getlog): Likewise.
        * intrinsics/hostnm.c (hostnm_0): Likewise.
        * intrinsics/string_intrinsics_inc.c (string_len_trim): Rework to
        work if gfc_charlen_type is unsigned.
        (string_scan): Likewise.
        * io/transfer.c (transfer_character): Modify prototype.
        (transfer_character_write): Likewise.
        (transfer_character_wide): Likewise.
        (transfer_character_wide_write): Likewise.
        (transfer_array): Typecast to avoid signed-unsigned comparison.
        * io/unit.c (is_trim_ok): Use gfc_charlen_type.
        * io/write.c (namelist_write): Likewise.
        * libgfortran.h (gfc_charlen_type): Change typedef to size_t.


Added:
    trunk/gcc/testsuite/gfortran.dg/repeat_7.f90
    trunk/gcc/testsuite/gfortran.dg/string_1_lp64.f90
    trunk/gcc/testsuite/gfortran.dg/string_3_lp64.f90
Modified:
    trunk/gcc/fortran/ChangeLog
    trunk/gcc/fortran/class.c
    trunk/gcc/fortran/dump-parse-tree.c
    trunk/gcc/fortran/expr.c
    trunk/gcc/fortran/gfortran.h
    trunk/gcc/fortran/gfortran.texi
    trunk/gcc/fortran/iresolve.c
    trunk/gcc/fortran/match.c
    trunk/gcc/fortran/misc.c
    trunk/gcc/fortran/module.c
    trunk/gcc/fortran/resolve.c
    trunk/gcc/fortran/simplify.c
    trunk/gcc/fortran/target-memory.c
    trunk/gcc/fortran/target-memory.h
    trunk/gcc/fortran/trans-array.c
    trunk/gcc/fortran/trans-const.c
    trunk/gcc/fortran/trans-const.h
    trunk/gcc/fortran/trans-expr.c
    trunk/gcc/fortran/trans-intrinsic.c
    trunk/gcc/fortran/trans-io.c
    trunk/gcc/fortran/trans-stmt.c
    trunk/gcc/fortran/trans-types.c
    trunk/gcc/fortran/trans-types.h
    trunk/gcc/testsuite/ChangeLog
    trunk/gcc/testsuite/gfortran.dg/repeat_4.f90
    trunk/gcc/testsuite/gfortran.dg/scan_2.f90
    trunk/gcc/testsuite/gfortran.dg/string_1.f90
    trunk/gcc/testsuite/gfortran.dg/string_3.f90
    trunk/libgfortran/ChangeLog
    trunk/libgfortran/intrinsics/args.c
    trunk/libgfortran/intrinsics/chmod.c
    trunk/libgfortran/intrinsics/env.c
    trunk/libgfortran/intrinsics/extends_type_of.c
    trunk/libgfortran/intrinsics/gerror.c
    trunk/libgfortran/intrinsics/getlog.c
    trunk/libgfortran/intrinsics/hostnm.c
    trunk/libgfortran/intrinsics/string_intrinsics_inc.c
    trunk/libgfortran/io/transfer.c
    trunk/libgfortran/io/unit.c
    trunk/libgfortran/io/write.c
    trunk/libgfortran/libgfortran.h


More information about the Gcc-bugs mailing list