This is the mail archive of the gcc-patches@gcc.gnu.org mailing list for the GCC project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

[0/67] Add wrapper classes for machine_modes

From: Richard Sandiford <richard dot sandiford at arm dot com>
To: gcc-patches at gcc dot gnu dot org
Date: Fri, 09 Dec 2016 12:48:01 +0000
Subject: [0/67] Add wrapper classes for machine_modes
Authentication-results: sourceware.org; auth=none

This series includes most of the changes in group C from:

    https://gcc.gnu.org/ml/gcc/2016-11/msg00033.html

The idea is to add wrapper classes around machine_mode_enum
for specific groups of modes, such as scalar integers, scalar floats,
complex values, etc.  This has two main benefits: one specific to SVE
and one not.

The SVE-specific benefit is that it helps to introduce the concept
of variable-length vectors.  To do that we need to change the size
of a vector mode from being a known compile-time constant to being
(possibly) a run-time invariant.  We then need to do the same for
unconstrained machine_modes, which might or might not be vectors.
Introducing these new constrained types means that we can continue
to treat them as having a constant size.

The other benefit is that it uses static type checking to enforce
conditions that are easily forgotten otherwise.  The most common
sources of problems seem to be:

(a) using VOIDmode or BLKmode where a scalar integer was expected
    (e.g. when getting the number of bits in the value).

(b) simplifying vector operations in ways that only make sense for
    scalars.

The series helps with both of these, although we don't get the full
benefit of (b) until variable-sized modes are introduced.

I know of three specific cases in which the static type checking
forced fixes for things that turned out to be real bugs (although
we didn't know that at the time, otherwise we'd have posted patches).
They were later fixed for trunk by:

  https://gcc.gnu.org/ml/gcc-patches/2016-07/msg01783.html
  https://gcc.gnu.org/ml/gcc-patches/2016-11/msg02983.html
  https://gcc.gnu.org/ml/gcc-patches/2016-11/msg02896.html

The group C patches in ARM/sve-branch did slow compile time down a little.
I've since taken steps to avoid that:

- Make the tailcall pass handle aggregate parameters and return values
  (already in trunk).

- Turn some of the new wrapper functions into inline functions.

- Make all the machmode.h macros that used:

    __builtin_constant_p (M) ? foo_inline (M) : foo_array[M[

  forward to an ALWAYS_INLINE function, so that (a) M is only evaluated
  once and (b) __builtin_constant_p is applied to a variable, and so is
  deferred until later passes.  This helped the optimisation to fire in
  more cases and to continue firing when M is a class rather than a
  raw enum.

- In a similar vein, make sure that conditions like:

     SImode == DImode

  are treated as builtin_constant_p by gencondmd, so that .md patterns
  with those conditions are dropped.

With these changes the series is actually a very slight compile-time win.
That might seem unlikely, but there are several possible reasons:

1. The machmode.h macro change above might allow more constant folding.

2. The series has a tendency to evaluate modes once, rather than
   continually fetching them from (sometimes quite deep) rtx nests.
   Refetching a mode is a particular problem if call comes between
   two uses, since the compiler then has to re-evaluate the whole thing.

3. The series introduces many uses of new SCALAR_*TYPE_MODE macros,
   as alternatives to TYPE_MODE.  The new macros avoid the usual:

     (VECTOR_TYPE_P (TYPE_CHECK (NODE)) \
      ? vector_type_mode (NODE) : (NODE)->type_common.mode)

   and become direct field accesses in release builds.

   VECTOR_TYPE_P would be consistently false for these uses,
   but call-clobbered registers would usually be treated as clobbered
   by the condition as a whole.

Maybe (3) is the most likely reason.

I tested this by compiling the testsuite for:

    aarch64-linux-gnu alpha-linux-gnu arc-elf arm-linux-gnueabi
    arm-linux-gnueabihf avr-elf bfin-elf c6x-elf cr16-elf cris-elf
    epiphany-elf fr30-elf frv-linux-gnu ft32-elf h8300-elf
    hppa64-hp-hpux11.23 ia64-linux-gnu i686-pc-linux-gnu
    i686-apple-darwin iq2000-elf lm32-elf m32c-elf m32r-elf
    m68k-linux-gnu mcore-elf microblaze-elf mips-linux-gnu
    mipsisa64-linux-gnu mmix mn10300-elf moxie-rtems msp430-elf
    nds32le-elf nios2-linux-gnu nvptx-none pdp11 powerpc-linux-gnuspe
    powerpc-eabispe powerpc64-linux-gnu powerpc-ibm-aix7.0 rl78-elf
    rx-elf s390-linux-gnu s390x-linux-gnu sh-linux-gnu sparc-linux-gnu
    sparc64-linux-gnu sparc-wrs-vxworks spu-elf tilegx-elf tilepro-elf
    xstormy16-elf v850-elf vax-netbsdelf visium-elf x86_64-darwin
    x86_64-linux-gnu xtensa-elf

and checking that there were no changes in assembly.  Also tested
in the normal way on aarch64-linux-gnu and x86_64-linux-gnu.

The series depends on the already-posted:

  https://gcc.gnu.org/ml/gcc-patches/2016-11/msg01657.html

Sorry that this is so late, been distracted by other things.  Even if
we're too far into stage 3 for SVE itself to go in, I was hoping this
part (which was kind-of posted during stage 1) could go in independently.

Thanks,
Richard

Follow-Ups:
- [1/67] Add an E_ prefix to mode names and update case statements
  - From: Richard Sandiford
- [2/67] Make machine_mode a class
  - From: Richard Sandiford
- [3/67] Add GDB pretty printer for machine mode classes
  - From: Richard Sandiford
- [4/67] Add FOR_EACH iterators for modes
  - From: Richard Sandiford
- [5/67] Small tweak to array_value_type
  - From: Richard Sandiford
- [6/67] Make GET_MODE_WIDER return an opt_mode
  - From: Richard Sandiford
- [7/67] Add scalar_float_mode
  - From: Richard Sandiford
- [8/67] Simplify gen_trunc/extend_conv_libfunc
  - From: Richard Sandiford
- [9/67] Add SCALAR_FLOAT_TYPE_MODE
  - From: Richard Sandiford
- [10/67] Make assemble_real take a scalar_float_mode
  - From: Richard Sandiford
- Re: [0/67] Add wrapper classes for machine_modes
  - From: Richard Biener
- [11/67] Add a float_mode_for_size helper function
  - From: Richard Sandiford
- [12/67] Use opt_scalar_float_mode when iterating over float modes
  - From: Richard Sandiford
- [13/67] Make floatn_mode return an opt_scalar_float_mode
  - From: Richard Sandiford
- [14/67] Make libgcc_floating_mode_supported_p take a scalar_float_mode
  - From: Richard Sandiford
- [15/67] Add scalar_int_mode
  - From: Richard Sandiford
- [16/67] Add scalar_int_mode_pod
  - From: Richard Sandiford
- [17/67] Add an int_mode_for_size helper function
  - From: Richard Sandiford
- [18/67] Make int_mode_for_mode return an opt_scalar_int_mode
  - From: Richard Sandiford
- [19/67] Add a smallest_int_mode_for_size helper function
  - From: Richard Sandiford
- [20/67] Replace MODE_INT checks with is_int_mode
  - From: Richard Sandiford
- [21/67] Replace SCALAR_INT_MODE_P checks with is_a <scalar_int_mode>
  - From: Richard Sandiford
- [22/67] Replace !VECTOR_MODE_P with is_a <scalar_int_mode>
  - From: Richard Sandiford
- [23/67] Replace != VOIDmode checks with is_a <scalar_int_mode>
  - From: Richard Sandiford
- [24/67] Replace a != BLKmode check with is_a <scalar_int_mode>
  - From: Richard Sandiford
- [25/67] Use is_a <scalar_int_mode> for bitmask optimisations
  - From: Richard Sandiford
- [26/67] Use is_a <scalar_int_mode> in subreg/extract simplifications
  - From: Richard Sandiford
- [27/67] Use is_a <scalar_int_mode> before LOAD_EXTEND_OP
  - From: Richard Sandiford
- [28/67] Use is_a <scalar_int_mode> for miscellaneous types of test
  - From: Richard Sandiford
- [29/67] Make some *_loc_descriptor helpers take scalar_int_mode
  - From: Richard Sandiford
- [30/67] Use scalar_int_mode for doubleword splits
  - From: Richard Sandiford
- [31/67] Use scalar_int_mode for move2add
  - From: Richard Sandiford
- [32/67] Check is_a <scalar_int_mode> before calling valid_pointer_mode
  - From: Richard Sandiford
- [33/67] Add a NARROWEST_INT_MODE macro
  - From: Richard Sandiford
- [34/67] Add a SCALAR_INT_TYPE_MODE macro
  - From: Richard Sandiford
- [35/67] Add uses of as_a <scalar_int_mode>
  - From: Richard Sandiford
- [36/67] Use scalar_int_mode in the RTL iv routines
  - From: Richard Sandiford
- [37/67] Use scalar_int_mode when emitting cstores
  - From: Richard Sandiford
- [38/67] Move SCALAR_INT_MODE_P out of strict_volatile_bitfield_p
  - From: Richard Sandiford
- [39/67] Two changes to the get_best_mode interface
  - From: Richard Sandiford
- [40/67] Use scalar_int_mode for extraction_insn fields
  - From: Richard Sandiford
- [41/67] Split scalar integer handling out of force_to_mode
  - From: Richard Sandiford
- [42/67] Use scalar_int_mode in simplify_shift_const_1
  - From: Richard Sandiford
- [43/67] Use scalar_int_mode in simplify_comparison
  - From: Richard Sandiford
- [44/67] Make simplify_and_const_int take a scalar_int_mode
  - From: Richard Sandiford
- [45/67] Make extract_left_shift take a scalar_int_mode
  - From: Richard Sandiford
- [46/67] Make widest_int_mode_for_size return a scalar_int_mode
  - From: Richard Sandiford
- [47/67] Make subroutines of nonzero_bits operate on scalar_int_mode
  - From: Richard Sandiford
- [48/67] Make subroutines of num_sign_bit_copies operate on scalar_int_mode
  - From: Richard Sandiford
- [49/67] Simplify nonzero/num_sign_bits hooks
  - From: Richard Sandiford
- [50/67] Add helper routines for SUBREG_PROMOTED_VAR_P subregs
  - From: Richard Sandiford
- [51/67] Use opt_scalar_int_mode when iterating over integer modes
  - From: Richard Sandiford
- [52/67] Use scalar_int_mode in extract/store_bit_field
  - From: Richard Sandiford
- [53/67] Pass a mode to const_scalar_mask_from_tree
  - From: Richard Sandiford
- [54/67] Add explicit int checks for alternative optab implementations
  - From: Richard Sandiford
- [55/67] Use scalar_int_mode in simplify_const_unary_operation
  - From: Richard Sandiford
- [56/67] Use the more specific type when two modes are known to be equal
  - From: Richard Sandiford
- [57/67] Use scalar_int_mode in expand_expr_addr_expr
  - From: Richard Sandiford
- [58/67] Use scalar_int_mode in a try_combine optimisation
  - From: Richard Sandiford
- [59/67] Add a rtx_jump_table_data::get_data_mode helper
  - From: Richard Sandiford
- [60/67] Pass scalar_int_modes to do_jump_by_parts_*
  - From: Richard Sandiford
- [61/67] Use scalar_int_mode in the AArch64 port
  - From: Richard Sandiford
- [62/67] Big machine_mode to scalar_int_mode replacement
  - From: Richard Sandiford
- [63/67] Simplifications after type switch
  - From: Richard Sandiford
- [64/67] Add a scalar_mode class
  - From: Richard Sandiford
- [65/67] Use scalar_mode in the AArch64 port
  - From: Richard Sandiford
- [66/67] Add a scalar_mode_pod class
  - From: Richard Sandiford
- [67/67] Add a complex_mode class
  - From: Richard Sandiford
- Re: [0/67] Add wrapper classes for machine_modes
  - From: Sandra Loosemore

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]