[PATCH][RFC] Introduce BIT_FIELD_INSERT

Richard Biener rguenther@suse.de
Tue Jan 7 10:04:00 GMT 2020


On Tue, 7 Jan 2020, Andrew Pinski wrote:

> On Mon, Jan 6, 2020 at 11:36 PM Richard Biener <rguenther@suse.de> wrote:
> >
> > On Mon, 16 Dec 2019, Andrew Pinski wrote:
> >
> > > On Thu, Nov 15, 2018 at 12:31 AM Richard Biener <rguenther@suse.de> wrote:
> > > >
> > > > On Thu, 15 Nov 2018, Richard Biener wrote:
> > > >
> > > > > On Wed, 14 Nov 2018, Andrew Pinski wrote:
> > > > >
> > > > > > On Fri, May 13, 2016 at 3:51 AM Richard Biener <rguenther@suse.de> wrote:
> > > > > > >
> > > > > > >
> > > > > > > The following patch adds BIT_FIELD_INSERT, an operation to
> > > > > > > facilitate doing bitfield inserts on registers (as opposed
> > > > > > > to currently where we'd have a BIT_FIELD_REF store).
> > > > > > >
> > > > > > > Originally this was developed as part of bitfield lowering
> > > > > > > where bitfield stores were lowered into read-modify-write
> > > > > > > cycles and the modify part, instead of doing shifting and masking,
> > > > > > > be kept in a more high-level form to ease combining them.
> > > > > > >
> > > > > > > A second use case (the above is still valid) is vector element
> > > > > > > inserts which we currently can only do via memory or
> > > > > > > by extracting all components and re-building the vector using
> > > > > > > a CONSTRUCTOR.  For this second use case I added code
> > > > > > > re-writing the BIT_FIELD_REF stores the C family FEs produce
> > > > > > > into BIT_FIELD_INSERT when update-address-taken can otherwise
> > > > > > > re-write a decl into SSA form (the testcase shows we miss
> > > > > > > a similar opportunity with the MEM_REF form of a vector insert,
> > > > > > > I plan to fix that for the final submission).
> > > > > > >
> > > > > > > One speciality of BIT_FIELD_INSERT as opposed to BIT_FIELD_REF
> > > > > > > is that the size of the insertion is given implicitely via the
> > > > > > > type size/precision of the value to insert.  That avoids
> > > > > > > introducing ways to have quaternary ops in folding and GIMPLE stmts.
> > > > > > >
> > > > > > > Bootstrapped and tested on x86_64-unknown-linux-gnu.
> > > > > > >
> > > > > > > Richard.
> > > > > > >
> > > > > > > 2011-06-16  Richard Guenther  <rguenther@suse.de>
> > > > > > >
> > > > > > >         PR tree-optimization/29756
> > > > > > >         * tree.def (BIT_FIELD_INSERT): New tcc_expression tree code.
> > > > > > >         * expr.c (expand_expr_real_2): Handle BIT_FIELD_INSERT.
> > > > > > >         * fold-const.c (operand_equal_p): Likewise.
> > > > > > >         (fold_ternary_loc): Add constant folding of BIT_FIELD_INSERT.
> > > > > > >         * gimplify.c (gimplify_expr): Handle BIT_FIELD_INSERT.
> > > > > > >         * tree-inline.c (estimate_operator_cost): Likewise.
> > > > > > >         * tree-pretty-print.c (dump_generic_node): Likewise.
> > > > > > >         * tree-ssa-operands.c (get_expr_operands): Likewise.
> > > > > > >         * cfgexpand.c (expand_debug_expr): Likewise.
> > > > > > >         * gimple-pretty-print.c (dump_ternary_rhs): Likewise.
> > > > > > >         * gimple.c (get_gimple_rhs_num_ops): Handle BIT_FIELD_INSERT.
> > > > > > >         * tree-cfg.c (verify_gimple_assign_ternary): Verify BIT_FIELD_INSERT.
> > > > > > >
> > > > > > >         * tree-ssa.c (non_rewritable_lvalue_p): We can rewrite
> > > > > > >         vector inserts using BIT_FIELD_REF on the lhs.
> > > > > > >         (execute_update_addresses_taken): Do it.
> > > > > > >
> > > > > > >         * gcc.dg/tree-ssa/vector-6.c: New testcase.
> > > > > > >
> > > > > > > Index: trunk/gcc/expr.c
> > > > > > > ===================================================================
> > > > > > > *** trunk.orig/gcc/expr.c       2016-05-12 13:40:30.704262951 +0200
> > > > > > > --- trunk/gcc/expr.c    2016-05-12 15:40:32.481225744 +0200
> > > > > > > *************** expand_expr_real_2 (sepops ops, rtx targ
> > > > > > > *** 9358,9363 ****
> > > > > > > --- 9358,9380 ----
> > > > > > >         target = expand_vec_cond_expr (type, treeop0, treeop1, treeop2, target);
> > > > > > >         return target;
> > > > > > >
> > > > > > > +     case BIT_FIELD_INSERT:
> > > > > > > +       {
> > > > > > > +       unsigned bitpos = tree_to_uhwi (treeop2);
> > > > > > > +       unsigned bitsize;
> > > > > > > +       if (INTEGRAL_TYPE_P (TREE_TYPE (treeop1)))
> > > > > > > +         bitsize = TYPE_PRECISION (TREE_TYPE (treeop1));
> > > > > > > +       else
> > > > > > > +         bitsize = tree_to_uhwi (TYPE_SIZE (TREE_TYPE (treeop1)));
> > > > > > > +       rtx op0 = expand_normal (treeop0);
> > > > > > > +       rtx op1 = expand_normal (treeop1);
> > > > > > > +       rtx dst = gen_reg_rtx (mode);
> > > > > > > +       emit_move_insn (dst, op0);
> > > > > > > +       store_bit_field (dst, bitsize, bitpos, 0, 0,
> > > > > > > +                        TYPE_MODE (TREE_TYPE (treeop1)), op1, false);
> > > > > > > +       return dst;
> > > > > > > +       }
> > > > > > > +
> > > > > > >       default:
> > > > > > >         gcc_unreachable ();
> > > > > > >       }
> > > > > > > Index: trunk/gcc/fold-const.c
> > > > > > > ===================================================================
> > > > > > > *** trunk.orig/gcc/fold-const.c 2016-05-12 13:40:30.704262951 +0200
> > > > > > > --- trunk/gcc/fold-const.c      2016-05-13 09:41:13.509812127 +0200
> > > > > > > *************** operand_equal_p (const_tree arg0, const_
> > > > > > > *** 3163,3168 ****
> > > > > > > --- 3163,3169 ----
> > > > > > >
> > > > > > >         case VEC_COND_EXPR:
> > > > > > >         case DOT_PROD_EXPR:
> > > > > > > +       case BIT_FIELD_INSERT:
> > > > > > >           return OP_SAME (0) && OP_SAME (1) && OP_SAME (2);
> > > > > > >
> > > > > > >         default:
> > > > > > > *************** fold_ternary_loc (location_t loc, enum t
> > > > > > > *** 11870,11875 ****
> > > > > > > --- 11871,11916 ----
> > > > > > >         }
> > > > > > >         return NULL_TREE;
> > > > > > >
> > > > > > > +     case BIT_FIELD_INSERT:
> > > > > > > +       /* Perform (partial) constant folding of BIT_FIELD_INSERT.  */
> > > > > > > +       if (TREE_CODE (arg0) == INTEGER_CST
> > > > > > > +         && TREE_CODE (arg1) == INTEGER_CST)
> > > > > > > +       {
> > > > > > > +         unsigned HOST_WIDE_INT bitpos = tree_to_uhwi (op2);
> > > > > > > +         unsigned bitsize = TYPE_PRECISION (TREE_TYPE (arg1));
> > > > > > > +         wide_int tem = wi::bit_and (arg0,
> > > > > > > +                                     wi::shifted_mask (bitpos, bitsize, true,
> > > > > > > +                                                       TYPE_PRECISION (type)));
> > > > > > > +         wide_int tem2
> > > > > > > +           = wi::lshift (wi::zext (wi::to_wide (arg1, TYPE_PRECISION (type)),
> > > > > > > +                                   bitsize), bitpos);
> > > > > > > +         return wide_int_to_tree (type, wi::bit_or (tem, tem2));
> > > > > > > +       }
> > > > > >
> > > > > > This seems incorrect for the case where BYTES_BIG_ENDIAN as far as I
> > > > > > can tell.  With BYTES_BIG_ENDIAN, the bits position starts most
> > > > > > significiant rather than the least significiant.
> > > > >
> > > > > You mean the bitpos operand of BIT_FIELD_INSERT works in a different way?
> > > > > I see the BIT_FIELD_REF folding uses native_encode/interpret but only
> > > > > handles byte-aligned references.  I suppose the BIT_INSERT_EXPR case
> > > > > (you are following up an old patch) could do the same.
> > > > >
> > > > > >  Sorry I am bring
> > > > > > this up after this has been in the tree for a long time but I finally
> > > > > > got around to testing my bit-field lowering on a few big-endian
> > > > > > targets and ran into this issue.
> > > > >
> > > > > So - can you fix it please?  Also note that the VECTOR_CST case
> > > > > (as in BIT_FIELD_REF) seems to be inconsistent here and counts
> > > > > "bits" in a different way?
> > > >
> > > > And bonus points for documenting BIT_FIELD_REF and BIT_INSERT_EXPR
> > > > in generic.texi, together with those "details".
> > >
> > > This is the fix:
> > > diff --git a/gcc/fold-const.c b/gcc/fold-const.c
> > > index 8e9e299..a919b63 100644
> > > --- a/gcc/fold-const.c
> > > +++ b/gcc/fold-const.c
> > > @@ -12301,6 +12301,8 @@ fold_ternary_loc (location_t loc, enum
> > > tree_code code, tree type,
> > >         {
> > >           unsigned HOST_WIDE_INT bitpos = tree_to_uhwi (op2);
> > >           unsigned bitsize = TYPE_PRECISION (TREE_TYPE (arg1));
> > > +         if (BYTES_BIG_ENDIAN)
> > > +           bitpos = TYPE_PRECISION (type) - bitpos - bitsize;
> > >           wide_int tem = (wi::to_wide (arg0)
> > >                           & wi::shifted_mask (bitpos, bitsize, true,
> > >                                               TYPE_PRECISION (type)));
> >
> > I guess you need to guard against BYTES_BIG_ENDIAN != WORDS_BIG_ENDIAN
> > as well.
> 
> Yes I will add that check.
> 
> >  Also the above only works reliably for mode-precision
> > integers?  We might want to disallow BIT_FIELD_REF/BIT_INSERT_EXPR
> > on non-mode-precision entities in the GIMPLE/GENERIC verifiers.
> 
> You added that check already for gimple in r268332 due to PR88739.
> BIT_FIELD_REF around tree-cfg.c:3083
> BIT_INSERT_EXPR  around tree-cfg.c:4324

Ah, good ;)  Note neither BIT_FIELD_REF nor BIT_INSERT_EXPR are
documented in generic.texi and BIT_FIELD_REF is documented in tree.def
as operating on structs/unions (well, memory).  And for register args
we interpret it as storing the register to memory and interpreting
the bit positions in memory bit terms (with the store doing endian
fiddling).  But for vector (register only?) accesses we interpret
it as specifying lane numbers but at least BIT_FIELD_REF verifying
doesn't barf on bit/sizes not corresponding to exact vector lanes
(and I know we introduce non-matching ones via at least VIEW_CONVERT
"merging" into BIT_FIELD_REFs).  I have in one of my trees:

Index: gcc/tree-cfg.c
===================================================================
--- gcc/tree-cfg.c      (revision 279944)
+++ gcc/tree-cfg.c      (working copy)
@@ -3094,6 +3094,26 @@ verify_types_in_gimple_reference (tree e
                     "%qs", code_name);
              return true;
            }
+         if (0 && VECTOR_TYPE_P (TREE_TYPE (op)))
+           {
+             if ((!VECTOR_TYPE_P (TREE_TYPE (expr))
+                  && !useless_type_conversion_p (TREE_TYPE (expr),
+                                                 TREE_TYPE (TREE_TYPE 
(op))))
+                 || (VECTOR_TYPE_P (TREE_TYPE (expr))
+                     && !useless_type_conversion_p (TREE_TYPE (TREE_TYPE 
(expr)),
+                                                    TREE_TYPE (TREE_TYPE 
(op)))))
+               {
+                 error ("invalid types in vector extract");
+                 debug_generic_stmt (TREE_TYPE (expr));
+                 debug_generic_stmt (TREE_TYPE (op));
+                 return true;
+               }
+             if (!multiple_p (bit_field_offset (expr), bit_field_size 
(expr)))
+               {
+                 error ("unaligned vector extract");
+                 return true;
+               }
+           }
        }
 
       if ((TREE_CODE (expr) == REALPART_EXPR

guess extracting same mode but different sign elements should be OK,
not sure about extracting SImode from a V4SFmode vector (I guess OK
as well).


> Thanks,
> Andrew Pinski
> 
> >
> > > ---- CUT ----
> > > I will do a full test in a little bit with the other patch I attached
> > > to related bugzilla.
> > >
> > > Thanks,
> > > Andrew
> > >
> > > >
> > > > Richard.
> > > >
> > > > > Thanks,
> > > > > Richard.
> > > > >
> > > > > > Thanks,
> > > > > > Andrew Pinski
> > > > > >
> > > > > > > +       else if (TREE_CODE (arg0) == VECTOR_CST
> > > > > > > +              && CONSTANT_CLASS_P (arg1)
> > > > > > > +              && types_compatible_p (TREE_TYPE (TREE_TYPE (arg0)),
> > > > > > > +                                     TREE_TYPE (arg1)))
> > > > > > > +       {
> > > > > > > +         unsigned HOST_WIDE_INT bitpos = tree_to_uhwi (op2);
> > > > > > > +         unsigned HOST_WIDE_INT elsize
> > > > > > > +           = tree_to_uhwi (TYPE_SIZE (TREE_TYPE (arg1)));
> > > > > > > +         if (bitpos % elsize == 0)
> > > > > > > +           {
> > > > > > > +             unsigned k = bitpos / elsize;
> > > > > > > +             if (operand_equal_p (VECTOR_CST_ELT (arg0, k), arg1, 0))
> > > > > > > +               return arg0;
> > > > > > > +             else
> > > > > > > +               {
> > > > > > > +                 tree *elts = XALLOCAVEC (tree, TYPE_VECTOR_SUBPARTS (type));
> > > > > > > +                 memcpy (elts, VECTOR_CST_ELTS (arg0),
> > > > > > > +                         sizeof (tree) * TYPE_VECTOR_SUBPARTS (type));
> > > > > > > +                 elts[k] = arg1;
> > > > > > > +                 return build_vector (type, elts);
> > > > > > > +               }
> > > > > > > +           }
> > > > > > > +       }
> > > > > > > +       return NULL_TREE;
> > > > > > > +
> > > > > > >       default:
> > > > > > >         return NULL_TREE;
> > > > > > >       } /* switch (code) */
> > > > > > > Index: trunk/gcc/gimplify.c
> > > > > > > ===================================================================
> > > > > > > *** trunk.orig/gcc/gimplify.c   2016-05-12 13:40:30.704262951 +0200
> > > > > > > --- trunk/gcc/gimplify.c        2016-05-12 13:56:18.679120641 +0200
> > > > > > > *************** gimplify_expr (tree *expr_p, gimple_seq
> > > > > > > *** 10936,10941 ****
> > > > > > > --- 10936,10945 ----
> > > > > > >           /* Classified as tcc_expression.  */
> > > > > > >           goto expr_3;
> > > > > > >
> > > > > > > +       case BIT_FIELD_INSERT:
> > > > > > > +         /* Argument 3 is a constant.  */
> > > > > > > +         goto expr_2;
> > > > > > > +
> > > > > > >         case POINTER_PLUS_EXPR:
> > > > > > >           {
> > > > > > >             enum gimplify_status r0, r1;
> > > > > > > Index: trunk/gcc/tree-inline.c
> > > > > > > ===================================================================
> > > > > > > *** trunk.orig/gcc/tree-inline.c        2016-05-12 13:40:30.704262951 +0200
> > > > > > > --- trunk/gcc/tree-inline.c     2016-05-12 13:42:45.465811959 +0200
> > > > > > > *************** estimate_operator_cost (enum tree_code c
> > > > > > > *** 3941,3946 ****
> > > > > > > --- 3941,3950 ----
> > > > > > >           return weights->div_mod_cost;
> > > > > > >         return 1;
> > > > > > >
> > > > > > > +     /* Bit-field insertion needs several shift and mask operations.  */
> > > > > > > +     case BIT_FIELD_INSERT:
> > > > > > > +       return 3;
> > > > > > > +
> > > > > > >       default:
> > > > > > >         /* We expect a copy assignment with no operator.  */
> > > > > > >         gcc_assert (get_gimple_rhs_class (code) == GIMPLE_SINGLE_RHS);
> > > > > > > Index: trunk/gcc/tree-pretty-print.c
> > > > > > > ===================================================================
> > > > > > > *** trunk.orig/gcc/tree-pretty-print.c  2016-05-12 13:40:30.704262951 +0200
> > > > > > > --- trunk/gcc/tree-pretty-print.c       2016-05-12 14:30:05.781944740 +0200
> > > > > > > *************** dump_generic_node (pretty_printer *pp, t
> > > > > > > *** 1876,1881 ****
> > > > > > > --- 1876,1898 ----
> > > > > > >         pp_greater (pp);
> > > > > > >         break;
> > > > > > >
> > > > > > > +     case BIT_FIELD_INSERT:
> > > > > > > +       pp_string (pp, "BIT_FIELD_INSERT <");
> > > > > > > +       dump_generic_node (pp, TREE_OPERAND (node, 0), spc, flags, false);
> > > > > > > +       pp_string (pp, ", ");
> > > > > > > +       dump_generic_node (pp, TREE_OPERAND (node, 1), spc, flags, false);
> > > > > > > +       pp_string (pp, ", ");
> > > > > > > +       dump_generic_node (pp, TREE_OPERAND (node, 2), spc, flags, false);
> > > > > > > +       pp_string (pp, " (");
> > > > > > > +       if (INTEGRAL_TYPE_P (TREE_TYPE (TREE_OPERAND (node, 1))))
> > > > > > > +       pp_decimal_int (pp,
> > > > > > > +                       TYPE_PRECISION (TREE_TYPE (TREE_OPERAND (node, 1))));
> > > > > > > +       else
> > > > > > > +       dump_generic_node (pp, TYPE_SIZE (TREE_TYPE (TREE_OPERAND (node, 1))),
> > > > > > > +                          spc, flags, false);
> > > > > > > +       pp_string (pp, " bits)>");
> > > > > > > +       break;
> > > > > > > +
> > > > > > >       case ARRAY_REF:
> > > > > > >       case ARRAY_RANGE_REF:
> > > > > > >         op0 = TREE_OPERAND (node, 0);
> > > > > > > Index: trunk/gcc/tree-ssa-operands.c
> > > > > > > ===================================================================
> > > > > > > *** trunk.orig/gcc/tree-ssa-operands.c  2016-05-12 13:42:45.465811959 +0200
> > > > > > > --- trunk/gcc/tree-ssa-operands.c       2016-05-12 13:48:26.881736503 +0200
> > > > > > > *************** get_expr_operands (struct function *fn,
> > > > > > > *** 833,838 ****
> > > > > > > --- 833,839 ----
> > > > > > >         get_expr_operands (fn, stmt, &TREE_OPERAND (expr, 0), flags);
> > > > > > >         return;
> > > > > > >
> > > > > > > +     case BIT_FIELD_INSERT:
> > > > > > >       case COMPOUND_EXPR:
> > > > > > >       case OBJ_TYPE_REF:
> > > > > > >       case ASSERT_EXPR:
> > > > > > > Index: trunk/gcc/tree.def
> > > > > > > ===================================================================
> > > > > > > *** trunk.orig/gcc/tree.def     2016-05-12 13:40:30.704262951 +0200
> > > > > > > --- trunk/gcc/tree.def  2016-05-12 13:47:09.972852423 +0200
> > > > > > > *************** DEFTREECODE (ADDR_EXPR, "addr_expr", tcc
> > > > > > > *** 852,857 ****
> > > > > > > --- 852,868 ----
> > > > > > >      descriptor of type ptr_mode.  */
> > > > > > >   DEFTREECODE (FDESC_EXPR, "fdesc_expr", tcc_expression, 2)
> > > > > > >
> > > > > > > + /* Given a word, a value and a bitfield position within the word,
> > > > > > > +    produce the value that results if replacing the
> > > > > > > +    described parts of word with value.
> > > > > > > +    Operand 0 is a tree for the word of integral type;
> > > > > > > +    Operand 1 is a tree for the value of integral type;
> > > > > > > +    Operand 2 is a tree giving the constant position of the first referenced bit;
> > > > > > > +    The number of bits replaced is given by the precision of the value
> > > > > > > +    type if that is integral or by its size if it is non-integral.
> > > > > > > +    The replaced bits shall be fully inside the word.  */
> > > > > > > + DEFTREECODE (BIT_FIELD_INSERT, "bit_field_insert", tcc_expression, 3)
> > > > > > > +
> > > > > > >   /* Given two real or integer operands of the same type,
> > > > > > >      returns a complex value of the corresponding complex type.  */
> > > > > > >   DEFTREECODE (COMPLEX_EXPR, "complex_expr", tcc_binary, 2)
> > > > > > > Index: trunk/gcc/cfgexpand.c
> > > > > > > ===================================================================
> > > > > > > *** trunk.orig/gcc/cfgexpand.c  2016-05-12 13:42:45.469812005 +0200
> > > > > > > --- trunk/gcc/cfgexpand.c       2016-05-13 11:48:04.513407495 +0200
> > > > > > > *************** expand_debug_expr (tree exp)
> > > > > > > *** 5025,5030 ****
> > > > > > > --- 5025,5031 ----
> > > > > > >       case FIXED_CONVERT_EXPR:
> > > > > > >       case OBJ_TYPE_REF:
> > > > > > >       case WITH_SIZE_EXPR:
> > > > > > > +     case BIT_FIELD_INSERT:
> > > > > > >         return NULL;
> > > > > > >
> > > > > > >       case DOT_PROD_EXPR:
> > > > > > > Index: trunk/gcc/gimple-pretty-print.c
> > > > > > > ===================================================================
> > > > > > > *** trunk.orig/gcc/gimple-pretty-print.c        2016-05-12 11:23:09.261375157 +0200
> > > > > > > --- trunk/gcc/gimple-pretty-print.c     2016-05-12 14:57:22.096175579 +0200
> > > > > > > *************** dump_ternary_rhs (pretty_printer *buffer
> > > > > > > *** 479,484 ****
> > > > > > > --- 479,502 ----
> > > > > > >         pp_greater (buffer);
> > > > > > >         break;
> > > > > > >
> > > > > > > +     case BIT_FIELD_INSERT:
> > > > > > > +       pp_string (buffer, "BIT_FIELD_INSERT <");
> > > > > > > +       dump_generic_node (buffer, gimple_assign_rhs1 (gs), spc, flags, false);
> > > > > > > +       pp_string (buffer, ", ");
> > > > > > > +       dump_generic_node (buffer, gimple_assign_rhs2 (gs), spc, flags, false);
> > > > > > > +       pp_string (buffer, ", ");
> > > > > > > +       dump_generic_node (buffer, gimple_assign_rhs3 (gs), spc, flags, false);
> > > > > > > +       pp_string (buffer, " (");
> > > > > > > +       if (INTEGRAL_TYPE_P (TREE_TYPE (gimple_assign_rhs2 (gs))))
> > > > > > > +       pp_decimal_int (buffer,
> > > > > > > +                       TYPE_PRECISION (TREE_TYPE (gimple_assign_rhs2 (gs))));
> > > > > > > +       else
> > > > > > > +       dump_generic_node (buffer,
> > > > > > > +                          TYPE_SIZE (TREE_TYPE (gimple_assign_rhs2 (gs))),
> > > > > > > +                          spc, flags, false);
> > > > > > > +       pp_string (buffer, " bits)>");
> > > > > > > +       break;
> > > > > > > +
> > > > > > >       default:
> > > > > > >         gcc_unreachable ();
> > > > > > >       }
> > > > > > > Index: trunk/gcc/gimple.c
> > > > > > > ===================================================================
> > > > > > > *** trunk.orig/gcc/gimple.c     2016-05-12 13:40:30.704262951 +0200
> > > > > > > --- trunk/gcc/gimple.c  2016-05-12 14:49:37.066994969 +0200
> > > > > > > *************** get_gimple_rhs_num_ops (enum tree_code c
> > > > > > > *** 2044,2049 ****
> > > > > > > --- 2044,2050 ----
> > > > > > >         || (SYM) == REALIGN_LOAD_EXPR                                       \
> > > > > > >         || (SYM) == VEC_COND_EXPR                                                   \
> > > > > > >         || (SYM) == VEC_PERM_EXPR                                             \
> > > > > > > +       || (SYM) == BIT_FIELD_INSERT                                        \
> > > > > > >         || (SYM) == FMA_EXPR) ? GIMPLE_TERNARY_RHS                          \
> > > > > > >      : ((SYM) == CONSTRUCTOR                                                \
> > > > > > >         || (SYM) == OBJ_TYPE_REF                                                    \
> > > > > > > Index: trunk/gcc/tree-cfg.c
> > > > > > > ===================================================================
> > > > > > > *** trunk.orig/gcc/tree-cfg.c   2016-05-06 14:38:33.959495081 +0200
> > > > > > > --- trunk/gcc/tree-cfg.c        2016-05-13 09:25:01.670630730 +0200
> > > > > > > *************** verify_gimple_assign_ternary (gassign *s
> > > > > > > *** 4155,4160 ****
> > > > > > > --- 4155,4207 ----
> > > > > > >
> > > > > > >         return false;
> > > > > > >
> > > > > > > +     case BIT_FIELD_INSERT:
> > > > > > > +       if (! useless_type_conversion_p (lhs_type, rhs1_type))
> > > > > > > +       {
> > > > > > > +         error ("type mismatch in BIT_FIELD_INSERT");
> > > > > > > +         debug_generic_expr (lhs_type);
> > > > > > > +         debug_generic_expr (rhs1_type);
> > > > > > > +         return true;
> > > > > > > +       }
> > > > > > > +       if (! ((INTEGRAL_TYPE_P (rhs1_type)
> > > > > > > +             && INTEGRAL_TYPE_P (rhs2_type))
> > > > > > > +            || (VECTOR_TYPE_P (rhs1_type)
> > > > > > > +                && types_compatible_p (TREE_TYPE (rhs1_type), rhs2_type))))
> > > > > > > +       {
> > > > > > > +         error ("not allowed type combination in BIT_FIELD_INSERT");
> > > > > > > +         debug_generic_expr (rhs1_type);
> > > > > > > +         debug_generic_expr (rhs2_type);
> > > > > > > +         return true;
> > > > > > > +       }
> > > > > > > +       if (! tree_fits_uhwi_p (rhs3)
> > > > > > > +         || ! tree_fits_uhwi_p (TYPE_SIZE (rhs2_type)))
> > > > > > > +       {
> > > > > > > +         error ("invalid position or size in BIT_FIELD_INSERT");
> > > > > > > +         return true;
> > > > > > > +       }
> > > > > > > +       if (INTEGRAL_TYPE_P (rhs1_type))
> > > > > > > +       {
> > > > > > > +         unsigned HOST_WIDE_INT bitpos = tree_to_uhwi (rhs3);
> > > > > > > +         if (bitpos >= TYPE_PRECISION (rhs1_type)
> > > > > > > +             || (bitpos + TYPE_PRECISION (rhs2_type)
> > > > > > > +                 > TYPE_PRECISION (rhs1_type)))
> > > > > > > +           {
> > > > > > > +             error ("insertion out of range in BIT_FIELD_INSERT");
> > > > > > > +             return true;
> > > > > > > +           }
> > > > > > > +       }
> > > > > > > +       else if (VECTOR_TYPE_P (rhs1_type))
> > > > > > > +       {
> > > > > > > +         unsigned HOST_WIDE_INT bitpos = tree_to_uhwi (rhs3);
> > > > > > > +         unsigned HOST_WIDE_INT bitsize = tree_to_uhwi (TYPE_SIZE (rhs2_type));
> > > > > > > +         if (bitpos % bitsize != 0)
> > > > > > > +           {
> > > > > > > +             error ("vector insertion not at element boundary");
> > > > > > > +             return true;
> > > > > > > +           }
> > > > > > > +       }
> > > > > > > +       return false;
> > > > > > > +
> > > > > > >       case DOT_PROD_EXPR:
> > > > > > >       case REALIGN_LOAD_EXPR:
> > > > > > >         /* FIXME.  */
> > > > > > > Index: trunk/gcc/tree-ssa.c
> > > > > > > ===================================================================
> > > > > > > *** trunk.orig/gcc/tree-ssa.c   2016-05-13 09:38:02.263611726 +0200
> > > > > > > --- trunk/gcc/tree-ssa.c        2016-05-13 09:50:31.020226585 +0200
> > > > > > > *************** non_rewritable_lvalue_p (tree lhs)
> > > > > > > *** 1318,1323 ****
> > > > > > > --- 1318,1335 ----
> > > > > > >         return false;
> > > > > > >       }
> > > > > > >
> > > > > > > +   /* A vector-insert using a BIT_FIELD_REF is rewritable using
> > > > > > > +      BIT_FIELD_INSERT.  */
> > > > > > > +   if (TREE_CODE (lhs) == BIT_FIELD_REF
> > > > > > > +       && DECL_P (TREE_OPERAND (lhs, 0))
> > > > > > > +       && VECTOR_TYPE_P (TREE_TYPE (TREE_OPERAND (lhs, 0)))
> > > > > > > +       /* && bitsize % element-size == 0 */
> > > > > > > +       && types_compatible_p (TREE_TYPE (lhs),
> > > > > > > +                            TREE_TYPE (TREE_TYPE (TREE_OPERAND (lhs, 0))))
> > > > > > > +       && (tree_to_uhwi (TREE_OPERAND (lhs, 2))
> > > > > > > +         % tree_to_uhwi (TYPE_SIZE (TREE_TYPE (lhs)))) == 0)
> > > > > > > +     return false;
> > > > > > > +
> > > > > > >     return true;
> > > > > > >   }
> > > > > > >
> > > > > > > *************** execute_update_addresses_taken (void)
> > > > > > > *** 1536,1541 ****
> > > > > > > --- 1548,1576 ----
> > > > > > >                     stmt = gsi_stmt (gsi);
> > > > > > >                     unlink_stmt_vdef (stmt);
> > > > > > >                     update_stmt (stmt);
> > > > > > > +                   continue;
> > > > > > > +                 }
> > > > > > > +
> > > > > > > +               /* Rewrite a vector insert via a BIT_FIELD_REF on the LHS
> > > > > > > +                  into a BIT_FIELD_INSERT.  */
> > > > > > > +               if (TREE_CODE (lhs) == BIT_FIELD_REF
> > > > > > > +                   && DECL_P (TREE_OPERAND (lhs, 0))
> > > > > > > +                   && VECTOR_TYPE_P (TREE_TYPE (TREE_OPERAND (lhs, 0)))
> > > > > > > +                   && types_compatible_p (TREE_TYPE (lhs),
> > > > > > > +                                          TREE_TYPE (TREE_TYPE
> > > > > > > +                                                      (TREE_OPERAND (lhs, 0))))
> > > > > > > +                   && (tree_to_uhwi (TREE_OPERAND (lhs, 2))
> > > > > > > +                       % tree_to_uhwi (TYPE_SIZE (TREE_TYPE (lhs))) == 0))
> > > > > > > +                 {
> > > > > > > +                   tree var = TREE_OPERAND (lhs, 0);
> > > > > > > +                   tree val = gimple_assign_rhs1 (stmt);
> > > > > > > +                   tree bitpos = TREE_OPERAND (lhs, 2);
> > > > > > > +                   gimple_assign_set_lhs (stmt, var);
> > > > > > > +                   gimple_assign_set_rhs_with_ops
> > > > > > > +                     (&gsi, BIT_FIELD_INSERT, var, val, bitpos);
> > > > > > > +                   stmt = gsi_stmt (gsi);
> > > > > > > +                   unlink_stmt_vdef (stmt);
> > > > > > > +                   update_stmt (stmt);
> > > > > > >                     continue;
> > > > > > >                   }
> > > > > > >
> > > > > > > Index: trunk/gcc/testsuite/gcc.dg/tree-ssa/vector-6.c
> > > > > > > ===================================================================
> > > > > > > *** /dev/null   1970-01-01 00:00:00.000000000 +0000
> > > > > > > --- trunk/gcc/testsuite/gcc.dg/tree-ssa/vector-6.c      2016-05-13 09:54:16.026814995 +0200
> > > > > > > ***************
> > > > > > > *** 0 ****
> > > > > > > --- 1,34 ----
> > > > > > > + /* { dg-do compile } */
> > > > > > > + /* { dg-options "-O -fdump-tree-ccp1" } */
> > > > > > > +
> > > > > > > + typedef int v4si __attribute__((vector_size (4 * sizeof (int))));
> > > > > > > +
> > > > > > > + v4si test1 (v4si v, int i)
> > > > > > > + {
> > > > > > > +   ((int *)&v)[0] = i;
> > > > > > > +   return v;
> > > > > > > + }
> > > > > > > +
> > > > > > > + v4si test2 (v4si v, int i)
> > > > > > > + {
> > > > > > > +   int *p = (int *)&v;
> > > > > > > +   *p = i;
> > > > > > > +   return v;
> > > > > > > + }
> > > > > > > +
> > > > > > > + v4si test3 (v4si v, int i)
> > > > > > > + {
> > > > > > > +   ((int *)&v)[3] = i;
> > > > > > > +   return v;
> > > > > > > + }
> > > > > > > +
> > > > > > > + v4si test4 (v4si v, int i)
> > > > > > > + {
> > > > > > > +   int *p = (int *)&v;
> > > > > > > +   p += 3;
> > > > > > > +   *p = i;
> > > > > > > +   return v;
> > > > > > > + }
> > > > > > > +
> > > > > > > + /* { dg-final { scan-tree-dump-times "Now a gimple register: v" 2 "ccp1" } } */
> > > > > > > + /* { dg-final { scan-tree-dump-times "Now a gimple register: v" 4 "ccp1" { xfail *-*-* } } } */
> > > > > >
> > > > > >
> > > > >
> > > > >
> > > >
> > > > --
> > > > Richard Biener <rguenther@suse.de>
> > > > SUSE LINUX GmbH, GF: Felix Imendoerffer, Jane Smithard, Graham Norton, HRB 21284 (AG Nuernberg)
> > >
> >
> > --
> > Richard Biener <rguenther@suse.de>
> > SUSE Software Solutions Germany GmbH, Maxfeldstrasse 5, 90409 Nuernberg,
> > Germany; GF: Felix Imendörffer; HRB 36809 (AG Nuernberg)
> 

-- 
Richard Biener <rguenther@suse.de>
SUSE Software Solutions Germany GmbH, Maxfeldstrasse 5, 90409 Nuernberg,
Germany; GF: Felix Imendörffer; HRB 36809 (AG Nuernberg)


More information about the Gcc-patches mailing list