This is the mail archive of the gcc-bugs@gcc.gnu.org mailing list for the GCC project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

[Bug rtl-optimization/63259] Detecting byteswap sequence

From: "olegendo at gcc dot gnu.org" <gcc-bugzilla at gcc dot gnu dot org>
To: gcc-bugs at gcc dot gnu dot org
Date: Fri, 19 Dec 2014 01:52:57 +0000
Subject: [Bug rtl-optimization/63259] Detecting byteswap sequence
Auto-submitted: auto-generated
References: <bug-63259-4 at http dot gcc dot gnu dot org/bugzilla/>

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=63259

--- Comment #13 from Oleg Endo <olegendo at gcc dot gnu.org> ---
(In reply to thopre01 from comment #12)
> 
> That's good, it means the pattern is recognized. Is there an optab defined
> for bswap16?

Nope.  Just this:

(define_insn "rotlhi3_8"
  [(set (match_operand:HI 0 "arith_reg_dest" "=r")
    (rotate:HI (match_operand:HI 1 "arith_reg_operand" "r")
           (const_int 8)))]
  "TARGET_SH1"
  "swap.b    %1,%0"
  [(set_attr "type" "arith")])

If I remember correctly, there was something that would check whether it can
use the rotate pattern instead of a bswaphi2.  After adding the following
pattern:

(define_expand "bswaphi2"
  [(set (match_operand:HI 0 "arith_reg_dest")
    (bswap:HI (match_operand:HI 1 "arith_reg_operand")))]
  "TARGET_SH1"
{
  if (!can_create_pseudo_p ())
    FAIL;
  else
    {
      emit_insn (gen_rotlhi3_8 (operands[0], operands[1]));
      DONE;
    }
})

it looks much better.  The cases above work, except for the signed short.  On
SH the bswap:HI HW insn actually doesn't modify the upper 16 bit of the 32 bit
register.  What it does is:

unsigned int test_0999 (unsigned int a, unsigned int b)
{
  return (a & 0xFFFF0000) | ((a & 0xFF00) >> 8) | ((a & 0xFF) << 8);
}

I was afraid that using a bswap:HI will result in unnecessary code around it:


    mov.l    .L6,r0  ! r0 = 0xFFFF0000
    and    r4,r0
    extu.w    r4,r4
    swap.b    r4,r4
    rts
    or    r4,r0

In my case, combine is looking for the pattern:

Failed to match this instruction:
(set (reg:SI 172 [ D.1356 ])
    (ior:SI (ior:SI (and:SI (ashift:SI (reg/v:SI 170 [ a ])
                    (const_int 8 [0x8]))
                (const_int 65280 [0xff00]))
            (zero_extract:SI (reg/v:SI 170 [ a ])
                (const_int 8 [0x8])
                (const_int 8 [0x8])))
        (and:SI (reg/v:SI 170 [ a ])
            (const_int -65536 [0xffffffffffff0000]))))

Which should then work.

> 
> Same as the other pattern, can you try to print n->n in hex with this new
> limit? My guess is that the pattern is now recognized but fails later for
> the same reason as above.

With increased limit, the number is/was the same.  The bswap:HI expander was
missing.  Thanks!

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]