[ARM] Fix vget_lane for big-endian targets

Message ID	1437033404-4759-1-git-send-email-christophe.lyon@linaro.org
State	New
Headers	show Return-Path: <patchwork-forward+bncBDNJJ5OM3EFRBI6ITWWQKGQEZRKRP2Q@linaro.org> MIME-Version: 1.0 Received-SPF: pass (google.com: domain of patch+caf_=patchwork-forward=linaro.org@linaro.org designates 2a00:1450:4010:c04::22f as permitted sender) client-ip=2a00:1450:4010:c04::22f; Received-SPF: pass (google.com: domain of gcc-patches-return-403090-patch=linaro.org@gcc.gnu.org designates 209.132.180.131 as permitted sender) client-ip=209.132.180.131; Mailing-List: list patchwork-forward@linaro.org; contact patchwork-forward+owners@linaro.org Precedence: list Sender: gcc-patches-owner@gcc.gnu.org From: Christophe Lyon <christophe.lyon@linaro.org> To: gcc-patches@gcc.gnu.org Subject: [ARM] Fix vget_lane for big-endian targets Date: Thu, 16 Jul 2015 09:56:44 +0200 Message-Id: <1437033404-4759-1-git-send-email-christophe.lyon@linaro.org>

Message ID

1437033404-4759-1-git-send-email-christophe.lyon@linaro.org

State

New

Headers

MIME-Version: 1.0
Received-SPF: pass (google.com: domain of
	patch+caf_=patchwork-forward=linaro.org@linaro.org designates
	2a00:1450:4010:c04::22f as permitted sender)
	client-ip=2a00:1450:4010:c04::22f; 
Received-SPF: pass (google.com: domain of
	gcc-patches-return-403090-patch=linaro.org@gcc.gnu.org
	designates 209.132.180.131 as permitted sender)
	client-ip=209.132.180.131; 
Mailing-List: list patchwork-forward@linaro.org;
	contact patchwork-forward+owners@linaro.org
Precedence: list
Sender: gcc-patches-owner@gcc.gnu.org
From: Christophe Lyon <christophe.lyon@linaro.org>
To: gcc-patches@gcc.gnu.org
Subject: [ARM] Fix vget_lane for big-endian targets
Date: Thu, 16 Jul 2015 09:56:44 +0200
Message-Id: <1437033404-4759-1-git-send-email-christophe.lyon@linaro.org>

Commit Message

Christophe Lyon July 16, 2015, 7:56 a.m. UTC

AdvSIMD vget_lane tests currently fail on armeb targets when dealing
with vectors of 2 64-bits elements. This patches fixes it, by adding a
code fragment similar to what is dones in other cases. I could have
simplified it a bit given that the vector width is known, but I chose
to hardcode 'reg_nelts = 2' to keep the code closer to what is done
elsewhere.

OK for trunk?

Christophe

2015-07-16  Christophe Lyon  <christophe.lyon@linaro.org>

	* config/arm/neon.md (neon_vget_lanev2di): Handle big-endian
	targets.

Comments

Christophe Lyon Aug. 4, 2015, 12:09 p.m. UTC | #1

On 21 July 2015 at 16:01, Kyrill Tkachov <kyrylo.tkachov@arm.com> wrote:
>
> On 16/07/15 08:56, Christophe Lyon wrote:
>>
>> AdvSIMD vget_lane tests currently fail on armeb targets when dealing
>> with vectors of 2 64-bits elements. This patches fixes it, by adding a
>> code fragment similar to what is dones in other cases. I could have
>> simplified it a bit given that the vector width is known, but I chose
>> to hardcode 'reg_nelts = 2' to keep the code closer to what is done
>> elsewhere.
>>
>> OK for trunk?
>>
>> Christophe
>>
>> 2015-07-16  Christophe Lyon  <christophe.lyon@linaro.org>
>>
>>         * config/arm/neon.md (neon_vget_lanev2di): Handle big-endian
>>         targets.
>
>
> I see we do this for other lanewise patterns as well.
> Has this been tested on an arm big-endian target?
>
> If so, ok for trunk.

I forgot to mention that yes, I actually tested it on arm big-endian,
using QEMU.

Christophe.

>
> Thanks,
> Kyrill
>
>
>>
>> diff --git a/gcc/config/arm/neon.md b/gcc/config/arm/neon.md
>> index 654d9d5..59ddc5b 100644
>> --- a/gcc/config/arm/neon.md
>> +++ b/gcc/config/arm/neon.md
>> @@ -2736,6 +2736,19 @@
>>      (match_operand:SI 2 "immediate_operand" "")]
>>     "TARGET_NEON"
>>   {
>> +  if (BYTES_BIG_ENDIAN)
>> +    {
>> +      /* The intrinsics are defined in terms of a model where the
>> +        element ordering in memory is vldm order, whereas the generic
>> +        RTL is defined in terms of a model where the element ordering
>> +        in memory is array order.  Convert the lane number to conform
>> +        to this model.  */
>> +      unsigned int elt = INTVAL (operands[2]);
>> +      unsigned int reg_nelts = 2;
>> +      elt ^= reg_nelts - 1;
>> +      operands[2] = GEN_INT (elt);
>> +    }
>> +
>>     switch (INTVAL (operands[2]))
>>       {
>>       case 0:
>
>

diff --git a/gcc/config/arm/neon.md b/gcc/config/arm/neon.md
index 654d9d5..59ddc5b 100644
--- a/gcc/config/arm/neon.md
+++ b/gcc/config/arm/neon.md
@@ -2736,6 +2736,19 @@ 
    (match_operand:SI 2 "immediate_operand" "")]
   "TARGET_NEON"
 {
+  if (BYTES_BIG_ENDIAN)
+    {
+      /* The intrinsics are defined in terms of a model where the
+	 element ordering in memory is vldm order, whereas the generic
+	 RTL is defined in terms of a model where the element ordering
+	 in memory is array order.  Convert the lane number to conform
+	 to this model.  */
+      unsigned int elt = INTVAL (operands[2]);
+      unsigned int reg_nelts = 2;
+      elt ^= reg_nelts - 1;
+      operands[2] = GEN_INT (elt);
+    }
+
   switch (INTVAL (operands[2]))
     {
     case 0:

[ARM] Fix vget_lane for big-endian targets

Commit Message

Comments

Patch