[v2,07/68] target/arm: Simplify UMAAL

Message ID	20190819213755.26175-8-richard.henderson@linaro.org
State	Superseded
Headers	show Delivered-To: patch@linaro.org Received-SPF: pass (google.com: domain of qemu-devel-bounces+patch=linaro.org@nongnu.org designates 209.51.188.17 as permitted sender) client-ip=209.51.188.17; From: Richard Henderson <richard.henderson@linaro.org> To: qemu-devel@nongnu.org Date: Mon, 19 Aug 2019 14:36:54 -0700 Message-Id: <20190819213755.26175-8-richard.henderson@linaro.org> In-Reply-To: <20190819213755.26175-1-richard.henderson@linaro.org> References: <20190819213755.26175-1-richard.henderson@linaro.org> Subject: [Qemu-devel] [PATCH v2 07/68] target/arm: Simplify UMAAL Precedence: list Cc: peter.maydell@linaro.org, qemu-arm@nongnu.org Errors-To: qemu-devel-bounces+patch=linaro.org@nongnu.org Sender: "Qemu-devel" <qemu-devel-bounces+patch=linaro.org@nongnu.org>
Series	target/arm: Convert aa32 base isa to decodetree \| expand [v2,00/68] target/arm: Convert aa32 base isa to decodetree [v2,01/68] target/arm: Use store_reg_from_load in thumb2 code [v2,02/68] target/arm: Add stubs for aa32 decodetree [v2,03/68] target/arm: Convert Data Processing (register) [v2,04/68] target/arm: Convert Data Processing (reg-shifted-reg) [v2,05/68] target/arm: Convert Data Processing (immediate) [v2,06/68] target/arm: Convert multiply and multiply accumulate [v2,07/68] target/arm: Simplify UMAAL [v2,08/68] target/arm: Convert Saturating addition and subtraction [v2,09/68] target/arm: Convert Halfword multiply and multiply accumulate [v2,10/68] target/arm: Simplify op_smlaxxx for SMLAL* [v2,11/68] target/arm: Simplify op_smlawx for SMLAW* [v2,12/68] target/arm: Convert MSR (immediate) and hints [v2,13/68] target/arm: Convert MRS/MSR (banked, register) [v2,14/68] target/arm: Convert Cyclic Redundancy Check [v2,15/68] target/arm: Convert BX, BXJ, BLX (register) [v2,16/68] target/arm: Convert CLZ [v2,17/68] target/arm: Convert ERET [v2,18/68] target/arm: Convert the rest of A32 Miscelaneous instructions [v2,19/68] target/arm: Convert T32 ADDW/SUBW [v2,20/68] target/arm: Convert load/store (register, immediate, literal) [v2,21/68] target/arm: Convert Synchronization primitives [v2,22/68] target/arm: Convert USAD8, USADA8, SBFX, UBFX, BFC, BFI, UDF [v2,23/68] target/arm: Convert Parallel addition and subtraction [v2,24/68] target/arm: Convert Packing, unpacking, saturation, and reversal [v2,25/68] target/arm: Convert Signed multiply, signed and unsigned divide [v2,26/68] target/arm: Convert MOVW, MOVT [v2,27/68] target/arm: Convert LDM, STM [v2,28/68] target/arm: Diagnose writeback register in list for LDM for v7 [v2,29/68] target/arm: Diagnose too few registers in list for LDM/STM [v2,30/68] target/arm: Diagnose base == pc for LDM/STM [v2,31/68] target/arm: Convert B, BL, BLX (immediate) [v2,32/68] target/arm: Convert SVC [v2,33/68] target/arm: Convert RFE and SRS [v2,34/68] target/arm: Convert Clear-Exclusive, Barriers [v2,35/68] target/arm: Convert CPS (privileged) [v2,36/68] target/arm: Convert SETEND [v2,37/68] target/arm: Convert PLI, PLD, PLDW [v2,38/68] target/arm: Convert Unallocated memory hint [v2,39/68] target/arm: Convert Table Branch [v2,40/68] target/arm: Convert SG [v2,41/68] target/arm: Convert TT [v2,42/68] target/arm: Simplify disas_thumb2_insn [v2,43/68] target/arm: Simplify disas_arm_insn [v2,44/68] target/arm: Add skeleton for T16 decodetree [v2,45/68] target/arm: Convert T16 data-processing (two low regs) [v2,46/68] target/arm: Convert T16 load/store (register offset) [v2,47/68] target/arm: Convert T16 load/store (immediate offset) [v2,48/68] target/arm: Convert T16 add pc/sp (immediate) [v2,49/68] target/arm: Convert T16 load/store multiple [v2,50/68] target/arm: Convert T16 add/sub (3 low, 2 low and imm) [v2,51/68] target/arm: Convert T16 one low register and immediate [v2,52/68] target/arm: Convert T16 branch and exchange [v2,53/68] target/arm: Convert T16 add, compare, move (two high registers) [v2,54/68] target/arm: Convert T16 adjust sp (immediate) [v2,55/68] target/arm: Convert T16, extract [v2,56/68] target/arm: Convert T16, Change processor state [v2,57/68] target/arm: Convert T16, Reverse bytes [v2,58/68] target/arm: Convert T16, nop hints [v2,59/68] target/arm: Split gen_nop_hint [v2,60/68] target/arm: Convert T16, push and pop [v2,61/68] target/arm: Convert T16, Conditional branches, Supervisor call [v2,62/68] target/arm: Convert T16, Miscellaneous 16-bit instructions [v2,63/68] target/arm: Convert T16, shift immediate [v2,64/68] target/arm: Convert T16, load (literal) [v2,65/68] target/arm: Convert T16, Unconditional branch [v2,66/68] target/arm: Convert T16, long branches [v2,67/68] target/arm: Clean up disas_thumb_insn [v2,68/68] target/arm: Inline gen_bx_im into callers

Message ID

20190819213755.26175-8-richard.henderson@linaro.org

State

Superseded

Headers

Received-SPF: pass (google.com: domain of
	qemu-devel-bounces+patch=linaro.org@nongnu.org designates
	209.51.188.17 as permitted sender) client-ip=209.51.188.17; 
From: Richard Henderson <richard.henderson@linaro.org>
To: qemu-devel@nongnu.org
Date: Mon, 19 Aug 2019 14:36:54 -0700
Message-Id: <20190819213755.26175-8-richard.henderson@linaro.org>
In-Reply-To: <20190819213755.26175-1-richard.henderson@linaro.org>
References: <20190819213755.26175-1-richard.henderson@linaro.org>
Subject: [Qemu-devel] [PATCH v2 07/68] target/arm: Simplify UMAAL
Precedence: list
Cc: peter.maydell@linaro.org, qemu-arm@nongnu.org
Errors-To: qemu-devel-bounces+patch=linaro.org@nongnu.org
Sender: "Qemu-devel" <qemu-devel-bounces+patch=linaro.org@nongnu.org>

Series

target/arm: Convert aa32 base isa to decodetree | expand

Commit Message

Richard Henderson Aug. 19, 2019, 9:36 p.m. UTC

Since all of the inputs and outputs are i32, dispense with
the intermediate promotion to i64 and use tcg_gen_mulu2_i32
and tcg_gen_add2_i32.

Signed-off-by: Richard Henderson <richard.henderson@linaro.org>

---
 target/arm/translate.c | 34 ++++++++++++----------------------
 1 file changed, 12 insertions(+), 22 deletions(-)

-- 
2.17.1

Comments

Peter Maydell Aug. 23, 2019, 12:20 p.m. UTC | #1

On Mon, 19 Aug 2019 at 22:38, Richard Henderson
<richard.henderson@linaro.org> wrote:
>

> Since all of the inputs and outputs are i32, dispense with

> the intermediate promotion to i64 and use tcg_gen_mulu2_i32

> and tcg_gen_add2_i32.

>

> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>8

> ---

>  target/arm/translate.c | 34 ++++++++++++----------------------

>  1 file changed, 12 insertions(+), 22 deletions(-)


Reviewed-by: Peter Maydell <peter.maydell@linaro.org>


thanks
-- PMM

diff --git a/target/arm/translate.c b/target/arm/translate.c
index 94659086c0..82bd207799 100644
--- a/target/arm/translate.c
+++ b/target/arm/translate.c
@@ -7324,21 +7324,6 @@  static void gen_storeq_reg(DisasContext *s, int rlow, int rhigh, TCGv_i64 val)
     store_reg(s, rhigh, tmp);
 }
 
-/* load a 32-bit value from a register and perform a 64-bit accumulate.  */
-static void gen_addq_lo(DisasContext *s, TCGv_i64 val, int rlow)
-{
-    TCGv_i64 tmp;
-    TCGv_i32 tmp2;
-
-    /* Load value and extend to 64 bits.  */
-    tmp = tcg_temp_new_i64();
-    tmp2 = load_reg(s, rlow);
-    tcg_gen_extu_i32_i64(tmp, tmp2);
-    tcg_temp_free_i32(tmp2);
-    tcg_gen_add_i64(val, val, tmp);
-    tcg_temp_free_i64(tmp);
-}
-
 /* load and add a 64-bit value from a register pair.  */
 static void gen_addq(DisasContext *s, TCGv_i64 val, int rlow, int rhigh)
 {
@@ -8090,8 +8075,7 @@  static bool trans_SMLAL(DisasContext *s, arg_SMLAL *a)
 
 static bool trans_UMAAL(DisasContext *s, arg_UMAAL *a)
 {
-    TCGv_i32 t0, t1;
-    TCGv_i64 t64;
+    TCGv_i32 t0, t1, t2, zero;
 
     if (s->thumb
         ? !arm_dc_feature(s, ARM_FEATURE_THUMB_DSP)
@@ -8101,11 +8085,17 @@  static bool trans_UMAAL(DisasContext *s, arg_UMAAL *a)
 
     t0 = load_reg(s, a->rm);
     t1 = load_reg(s, a->rn);
-    t64 = gen_mulu_i64_i32(t0, t1);
-    gen_addq_lo(s, t64, a->ra);
-    gen_addq_lo(s, t64, a->rd);
-    gen_storeq_reg(s, a->ra, a->rd, t64);
-    tcg_temp_free_i64(t64);
+    tcg_gen_mulu2_i32(t0, t1, t0, t1);
+    zero = tcg_const_i32(0);
+    t2 = load_reg(s, a->ra);
+    tcg_gen_add2_i32(t0, t1, t0, t1, t2, zero);
+    tcg_temp_free_i32(t2);
+    t2 = load_reg(s, a->rd);
+    tcg_gen_add2_i32(t0, t1, t0, t1, t2, zero);
+    tcg_temp_free_i32(t2);
+    tcg_temp_free_i32(zero);
+    store_reg(s, a->ra, t0);
+    store_reg(s, a->rd, t1);
     return true;
 }

[v2,07/68] target/arm: Simplify UMAAL

Commit Message

Comments

Patch