[29/43] tcg: Add tcg_reg_alloc_dup2

Message ID	20200909001647.532249-30-richard.henderson@linaro.org
State	Superseded
Headers	show Delivered-To: patch@linaro.org Received-SPF: pass (google.com: domain of qemu-devel-bounces+patch=linaro.org@nongnu.org designates 209.51.188.17 as permitted sender) client-ip=209.51.188.17; From: Richard Henderson <richard.henderson@linaro.org> To: qemu-devel@nongnu.org Subject: [PATCH 29/43] tcg: Add tcg_reg_alloc_dup2 Date: Tue, 8 Sep 2020 17:16:33 -0700 Message-Id: <20200909001647.532249-30-richard.henderson@linaro.org> In-Reply-To: <20200909001647.532249-1-richard.henderson@linaro.org> References: <20200909001647.532249-1-richard.henderson@linaro.org> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Received-SPF: pass client-ip=2607:f8b0:4864:20::444; envelope-from=richard.henderson@linaro.org; helo=mail-pf1-x444.google.com Precedence: list Errors-To: qemu-devel-bounces+patch=linaro.org@nongnu.org Sender: "Qemu-devel" <qemu-devel-bounces+patch=linaro.org@nongnu.org>
Series	tcg patch queue \| expand [00/43] tcg patch queue [01/43] tcg: Adjust simd_desc size encoding [02/43] tcg: Drop union from TCGArgConstraint [03/43] tcg: Move sorted_args into TCGArgConstraint.sort_index [04/43] tcg: Remove TCG_CT_REG [05/43] tcg: Move some TCG_CT_* bits to TCGArgConstraint bitfields [06/43] tcg: Remove TCGOpDef.used [07/43] tcg/i386: Fix dupi for avx2 32-bit hosts [08/43] tcg: Fix generation of dupi_vec for 32-bit host [09/43] tcg/optimize: Fold dup2_vec [10/43] tcg: Remove TCG_TARGET_HAS_cmp_vec [11/43] tcg: Use tcg_out_dupi_vec from temp_load [12/43] tcg: Increase tcg_out_dupi_vec immediate to int64_t [13/43] tcg: Consolidate 3 bits into enum TCGTempKind [14/43] tcg: Add temp_readonly [15/43] tcg: Expand TCGTemp.val to 64-bits [16/43] tcg: Rename struct tcg_temp_info to TempOptInfo [17/43] tcg: Expand TempOptInfo to 64-bits [18/43] tcg: Introduce TYPE_CONST temporaries [19/43] tcg/optimize: Improve find_better_copy [20/43] tcg/optimize: Adjust TempOptInfo allocation [21/43] tcg/optimize: Use tcg_constant_internal with constant folding [22/43] tcg: Convert tcg_gen_dupi_vec to TCG_CONST [23/43] tcg: Use tcg_constant_i32 with icount expander [24/43] tcg: Use tcg_constant_{i32,i64} with tcg int expanders [25/43] tcg: Use tcg_constant_{i32,i64} with tcg plugins [26/43] tcg: Use tcg_constant_{i32, i64, vec} with gvec expanders [27/43] tcg/tci: Add special tci_movi_{i32,i64} opcodes [28/43] tcg: Remove movi and dupi opcodes [29/43] tcg: Add tcg_reg_alloc_dup2 [30/43] tcg/i386: Use tcg_constant_vec with tcg vec expanders [31/43] tcg: Remove tcg_gen_dup{8,16,32,64}i_vec [32/43] tcg/ppc: Use tcg_constant_vec with tcg vec expanders [33/43] tcg/aarch64: Use tcg_constant_vec with tcg vec expanders [34/43] tcg: Add tcg-constr.c.inc [35/43] tcg/i386: Convert to tcg-constr.c.inc [36/43] tcg/aarch64: Convert to tcg-constr.c.inc [37/43] tcg/arm: Convert to tcg-constr.c.inc [38/43] tcg/mips: Convert to tcg-constr.c.inc [39/43] tcg/ppc: Convert to tcg-constr.c.inc [40/43] tcg/riscv: Convert to tcg-constr.c.inc [41/43] tcg/s390: Convert to tcg-constr.c.inc [42/43] tcg/sparc: Convert to tcg-constr.c.inc [43/43] tcg/tci: Convert to tcg-constr.c.inc

Message ID

20200909001647.532249-30-richard.henderson@linaro.org

State

Superseded

Headers

Received-SPF: pass (google.com: domain of
	qemu-devel-bounces+patch=linaro.org@nongnu.org designates
	209.51.188.17 as permitted sender) client-ip=209.51.188.17; 
From: Richard Henderson <richard.henderson@linaro.org>
To: qemu-devel@nongnu.org
Subject: [PATCH 29/43] tcg: Add tcg_reg_alloc_dup2
Date: Tue,  8 Sep 2020 17:16:33 -0700
Message-Id: <20200909001647.532249-30-richard.henderson@linaro.org>
In-Reply-To: <20200909001647.532249-1-richard.henderson@linaro.org>
References: <20200909001647.532249-1-richard.henderson@linaro.org>
MIME-Version: 1.0
Content-Transfer-Encoding: 8bit
Received-SPF: pass client-ip=2607:f8b0:4864:20::444;
	envelope-from=richard.henderson@linaro.org;
	helo=mail-pf1-x444.google.com
X-Spam_score_int: -20
X-Spam_score: -2.1
X-Spam_bar: --
X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1,
	DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1,
	RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001,
	SPF_PASS=-0.001 autolearn=ham autolearn_force=no
X-Spam_action: no action
X-BeenThere: qemu-devel@nongnu.org
X-Mailman-Version: 2.1.23
Precedence: list
List-Id: <qemu-devel.nongnu.org>
List-Unsubscribe: <https://lists.nongnu.org/mailman/options/qemu-devel>,
	<mailto:qemu-devel-request@nongnu.org?subject=unsubscribe>
List-Archive: <https://lists.nongnu.org/archive/html/qemu-devel>
List-Post: <mailto:qemu-devel@nongnu.org>
List-Help: <mailto:qemu-devel-request@nongnu.org?subject=help>
List-Subscribe: <https://lists.nongnu.org/mailman/listinfo/qemu-devel>,
	<mailto:qemu-devel-request@nongnu.org?subject=subscribe>
Errors-To: qemu-devel-bounces+patch=linaro.org@nongnu.org
Sender: "Qemu-devel" <qemu-devel-bounces+patch=linaro.org@nongnu.org>

Series

tcg patch queue | expand

Commit Message

Richard Henderson Sept. 9, 2020, 12:16 a.m. UTC

There are several ways we can expand a vector dup of a 64-bit
element on a 32-bit host.

Signed-off-by: Richard Henderson <richard.henderson@linaro.org>

---
 tcg/tcg.c | 97 +++++++++++++++++++++++++++++++++++++++++++++++++++++++
 1 file changed, 97 insertions(+)

-- 
2.25.1

diff --git a/tcg/tcg.c b/tcg/tcg.c
index f9c6450837..507c95cd39 100644
--- a/tcg/tcg.c
+++ b/tcg/tcg.c
@@ -3954,6 +3954,100 @@  static void tcg_reg_alloc_op(TCGContext *s, const TCGOp *op)
     }
 }
 
+static void tcg_reg_alloc_dup2(TCGContext *s, const TCGOp *op)
+{
+    const TCGLifeData arg_life = op->life;
+    TCGTemp *ots, *itsl, *itsh;
+    TCGType vtype = TCGOP_VECL(op) + TCG_TYPE_V64;
+
+    /* This opcode is only valid for 32-bit hosts, for 64-bit elements. */
+    tcg_debug_assert(TCG_TARGET_REG_BITS == 32);
+    tcg_debug_assert(TCGOP_VECE(op) == MO_64);
+
+    ots = arg_temp(op->args[0]);
+    itsl = arg_temp(op->args[1]);
+    itsh = arg_temp(op->args[2]);
+
+    /* ENV should not be modified.  */
+    tcg_debug_assert(!temp_readonly(ots));
+
+    /* Allocate the output register now.  */
+    if (ots->val_type != TEMP_VAL_REG) {
+        TCGRegSet allocated_regs = s->reserved_regs;
+        TCGRegSet dup_out_regs =
+            tcg_op_defs[INDEX_op_dup_vec].args_ct[0].regs;
+
+        /* Make sure to not spill the input registers. */
+        if (!IS_DEAD_ARG(1) && itsl->val_type == TEMP_VAL_REG) {
+            tcg_regset_set_reg(allocated_regs, itsl->reg);
+        }
+        if (!IS_DEAD_ARG(2) && itsh->val_type == TEMP_VAL_REG) {
+            tcg_regset_set_reg(allocated_regs, itsh->reg);
+        }
+
+        ots->reg = tcg_reg_alloc(s, dup_out_regs, allocated_regs,
+                                 op->output_pref[0], ots->indirect_base);
+        ots->val_type = TEMP_VAL_REG;
+        ots->mem_coherent = 0;
+        s->reg_to_temp[ots->reg] = ots;
+    }
+
+    /* Promote dup2 of immediates to dupi_vec. */
+    if (itsl->val_type == TEMP_VAL_CONST && itsh->val_type == TEMP_VAL_CONST) {
+        uint64_t val = deposit64(itsl->val, 32, 32, itsh->val);
+        MemOp vece = MO_64;
+
+        if (val == dup_const(MO_8, val)) {
+            vece = MO_8;
+        } else if (val == dup_const(MO_16, val)) {
+            vece = MO_16;
+        } else if (val == dup_const(MO_32, val)) {
+            vece = MO_32;
+        }
+
+        tcg_out_dupi_vec(s, vtype, vece, ots->reg, val);
+        goto done;
+    }
+
+    /* If the two inputs form one 64-bit value, try dupm_vec. */
+    if (itsl + 1 == itsh &&
+        itsl->base_type == TCG_TYPE_I64 &&
+        itsh->base_type == TCG_TYPE_I64) {
+        if (!itsl->mem_coherent) {
+            temp_sync(s, itsl, s->reserved_regs, 0, 0);
+        }
+        if (!itsl->mem_coherent) {
+            temp_sync(s, itsl, s->reserved_regs, 0, 0);
+        }
+#ifdef HOST_WORDS_BIGENDIAN
+        TCGTemp *its = itsh;
+#else
+        TCGTemp *its = itsl;
+#endif
+        if (tcg_out_dupm_vec(s, vtype, MO_64, ots->reg,
+                             its->mem_base->reg, its->mem_offset)) {
+            goto done;
+        }
+    }
+
+    /* Fall back to generic expansion. */
+    tcg_reg_alloc_op(s, op);
+    return;
+
+ done:
+    if (IS_DEAD_ARG(1)) {
+        temp_dead(s, itsl);
+    }
+    if (IS_DEAD_ARG(2)) {
+        temp_dead(s, itsh);
+    }
+    if (NEED_SYNC_ARG(0)) {
+        temp_sync(s, ots, s->reserved_regs, 0, IS_DEAD_ARG(0));
+    } else if (IS_DEAD_ARG(0)) {
+        temp_dead(s, ots);
+    }
+}
+
 #ifdef TCG_TARGET_STACK_GROWSUP
 #define STACK_DIR(x) (-(x))
 #else
@@ -4345,6 +4439,9 @@  int tcg_gen_code(TCGContext *s, TranslationBlock *tb)
         case INDEX_op_dup_vec:
             tcg_reg_alloc_dup(s, op);
             break;
+        case INDEX_op_dup2_vec:
+            tcg_reg_alloc_dup2(s, op);
+            break;
         case INDEX_op_insn_start:
             if (num_insns >= 0) {
                 size_t off = tcg_current_code_size(s);

[29/43] tcg: Add tcg_reg_alloc_dup2

Commit Message

Patch