[v4,110/163] tcg/riscv: Drop support for add2/sub2

Message ID	20250415192515.232910-111-richard.henderson@linaro.org
State	New
Headers	show Delivered-To: patch@linaro.org Received-SPF: pass (google.com: domain of qemu-devel-bounces+patch=linaro.org@nongnu.org designates 209.51.188.17 as permitted sender) client-ip=209.51.188.17; From: Richard Henderson <richard.henderson@linaro.org> To: qemu-devel@nongnu.org Subject: [PATCH v4 110/163] tcg/riscv: Drop support for add2/sub2 Date: Tue, 15 Apr 2025 12:24:21 -0700 Message-ID: <20250415192515.232910-111-richard.henderson@linaro.org> In-Reply-To: <20250415192515.232910-1-richard.henderson@linaro.org> References: <20250415192515.232910-1-richard.henderson@linaro.org> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Received-SPF: pass client-ip=2607:f8b0:4864:20::62f; envelope-from=richard.henderson@linaro.org; helo=mail-pl1-x62f.google.com X-Spam_score_int: -20 X-Spam_score: -2.1 X-Spam_bar: -- X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no X-Spam_action: no action Precedence: list Errors-To: qemu-devel-bounces+patch=linaro.org@nongnu.org Sender: qemu-devel-bounces+patch=linaro.org@nongnu.org
Series	tcg: Convert to TCGOutOp structures \| expand [v4,000/163] tcg: Convert to TCGOutOp structures [v4,001/163] tcg: Add all_outop[] [v4,002/163] tcg: Use extract2 for cross-word 64-bit extract on 32-bit host [v4,003/163] tcg: Remove INDEX_op_ext{8,16,32}* [v4,004/163] tcg: Merge INDEX_op_mov_{i32,i64} [v4,005/163] tcg: Convert add to TCGOutOpBinary [v4,006/163] tcg: Merge INDEX_op_add_{i32,i64} [v4,007/163] tcg: Convert and to TCGOutOpBinary [v4,008/163] tcg: Merge INDEX_op_and_{i32,i64} [v4,009/163] tcg/optimize: Fold andc with immediate to and [v4,010/163] tcg/optimize: Emit add r, r, -1 in fold_setcond_tst_pow2 [v4,011/163] tcg: Convert andc to TCGOutOpBinary [v4,012/163] tcg: Merge INDEX_op_andc_{i32,i64} [v4,013/163] tcg: Convert or to TCGOutOpBinary [v4,014/163] tcg: Merge INDEX_op_or_{i32,i64} [v4,015/163] tcg/optimize: Fold orc with immediate to or [v4,016/163] tcg: Convert orc to TCGOutOpBinary [v4,017/163] tcg: Merge INDEX_op_orc_{i32,i64} [v4,018/163] tcg: Convert xor to TCGOutOpBinary [v4,019/163] tcg: Merge INDEX_op_xor_{i32,i64} [v4,020/163] tcg/optimize: Fold eqv with immediate to xor [v4,021/163] tcg: Convert eqv to TCGOutOpBinary [v4,022/163] tcg: Merge INDEX_op_eqv_{i32,i64} [v4,023/163] tcg: Convert nand to TCGOutOpBinary [v4,024/163] tcg: Merge INDEX_op_nand_{i32,i64} [v4,025/163] tcg/loongarch64: Do not accept constant argument to nor [v4,026/163] tcg: Convert nor to TCGOutOpBinary [v4,027/163] tcg: Merge INDEX_op_nor_{i32,i64} [v4,028/163] tcg/arm: Fix constraints for sub [v4,029/163] tcg: Convert sub to TCGOutOpSubtract [v4,030/163] tcg: Merge INDEX_op_sub_{i32,i64} [v4,031/163] tcg: Convert neg to TCGOutOpUnary [v4,032/163] tcg: Merge INDEX_op_neg_{i32,i64} [v4,033/163] tcg: Convert not to TCGOutOpUnary [v4,034/163] tcg: Merge INDEX_op_not_{i32,i64} [v4,035/163] tcg: Convert mul to TCGOutOpBinary [v4,036/163] tcg: Merge INDEX_op_mul_{i32,i64} [v4,037/163] tcg: Convert muluh to TCGOutOpBinary [v4,038/163] tcg: Merge INDEX_op_muluh_{i32,i64} [v4,039/163] tcg: Convert mulsh to TCGOutOpBinary [v4,040/163] tcg: Merge INDEX_op_mulsh_{i32,i64} [v4,041/163] tcg: Convert div to TCGOutOpBinary [v4,042/163] tcg: Merge INDEX_op_div_{i32,i64} [v4,043/163] tcg: Convert divu to TCGOutOpBinary [v4,044/163] tcg: Merge INDEX_op_divu_{i32,i64} [v4,045/163] tcg: Convert div2 to TCGOutOpDivRem [v4,046/163] tcg: Merge INDEX_op_div2_{i32,i64} [v4,047/163] tcg: Convert divu2 to TCGOutOpDivRem [v4,048/163] tcg: Merge INDEX_op_divu2_{i32,i64} [v4,049/163] tcg: Convert rem to TCGOutOpBinary [v4,050/163] tcg: Merge INDEX_op_rem_{i32,i64} [v4,051/163] tcg: Convert remu to TCGOutOpBinary [v4,052/163] tcg: Merge INDEX_op_remu_{i32,i64} [v4,053/163] tcg: Convert shl to TCGOutOpBinary [v4,054/163] tcg: Merge INDEX_op_shl_{i32,i64} [v4,055/163] tcg: Convert shr to TCGOutOpBinary [v4,056/163] tcg: Merge INDEX_op_shr_{i32,i64} [v4,057/163] tcg: Convert sar to TCGOutOpBinary [v4,058/163] tcg: Merge INDEX_op_sar_{i32,i64} [v4,059/163] tcg: Do not require both rotr and rotl from the backend [v4,060/163] tcg: Convert rotl, rotr to TCGOutOpBinary [v4,061/163] tcg: Merge INDEX_op_rot{l,r}_{i32,i64} [v4,062/163] tcg: Convert clz to TCGOutOpBinary [v4,063/163] tcg: Merge INDEX_op_clz_{i32,i64} [v4,064/163] tcg: Convert ctz to TCGOutOpBinary [v4,065/163] tcg: Merge INDEX_op_ctz_{i32,i64} [v4,066/163] tcg: Convert ctpop to TCGOutOpUnary [v4,067/163] tcg: Merge INDEX_op_ctpop_{i32,i64} [v4,068/163] tcg: Convert muls2 to TCGOutOpMul2 [v4,069/163] tcg: Merge INDEX_op_muls2_{i32,i64} [v4,070/163] tcg: Convert mulu2 to TCGOutOpMul2 [v4,071/163] tcg: Merge INDEX_op_mulu2_{i32,i64} [v4,072/163] tcg/loongarch64: Support negsetcond [v4,073/163] tcg/mips: Support negsetcond [v4,074/163] tcg/tci: Support negsetcond [v4,075/163] tcg: Remove TCG_TARGET_HAS_negsetcond_{i32,i64} [v4,076/163] tcg: Convert setcond, negsetcond to TCGOutOpSetcond [v4,077/163] tcg: Merge INDEX_op_{neg}setcond_{i32,i64}` [v4,078/163] tcg: Convert brcond to TCGOutOpBrcond [v4,079/163] tcg: Merge INDEX_op_brcond_{i32,i64} [v4,080/163] tcg: Convert movcond to TCGOutOpMovcond [v4,081/163] tcg: Merge INDEX_op_movcond_{i32,i64} [v4,082/163] tcg/ppc: Drop fallback constant loading in tcg_out_cmp [v4,083/163] tcg/arm: Expand arguments to tcg_out_cmp2 [v4,084/163] tcg/ppc: Expand arguments to tcg_out_cmp2 [v4,085/163] tcg: Convert brcond2_i32 to TCGOutOpBrcond2 [v4,086/163] tcg: Convert setcond2_i32 to TCGOutOpSetcond2 [v4,087/163] tcg: Convert bswap16 to TCGOutOpBswap [v4,088/163] tcg: Merge INDEX_op_bswap16_{i32,i64} [v4,089/163] tcg: Convert bswap32 to TCGOutOpBswap [v4,090/163] tcg: Merge INDEX_op_bswap32_{i32,i64} [v4,091/163] tcg: Convert bswap64 to TCGOutOpUnary [v4,092/163] tcg: Rename INDEX_op_bswap64_i64 to INDEX_op_bswap64 [v4,093/163] tcg: Convert extract to TCGOutOpExtract [v4,094/163] tcg: Merge INDEX_op_extract_{i32,i64} [v4,095/163] tcg: Convert sextract to TCGOutOpExtract [v4,096/163] tcg: Merge INDEX_op_sextract_{i32,i64} [v4,097/163] tcg: Convert ext_i32_i64 to TCGOutOpUnary [v4,098/163] tcg: Convert extu_i32_i64 to TCGOutOpUnary [v4,099/163] tcg: Convert extrl_i64_i32 to TCGOutOpUnary [v4,100/163] tcg: Convert extrh_i64_i32 to TCGOutOpUnary [v4,101/163] tcg: Convert deposit to TCGOutOpDeposit [v4,102/163] tcg/aarch64: Improve deposit [v4,103/163] tcg: Merge INDEX_op_deposit_{i32,i64} [v4,104/163] tcg: Convert extract2 to TCGOutOpExtract2 [v4,105/163] tcg: Merge INDEX_op_extract2_{i32,i64} [v4,106/163] tcg: Expand fallback add2 with 32-bit operations [v4,107/163] tcg: Expand fallback sub2 with 32-bit operations [v4,108/163] tcg: Do not default add2/sub2_i32 for 32-bit hosts [v4,109/163] tcg/mips: Drop support for add2/sub2 [v4,110/163] tcg/riscv: Drop support for add2/sub2 [v4,111/163] tcg: Move i into each for loop in liveness_pass_1 [v4,112/163] tcg: Sink def, nb_iargs, nb_oargs loads in liveness_pass_1 [v4,113/163] tcg: Add add/sub with carry opcodes and infrastructure [v4,114/163] tcg: Add TCGOutOp structures for add/sub carry opcodes [v4,115/163] tcg/optimize: Handle add/sub with carry opcodes [v4,116/163] tcg/optimize: With two const operands, prefer 0 in arg1 [v4,117/163] tcg: Use add carry opcodes to expand add2 [v4,118/163] tcg: Use sub carry opcodes to expand sub2 [v4,119/163] tcg/i386: Honor carry_live in tcg_out_movi [v4,120/163] tcg/i386: Implement add/sub carry opcodes [v4,121/163] tcg/i386: Remove support for add2/sub2 [v4,122/163] tcg/i386: Special case addci r, 0, 0 [v4,123/163] tcg: Add tcg_gen_addcio_{i32,i64,tl} [v4,124/163] target/arm: Use tcg_gen_addcio_* for ADCS [v4,125/163] target/hppa: Use tcg_gen_addcio_i64 [v4,126/163] target/microblaze: Use tcg_gen_addcio_i32 [v4,127/163] target/openrisc: Use tcg_gen_addcio_* for ADDC [v4,128/163] target/ppc: Use tcg_gen_addcio_tl for ADD and SUBF [v4,129/163] target/s390x: Use tcg_gen_addcio_i64 for op_addc64 [v4,130/163] target/sh4: Use tcg_gen_addcio_i32 for addc [v4,131/163] target/sparc: Use tcg_gen_addcio_tl for gen_op_addcc_int [v4,132/163] target/tricore: Use tcg_gen_addcio_i32 for gen_addc_CC [v4,133/163] tcg/aarch64: Implement add/sub carry opcodes [v4,134/163] tcg/aarch64: Remove support for add2/sub2 [v4,135/163] tcg/arm: Implement add/sub carry opcodes [v4,136/163] tcg/arm: Remove support for add2/sub2 [v4,137/163] tcg/ppc: Implement add/sub carry opcodes [v4,138/163] tcg/ppc: Remove support for add2/sub2 [v4,139/163] tcg/s390x: Honor carry_live in tcg_out_movi [v4,140/163] tcg/s390: Add TCG_CT_CONST_N32 [v4,141/163] tcg/s390x: Implement add/sub carry opcodes [v4,142/163] tcg/s390x: Use ADD LOGICAL WITH SIGNED IMMEDIATE [v4,143/163] tcg/s390x: Remove support for add2/sub2 [v4,144/163] tcg/sparc64: Hoist tcg_cond_to_bcond lookup out of tcg_out_movcc [v4,145/163] tcg/sparc64: Implement add/sub carry opcodes [v4,146/163] tcg/sparc64: Remove support for add2/sub2 [v4,147/163] tcg/tci: Implement add/sub carry opcodes [v4,148/163] tcg/tci: Remove support for add2/sub2 [v4,149/163] tcg: Remove add2/sub2 opcodes [v4,150/163] tcg: Formalize tcg_out_mb [v4,151/163] tcg: Formalize tcg_out_br [v4,152/163] tcg: Formalize tcg_out_goto_ptr [v4,153/163] tcg: Assign TCGOP_TYPE in liveness_pass_2 [v4,154/163] tcg: Convert ld to TCGOutOpLoad [v4,155/163] tcg: Merge INDEX_op_ld_{i32,i64} [v4,156/163] tcg: Convert st to TCGOutOpStore [v4,157/163] tcg: Merge INDEX_op_st_{i32,i64} [v4,158/163] tcg: Stash MemOp size in TCGOP_FLAGS [v4,159/163] tcg: Remove INDEX_op_qemu_st8_* [v4,160/163] tcg: Merge INDEX_op_{ld,st}_{i32,i64,i128} [v4,161/163] tcg: Convert qemu_ld{2} to TCGOutOpLoad{2} [v4,162/163] tcg: Convert qemu_st{2} to TCGOutOpLdSt{2} [v4,163/163] tcg: Remove tcg_out_op

Message ID

20250415192515.232910-111-richard.henderson@linaro.org

State

New

Headers

Received-SPF: pass (google.com: domain of
 qemu-devel-bounces+patch=linaro.org@nongnu.org designates 209.51.188.17 as
 permitted sender) client-ip=209.51.188.17;
From: Richard Henderson <richard.henderson@linaro.org>
To: qemu-devel@nongnu.org
Subject: [PATCH v4 110/163] tcg/riscv: Drop support for add2/sub2
Date: Tue, 15 Apr 2025 12:24:21 -0700
Message-ID: <20250415192515.232910-111-richard.henderson@linaro.org>
In-Reply-To: <20250415192515.232910-1-richard.henderson@linaro.org>
References: <20250415192515.232910-1-richard.henderson@linaro.org>
MIME-Version: 1.0
Content-Transfer-Encoding: 8bit
Received-SPF: pass client-ip=2607:f8b0:4864:20::62f;
 envelope-from=richard.henderson@linaro.org; helo=mail-pl1-x62f.google.com
X-Spam_score_int: -20
X-Spam_score: -2.1
X-Spam_bar: --
X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1,
 DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1,
 RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001,
 SPF_PASS=-0.001 autolearn=ham autolearn_force=no
X-Spam_action: no action
X-BeenThere: qemu-devel@nongnu.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: <qemu-devel.nongnu.org>
List-Unsubscribe: <https://lists.nongnu.org/mailman/options/qemu-devel>,
 <mailto:qemu-devel-request@nongnu.org?subject=unsubscribe>
List-Archive: <https://lists.nongnu.org/archive/html/qemu-devel>
List-Post: <mailto:qemu-devel@nongnu.org>
List-Help: <mailto:qemu-devel-request@nongnu.org?subject=help>
List-Subscribe: <https://lists.nongnu.org/mailman/listinfo/qemu-devel>,
 <mailto:qemu-devel-request@nongnu.org?subject=subscribe>
Errors-To: qemu-devel-bounces+patch=linaro.org@nongnu.org
Sender: qemu-devel-bounces+patch=linaro.org@nongnu.org

Series

tcg: Convert to TCGOutOp structures | expand

Commit Message

Richard Henderson April 15, 2025, 7:24 p.m. UTC

We now produce exactly the same code via generic expansion.

Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
---
 tcg/riscv/tcg-target-con-set.h |  1 -
 tcg/riscv/tcg-target-has.h     |  6 +--
 tcg/riscv/tcg-target.c.inc     | 86 +---------------------------------
 3 files changed, 3 insertions(+), 90 deletions(-)

Comments

Pierrick Bouvier April 15, 2025, 10:05 p.m. UTC | #1

On 4/15/25 12:24, Richard Henderson wrote:
> We now produce exactly the same code via generic expansion.
> 
> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
> ---
>   tcg/riscv/tcg-target-con-set.h |  1 -
>   tcg/riscv/tcg-target-has.h     |  6 +--
>   tcg/riscv/tcg-target.c.inc     | 86 +---------------------------------
>   3 files changed, 3 insertions(+), 90 deletions(-)
> 
> diff --git a/tcg/riscv/tcg-target-con-set.h b/tcg/riscv/tcg-target-con-set.h
> index 5ff2c2db60..0fc26d3f98 100644
> --- a/tcg/riscv/tcg-target-con-set.h
> +++ b/tcg/riscv/tcg-target-con-set.h
> @@ -18,7 +18,6 @@ C_O1_I2(r, r, ri)
>   C_O1_I2(r, r, rI)
>   C_N1_I2(r, r, rM)
>   C_O1_I4(r, r, rI, rM, rM)
> -C_O2_I4(r, r, rz, rz, rM, rM)
>   C_O0_I2(v, r)
>   C_O1_I1(v, r)
>   C_O1_I1(v, v)
> diff --git a/tcg/riscv/tcg-target-has.h b/tcg/riscv/tcg-target-has.h
> index b2814f8ef9..c95dc1921e 100644
> --- a/tcg/riscv/tcg-target-has.h
> +++ b/tcg/riscv/tcg-target-has.h
> @@ -10,13 +10,11 @@
>   #include "host/cpuinfo.h"
>   
>   /* optional instructions */
> -#define TCG_TARGET_HAS_add2_i32         1
> -#define TCG_TARGET_HAS_sub2_i32         1
>   #define TCG_TARGET_HAS_qemu_st8_i32     0
>   
>   #define TCG_TARGET_HAS_extr_i64_i32     1
> -#define TCG_TARGET_HAS_add2_i64         1
> -#define TCG_TARGET_HAS_sub2_i64         1
> +#define TCG_TARGET_HAS_add2_i64         0
> +#define TCG_TARGET_HAS_sub2_i64         0
>   
>   #define TCG_TARGET_HAS_qemu_ldst_i128   0
>   
> diff --git a/tcg/riscv/tcg-target.c.inc b/tcg/riscv/tcg-target.c.inc
> index d74ac7587a..dce46dcba6 100644
> --- a/tcg/riscv/tcg-target.c.inc
> +++ b/tcg/riscv/tcg-target.c.inc
> @@ -401,7 +401,7 @@ static bool tcg_target_const_match(int64_t val, int ct,
>       }
>       /*
>        * Sign extended from 12 bits, +/- matching: [-0x7ff, 0x7ff].
> -     * Used by addsub2 and movcond, which may need the negative value,
> +     * Used by movcond, which may need the negative value,
>        * and requires the modified constant to be representable.
>        */
>       if ((ct & TCG_CT_CONST_M12) && val >= -0x7ff && val <= 0x7ff) {
> @@ -1073,67 +1073,6 @@ static bool tcg_out_sti(TCGContext *s, TCGType type, TCGArg val,
>       return false;
>   }
>   
> -static void tcg_out_addsub2(TCGContext *s,
> -                            TCGReg rl, TCGReg rh,
> -                            TCGReg al, TCGReg ah,
> -                            TCGArg bl, TCGArg bh,
> -                            bool cbl, bool cbh, bool is_sub, bool is32bit)
> -{
> -    const RISCVInsn opc_add = is32bit ? OPC_ADDW : OPC_ADD;
> -    const RISCVInsn opc_addi = is32bit ? OPC_ADDIW : OPC_ADDI;
> -    const RISCVInsn opc_sub = is32bit ? OPC_SUBW : OPC_SUB;
> -    TCGReg th = TCG_REG_TMP1;
> -
> -    /* If we have a negative constant such that negating it would
> -       make the high part zero, we can (usually) eliminate one insn.  */
> -    if (cbl && cbh && bh == -1 && bl != 0) {
> -        bl = -bl;
> -        bh = 0;
> -        is_sub = !is_sub;
> -    }
> -
> -    /* By operating on the high part first, we get to use the final
> -       carry operation to move back from the temporary.  */
> -    if (!cbh) {
> -        tcg_out_opc_reg(s, (is_sub ? opc_sub : opc_add), th, ah, bh);
> -    } else if (bh != 0 || ah == rl) {
> -        tcg_out_opc_imm(s, opc_addi, th, ah, (is_sub ? -bh : bh));
> -    } else {
> -        th = ah;
> -    }
> -
> -    /* Note that tcg optimization should eliminate the bl == 0 case.  */
> -    if (is_sub) {
> -        if (cbl) {
> -            tcg_out_opc_imm(s, OPC_SLTIU, TCG_REG_TMP0, al, bl);
> -            tcg_out_opc_imm(s, opc_addi, rl, al, -bl);
> -        } else {
> -            tcg_out_opc_reg(s, OPC_SLTU, TCG_REG_TMP0, al, bl);
> -            tcg_out_opc_reg(s, opc_sub, rl, al, bl);
> -        }
> -        tcg_out_opc_reg(s, opc_sub, rh, th, TCG_REG_TMP0);
> -    } else {
> -        if (cbl) {
> -            tcg_out_opc_imm(s, opc_addi, rl, al, bl);
> -            tcg_out_opc_imm(s, OPC_SLTIU, TCG_REG_TMP0, rl, bl);
> -        } else if (al == bl) {
> -            /*
> -             * If the input regs overlap, this is a simple doubling
> -             * and carry-out is the input msb.  This special case is
> -             * required when the output reg overlaps the input,
> -             * but we might as well use it always.
> -             */
> -            tcg_out_opc_imm(s, OPC_SLTI, TCG_REG_TMP0, al, 0);
> -            tcg_out_opc_reg(s, opc_add, rl, al, al);
> -        } else {
> -            tcg_out_opc_reg(s, opc_add, rl, al, bl);
> -            tcg_out_opc_reg(s, OPC_SLTU, TCG_REG_TMP0,
> -                            rl, (rl == bl ? al : bl));
> -        }
> -        tcg_out_opc_reg(s, opc_add, rh, th, TCG_REG_TMP0);
> -    }
> -}
> -
>   static bool tcg_out_dup_vec(TCGContext *s, TCGType type, unsigned vece,
>                                      TCGReg dst, TCGReg src)
>   {
> @@ -2608,23 +2547,6 @@ static void tcg_out_op(TCGContext *s, TCGOpcode opc, TCGType type,
>           tcg_out_ldst(s, OPC_SD, a0, a1, a2);
>           break;
>   
> -    case INDEX_op_add2_i32:
> -        tcg_out_addsub2(s, a0, a1, a2, args[3], args[4], args[5],
> -                        const_args[4], const_args[5], false, true);
> -        break;
> -    case INDEX_op_add2_i64:
> -        tcg_out_addsub2(s, a0, a1, a2, args[3], args[4], args[5],
> -                        const_args[4], const_args[5], false, false);
> -        break;
> -    case INDEX_op_sub2_i32:
> -        tcg_out_addsub2(s, a0, a1, a2, args[3], args[4], args[5],
> -                        const_args[4], const_args[5], true, true);
> -        break;
> -    case INDEX_op_sub2_i64:
> -        tcg_out_addsub2(s, a0, a1, a2, args[3], args[4], args[5],
> -                        const_args[4], const_args[5], true, false);
> -        break;
> -
>       case INDEX_op_qemu_ld_i32:
>           tcg_out_qemu_ld(s, a0, a1, a2, TCG_TYPE_I32);
>           break;
> @@ -2897,12 +2819,6 @@ tcg_target_op_def(TCGOpcode op, TCGType type, unsigned flags)
>       case INDEX_op_st_i64:
>           return C_O0_I2(rz, r);
>   
> -    case INDEX_op_add2_i32:
> -    case INDEX_op_add2_i64:
> -    case INDEX_op_sub2_i32:
> -    case INDEX_op_sub2_i64:
> -        return C_O2_I4(r, r, rz, rz, rM, rM);
> -
>       case INDEX_op_qemu_ld_i32:
>       case INDEX_op_qemu_ld_i64:
>           return C_O1_I1(r, r);

Reviewed-by: Pierrick Bouvier <pierrick.bouvier@linaro.org>

diff --git a/tcg/riscv/tcg-target-con-set.h b/tcg/riscv/tcg-target-con-set.h
index 5ff2c2db60..0fc26d3f98 100644
--- a/tcg/riscv/tcg-target-con-set.h
+++ b/tcg/riscv/tcg-target-con-set.h
@@ -18,7 +18,6 @@  C_O1_I2(r, r, ri)
 C_O1_I2(r, r, rI)
 C_N1_I2(r, r, rM)
 C_O1_I4(r, r, rI, rM, rM)
-C_O2_I4(r, r, rz, rz, rM, rM)
 C_O0_I2(v, r)
 C_O1_I1(v, r)
 C_O1_I1(v, v)
diff --git a/tcg/riscv/tcg-target-has.h b/tcg/riscv/tcg-target-has.h
index b2814f8ef9..c95dc1921e 100644
--- a/tcg/riscv/tcg-target-has.h
+++ b/tcg/riscv/tcg-target-has.h
@@ -10,13 +10,11 @@ 
 #include "host/cpuinfo.h"
 
 /* optional instructions */
-#define TCG_TARGET_HAS_add2_i32         1
-#define TCG_TARGET_HAS_sub2_i32         1
 #define TCG_TARGET_HAS_qemu_st8_i32     0
 
 #define TCG_TARGET_HAS_extr_i64_i32     1
-#define TCG_TARGET_HAS_add2_i64         1
-#define TCG_TARGET_HAS_sub2_i64         1
+#define TCG_TARGET_HAS_add2_i64         0
+#define TCG_TARGET_HAS_sub2_i64         0
 
 #define TCG_TARGET_HAS_qemu_ldst_i128   0
 
diff --git a/tcg/riscv/tcg-target.c.inc b/tcg/riscv/tcg-target.c.inc
index d74ac7587a..dce46dcba6 100644
--- a/tcg/riscv/tcg-target.c.inc
+++ b/tcg/riscv/tcg-target.c.inc
@@ -401,7 +401,7 @@  static bool tcg_target_const_match(int64_t val, int ct,
     }
     /*
      * Sign extended from 12 bits, +/- matching: [-0x7ff, 0x7ff].
-     * Used by addsub2 and movcond, which may need the negative value,
+     * Used by movcond, which may need the negative value,
      * and requires the modified constant to be representable.
      */
     if ((ct & TCG_CT_CONST_M12) && val >= -0x7ff && val <= 0x7ff) {
@@ -1073,67 +1073,6 @@  static bool tcg_out_sti(TCGContext *s, TCGType type, TCGArg val,
     return false;
 }
 
-static void tcg_out_addsub2(TCGContext *s,
-                            TCGReg rl, TCGReg rh,
-                            TCGReg al, TCGReg ah,
-                            TCGArg bl, TCGArg bh,
-                            bool cbl, bool cbh, bool is_sub, bool is32bit)
-{
-    const RISCVInsn opc_add = is32bit ? OPC_ADDW : OPC_ADD;
-    const RISCVInsn opc_addi = is32bit ? OPC_ADDIW : OPC_ADDI;
-    const RISCVInsn opc_sub = is32bit ? OPC_SUBW : OPC_SUB;
-    TCGReg th = TCG_REG_TMP1;
-
-    /* If we have a negative constant such that negating it would
-       make the high part zero, we can (usually) eliminate one insn.  */
-    if (cbl && cbh && bh == -1 && bl != 0) {
-        bl = -bl;
-        bh = 0;
-        is_sub = !is_sub;
-    }
-
-    /* By operating on the high part first, we get to use the final
-       carry operation to move back from the temporary.  */
-    if (!cbh) {
-        tcg_out_opc_reg(s, (is_sub ? opc_sub : opc_add), th, ah, bh);
-    } else if (bh != 0 || ah == rl) {
-        tcg_out_opc_imm(s, opc_addi, th, ah, (is_sub ? -bh : bh));
-    } else {
-        th = ah;
-    }
-
-    /* Note that tcg optimization should eliminate the bl == 0 case.  */
-    if (is_sub) {
-        if (cbl) {
-            tcg_out_opc_imm(s, OPC_SLTIU, TCG_REG_TMP0, al, bl);
-            tcg_out_opc_imm(s, opc_addi, rl, al, -bl);
-        } else {
-            tcg_out_opc_reg(s, OPC_SLTU, TCG_REG_TMP0, al, bl);
-            tcg_out_opc_reg(s, opc_sub, rl, al, bl);
-        }
-        tcg_out_opc_reg(s, opc_sub, rh, th, TCG_REG_TMP0);
-    } else {
-        if (cbl) {
-            tcg_out_opc_imm(s, opc_addi, rl, al, bl);
-            tcg_out_opc_imm(s, OPC_SLTIU, TCG_REG_TMP0, rl, bl);
-        } else if (al == bl) {
-            /*
-             * If the input regs overlap, this is a simple doubling
-             * and carry-out is the input msb.  This special case is
-             * required when the output reg overlaps the input,
-             * but we might as well use it always.
-             */
-            tcg_out_opc_imm(s, OPC_SLTI, TCG_REG_TMP0, al, 0);
-            tcg_out_opc_reg(s, opc_add, rl, al, al);
-        } else {
-            tcg_out_opc_reg(s, opc_add, rl, al, bl);
-            tcg_out_opc_reg(s, OPC_SLTU, TCG_REG_TMP0,
-                            rl, (rl == bl ? al : bl));
-        }
-        tcg_out_opc_reg(s, opc_add, rh, th, TCG_REG_TMP0);
-    }
-}
-
 static bool tcg_out_dup_vec(TCGContext *s, TCGType type, unsigned vece,
                                    TCGReg dst, TCGReg src)
 {
@@ -2608,23 +2547,6 @@  static void tcg_out_op(TCGContext *s, TCGOpcode opc, TCGType type,
         tcg_out_ldst(s, OPC_SD, a0, a1, a2);
         break;
 
-    case INDEX_op_add2_i32:
-        tcg_out_addsub2(s, a0, a1, a2, args[3], args[4], args[5],
-                        const_args[4], const_args[5], false, true);
-        break;
-    case INDEX_op_add2_i64:
-        tcg_out_addsub2(s, a0, a1, a2, args[3], args[4], args[5],
-                        const_args[4], const_args[5], false, false);
-        break;
-    case INDEX_op_sub2_i32:
-        tcg_out_addsub2(s, a0, a1, a2, args[3], args[4], args[5],
-                        const_args[4], const_args[5], true, true);
-        break;
-    case INDEX_op_sub2_i64:
-        tcg_out_addsub2(s, a0, a1, a2, args[3], args[4], args[5],
-                        const_args[4], const_args[5], true, false);
-        break;
-
     case INDEX_op_qemu_ld_i32:
         tcg_out_qemu_ld(s, a0, a1, a2, TCG_TYPE_I32);
         break;
@@ -2897,12 +2819,6 @@  tcg_target_op_def(TCGOpcode op, TCGType type, unsigned flags)
     case INDEX_op_st_i64:
         return C_O0_I2(rz, r);
 
-    case INDEX_op_add2_i32:
-    case INDEX_op_add2_i64:
-    case INDEX_op_sub2_i32:
-    case INDEX_op_sub2_i64:
-        return C_O2_I4(r, r, rz, rz, rM, rM);
-
     case INDEX_op_qemu_ld_i32:
     case INDEX_op_qemu_ld_i64:
         return C_O1_I1(r, r);

[v4,110/163] tcg/riscv: Drop support for add2/sub2

Commit Message

Comments

Patch