[26/67] target/arm: Convert FSQRT (scalar) to decodetree

Message ID	20241201150607.12812-27-richard.henderson@linaro.org
State	Superseded
Headers	show Delivered-To: patch@linaro.org Received-SPF: pass (google.com: domain of qemu-devel-bounces+patch=linaro.org@nongnu.org designates 209.51.188.17 as permitted sender) client-ip=209.51.188.17; From: Richard Henderson <richard.henderson@linaro.org> To: qemu-devel@nongnu.org Cc: qemu-arm@nongnu.org Subject: [PATCH 26/67] target/arm: Convert FSQRT (scalar) to decodetree Date: Sun, 1 Dec 2024 09:05:25 -0600 Message-ID: <20241201150607.12812-27-richard.henderson@linaro.org> In-Reply-To: <20241201150607.12812-1-richard.henderson@linaro.org> References: <20241201150607.12812-1-richard.henderson@linaro.org> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Received-SPF: pass client-ip=2607:f8b0:4864:20::c32; envelope-from=richard.henderson@linaro.org; helo=mail-oo1-xc32.google.com X-Spam_score_int: -20 X-Spam_score: -2.1 X-Spam_bar: -- X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no X-Spam_action: no action Precedence: list Errors-To: qemu-devel-bounces+patch=linaro.org@nongnu.org Sender: qemu-devel-bounces+patch=linaro.org@nongnu.org
Series	target/arm: AArch64 decodetree conversion, final part \| expand [00/67] target/arm: AArch64 decodetree conversion, final part [01/67] target/arm: Use ### to separate 3rd-level sections in a64.decode [02/67] target/arm: Convert UDIV, SDIV to decodetree [03/67] target/arm: Convert LSLV, LSRV, ASRV, RORV to decodetree [04/67] target/arm: Convert CRC32, CRC32C to decodetree [05/67] target/arm: Convert SUBP, IRG, GMI to decodetree [06/67] target/arm: Convert PACGA to decodetree [07/67] target/arm: Convert RBIT, REV16, REV32, REV64 to decodetree [08/67] target/arm: Convert CLZ, CLS to decodetree [09/67] target/arm: Convert PAC[ID], AUT[ID] to decodetree [10/67] target/arm: Convert XPAC[ID] to decodetree [11/67] target/arm: Convert disas_logic_reg to decodetree [12/67] target/arm: Convert disas_add_sub_ext_reg to decodetree [13/67] target/arm: Convert disas_add_sub_reg to decodetree [14/67] target/arm: Convert disas_data_proc_3src to decodetree [15/67] target/arm: Convert disas_adc_sbc to decodetree [16/67] target/arm: Convert RMIF to decodetree [17/67] target/arm: Convert SETF8, SETF16 to decodetree [18/67] target/arm: Convert CCMP, CCMN to decodetree [19/67] target/arm: Convert disas_cond_select to decodetree [20/67] target/arm: Introduce fp_access_check_scalar_hsd [21/67] target/arm: Introduce fp_access_check_vector_hsd [22/67] target/arm: Convert FCMP, FCMPE, FCCMP, FCCMPE to decodetree [23/67] target/arm: Convert FMOV, FABS, FNEG (scalar) to decodetree [24/67] target/arm: Pass fpstatus to vfp_sqrt* [25/67] target/arm: Remove helper_sqrt_f16 [26/67] target/arm: Convert FSQRT (scalar) to decodetree [27/67] target/arm: Convert FRINT[NPMSAXI] (scalar) to decodetree [28/67] target/arm: Convert BFCVT to decodetree [29/67] target/arm: Convert FRINT{32, 64}[ZX] (scalar) to decodetree [30/67] target/arm: Convert FCVT (scalar) to decodetree [31/67] target/arm: Convert handle_fpfpcvt to decodetree [32/67] target/arm: Convert FJCVTZS to decodetree [33/67] target/arm: Convert handle_fmov to decodetree [34/67] target/arm: Convert SQABS, SQNEG to decodetree [35/67] target/arm: Convert ABS, NEG to decodetree [36/67] target/arm: Introduce gen_gvec_cls, gen_gvec_clz [37/67] target/arm: Convert CLS, CLZ (vector) to decodetree [38/67] target/arm: Introduce gen_gvec_cnt, gen_gvec_rbit [39/67] target/arm: Convert CNT, NOT, RBIT (vector) to decodetree [40/67] target/arm: Convert CMGT, CMGE, GMLT, GMLE, CMEQ (zero) to decodetree [41/67] target/arm: Introduce gen_gvec_rev{16,32,64} [42/67] target/arm: Convert handle_rev to decodetree [43/67] target/arm: Move helper_neon_addlp_{s8, s16} to neon_helper.c [44/67] target/arm: Introduce gen_gvec_{s,u}{add,ada}lp [45/67] target/arm: Convert handle_2misc_pairwise to decodetree [46/67] target/arm: Remove helper_neon_{add,sub}l_u{16,32} [47/67] target/arm: Introduce clear_vec [48/67] target/arm: Convert XTN, SQXTUN, SQXTN, UQXTN to decodetree [49/67] target/arm: Convert FCVTN, BFCVTN to decodetree [50/67] target/arm: Convert FCVTXN to decodetree [51/67] target/arm: Convert SHLL to decodetree [52/67] target/arm: Convert FABS, FNEG (vector) to decodetree [53/67] target/arm: Convert FSQRT (vector) to decodetree [54/67] target/arm: Convert FRINT* (vector) to decodetree [55/67] target/arm: Convert FCVT* (vector, integer) scalar to decodetree [56/67] target/arm: Convert FCVT* (vector, fixed-point) scalar to decodetree [57/67] target/arm: Convert [US]CVTF (vector, integer) scalar to decodetree [58/67] target/arm: Convert [US]CVTF (vector, fixed-point) scalar to decodetree [59/67] target/arm: Rename helper_gvec_vcvt_[hf][su] with _rz [60/67] target/arm: Convert [US]CVTF (vector) to decodetree [61/67] target/arm: Convert FCVTZ[SU] (vector, fixed-point) to decodetree [62/67] target/arm: Convert FCVT* (vector, integer) to decodetree [63/67] target/arm: Convert handle_2misc_fcmp_zero to decodetree [64/67] target/arm: Convert FRECPE, FRECPX, FRSQRTE to decodetree [65/67] target/arm: Introduce gen_gvec_urecpe, gen_gvec_ursqrte [66/67] target/arm: Convert URECPE and URSQRTE to decodetree [67/67] target/arm: Convert FCVTL to decodetree

Message ID

20241201150607.12812-27-richard.henderson@linaro.org

State

Superseded

Headers

Received-SPF: pass (google.com: domain of
 qemu-devel-bounces+patch=linaro.org@nongnu.org designates 209.51.188.17 as
 permitted sender) client-ip=209.51.188.17;
From: Richard Henderson <richard.henderson@linaro.org>
To: qemu-devel@nongnu.org
Cc: qemu-arm@nongnu.org
Subject: [PATCH 26/67] target/arm: Convert FSQRT (scalar) to decodetree
Date: Sun,  1 Dec 2024 09:05:25 -0600
Message-ID: <20241201150607.12812-27-richard.henderson@linaro.org>
In-Reply-To: <20241201150607.12812-1-richard.henderson@linaro.org>
References: <20241201150607.12812-1-richard.henderson@linaro.org>
MIME-Version: 1.0
Content-Transfer-Encoding: 8bit
Received-SPF: pass client-ip=2607:f8b0:4864:20::c32;
 envelope-from=richard.henderson@linaro.org; helo=mail-oo1-xc32.google.com
X-Spam_score_int: -20
X-Spam_score: -2.1
X-Spam_bar: --
X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1,
 DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1,
 RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001,
 SPF_PASS=-0.001 autolearn=ham autolearn_force=no
X-Spam_action: no action
X-BeenThere: qemu-devel@nongnu.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: <qemu-devel.nongnu.org>
List-Unsubscribe: <https://lists.nongnu.org/mailman/options/qemu-devel>,
 <mailto:qemu-devel-request@nongnu.org?subject=unsubscribe>
List-Archive: <https://lists.nongnu.org/archive/html/qemu-devel>
List-Post: <mailto:qemu-devel@nongnu.org>
List-Help: <mailto:qemu-devel-request@nongnu.org?subject=help>
List-Subscribe: <https://lists.nongnu.org/mailman/listinfo/qemu-devel>,
 <mailto:qemu-devel-request@nongnu.org?subject=subscribe>
Errors-To: qemu-devel-bounces+patch=linaro.org@nongnu.org
Sender: qemu-devel-bounces+patch=linaro.org@nongnu.org

Series

target/arm: AArch64 decodetree conversion, final part | expand

Signed-off-by: Richard Henderson <richard.henderson@linaro.org> --- target/arm/tcg/translate-a64.c | 70 +++++++++++++++++++++++++++++----- target/arm/tcg/a64.decode | 1 + 2 files changed, 61 insertions(+), 10 deletions(-)

Comments

Peter Maydell Dec. 6, 2024, 1:19 p.m. UTC | #1

On Sun, 1 Dec 2024 at 15:14, Richard Henderson
<richard.henderson@linaro.org> wrote:
>
> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
> ---
>  target/arm/tcg/translate-a64.c | 70 +++++++++++++++++++++++++++++-----
>  target/arm/tcg/a64.decode      |  1 +
>  2 files changed, 61 insertions(+), 10 deletions(-)

Other than the missing code in the old decoder mentioned
on the comments on the earlier patch,
Reviewed-by: Peter Maydell <peter.maydell@linaro.org>

thanks
-- PMM

diff --git a/target/arm/tcg/translate-a64.c b/target/arm/tcg/translate-a64.c
index 4d945f2d5b..750db921cd 100644
--- a/target/arm/tcg/translate-a64.c
+++ b/target/arm/tcg/translate-a64.c
@@ -8348,6 +8348,63 @@  static const FPScalar1Int f_scalar_fneg = {
 };
 TRANS(FNEG_s, do_fp1_scalar_int, a, &f_scalar_fneg)
 
+typedef struct FPScalar1 {
+    void (*gen_h)(TCGv_i32, TCGv_i32, TCGv_ptr);
+    void (*gen_s)(TCGv_i32, TCGv_i32, TCGv_ptr);
+    void (*gen_d)(TCGv_i64, TCGv_i64, TCGv_ptr);
+} FPScalar1;
+
+static bool do_fp1_scalar(DisasContext *s, arg_rr_e *a,
+                          const FPScalar1 *f, int rmode)
+{
+    TCGv_i32 tcg_rmode = NULL;
+    TCGv_ptr fpst;
+    TCGv_i64 t64;
+    TCGv_i32 t32;
+    int check = fp_access_check_scalar_hsd(s, a->esz);
+
+    if (check <= 0) {
+        return check == 0;
+    }
+
+    fpst = fpstatus_ptr(a->esz == MO_16 ? FPST_FPCR_F16 : FPST_FPCR);
+    if (rmode >= 0) {
+        tcg_rmode = gen_set_rmode(rmode, fpst);
+    }
+
+    switch (a->esz) {
+    case MO_64:
+        t64 = read_fp_dreg(s, a->rn);
+        f->gen_d(t64, t64, fpst);
+        write_fp_dreg(s, a->rd, t64);
+        break;
+    case MO_32:
+        t32 = read_fp_sreg(s, a->rn);
+        f->gen_s(t32, t32, fpst);
+        write_fp_sreg(s, a->rd, t32);
+        break;
+    case MO_16:
+        t32 = read_fp_hreg(s, a->rn);
+        f->gen_h(t32, t32, fpst);
+        write_fp_sreg(s, a->rd, t32);
+        break;
+    default:
+        g_assert_not_reached();
+    }
+
+    if (rmode >= 0) {
+        gen_restore_rmode(tcg_rmode, fpst);
+    }
+    return true;
+}
+
+static const FPScalar1 f_scalar_fsqrt = {
+    gen_helper_vfp_sqrth,
+    gen_helper_vfp_sqrts,
+    gen_helper_vfp_sqrtd,
+};
+TRANS(FSQRT_s, do_fp1_scalar, a, &f_scalar_fsqrt, -1)
+
 /* Floating-point data-processing (1 source) - half precision */
 static void handle_fp_1src_half(DisasContext *s, int opcode, int rd, int rn)
 {
@@ -8356,10 +8413,6 @@  static void handle_fp_1src_half(DisasContext *s, int opcode, int rd, int rn)
     TCGv_i32 tcg_res = tcg_temp_new_i32();
 
     switch (opcode) {
-    case 0x3: /* FSQRT */
-        fpst = fpstatus_ptr(FPST_FPCR_F16);
-        gen_helper_vfp_sqrth(tcg_res, tcg_op, fpst);
-        break;
     case 0x8: /* FRINTN */
     case 0x9: /* FRINTP */
     case 0xa: /* FRINTM */
@@ -8386,6 +8439,7 @@  static void handle_fp_1src_half(DisasContext *s, int opcode, int rd, int rn)
     case 0x0: /* FMOV */
     case 0x1: /* FABS */
     case 0x2: /* FNEG */
+    case 0x3: /* FSQRT */
         g_assert_not_reached();
     }
 
@@ -8404,9 +8458,6 @@  static void handle_fp_1src_single(DisasContext *s, int opcode, int rd, int rn)
     tcg_res = tcg_temp_new_i32();
 
     switch (opcode) {
-    case 0x3: /* FSQRT */
-        gen_fpst = gen_helper_vfp_sqrts;
-        break;
     case 0x6: /* BFCVT */
         gen_fpst = gen_helper_bfcvt;
         break;
@@ -8442,6 +8493,7 @@  static void handle_fp_1src_single(DisasContext *s, int opcode, int rd, int rn)
     case 0x0: /* FMOV */
     case 0x1: /* FABS */
     case 0x2: /* FNEG */
+    case 0x3: /* FSQRT */
         g_assert_not_reached();
     }
 
@@ -8469,9 +8521,6 @@  static void handle_fp_1src_double(DisasContext *s, int opcode, int rd, int rn)
     tcg_res = tcg_temp_new_i64();
 
     switch (opcode) {
-    case 0x3: /* FSQRT */
-        gen_fpst = gen_helper_vfp_sqrtd;
-        break;
     case 0x8: /* FRINTN */
     case 0x9: /* FRINTP */
     case 0xa: /* FRINTM */
@@ -8504,6 +8553,7 @@  static void handle_fp_1src_double(DisasContext *s, int opcode, int rd, int rn)
     case 0x0: /* FMOV */
     case 0x1: /* FABS */
     case 0x2: /* FNEG */
+    case 0x3: /* FSQRT */
         g_assert_not_reached();
     }
 
diff --git a/target/arm/tcg/a64.decode b/target/arm/tcg/a64.decode
index fca64c63c3..9e5ea14683 100644
--- a/target/arm/tcg/a64.decode
+++ b/target/arm/tcg/a64.decode
@@ -1327,6 +1327,7 @@  FMINV_s         0110 1110 10 11000 01111 10 ..... .....     @rr_q1e2
 FMOV_s          00011110 .. 1 000000 10000 ..... .....      @rr_hsd
 FABS_s          00011110 .. 1 000001 10000 ..... .....      @rr_hsd
 FNEG_s          00011110 .. 1 000010 10000 ..... .....      @rr_hsd
+FSQRT_s         00011110 .. 1 000011 10000 ..... .....      @rr_hsd
 
 # Floating-point Immediate

[26/67] target/arm: Convert FSQRT (scalar) to decodetree

Commit Message

Comments

Patch