[v3,50/69] target/arm: Convert FCVTN, BFCVTN to decodetree

Message ID	20241211163036.2297116-51-richard.henderson@linaro.org
State	New
Headers	show Delivered-To: patch@linaro.org Received-SPF: pass (google.com: domain of qemu-devel-bounces+patch=linaro.org@nongnu.org designates 209.51.188.17 as permitted sender) client-ip=209.51.188.17; From: Richard Henderson <richard.henderson@linaro.org> To: qemu-devel@nongnu.org Cc: qemu-arm@nongnu.org, Peter Maydell <peter.maydell@linaro.org> Subject: [PATCH v3 50/69] target/arm: Convert FCVTN, BFCVTN to decodetree Date: Wed, 11 Dec 2024 10:30:17 -0600 Message-ID: <20241211163036.2297116-51-richard.henderson@linaro.org> In-Reply-To: <20241211163036.2297116-1-richard.henderson@linaro.org> References: <20241211163036.2297116-1-richard.henderson@linaro.org> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Received-SPF: pass client-ip=2607:f8b0:4864:20::f30; envelope-from=richard.henderson@linaro.org; helo=mail-qv1-xf30.google.com X-Spam_score_int: -20 X-Spam_score: -2.1 X-Spam_bar: -- X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no X-Spam_action: no action Precedence: list Errors-To: qemu-devel-bounces+patch=linaro.org@nongnu.org Sender: qemu-devel-bounces+patch=linaro.org@nongnu.org
Series	target/arm: AArch64 decodetree conversion, final part \| expand [v3,00/69] target/arm: AArch64 decodetree conversion, final part [v3,01/69] target/arm: Add section labels for "Data Processing (register)" [v3,02/69] target/arm: Convert UDIV, SDIV to decodetree [v3,03/69] target/arm: Convert LSLV, LSRV, ASRV, RORV to decodetree [v3,04/69] target/arm: Convert CRC32, CRC32C to decodetree [v3,05/69] target/arm: Convert SUBP, IRG, GMI to decodetree [v3,06/69] target/arm: Convert PACGA to decodetree [v3,07/69] target/arm: Convert RBIT, REV16, REV32, REV64 to decodetree [v3,08/69] target/arm: Convert CLZ, CLS to decodetree [v3,09/69] target/arm: Convert PAC[ID], AUT[ID] to decodetree [v3,10/69] target/arm: Convert XPAC[ID] to decodetree [v3,11/69] target/arm: Convert disas_logic_reg to decodetree [v3,12/69] target/arm: Convert disas_add_sub_ext_reg to decodetree [v3,13/69] target/arm: Convert disas_add_sub_reg to decodetree [v3,14/69] target/arm: Convert disas_data_proc_3src to decodetree [v3,15/69] target/arm: Convert disas_adc_sbc to decodetree [v3,16/69] target/arm: Convert RMIF to decodetree [v3,17/69] target/arm: Convert SETF8, SETF16 to decodetree [v3,18/69] target/arm: Convert CCMP, CCMN to decodetree [v3,19/69] target/arm: Convert disas_cond_select to decodetree [v3,20/69] target/arm: Introduce fp_access_check_scalar_hsd [v3,21/69] target/arm: Introduce fp_access_check_vector_hsd [v3,22/69] target/arm: Convert FCMP, FCMPE, FCCMP, FCCMPE to decodetree [v3,23/69] target/arm: Fix decode of fp16 vector fabs, fneg, fsqrt [v3,24/69] target/arm: Convert FMOV, FABS, FNEG (scalar) to decodetree [v3,25/69] target/arm: Pass fpstatus to vfp_sqrt* [v3,26/69] target/arm: Remove helper_sqrt_f16 [v3,27/69] target/arm: Convert FSQRT (scalar) to decodetree [v3,28/69] target/arm: Convert FRINT[NPMSAXI] (scalar) to decodetree [v3,29/69] target/arm: Convert BFCVT to decodetree [v3,30/69] target/arm: Convert FRINT{32, 64}[ZX] (scalar) to decodetree [v3,31/69] target/arm: Convert FCVT (scalar) to decodetree [v3,32/69] target/arm: Convert handle_fpfpcvt to decodetree [v3,33/69] target/arm: Convert FJCVTZS to decodetree [v3,34/69] target/arm: Convert handle_fmov to decodetree [v3,35/69] target/arm: Convert SQABS, SQNEG to decodetree [v3,36/69] target/arm: Convert ABS, NEG to decodetree [v3,37/69] target/arm: Introduce gen_gvec_cls, gen_gvec_clz [v3,38/69] target/arm: Convert CLS, CLZ (vector) to decodetree [v3,39/69] target/arm: Introduce gen_gvec_cnt, gen_gvec_rbit [v3,40/69] target/arm: Convert CNT, NOT, RBIT (vector) to decodetree [v3,41/69] target/arm: Convert CMGT, CMGE, GMLT, GMLE, CMEQ (zero) to decodetree [v3,42/69] target/arm: Introduce gen_gvec_rev{16,32,64} [v3,43/69] target/arm: Convert handle_rev to decodetree [v3,44/69] target/arm: Move helper_neon_addlp_{s8, s16} to neon_helper.c [v3,45/69] target/arm: Introduce gen_gvec_{s,u}{add,ada}lp [v3,46/69] target/arm: Convert handle_2misc_pairwise to decodetree [v3,47/69] target/arm: Remove helper_neon_{add,sub}l_u{16,32} [v3,48/69] target/arm: Introduce clear_vec [v3,49/69] target/arm: Convert XTN, SQXTUN, SQXTN, UQXTN to decodetree [v3,50/69] target/arm: Convert FCVTN, BFCVTN to decodetree [v3,51/69] target/arm: Convert FCVTXN to decodetree [v3,52/69] target/arm: Convert SHLL to decodetree [v3,53/69] target/arm: Implement gen_gvec_fabs, gen_gvec_fneg [v3,54/69] target/arm: Convert FABS, FNEG (vector) to decodetree [v3,55/69] target/arm: Convert FSQRT (vector) to decodetree [v3,56/69] target/arm: Convert FRINT* (vector) to decodetree [v3,57/69] target/arm: Convert FCVT* (vector, integer) scalar to decodetree [v3,58/69] target/arm: Convert FCVT* (vector, fixed-point) scalar to decodetree [v3,59/69] target/arm: Convert [US]CVTF (vector, integer) scalar to decodetree [v3,60/69] target/arm: Convert [US]CVTF (vector, fixed-point) scalar to decodetree [v3,61/69] target/arm: Rename helper_gvec_vcvt_[hf][su] with _rz [v3,62/69] target/arm: Convert [US]CVTF (vector) to decodetree [v3,63/69] target/arm: Convert FCVTZ[SU] (vector, fixed-point) to decodetree [v3,64/69] target/arm: Convert FCVT* (vector, integer) to decodetree [v3,65/69] target/arm: Convert handle_2misc_fcmp_zero to decodetree [v3,66/69] target/arm: Convert FRECPE, FRECPX, FRSQRTE to decodetree [v3,67/69] target/arm: Introduce gen_gvec_urecpe, gen_gvec_ursqrte [v3,68/69] target/arm: Convert URECPE and URSQRTE to decodetree [v3,69/69] target/arm: Convert FCVTL to decodetree

Message ID

20241211163036.2297116-51-richard.henderson@linaro.org

State

New

Headers

Received-SPF: pass (google.com: domain of
 qemu-devel-bounces+patch=linaro.org@nongnu.org designates 209.51.188.17 as
 permitted sender) client-ip=209.51.188.17;
From: Richard Henderson <richard.henderson@linaro.org>
To: qemu-devel@nongnu.org
Cc: qemu-arm@nongnu.org,
	Peter Maydell <peter.maydell@linaro.org>
Subject: [PATCH v3 50/69] target/arm: Convert FCVTN, BFCVTN to decodetree
Date: Wed, 11 Dec 2024 10:30:17 -0600
Message-ID: <20241211163036.2297116-51-richard.henderson@linaro.org>
In-Reply-To: <20241211163036.2297116-1-richard.henderson@linaro.org>
References: <20241211163036.2297116-1-richard.henderson@linaro.org>
MIME-Version: 1.0
Content-Transfer-Encoding: 8bit
Received-SPF: pass client-ip=2607:f8b0:4864:20::f30;
 envelope-from=richard.henderson@linaro.org; helo=mail-qv1-xf30.google.com
X-Spam_score_int: -20
X-Spam_score: -2.1
X-Spam_bar: --
X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1,
 DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1,
 RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001,
 SPF_PASS=-0.001 autolearn=ham autolearn_force=no
X-Spam_action: no action
X-BeenThere: qemu-devel@nongnu.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: <qemu-devel.nongnu.org>
List-Unsubscribe: <https://lists.nongnu.org/mailman/options/qemu-devel>,
 <mailto:qemu-devel-request@nongnu.org?subject=unsubscribe>
List-Archive: <https://lists.nongnu.org/archive/html/qemu-devel>
List-Post: <mailto:qemu-devel@nongnu.org>
List-Help: <mailto:qemu-devel-request@nongnu.org?subject=help>
List-Subscribe: <https://lists.nongnu.org/mailman/listinfo/qemu-devel>,
 <mailto:qemu-devel-request@nongnu.org?subject=subscribe>
Errors-To: qemu-devel-bounces+patch=linaro.org@nongnu.org
Sender: qemu-devel-bounces+patch=linaro.org@nongnu.org

Series

target/arm: AArch64 decodetree conversion, final part | expand

Commit Message

Richard Henderson Dec. 11, 2024, 4:30 p.m. UTC

Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
---
 target/arm/tcg/translate-a64.c | 89 ++++++++++++++++++----------------
 target/arm/tcg/a64.decode      |  5 ++
 2 files changed, 52 insertions(+), 42 deletions(-)

diff --git a/target/arm/tcg/translate-a64.c b/target/arm/tcg/translate-a64.c
index 7b76945b0a..d4d19c9caa 100644
--- a/target/arm/tcg/translate-a64.c
+++ b/target/arm/tcg/translate-a64.c
@@ -9051,6 +9051,49 @@  TRANS(SQXTUN_v, do_2misc_narrow_vector, a, f_scalar_sqxtun)
 TRANS(SQXTN_v, do_2misc_narrow_vector, a, f_scalar_sqxtn)
 TRANS(UQXTN_v, do_2misc_narrow_vector, a, f_scalar_uqxtn)
 
+static void gen_fcvtn_hs(TCGv_i64 d, TCGv_i64 n)
+{
+    TCGv_i32 tcg_lo = tcg_temp_new_i32();
+    TCGv_i32 tcg_hi = tcg_temp_new_i32();
+    TCGv_ptr fpst = fpstatus_ptr(FPST_FPCR);
+    TCGv_i32 ahp = get_ahp_flag();
+
+    tcg_gen_extr_i64_i32(tcg_lo, tcg_hi, n);
+    gen_helper_vfp_fcvt_f32_to_f16(tcg_lo, tcg_lo, fpst, ahp);
+    gen_helper_vfp_fcvt_f32_to_f16(tcg_hi, tcg_hi, fpst, ahp);
+    tcg_gen_deposit_i32(tcg_lo, tcg_lo, tcg_hi, 16, 16);
+    tcg_gen_extu_i32_i64(d, tcg_lo);
+}
+
+static void gen_fcvtn_sd(TCGv_i64 d, TCGv_i64 n)
+{
+    TCGv_i32 tmp = tcg_temp_new_i32();
+    gen_helper_vfp_fcvtsd(tmp, n, tcg_env);
+    tcg_gen_extu_i32_i64(d, tmp);
+}
+
+static ArithOneOp * const f_vector_fcvtn[] = {
+    NULL,
+    gen_fcvtn_hs,
+    gen_fcvtn_sd,
+};
+TRANS(FCVTN_v, do_2misc_narrow_vector, a, f_vector_fcvtn)
+
+static void gen_bfcvtn_hs(TCGv_i64 d, TCGv_i64 n)
+{
+    TCGv_ptr fpst = fpstatus_ptr(FPST_FPCR);
+    TCGv_i32 tmp = tcg_temp_new_i32();
+    gen_helper_bfcvt_pair(tmp, n, fpst);
+    tcg_gen_extu_i32_i64(d, tmp);
+}
+
+static ArithOneOp * const f_vector_bfcvtn[] = {
+    NULL,
+    gen_bfcvtn_hs,
+    NULL,
+};
+TRANS_FEAT(BFCVTN_v, aa64_bf16, do_2misc_narrow_vector, a, f_vector_bfcvtn)
+
 /* Common vector code for handling integer to FP conversion */
 static void handle_simd_intfp_conv(DisasContext *s, int rd, int rn,
                                    int elements, int is_signed,
@@ -9633,33 +9676,6 @@  static void handle_2misc_narrow(DisasContext *s, bool scalar,
         tcg_res[pass] = tcg_temp_new_i64();
 
         switch (opcode) {
-        case 0x16: /* FCVTN, FCVTN2 */
-            /* 32 bit to 16 bit or 64 bit to 32 bit float conversion */
-            if (size == 2) {
-                TCGv_i32 tmp = tcg_temp_new_i32();
-                gen_helper_vfp_fcvtsd(tmp, tcg_op, tcg_env);
-                tcg_gen_extu_i32_i64(tcg_res[pass], tmp);
-            } else {
-                TCGv_i32 tcg_lo = tcg_temp_new_i32();
-                TCGv_i32 tcg_hi = tcg_temp_new_i32();
-                TCGv_ptr fpst = fpstatus_ptr(FPST_FPCR);
-                TCGv_i32 ahp = get_ahp_flag();
-
-                tcg_gen_extr_i64_i32(tcg_lo, tcg_hi, tcg_op);
-                gen_helper_vfp_fcvt_f32_to_f16(tcg_lo, tcg_lo, fpst, ahp);
-                gen_helper_vfp_fcvt_f32_to_f16(tcg_hi, tcg_hi, fpst, ahp);
-                tcg_gen_deposit_i32(tcg_lo, tcg_lo, tcg_hi, 16, 16);
-                tcg_gen_extu_i32_i64(tcg_res[pass], tcg_lo);
-            }
-            break;
-        case 0x36: /* BFCVTN, BFCVTN2 */
-            {
-                TCGv_ptr fpst = fpstatus_ptr(FPST_FPCR);
-                TCGv_i32 tmp = tcg_temp_new_i32();
-                gen_helper_bfcvt_pair(tmp, tcg_op, fpst);
-                tcg_gen_extu_i32_i64(tcg_res[pass], tmp);
-            }
-            break;
         case 0x56:  /* FCVTXN, FCVTXN2 */
             {
                 /*
@@ -9675,6 +9691,8 @@  static void handle_2misc_narrow(DisasContext *s, bool scalar,
         default:
         case 0x12: /* XTN, SQXTUN */
         case 0x14: /* SQXTN, UQXTN */
+        case 0x16: /* FCVTN, FCVTN2 */
+        case 0x36: /* BFCVTN, BFCVTN2 */
             g_assert_not_reached();
         }
 
@@ -10088,21 +10106,6 @@  static void disas_simd_two_reg_misc(DisasContext *s, uint32_t insn)
                 unallocated_encoding(s);
                 return;
             }
-            /* fall through */
-        case 0x16: /* FCVTN, FCVTN2 */
-            /* handle_2misc_narrow does a 2*size -> size operation, but these
-             * instructions encode the source size rather than dest size.
-             */
-            if (!fp_access_check(s)) {
-                return;
-            }
-            handle_2misc_narrow(s, false, opcode, 0, is_q, size - 1, rn, rd);
-            return;
-        case 0x36: /* BFCVTN, BFCVTN2 */
-            if (!dc_isar_feature(aa64_bf16, s) || size != 2) {
-                unallocated_encoding(s);
-                return;
-            }
             if (!fp_access_check(s)) {
                 return;
             }
@@ -10155,6 +10158,8 @@  static void disas_simd_two_reg_misc(DisasContext *s, uint32_t insn)
             }
             break;
         default:
+        case 0x16: /* FCVTN, FCVTN2 */
+        case 0x36: /* BFCVTN, BFCVTN2 */
             unallocated_encoding(s);
             return;
         }
diff --git a/target/arm/tcg/a64.decode b/target/arm/tcg/a64.decode
index 295329448f..456912cd7c 100644
--- a/target/arm/tcg/a64.decode
+++ b/target/arm/tcg/a64.decode
@@ -21,6 +21,7 @@ 
 
 %rd             0:5
 %esz_sd         22:1 !function=plus_2
+%esz_hs         22:1 !function=plus_1
 %esz_hsd        22:2 !function=xor_2
 %hl             11:1 21:1
 %hlm            11:1 20:2
@@ -74,6 +75,7 @@ 
 @qrr_b          . q:1 ...... .. ...... ...... rn:5 rd:5  &qrr_e esz=0
 @qrr_h          . q:1 ...... .. ...... ...... rn:5 rd:5  &qrr_e esz=1
 @qrr_bh         . q:1 ...... . esz:1 ...... ...... rn:5 rd:5  &qrr_e
+@qrr_hs         . q:1 ...... .. ...... ...... rn:5 rd:5  &qrr_e esz=%esz_hs
 @qrr_e          . q:1 ...... esz:2 ...... ...... rn:5 rd:5  &qrr_e
 
 @qrrr_b         . q:1 ...... ... rm:5 ...... rn:5 rd:5  &qrrr_e esz=0
@@ -1676,3 +1678,6 @@  XTN             0.00 1110 ..1 00001 00101 0 ..... .....     @qrr_e
 SQXTUN_v        0.10 1110 ..1 00001 00101 0 ..... .....     @qrr_e
 SQXTN_v         0.00 1110 ..1 00001 01001 0 ..... .....     @qrr_e
 UQXTN_v         0.10 1110 ..1 00001 01001 0 ..... .....     @qrr_e
+
+FCVTN_v         0.00 1110 0.1 00001 01101 0 ..... .....     @qrr_hs
+BFCVTN_v        0.00 1110 101 00001 01101 0 ..... .....     @qrr_h

[v3,50/69] target/arm: Convert FCVTN, BFCVTN to decodetree

Commit Message

Patch