[57/76] target/arm: Handle FPCR.AH in SVE FABD

Message ID	20250124162836.2332150-58-peter.maydell@linaro.org
State	Superseded
Headers	show Delivered-To: patch@linaro.org Received-SPF: pass (google.com: domain of qemu-devel-bounces+patch=linaro.org@nongnu.org designates 209.51.188.17 as permitted sender) client-ip=209.51.188.17; From: Peter Maydell <peter.maydell@linaro.org> To: qemu-arm@nongnu.org, qemu-devel@nongnu.org Subject: [PATCH 57/76] target/arm: Handle FPCR.AH in SVE FABD Date: Fri, 24 Jan 2025 16:28:17 +0000 Message-Id: <20250124162836.2332150-58-peter.maydell@linaro.org> In-Reply-To: <20250124162836.2332150-1-peter.maydell@linaro.org> References: <20250124162836.2332150-1-peter.maydell@linaro.org> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Received-SPF: pass client-ip=2a00:1450:4864:20::332; envelope-from=peter.maydell@linaro.org; helo=mail-wm1-x332.google.com X-Spam_score_int: -20 X-Spam_score: -2.1 X-Spam_bar: -- X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01 autolearn=unavailable autolearn_force=no X-Spam_action: no action Precedence: list Errors-To: qemu-devel-bounces+patch=linaro.org@nongnu.org Sender: qemu-devel-bounces+patch=linaro.org@nongnu.org
Series	target/arm: Implement FEAT_AFP and FEAT_RPRES \| expand [00/76] target/arm: Implement FEAT_AFP and FEAT_RPRES [01/76] target/i386: Do not raise Invalid for 0 * Inf + QNaN [02/76] tests/tcg/x86_64/fma: Test some x86 fused-multiply-add cases [03/76] target/arm: arm_reset_sve_state() should set FPSR, not FPCR [04/76] target/arm: Use FPSR_ constants in vfp_exceptbits_from_host() [05/76] target/arm: Use uint32_t in vfp_exceptbits_from_host() [06/76] target/arm: Define new fp_status_a32 and fp_status_a64 [07/76] target/arm: Use vfp.fp_status_a64 in A64-only helper functions [08/76] target/arm: Use fp_status_a32 in vjvct helper [09/76] target/arm: Use fp_status_a32 in vfp_cmp helpers [10/76] target/arm: Use FPST_FPCR_A32 in A32 decoder [11/76] target/arm: Use FPST_FPCR_A64 in A64 decoder [12/76] target/arm: Remove now-unused vfp.fp_status and FPST_FPCR [13/76] target/arm: Define new fp_status_f16_a32 and fp_status_f16_a64 [14/76] target/arm: Use fp_status_f16_a32 in AArch32-only helpers [15/76] target/arm: Use fp_status_f16_a64 in AArch64-only helpers [16/76] target/arm: Use FPST_FPCR_F16_A32 in A32 decoder [17/76] target/arm: Use FPST_FPCR_F16_A64 in A64 decoder [18/76] target/arm: Remove now-unused vfp.fp_status_f16 and FPST_FPCR_F16 [19/76] fpu: Rename float_flag_input_denormal to float_flag_input_denormal_flushed [20/76] fpu: Rename float_flag_output_denormal to float_flag_output_denormal_flushed [21/76] fpu: Fix a comment in softfloat-types.h [22/76] fpu: Add float_class_denormal [23/76] fpu: Implement float_flag_input_denormal_used [24/76] fpu: allow flushing of output denormals to be after rounding [25/76] target/arm: Remove redundant advsimd float16 helpers [26/76] target/arm: Use FPST_FPCR_F16_A64 for halfprec-to-other conversions [27/76] target/arm: Define FPCR AH, FIZ, NEP bits [28/76] target/arm: Implement FPCR.FIZ handling [29/76] target/arm: Adjust FP behaviour for FPCR.AH = 1 [30/76] target/arm: Adjust exception flag handling for AH = 1 [31/76] target/arm: Add FPCR.AH to tbflags [32/76] target/arm: Set up float_status to use for FPCR.AH=1 behaviour [33/76] target/arm: Use FPST_FPCR_AH for FRECPE, FRECPS, FRECPX, FRSQRTE, FRSQRTS [34/76] target/arm: Use FPST_FPCR_AH for BFCVT* insns [35/76] target/arm: Use FPST_FPCR_AH for BFMLAL, BFMLSL insns [36/76] target/arm: Add FPCR.NEP to TBFLAGS [37/76] target/arm: Define and use new write_fp_*reg_merging() functions [38/76] target/arm: Handle FPCR.NEP for 3-input scalar operations [39/76] target/arm: Handle FPCR.NEP for BFCVT scalar [40/76] target/arm: Handle FPCR.NEP for 1-input scalar operations [41/76] target/arm: Handle FPCR.NEP in do_cvtf_scalar() [42/76] target/arm: Handle FPCR.NEP for scalar FABS and FNEG [43/76] target/arm: Handle FPCR.NEP for FCVTXN (scalar) [44/76] target/arm: Handle FPCR.NEP for NEP for FMUL, FMULX scalar by element [45/76] target/arm: Implement FPCR.AH semantics for scalar FMIN/FMAX [46/76] target/arm: Implement FPCR.AH semantics for vector FMIN/FMAX [47/76] target/arm: Implement FPCR.AH semantics for FMAXV and FMINV [48/76] target/arm: Implement FPCR.AH semantics for FMINP and FMAXP [49/76] target/arm: Implement FPCR.AH semantics for SVE FMAXV and FMINV [50/76] target/arm: Implement FPCR.AH semantics for SVE FMIN/FMAX immediate [51/76] target/arm: Implement FPCR.AH semantics for SVE FMIN/FMAX vector [52/76] target/arm: Implement FPCR.AH handling of negation of NaN [53/76] target/arm: Implement FPCR.AH handling for scalar FABS and FABD [54/76] target/arm: Handle FPCR.AH in vector FABD [55/76] target/arm: Handle FPCR.AH in SVE FNEG [56/76] target/arm: Handle FPCR.AH in SVE FABS [57/76] target/arm: Handle FPCR.AH in SVE FABD [58/76] target/arm: Handle FPCR.AH in negation steps in FCADD [59/76] target/arm: Handle FPCR.AH in negation steps in SVE FCADD [60/76] target/arm: Handle FPCR.AH in FMLSL [61/76] target/arm: Handle FPCR.AH in FRECPS and FRSQRTS scalar insns [62/76] target/arm: Handle FPCR.AH in FRECPS and FRSQRTS vector insns [63/76] target/arm: Handle FPCR.AH in negation step in FMLS (indexed) [64/76] target/arm: Handle FPCR.AH in negation in FMLS (vector) [65/76] target/arm: Handle FPCR.AH in negation step in SVE FMLS (vector) [66/76] target/arm: Handle FPCR.AH in SVE FTSSEL [67/76] target/arm: Handle FPCR.AH in SVE FTMAD [68/76] target/arm: Enable FEAT_AFP for '-cpu max' [69/76] target/arm: Plumb FEAT_RPRES frecpe and frsqrte through to new helper [70/76] target/arm: Implement increased precision FRECPE [71/76] target/arm: Implement increased precision FRSQRTE [72/76] target/arm: Enable FEAT_RPRES for -cpu max [73/76] target/i386: Detect flush-to-zero after rounding [74/76] target/i386: Use correct type for get_float_exception_flags() values [75/76] target/i386: Wire up MXCSR.DE and FPUS.DE correctly [76/76] tests/tcg/x86_64/fma: add test for exact-denormal output

Message ID

20250124162836.2332150-58-peter.maydell@linaro.org

State

Superseded

Headers

Received-SPF: pass (google.com: domain of
 qemu-devel-bounces+patch=linaro.org@nongnu.org designates 209.51.188.17 as
 permitted sender) client-ip=209.51.188.17;
From: Peter Maydell <peter.maydell@linaro.org>
To: qemu-arm@nongnu.org,
	qemu-devel@nongnu.org
Subject: [PATCH 57/76] target/arm: Handle FPCR.AH in SVE FABD
Date: Fri, 24 Jan 2025 16:28:17 +0000
Message-Id: <20250124162836.2332150-58-peter.maydell@linaro.org>
In-Reply-To: <20250124162836.2332150-1-peter.maydell@linaro.org>
References: <20250124162836.2332150-1-peter.maydell@linaro.org>
MIME-Version: 1.0
Content-Transfer-Encoding: 8bit
Received-SPF: pass client-ip=2a00:1450:4864:20::332;
 envelope-from=peter.maydell@linaro.org; helo=mail-wm1-x332.google.com
X-Spam_score_int: -20
X-Spam_score: -2.1
X-Spam_bar: --
X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1,
 DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1,
 RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001,
 T_SCC_BODY_TEXT_LINE=-0.01 autolearn=unavailable autolearn_force=no
X-Spam_action: no action
X-BeenThere: qemu-devel@nongnu.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: <qemu-devel.nongnu.org>
List-Unsubscribe: <https://lists.nongnu.org/mailman/options/qemu-devel>,
 <mailto:qemu-devel-request@nongnu.org?subject=unsubscribe>
List-Archive: <https://lists.nongnu.org/archive/html/qemu-devel>
List-Post: <mailto:qemu-devel@nongnu.org>
List-Help: <mailto:qemu-devel-request@nongnu.org?subject=help>
List-Subscribe: <https://lists.nongnu.org/mailman/listinfo/qemu-devel>,
 <mailto:qemu-devel-request@nongnu.org?subject=subscribe>
Errors-To: qemu-devel-bounces+patch=linaro.org@nongnu.org
Sender: qemu-devel-bounces+patch=linaro.org@nongnu.org

Series

target/arm: Implement FEAT_AFP and FEAT_RPRES | expand

Commit Message

Peter Maydell Jan. 24, 2025, 4:28 p.m. UTC

Make the SVE FABD insn honour the FPCR.AH "don't negate the sign
of a NaN" semantics.

Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 target/arm/tcg/helper-sve.h    |  7 +++++++
 target/arm/tcg/sve_helper.c    | 22 ++++++++++++++++++++++
 target/arm/tcg/translate-sve.c |  2 +-
 3 files changed, 30 insertions(+), 1 deletion(-)

Comments

Richard Henderson Jan. 26, 2025, 1:06 p.m. UTC | #1

On 1/24/25 08:28, Peter Maydell wrote:
> Make the SVE FABD insn honour the FPCR.AH "don't negate the sign
> of a NaN" semantics.
> 
> Signed-off-by: Peter Maydell<peter.maydell@linaro.org>
> ---
>   target/arm/tcg/helper-sve.h    |  7 +++++++
>   target/arm/tcg/sve_helper.c    | 22 ++++++++++++++++++++++
>   target/arm/tcg/translate-sve.c |  2 +-
>   3 files changed, 30 insertions(+), 1 deletion(-)

Reviewed-by: Richard Henderson <richard.henderson@linaro.org>

r~

diff --git a/target/arm/tcg/helper-sve.h b/target/arm/tcg/helper-sve.h
index ff12f650c87..29c70f054af 100644
--- a/target/arm/tcg/helper-sve.h
+++ b/target/arm/tcg/helper-sve.h
@@ -1183,6 +1183,13 @@  DEF_HELPER_FLAGS_6(sve_fabd_s, TCG_CALL_NO_RWG,
 DEF_HELPER_FLAGS_6(sve_fabd_d, TCG_CALL_NO_RWG,
                    void, ptr, ptr, ptr, ptr, fpst, i32)
 
+DEF_HELPER_FLAGS_6(sve_ah_fabd_h, TCG_CALL_NO_RWG,
+                   void, ptr, ptr, ptr, ptr, fpst, i32)
+DEF_HELPER_FLAGS_6(sve_ah_fabd_s, TCG_CALL_NO_RWG,
+                   void, ptr, ptr, ptr, ptr, fpst, i32)
+DEF_HELPER_FLAGS_6(sve_ah_fabd_d, TCG_CALL_NO_RWG,
+                   void, ptr, ptr, ptr, ptr, fpst, i32)
+
 DEF_HELPER_FLAGS_6(sve_fscalbn_h, TCG_CALL_NO_RWG,
                    void, ptr, ptr, ptr, ptr, fpst, i32)
 DEF_HELPER_FLAGS_6(sve_fscalbn_s, TCG_CALL_NO_RWG,
diff --git a/target/arm/tcg/sve_helper.c b/target/arm/tcg/sve_helper.c
index 5ce7d736475..8527a7495a6 100644
--- a/target/arm/tcg/sve_helper.c
+++ b/target/arm/tcg/sve_helper.c
@@ -4394,9 +4394,31 @@  static inline float64 abd_d(float64 a, float64 b, float_status *s)
     return float64_abs(float64_sub(a, b, s));
 }
 
+/* ABD when FPCR.AH = 1: avoid flipping sign bit of a NaN result */
+static float16 ah_abd_h(float16 op1, float16 op2, float_status *stat)
+{
+    float16 r = float16_sub(op1, op2, stat);
+    return float16_is_any_nan(r) ? r : float16_abs(r);
+}
+
+static float32 ah_abd_s(float32 op1, float32 op2, float_status *stat)
+{
+    float32 r = float32_sub(op1, op2, stat);
+    return float32_is_any_nan(r) ? r : float32_abs(r);
+}
+
+static float64 ah_abd_d(float64 op1, float64 op2, float_status *stat)
+{
+    float64 r = float64_sub(op1, op2, stat);
+    return float64_is_any_nan(r) ? r : float64_abs(r);
+}
+
 DO_ZPZZ_FP(sve_fabd_h, uint16_t, H1_2, abd_h)
 DO_ZPZZ_FP(sve_fabd_s, uint32_t, H1_4, abd_s)
 DO_ZPZZ_FP(sve_fabd_d, uint64_t, H1_8, abd_d)
+DO_ZPZZ_FP(sve_ah_fabd_h, uint16_t, H1_2, ah_abd_h)
+DO_ZPZZ_FP(sve_ah_fabd_s, uint32_t, H1_4, ah_abd_s)
+DO_ZPZZ_FP(sve_ah_fabd_d, uint64_t, H1_8, ah_abd_d)
 
 static inline float64 scalbn_d(float64 a, int64_t b, float_status *s)
 {
diff --git a/target/arm/tcg/translate-sve.c b/target/arm/tcg/translate-sve.c
index c234a4910dd..9200f7f8a49 100644
--- a/target/arm/tcg/translate-sve.c
+++ b/target/arm/tcg/translate-sve.c
@@ -3789,7 +3789,7 @@  DO_ZPZZ_AH_FP(FMIN_zpzz, aa64_sve, sve_fmin, sve_ah_fmin)
 DO_ZPZZ_AH_FP(FMAX_zpzz, aa64_sve, sve_fmax, sve_ah_fmax)
 DO_ZPZZ_FP(FMINNM_zpzz, aa64_sve, sve_fminnum)
 DO_ZPZZ_FP(FMAXNM_zpzz, aa64_sve, sve_fmaxnum)
-DO_ZPZZ_FP(FABD, aa64_sve, sve_fabd)
+DO_ZPZZ_AH_FP(FABD, aa64_sve, sve_fabd, sve_ah_fabd)
 DO_ZPZZ_FP(FSCALE, aa64_sve, sve_fscalbn)
 DO_ZPZZ_FP(FDIV, aa64_sve, sve_fdiv)
 DO_ZPZZ_FP(FMULX, aa64_sve, sve_fmulx)

[57/76] target/arm: Handle FPCR.AH in SVE FABD

Commit Message

Comments

Patch