[PULL,36/44] target/mips/mxu: Add Q8SAD instruction

Message ID	20230710222611.50978-37-philmd@linaro.org
State	Accepted
Commit	8aedfb64cdcfa60a077c66e802f6c65a419631de
Headers	show Delivered-To: patch@linaro.org Received-SPF: pass (google.com: domain of qemu-devel-bounces+patch=linaro.org@nongnu.org designates 209.51.188.17 as permitted sender) client-ip=209.51.188.17; From: =?utf-8?q?Philippe_Mathieu-Daud=C3=A9?= <philmd@linaro.org> To: qemu-devel@nongnu.org Cc: Siarhei Volkau <lis8215@gmail.com>, Huacai Chen <chenhuacai@kernel.org>, =?utf-8?q?Philippe_Mathieu-Daud=C3=A9?= <philmd@linaro.org>, Jiaxun Yang <jiaxun.yang@flygoat.com> Subject: [PULL 36/44] target/mips/mxu: Add Q8SAD instruction Date: Tue, 11 Jul 2023 00:26:03 +0200 Message-Id: <20230710222611.50978-37-philmd@linaro.org> In-Reply-To: <20230710222611.50978-1-philmd@linaro.org> References: <20230710222611.50978-1-philmd@linaro.org> MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Received-SPF: pass client-ip=2a00:1450:4864:20::42e; envelope-from=philmd@linaro.org; helo=mail-wr1-x42e.google.com X-Spam_score_int: -20 X-Spam_score: -2.1 X-Spam_bar: -- X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01 autolearn=ham autolearn_force=no X-Spam_action: no action Precedence: list Errors-To: qemu-devel-bounces+patch=linaro.org@nongnu.org Sender: qemu-devel-bounces+patch=linaro.org@nongnu.org
Series	[PULL,01/44] target/mips: Rework cp0_timer with clock API \| expand [PULL,01/44] target/mips: Rework cp0_timer with clock API [PULL,02/44] target/mips: Implement Loongson CSR instructions [PULL,03/44] hw/mips/loongson3_virt: Relax CPU restrictions for TCG [PULL,04/44] target/mips: Add emulation of MXU instructions for 32-bit load/store [PULL,05/44] target/mips: Add support of two XBurst CPUs [PULL,06/44] target/mips/mxu: Add LXW LXB LXH LXBU LXHU instructions [PULL,07/44] target/mips/mxu: Add S32MADD/MADDU/MSUB/MSUBU instructions [PULL,08/44] target/mips/mxu: Add Q8SLT Q8SLTU instructions [PULL,09/44] target/mips/mxu: Fix D16MAX D16MIN Q8MAX Q8MIN instructions [PULL,10/44] target/mips/mxu: Add S32SLT D16SLT D16AVG[R] Q8AVG[R] insns [PULL,11/44] target/mips/mxu: Add Q8ADD instruction [PULL,12/44] target/mips/mxu: Add S32CPS D16CPS Q8ABD Q16SAT insns [PULL,13/44] target/mips/mxu: Add D16MULF D16MULE instructions [PULL,14/44] target/mips/mxu: Add D16MACF D16MACE instructions [PULL,15/44] target/mips/mxu: Add D16MADL instruction [PULL,16/44] target/mips/mxu: Add S16MAD instruction [PULL,17/44] target/mips/mxu: Add Q16ADD instruction [PULL,18/44] target/mips/mxu: Add D32ADD instruction [PULL,19/44] target/mips/mxu: Add D32ACC D32ACCM D32ASUM instructions [PULL,20/44] target/mips/mxu: Add D32ADDC instruction [PULL,21/44] target/mips/mxu: Add Q16ACC Q16ACCM D16ASUM instructions [PULL,22/44] target/mips/mxu: Add Q8ADDE Q8ACCE D8SUM D8SUMC instructions [PULL,23/44] target/mips/mxu: Add S8STD S8LDI S8SDI instructions [PULL,24/44] target/mips/mxu: Add S16LDD S16STD S16LDI S16SDI instructions [PULL,25/44] target/mips/mxu: Add S32MUL S32MULU S32EXTR S32EXTRV insns [PULL,26/44] target/mips/mxu: Add S32ALN S32LUI insns [PULL,27/44] target/mips/mxu: Add D32SARL D32SARW instructions [PULL,28/44] target/mips/mxu: Add D32SLL D32SLR D32SAR instructions [PULL,29/44] target/mips/mxu: Add Q16SLL Q16SLR Q16SAR instructions [PULL,30/44] target/mips/mxu: Add D32/Q16- SLLV/SLRV/SARV instructions [PULL,31/44] target/mips/mxu: Add S32/D16/Q8- MOVZ/MOVN instructions [PULL,32/44] target/mips/mxu: Add Q8MAC Q8MACSU instructions [PULL,33/44] target/mips/mxu: Add Q16SCOP instruction [PULL,34/44] target/mips/mxu: Add Q8MADL instruction [PULL,35/44] target/mips/mxu: Add S32SFL instruction [PULL,36/44] target/mips/mxu: Add Q8SAD instruction [PULL,37/44] target/mips: enable GINVx support for I6400 and I6500 [PULL,38/44] hw/ide/pci: Expose legacy interrupts as named GPIOs [PULL,39/44] hw/ide/via: Wire up IDE legacy interrupts in host device [PULL,40/44] hw/isa/vt82c686: Remove via_isa_set_irq() [PULL,41/44] hw/ide: Extract IDEBus assignment into bmdma_init() [PULL,42/44] hw/ide: Extract bmdma_status_writeb() [PULL,43/44] hw/ide/pci: Replace some magic numbers by constants [PULL,44/44] hw/ide/piix: Move registration of VMStateDescription to DeviceClass

Message ID

20230710222611.50978-37-philmd@linaro.org

State

Accepted

Commit

8aedfb64cdcfa60a077c66e802f6c65a419631de

Headers

Received-SPF: pass (google.com: domain of
 qemu-devel-bounces+patch=linaro.org@nongnu.org designates 209.51.188.17 as
 permitted sender) client-ip=209.51.188.17;
From: =?utf-8?q?Philippe_Mathieu-Daud=C3=A9?= <philmd@linaro.org>
To: qemu-devel@nongnu.org
Cc: Siarhei Volkau <lis8215@gmail.com>, Huacai Chen <chenhuacai@kernel.org>,
	=?utf-8?q?Philippe_Mathieu-Daud=C3=A9?= <philmd@linaro.org>,
 Jiaxun Yang <jiaxun.yang@flygoat.com>
Subject: [PULL 36/44] target/mips/mxu: Add Q8SAD instruction
Date: Tue, 11 Jul 2023 00:26:03 +0200
Message-Id: <20230710222611.50978-37-philmd@linaro.org>
In-Reply-To: <20230710222611.50978-1-philmd@linaro.org>
References: <20230710222611.50978-1-philmd@linaro.org>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit
Received-SPF: pass client-ip=2a00:1450:4864:20::42e;
 envelope-from=philmd@linaro.org; helo=mail-wr1-x42e.google.com
X-Spam_score_int: -20
X-Spam_score: -2.1
X-Spam_bar: --
X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1,
 DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1,
 RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001,
 T_SCC_BODY_TEXT_LINE=-0.01 autolearn=ham autolearn_force=no
X-Spam_action: no action
X-BeenThere: qemu-devel@nongnu.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: <qemu-devel.nongnu.org>
List-Unsubscribe: <https://lists.nongnu.org/mailman/options/qemu-devel>,
 <mailto:qemu-devel-request@nongnu.org?subject=unsubscribe>
List-Archive: <https://lists.nongnu.org/archive/html/qemu-devel>
List-Post: <mailto:qemu-devel@nongnu.org>
List-Help: <mailto:qemu-devel-request@nongnu.org?subject=help>
List-Subscribe: <https://lists.nongnu.org/mailman/listinfo/qemu-devel>,
 <mailto:qemu-devel-request@nongnu.org?subject=subscribe>
Errors-To: qemu-devel-bounces+patch=linaro.org@nongnu.org
Sender: qemu-devel-bounces+patch=linaro.org@nongnu.org

Series

[PULL,01/44] target/mips: Rework cp0_timer with clock API | expand

Commit Message

Philippe Mathieu-Daudé July 10, 2023, 10:26 p.m. UTC

From: Siarhei Volkau <lis8215@gmail.com>

The instruction implements SAD (sum-absolute-difference) operation which
is used in motion estimation algorithms. The instruction handles four
8-bit data in parallel.

Signed-off-by: Siarhei Volkau <lis8215@gmail.com>
Message-Id: <20230608104222.1520143-34-lis8215@gmail.com>
Signed-off-by: Philippe Mathieu-Daudé <philmd@linaro.org>
---
 target/mips/tcg/mxu_translate.c | 45 +++++++++++++++++++++++++++++++++
 1 file changed, 45 insertions(+)

diff --git a/target/mips/tcg/mxu_translate.c b/target/mips/tcg/mxu_translate.c
index c60404f739..deb8060a17 100644
--- a/target/mips/tcg/mxu_translate.c
+++ b/target/mips/tcg/mxu_translate.c
@@ -408,6 +408,7 @@  enum {
     OPC_MXU_Q16SCOP  = 0x3B,
     OPC_MXU_Q8MADL   = 0x3C,
     OPC_MXU_S32SFL   = 0x3D,
+    OPC_MXU_Q8SAD    = 0x3E,
 };
 
 
@@ -4039,6 +4040,47 @@  static void gen_mxu_s32sfl(DisasContext *ctx)
     gen_store_mxu_gpr(t3, XRd);
 }
 
+/*
+ *  Q8SAD XRa, XRd, XRb, XRc
+ *    Typical SAD opration for motion estimation.
+ */
+static void gen_mxu_q8sad(DisasContext *ctx)
+{
+    uint32_t XRd, XRc, XRb, XRa;
+
+    XRd = extract32(ctx->opcode, 18, 4);
+    XRc = extract32(ctx->opcode, 14, 4);
+    XRb = extract32(ctx->opcode, 10, 4);
+    XRa = extract32(ctx->opcode,  6, 4);
+
+    TCGv t0 = tcg_temp_new();
+    TCGv t1 = tcg_temp_new();
+    TCGv t2 = tcg_temp_new();
+    TCGv t3 = tcg_temp_new();
+    TCGv t4 = tcg_temp_new();
+    TCGv t5 = tcg_temp_new();
+
+    gen_load_mxu_gpr(t2, XRb);
+    gen_load_mxu_gpr(t3, XRc);
+    gen_load_mxu_gpr(t5, XRd);
+    tcg_gen_movi_tl(t4, 0);
+
+    for (int i = 0; i < 4; i++) {
+        tcg_gen_andi_tl(t0, t2, 0xff);
+        tcg_gen_andi_tl(t1, t3, 0xff);
+        tcg_gen_sub_tl(t0, t0, t1);
+        tcg_gen_abs_tl(t0, t0);
+        tcg_gen_add_tl(t4, t4, t0);
+        if (i < 3) {
+            tcg_gen_shri_tl(t2, t2, 8);
+            tcg_gen_shri_tl(t3, t3, 8);
+        }
+    }
+    tcg_gen_add_tl(t5, t5, t4);
+    gen_store_mxu_gpr(t4, XRa);
+    gen_store_mxu_gpr(t5, XRd);
+}
+
 /*
  *                 MXU instruction category: align
  *                 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
@@ -5040,6 +5082,9 @@  bool decode_ase_mxu(DisasContext *ctx, uint32_t insn)
         case OPC_MXU_S32SFL:
             gen_mxu_s32sfl(ctx);
             break;
+        case OPC_MXU_Q8SAD:
+            gen_mxu_q8sad(ctx);
+            break;
         default:
             return false;
         }

[PULL,36/44] target/mips/mxu: Add Q8SAD instruction

Commit Message

Patch