[v3,25/30] target/ppc: Move ADDI, ADDIS to decodetree, implement PADDI

Message ID	20210430011543.1017113-26-richard.henderson@linaro.org
State	New
Headers	show Delivered-To: patch@linaro.org Received-SPF: pass (google.com: domain of qemu-devel-bounces+patch=linaro.org@nongnu.org designates 209.51.188.17 as permitted sender) client-ip=209.51.188.17; From: Richard Henderson <richard.henderson@linaro.org> To: qemu-devel@nongnu.org Subject: [PATCH v3 25/30] target/ppc: Move ADDI, ADDIS to decodetree, implement PADDI Date: Thu, 29 Apr 2021 18:15:38 -0700 Message-Id: <20210430011543.1017113-26-richard.henderson@linaro.org> In-Reply-To: <20210430011543.1017113-1-richard.henderson@linaro.org> References: <20210430011543.1017113-1-richard.henderson@linaro.org> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Received-SPF: pass client-ip=2607:f8b0:4864:20::1032; envelope-from=richard.henderson@linaro.org; helo=mail-pj1-x1032.google.com X-Spam_score_int: -20 X-Spam_score: -2.1 X-Spam_bar: -- X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=unavailable autolearn_force=no X-Spam_action: no action Precedence: list Cc: f4bug@amsat.org, luis.pires@eldorado.org.br, qemu-ppc@nongnu.org, lagarcia@br.ibm.com, bruno.larsen@eldorado.org.br, matheus.ferst@eldorado.org.br, david@gibson.dropbear.id.au Errors-To: qemu-devel-bounces+patch=linaro.org@nongnu.org Sender: "Qemu-devel" <qemu-devel-bounces+patch=linaro.org@nongnu.org>
Series	Base for adding PowerPC 64-bit instructions \| expand [v3,00/30] Base for adding PowerPC 64-bit instructions [v3,01/30] decodetree: Introduce whex and whexC helpers [v3,02/30] decodetree: More use of f-strings [v3,03/30] decodetree: Add support for 64-bit instructions [v3,04/30] decodetree: Extend argument set syntax to allow types [v3,05/30] target/ppc: Add cia field to DisasContext [v3,06/30] target/ppc: Split out decode_legacy [v3,07/30] target/ppc: Move DISAS_NORETURN setting into gen_exception* [v3,08/30] target/ppc: Remove special case for POWERPC_SYSCALL [v3,09/30] target/ppc: Remove special case for POWERPC_EXCP_TRAP [v3,10/30] target/ppc: Simplify gen_debug_exception [v3,11/30] target/ppc: Introduce DISAS_{EXIT,CHAIN}{,_UPDATE} [v3,12/30] target/ppc: Replace POWERPC_EXCP_SYNC with DISAS_EXIT [v3,13/30] target/ppc: Remove unnecessary gen_io_end calls [v3,14/30] target/ppc: Introduce gen_icount_io_start [v3,15/30] target/ppc: Replace POWERPC_EXCP_STOP with DISAS_EXIT_UPDATE [v3,16/30] target/ppc: Replace POWERPC_EXCP_BRANCH with DISAS_NORETURN [v3,17/30] target/ppc: Remove DisasContext.exception [v3,18/30] target/ppc: Move single-step check to ppc_tr_tb_stop [v3,19/30] target/ppc: Tidy exception vs exit_tb [v3,20/30] target/ppc: Mark helper_raise_exception* as noreturn [v3,21/30] target/ppc: Use translator_loop_temp_check [v3,22/30] target/ppc: Introduce macros to check isa extensions [v3,23/30] target/ppc: Add infrastructure for prefixed insns [v3,24/30] target/ppc: Move page crossing check to ppc_tr_translate_insn [v3,25/30] target/ppc: Move ADDI, ADDIS to decodetree, implement PADDI [v3,26/30] target/ppc: Implement PNOP [v3,27/30] target/ppc: Move D/DS/X-form integer loads to decodetree [v3,28/30] target/ppc: Implement prefixed integer load instructions [v3,29/30] target/ppc: Move D/DS/X-form integer stores to decodetree [v3,30/30] target/ppc: Implement prefixed integer store instructions

Richard Henderson April 30, 2021, 1:15 a.m. UTC

Signed-off-by: Richard Henderson <richard.henderson@linaro.org>

---
 target/ppc/insn32.decode                   | 12 +++++++
 target/ppc/insn64.decode                   | 15 +++++++++
 target/ppc/translate.c                     | 29 ----------------
 target/ppc/translate/fixedpoint-impl.c.inc | 39 ++++++++++++++++++++++
 4 files changed, 66 insertions(+), 29 deletions(-)

-- 
2.25.1

Luis Fernando Fujita Pires April 30, 2021, 11:23 a.m. UTC | #1

From: Richard Henderson <richard.henderson@linaro.org>

> +&D              rt ra si

> +@D              ...... rt:5 ra:5 si:s16                 &D

> +

> +# If a prefix is allowed, decode with default values.

> +&PLS_D          rt ra si:int64_t r:bool

> +@PLS_D          ...... rt:5 ra:5 si:s16                 &PLS_D r=0

> +

> +### Fixed-Point Arithmetic Instructions

> +

> +ADDI            001110 ..... ..... ................     @PLS_D

> +ADDIS           001111 ..... ..... ................     @D

> diff --git a/target/ppc/insn64.decode b/target/ppc/insn64.decode index

> 9fc45d0614..f4272df724 100644

> --- a/target/ppc/insn64.decode

> +++ b/target/ppc/insn64.decode

> @@ -16,3 +16,18 @@

>  # You should have received a copy of the GNU Lesser General Public  # License

> along with this library; if not, see <http://www.gnu.org/licenses/>.

>  #

> +

> +# Many all of these instruction names would be prefixed by "P", # but

> +we share code with the non-prefixed instruction.

> +

> +# Format MLS:D and 8LS:D

> +&PLS_D          rt ra si:int64_t r:bool  !extern

> +%pls_si         32:s18 0:16

> +@PLS_D          ...... .. ... r:1 .. .................. \

> +                ...... rt:5 ra:5 ................       \

> +                &PLS_D si=%pls_si

> +

> +### Fixed-Point Arithmetic Instructions

> +

> +ADDI            000001 10 0--.-- ..................     \

> +                001110 ..... ..... ................     @PLS_D


I think we should reconsider using the same .decode file for both 32- and 64-bit instructions, to avoid duplicating argument set definitions, and to keep the prefixed instructions close to their non-prefixed counterparts. For ADDI/PADDI, something along the lines of:

&PLS_D          rt ra si:int64_t r:bool

%pls_si         32:s18 0:16
@PLS_D          ...... .. ... r:1 .. .................. \
                ...... rt:5 ra:5 ................       \
                &PLS_D si=%pls_si

@PLS_D_32       ...... rt:5 ra:5 si:s16                 &PLS_D r=0

PADDI           000001 10 0--.-- ..................     \
                001110 ..... ..... ................     @PLS_D
ADDI            001110 ..... ..... ................     @PLS_D_32
ADDIS           001111 ..... ..... ................     @D

That's where I was going with the original patch, using the varinsnwidth support from decodetree.py.

And, in order to share the trans_PADDI/ADDI implementation, maybe add something to decodetree.py to allow us to specify that an instruction shares the trans_XX() implementation from another one, such as:
ADDI            001110 ..... ..... ................     @PLS_D_32 !impl=PADDI

This way, we could (and would need to, in fact) keep the 'P' in the prefixed instruction names, but at the same time avoid having extra trans_XX functions just calling another one without any additional code.

For the load functions, we would then have:

%ds_si          2:s14  !function=times_4
@PLS_DS_32      ...... rt:5 ra:5 .............. ..      &PLS_D si=%ds_si r=0

&X              rt ra rb
@X              ...... rt:5 ra:5 rb:5 .......... .      &X

PLBZ            000001 10 0--.-- .................. \
                100010 ..... ..... ................     @PLS_D
LBZ             100010 ..... ..... ................     @PLS_D_32 !impl=PLBZ
LBZU            100011 ..... ..... ................     @PLS_D_32
LBZX            011111 ..... ..... ..... 0001010111 -   @X
LBZUX           011111 ..... ..... ..... 0001110111 -   @X

PLHZ            000001 10 0--.-- .................. \
                101000 ..... ..... ................     @PLS_D
LHZ             101000 ..... ..... ................     @PLS_D_32 !impl=PLHZ
LHZU            101001 ..... ..... ................     @PLS_D_32
LHZX            011111 ..... ..... ..... 0100010111 -   @X
LHZUX           011111 ..... ..... ..... 0100110111 -   @X

PLHA            000001 10 0--.-- .................. \
                101010 ..... ..... ................     @PLS_D
LHA             101010 ..... ..... ................     @PLS_D_32 !impl=PLHA
LHAU            101011 ..... ..... ................     @PLS_D_32
LHAX            011111 ..... ..... ..... 0101010111 -   @X
LHAXU           011111 ..... ..... ..... 0101110111 -   @X

PLWZ            000001 10 0--.-- .................. \
                100000 ..... ..... ................     @PLS_D
LWZ             100000 ..... ..... ................     @PLS_D_32 !impl=PLWZ
LWZU            100001 ..... ..... ................     @PLS_D_32
LWZX            011111 ..... ..... ..... 0000010111 -   @X
LWZUX           011111 ..... ..... ..... 0000110111 -   @X

PLWA            000001 00 0--.-- .................. \
                101001 ..... ..... ................     @PLS_D
LWA             111010 ..... ..... ..............10     @PLS_DS_32 !impl=PLWA
LWAX            011111 ..... ..... ..... 0101010101 -   @X
LWAUX           011111 ..... ..... ..... 0101110101 -   @X

PLD             000001 00 0--.-- .................. \
                111001 ..... ..... ................     @PLS_D
LD              111010 ..... ..... ..............00     @PLS_DS_32 !impl=PLD
LDU             111010 ..... ..... ..............01     @PLS_DS_32
LDX             011111 ..... ..... ..... 0000010101 -   @X
LDUX            011111 ..... ..... ..... 0000110101 -   @X

--
Luis

Matheus K. Ferst April 30, 2021, 2:05 p.m. UTC | #2

On 29/04/2021 22:15, Richard Henderson wrote:
> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>

> ---

>   target/ppc/insn32.decode                   | 12 +++++++

>   target/ppc/insn64.decode                   | 15 +++++++++

>   target/ppc/translate.c                     | 29 ----------------

>   target/ppc/translate/fixedpoint-impl.c.inc | 39 ++++++++++++++++++++++

>   4 files changed, 66 insertions(+), 29 deletions(-)

> 

> diff --git a/target/ppc/insn32.decode b/target/ppc/insn32.decode

> index b175441209..52d9b355d4 100644

> --- a/target/ppc/insn32.decode

> +++ b/target/ppc/insn32.decode

> @@ -16,3 +16,15 @@

>   # You should have received a copy of the GNU Lesser General Public

>   # License along with this library; if not, see <http://www.gnu.org/licenses/>.

>   #

> +

> +&D              rt ra si

> +@D              ...... rt:5 ra:5 si:s16                 &D

> +

> +# If a prefix is allowed, decode with default values.

> +&PLS_D          rt ra si:int64_t r:bool

> +@PLS_D          ...... rt:5 ra:5 si:s16                 &PLS_D r=0

> +

> +### Fixed-Point Arithmetic Instructions

> +

> +ADDI            001110 ..... ..... ................     @PLS_D

> +ADDIS           001111 ..... ..... ................     @D

> diff --git a/target/ppc/insn64.decode b/target/ppc/insn64.decode

> index 9fc45d0614..f4272df724 100644

> --- a/target/ppc/insn64.decode

> +++ b/target/ppc/insn64.decode

> @@ -16,3 +16,18 @@

>   # You should have received a copy of the GNU Lesser General Public

>   # License along with this library; if not, see <http://www.gnu.org/licenses/>.

>   #

> +

> +# Many all of these instruction names would be prefixed by "P",

> +# but we share code with the non-prefixed instruction.

> +

> +# Format MLS:D and 8LS:D

> +&PLS_D          rt ra si:int64_t r:bool  !extern

> +%pls_si         32:s18 0:16

> +@PLS_D          ...... .. ... r:1 .. .................. \

> +                ...... rt:5 ra:5 ................       \

> +                &PLS_D si=%pls_si

> +

> +### Fixed-Point Arithmetic Instructions

> +

> +ADDI            000001 10 0--.-- ..................     \

> +                001110 ..... ..... ................     @PLS_D


I'm not sure about this. It's a bit surprising to find ADDI here, and 
the comment that explains why is likely to be ignored after the big 
copyright header.

<snip>

> diff --git a/target/ppc/translate/fixedpoint-impl.c.inc b/target/ppc/translate/fixedpoint-impl.c.inc

> index b740083605..7af1b3bcf5 100644

> --- a/target/ppc/translate/fixedpoint-impl.c.inc

> +++ b/target/ppc/translate/fixedpoint-impl.c.inc

> @@ -16,3 +16,42 @@

>    * You should have received a copy of the GNU Lesser General Public

>    * License along with this library; if not, see <http://www.gnu.org/licenses/>.

>    */

> +

> +/*

> + * Incorporate CIA into the constant when R=1.

> + * Validate that when R=1, RA=0.

> + */

> +static bool resolve_PLS_D(DisasContext *ctx, arg_PLS_D *a)

> +{

> +    if (a->r) {

> +        if (unlikely(a->ra != 0)) {

> +            gen_invalid(ctx);

> +            return false;

> +        }

> +        a->si += ctx->cia;

> +    }

> +    return true;

> +}

> +

> +static bool trans_ADDI(DisasContext *ctx, arg_PLS_D *a)

> +{

> +    if (resolve_PLS_D(ctx, a)) {

> +        if (a->ra) {

> +            tcg_gen_addi_tl(cpu_gpr[a->rt], cpu_gpr[a->ra], a->si);

> +        } else {

> +            tcg_gen_movi_tl(cpu_gpr[a->rt], a->si);

> +        }

> +    }

> +    return true;

> +}

> +


I'd prefer to keep a trans_PADDI like

 > static bool trans_PADDI(DisasContext *ctx, arg_PLS_D *a)

 > {

 >     if(!resolve_PLS_D(ctx, a)) {

 >         return false;

 >     }

 >     return trans_ADDI(ctx, a);

 > }


It's the middle way between v2 and v3. trans_ADDI code is reused, it'll 
probably be optimized as a tail call, and resolve_PLS_D is not called 
when it's not needed.

> +static bool trans_ADDIS(DisasContext *ctx, arg_D *a)

> +{

> +    int si = a->si << 16;

> +    if (a->ra) {

> +        tcg_gen_addi_tl(cpu_gpr[a->rt], cpu_gpr[a->ra], si);

> +    } else {

> +        tcg_gen_movi_tl(cpu_gpr[a->rt], si);

> +    }

> +    return true;

> +}

> 


I'd also keep this as in the last version, where trans_ADDI is called.

Thanks,
Matheus K. Ferst
Instituto de Pesquisas ELDORADO <http://www.eldorado.org.br/>
Analista de Software Júnior
Aviso Legal - Disclaimer <https://www.eldorado.org.br/disclaimer.html>

Richard Henderson April 30, 2021, 2:23 p.m. UTC | #3

On 4/30/21 4:23 AM, Luis Fernando Fujita Pires wrote:
> I think we should reconsider using the same .decode file for both 32- and

> 64-bit instructions, to avoid duplicating argument set definitions, and to

> keep the prefixed instructions close to their non-prefixed counterparts.

varinsnwidth assumes there is no easy way to determine, before decoding, the 
width of the instruction.  The way this is implemented in decodetree is vastly 
less optimal than what we can do with a few lines for ppc.

In addition, there's a rough spot in %field definitions.  You can't share those 
between patterns of different sizes, which can get confusing.  Have a look at 
target/rx, and the definitions of %b[23]_r_0, which is the same field for 2 and 
3-byte insns.

The replication of argument set definitions is unfortunate, but in the end will 
only be a handful of lines.  We could probably come up with a way to avoid that 
too, via a decodetree extension, if you really insist.  (My vague idea there 
would put the argument set definitions into a 3rd file, included on the 
decodetree command-line.)

> And, in order to share the trans_PADDI/ADDI implementation, maybe add something to decodetree.py to allow us to specify that an instruction shares the trans_XX() implementation from another one, such as:

> ADDI            001110 ..... ..... ................     @PLS_D_32 !impl=PADDI

This is done by using the same name up front.
If you like, add a comment to give the real instruction name.

PADDI   001110 ..... ..... ................     @PLS_D_32 # ADDI

> This way, we could (and would need to, in fact) keep the 'P' in the prefixed instruction names, but at the same time avoid having extra trans_XX functions just calling another one without any additional code.

I don't understand this at all.

r~

Richard Henderson April 30, 2021, 2:31 p.m. UTC | #4

On 4/30/21 7:05 AM, Matheus K. Ferst wrote:
>> +ADDI            000001 10 0--.-- ..................     \

>> +                001110 ..... ..... ................     @PLS_D

> 

> I'm not sure about this. It's a bit surprising to find ADDI here, and the 

> comment that explains why is likely to be ignored after the big copyright header.


You could move the comment closer, and replicate, e.g.

ADDI .... \
      .... @PLS_D # PADDI


> I'd prefer to keep a trans_PADDI like

> 

>  > static bool trans_PADDI(DisasContext *ctx, arg_PLS_D *a)

>  > {

>  >     if(!resolve_PLS_D(ctx, a)) {

>  >         return false;

>  >     }

>  >     return trans_ADDI(ctx, a);

>  > }


But in this case ADDI probably doesn't use PLS_D.  You could use

static bool trans_PADDI(DisasContext *ctx, arg_PLS_D *a)
{
     arg_D d;
     if (!resolve_PLS_D(ctx, &d, a)) {
         return false;
     }
     return trans_ADDI(ctx, &d);
}

making sure to use int64_t in the offset for arg_D.

> It's the middle way between v2 and v3. trans_ADDI code is reused, it'll 

> probably be optimized as a tail call, and resolve_PLS_D is not called when it's 

> not needed.


My version won't tail-call, because of the escaping local storage, but I don't 
see how you can avoid it.


r~

Matheus K. Ferst April 30, 2021, 6:02 p.m. UTC | #5

On 30/04/2021 11:31, Richard Henderson wrote:
> On 4/30/21 7:05 AM, Matheus K. Ferst wrote:

>>> +ADDI            000001 10 0--.-- ..................     \

>>> +                001110 ..... ..... ................     @PLS_D

>>

>> I'm not sure about this. It's a bit surprising to find ADDI here, and 

>> the comment that explains why is likely to be ignored after the big 

>> copyright header.

> 

> You could move the comment closer, and replicate, e.g.

> 

> ADDI .... \

>       .... @PLS_D # PADDI

> 

> 


If we keep this naming, IMHO moving the comment closer looks better.

>> I'd prefer to keep a trans_PADDI like

>>

>>  > static bool trans_PADDI(DisasContext *ctx, arg_PLS_D *a)

>>  > {

>>  >     if(!resolve_PLS_D(ctx, a)) {

>>  >         return false;

>>  >     }

>>  >     return trans_ADDI(ctx, a);

>>  > }

> 

> But in this case ADDI probably doesn't use PLS_D.  You could use

> 

> static bool trans_PADDI(DisasContext *ctx, arg_PLS_D *a)

> {

>      arg_D d;

>      if (!resolve_PLS_D(ctx, &d, a)) {

>          return false;

>      }

>      return trans_ADDI(ctx, &d);

> }

> 

> making sure to use int64_t in the offset for arg_D.

> 


We'd keep trans_ADDI with the same signature to avoid creating an arg_D 
on the stack. Patch 4 added type specification, maybe we can define an 
arg_D within arg_PLD_D? I'll play a bit to see if it works.

>> It's the middle way between v2 and v3. trans_ADDI code is reused, 

>> it'll probably be optimized as a tail call, and resolve_PLS_D is not 

>> called when it's not needed.

> 

> My version won't tail-call, because of the escaping local storage, but I 

> don't see how you can avoid it.

> 

> 

> r~


I haven't been able to test it properly yet, but at least on godbolt it 
seems that the compiler prefers to inline over tail call, so maybe 
that's not a problem.

Thanks,
Matheus K. Ferst
Instituto de Pesquisas ELDORADO <http://www.eldorado.org.br/>
Analista de Software Júnior
Aviso Legal - Disclaimer <https://www.eldorado.org.br/disclaimer.html>

Richard Henderson April 30, 2021, 6:43 p.m. UTC | #6

On 4/30/21 11:02 AM, Matheus K. Ferst wrote:
>> But in this case ADDI probably doesn't use PLS_D.  You could use

>>

>> static bool trans_PADDI(DisasContext *ctx, arg_PLS_D *a)

>> {

>>      arg_D d;

>>      if (!resolve_PLS_D(ctx, &d, a)) {

>>          return false;

>>      }

>>      return trans_ADDI(ctx, &d);

>> }

>>

>> making sure to use int64_t in the offset for arg_D.

>>

> 

> We'd keep trans_ADDI with the same signature to avoid creating an arg_D on the 

> stack. Patch 4 added type specification, maybe we can define an arg_D within 

> arg_PLD_D? I'll play a bit to see if it works.


That starts to creep, with e.g. ADDIS now requiring arg_PLD_D.  You'll want to 
audit the other D-form insns to see what other special cases there are.

r~

Luis Fernando Fujita Pires April 30, 2021, 6:45 p.m. UTC | #7

From: Richard Henderson <richard.henderson@linaro.org>

> On 4/30/21 4:23 AM, Luis Fernando Fujita Pires wrote:

> > I think we should reconsider using the same .decode file for both 32-

> > and 64-bit instructions, to avoid duplicating argument set

> > definitions, and to keep the prefixed instructions close to their non-prefixed

> counterparts.

> varinsnwidth assumes there is no easy way to determine, before decoding, the

> width of the instruction.  The way this is implemented in decodetree is vastly less

> optimal than what we can do with a few lines for ppc.

I tried to solve this with one of the previous decodetree patches ("decodetree: Allow custom var width load functions"), whose goal was to allow us to implement a custom instruction load function (in reality, the only effect it had inside decodetree.py was to not generate the _load function).
So the instruction load would still be handled by a simple function inside translate.c, but we would use the auto-generated decode() function to call the trans_XX() functions.

> In addition, there's a rough spot in %field definitions.  You can't share those

> between patterns of different sizes, which can get confusing.  Have a look at

> target/rx, and the definitions of %b[23]_r_0, which is the same field for 2 and 3-

> byte insns.

Right. In the current patch we're already using separate definitions for 'si' depending on the format (%pls_si and %ds_si below):

&PLS_D          rt ra si:int64_t r:bool

%pls_si         32:s18 0:16
@PLS_D          ...... .. ... r:1 .. .................. \
                ...... rt:5 ra:5 ................       \
                &PLS_D si=%pls_si

@PLS_D_32       ...... rt:5 ra:5 si:s16                 &PLS_D r=0

%ds_si          2:s14  !function=times_4
@PLS_DS_32      ...... rt:5 ra:5 .............. ..      &PLS_D si=%ds_si r=0

And I also had to create separate @formats for 32- and 64-bit versions (@PLS_D, @PLS_D_32, etc.), which isn't that nice either.

> The replication of argument set definitions is unfortunate, but in the end will

> only be a handful of lines.  We could probably come up with a way to avoid that

> too, via a decodetree extension, if you really insist.  (My vague idea there would

> put the argument set definitions into a 3rd file, included on the decodetree

> command-line.)

I think we can already pass multiple files to decodetree.py and it will handle them correctly. I just didn't find a way to do that from the meson build files, which assume decodetree will always use a single input file.
Another option would be to allow files to be included from inside other .decode files.

> > And, in order to share the trans_PADDI/ADDI implementation, maybe add

> something to decodetree.py to allow us to specify that an instruction shares the

> trans_XX() implementation from another one, such as:

> > ADDI            001110 ..... ..... ................     @PLS_D_32 !impl=PADDI

> 

> This is done by using the same name up front.

> If you like, add a comment to give the real instruction name.

> 

> PADDI   001110 ..... ..... ................     @PLS_D_32 # ADDI

> 

> 

> > This way, we could (and would need to, in fact) keep the 'P' in the prefixed

> instruction names, but at the same time avoid having extra trans_XX functions

> just calling another one without any additional code.

> 

> I don't understand this at all.

Not a big deal. I was just referring to the fact that, in the current patch, you noted that the instruction names in insn64.decode were not prefixed by "P" due to the code sharing with the 32-bit instructions.

Thanks!
Luis

Richard Henderson April 30, 2021, 7:11 p.m. UTC | #8

On 4/30/21 11:45 AM, Luis Fernando Fujita Pires wrote:
> I think we can already pass multiple files to decodetree.py and it will handle them correctly. I just didn't find a way to do that from the meson build files, which assume decodetree will always use a single input file.


Oh, riscv does this via extra_args:.

r~

Luis Fernando Fujita Pires April 30, 2021, 8:32 p.m. UTC | #9

From: Richard Henderson <richard.henderson@linaro.org>

> On 4/30/21 11:45 AM, Luis Fernando Fujita Pires wrote:

> > I think we can already pass multiple files to decodetree.py and it will handle

> them correctly. I just didn't find a way to do that from the meson build files,

> which assume decodetree will always use a single input file.

> 

> Oh, riscv does this via extra_args:.

> 

> r~


The build system probably will fail to detect that a rebuild is needed if the file passed in through extra_args is changed though, right?

Richard Henderson April 30, 2021, 10:29 p.m. UTC | #10

On 4/30/21 1:32 PM, Luis Fernando Fujita Pires wrote:
> From: Richard Henderson <richard.henderson@linaro.org>

>> On 4/30/21 11:45 AM, Luis Fernando Fujita Pires wrote:

>>> I think we can already pass multiple files to decodetree.py and it will handle

>> them correctly. I just didn't find a way to do that from the meson build files,

>> which assume decodetree will always use a single input file.

>>

>> Oh, riscv does this via extra_args:.

>>

>> r~

> 

> The build system probably will fail to detect that a rebuild is needed if the file passed in through extra_args is changed though, right?


Oh, true.  Good thing there are patches for riscv to remove that.  :-P


r~

Matheus K. Ferst April 30, 2021, 11:29 p.m. UTC | #11

On 30/04/2021 15:43, Richard Henderson wrote:
> On 4/30/21 11:02 AM, Matheus K. Ferst wrote:

>>> But in this case ADDI probably doesn't use PLS_D.  You could use

>>> 

>>> static bool trans_PADDI(DisasContext *ctx, arg_PLS_D *a) { arg_D 

>>> d; if (!resolve_PLS_D(ctx, &d, a)) { return false; } return 

>>> trans_ADDI(ctx, &d); }

>>> 

>>> making sure to use int64_t in the offset for arg_D.

>>> 

>> 

>> We'd keep trans_ADDI with the same signature to avoid creating an 

>> arg_D on the stack. Patch 4 added type specification, maybe we can 

>> define an arg_D within arg_PLD_D? I'll play a bit to see if it 

>> works.

> 

> That starts to creep, with e.g. ADDIS now requiring arg_PLD_D. You'll

> want to audit the other D-form insns to see what other special cases

> there are.

> 

> r~

Well, anything that shares implementation with a prefixed instruction
would use the prefixed arg.

I got arg_D within arg_PLD_D using !function and calling
decode_insn32_extract_D. A bit on the ugly side, and probably not
worth the effort. Maybe changing decodetree would help, but that's 
another patch, for another series.

Anyway, my compiler (GCC 9.3) is inlining trans_ADDI calls with -O3, so 
I'd say that your trans_PADDI from the previous message, with trans_ADDI 
and trans_ADDIS using arg_D, would be the cleaner solution.

Thanks,
Matheus K. Ferst
Instituto de Pesquisas ELDORADO <http://www.eldorado.org.br/>
Analista de Software Júnior
Aviso Legal - Disclaimer <https://www.eldorado.org.br/disclaimer.html>

[v3,25/30] target/ppc: Move ADDI, ADDIS to decodetree, implement PADDI

Commit Message

Comments

Patch