[10/11] argp: Fix shift bug

Message ID	20250507142110.3452012-11-adhemerval.zanella@linaro.org
State	New
Headers	show Delivered-To: patch@linaro.org Received-SPF: pass (google.com: domain of libc-alpha-bounces~patch=linaro.org@sourceware.org designates 2620:52:3:1:0:246e:9693:128c as permitted sender) client-ip=2620:52:3:1:0:246e:9693:128c; DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org 8FF533858D34 From: Adhemerval Zanella <adhemerval.zanella@linaro.org> To: libc-alpha@sourceware.org Cc: Carlos O'Donell <carlos@redhat.com> Subject: [PATCH 10/11] argp: Fix shift bug Date: Wed, 7 May 2025 11:17:28 -0300 Message-ID: <20250507142110.3452012-11-adhemerval.zanella@linaro.org> In-Reply-To: <20250507142110.3452012-1-adhemerval.zanella@linaro.org> References: <20250507142110.3452012-1-adhemerval.zanella@linaro.org> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Precedence: list Errors-To: libc-alpha-bounces~patch=linaro.org@sourceware.org
Series	Add initial support for --enable-ubsan \| expand [00/11] Add initial support for --enable-ubsan [01/11] ubsan: Add initial support for -fsanitize=undefined [02/11] riscv: Fix --enable-ubsan build failure on riscv [03/11] locale: Fix --enable-ubsan build failure on some ABIs [04/11] elf: Adjust DT_EXTRATAGIDX to avoid undefined shifts [05/11] locate: Fix UB on memcpy call [06/11] locale: Fix UB on insert_weights [07/11] localte: Fix UB on collate_finish [08/11] locale: Fix UB in elem_hash [09/11] locale: Fix UB on add_locale_uint32_array [10/11] argp: Fix shift bug [11/11] elf: Fix UB on _dl_map_object_from_fd

Message ID

20250507142110.3452012-11-adhemerval.zanella@linaro.org

State

New

Headers

Received-SPF: pass (google.com: domain of
 libc-alpha-bounces~patch=linaro.org@sourceware.org designates
 2620:52:3:1:0:246e:9693:128c as permitted sender)
 client-ip=2620:52:3:1:0:246e:9693:128c;
DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org 8FF533858D34
From: Adhemerval Zanella <adhemerval.zanella@linaro.org>
To: libc-alpha@sourceware.org
Cc: Carlos O'Donell <carlos@redhat.com>
Subject: [PATCH 10/11] argp: Fix shift bug
Date: Wed,  7 May 2025 11:17:28 -0300
Message-ID: <20250507142110.3452012-11-adhemerval.zanella@linaro.org>
In-Reply-To: <20250507142110.3452012-1-adhemerval.zanella@linaro.org>
References: <20250507142110.3452012-1-adhemerval.zanella@linaro.org>
MIME-Version: 1.0
Content-Transfer-Encoding: 8bit
Precedence: list
Errors-To: libc-alpha-bounces~patch=linaro.org@sourceware.org

Series

Add initial support for --enable-ubsan | expand

Comments

Florian Weimer May 7, 2025, 7:42 p.m. UTC | #1

* Adhemerval Zanella:

>>From gnulib commits 06094e390b0 and 88033d3779362a.
> ---
>  argp/argp-parse.c | 15 +++++++++------
>  1 file changed, 9 insertions(+), 6 deletions(-)
>
> diff --git a/argp/argp-parse.c b/argp/argp-parse.c
> index 82c7b784de..99f8d9ecd4 100644
> --- a/argp/argp-parse.c
> +++ b/argp/argp-parse.c
> @@ -735,12 +735,15 @@ parser_parse_opt (struct parser *parser, int opt, char *val)
>  	    }
>      }
>    else
> -    /* A long option.  We use shifts instead of masking for extracting
> -       the user value in order to preserve the sign.  */
> -    err =
> -      group_parse (&parser->groups[group_key - 1], &parser->state,
> -		   (opt << GROUP_BITS) >> GROUP_BITS,
> -		   parser->opt_data.optarg);
> +    /* A long option.  Preserve the sign in the user key, without
> +       invoking undefined behavior.  Assume two's complement.  */
> +    {
> +      int user_key =
> +        ((opt & (1 << (USER_BITS - 1))) ? ~USER_MASK : 0) | (opt & USER_MASK);

Would this be clearer?

		   (int) ((unsigned int) opt << GROUP_BITS) >> GROUP_BITS,

Or does ubsan flag that as well?  Conversion to negative int is a GCC
extension:

| For conversion to a type of width N, the value is reduced modulo 2^N
| to be within range of the type; no signal is raised.

<https://gcc.gnu.org/onlinedocs/gcc-15.1.0/gcc/Integers-implementation.html>

Adhemerval Zanella May 7, 2025, 8:44 p.m. UTC | #2

On 07/05/25 16:42, Florian Weimer wrote:
> * Adhemerval Zanella:
> 
>> >From gnulib commits 06094e390b0 and 88033d3779362a.
>> ---
>>  argp/argp-parse.c | 15 +++++++++------
>>  1 file changed, 9 insertions(+), 6 deletions(-)
>>
>> diff --git a/argp/argp-parse.c b/argp/argp-parse.c
>> index 82c7b784de..99f8d9ecd4 100644
>> --- a/argp/argp-parse.c
>> +++ b/argp/argp-parse.c
>> @@ -735,12 +735,15 @@ parser_parse_opt (struct parser *parser, int opt, char *val)
>>  	    }
>>      }
>>    else
>> -    /* A long option.  We use shifts instead of masking for extracting
>> -       the user value in order to preserve the sign.  */
>> -    err =
>> -      group_parse (&parser->groups[group_key - 1], &parser->state,
>> -		   (opt << GROUP_BITS) >> GROUP_BITS,
>> -		   parser->opt_data.optarg);
>> +    /* A long option.  Preserve the sign in the user key, without
>> +       invoking undefined behavior.  Assume two's complement.  */
>> +    {
>> +      int user_key =
>> +        ((opt & (1 << (USER_BITS - 1))) ? ~USER_MASK : 0) | (opt & USER_MASK);
> 
> Would this be clearer?
> 
> 		   (int) ((unsigned int) opt << GROUP_BITS) >> GROUP_BITS,

This would be another gnulib deviation, and the code comes from gnulib.
I think we should keep it simple to make the sync more straightforward.

> 
> Or does ubsan flag that as well?  Conversion to negative int is a GCC
> extension:
> 
> | For conversion to a type of width N, the value is reduced modulo 2^N
> | to be within range of the type; no signal is raised.

Yeah, but -fsanitize=undefinied still triggers this as UB.  I can add an
option to suppress this kind of shift, but this will add a bit more
complexity on the handler handling.

> 
> <https://gcc.gnu.org/onlinedocs/gcc-15.1.0/gcc/Integers-implementation.html>

diff --git a/argp/argp-parse.c b/argp/argp-parse.c
index 82c7b784de..99f8d9ecd4 100644
--- a/argp/argp-parse.c
+++ b/argp/argp-parse.c
@@ -735,12 +735,15 @@  parser_parse_opt (struct parser *parser, int opt, char *val)
 	    }
     }
   else
-    /* A long option.  We use shifts instead of masking for extracting
-       the user value in order to preserve the sign.  */
-    err =
-      group_parse (&parser->groups[group_key - 1], &parser->state,
-		   (opt << GROUP_BITS) >> GROUP_BITS,
-		   parser->opt_data.optarg);
+    /* A long option.  Preserve the sign in the user key, without
+       invoking undefined behavior.  Assume two's complement.  */
+    {
+      int user_key =
+        ((opt & (1 << (USER_BITS - 1))) ? ~USER_MASK : 0) | (opt & USER_MASK);
+      err =
+        group_parse (&parser->groups[group_key - 1], &parser->state,
+                     user_key, parser->opt_data.optarg);
+    }
 
   if (err == EBADKEY)
     /* At least currently, an option not recognized is an error in the

[10/11] argp: Fix shift bug

Commit Message

Comments

Patch