diff mbox series

pinctrl: qcom: Clear latched interrupt status when changing IRQ type

Message ID 20250312-pinctrl-msm-type-latch-v1-1-ce87c561d3d7@linaro.org
State New
Headers show
Series pinctrl: qcom: Clear latched interrupt status when changing IRQ type | expand

Commit Message

Stephan Gerhold March 12, 2025, 1:19 p.m. UTC
When submitting the TLMM test driver, Bjorn reported that some of the test
cases are failing for GPIOs that not are backed by PDC (i.e. "non-wakeup"
GPIOs that are handled directly in pinctrl-msm). Basically, lingering
latched interrupt state is still being delivered at IRQ request time, e.g.:

  ok 1 tlmm_test_silent_rising
  tlmm_test_silent_falling: ASSERTION FAILED at drivers/pinctrl/qcom/tlmm-test.c:178
  Expected atomic_read(&priv->intr_count) == 0, but
      atomic_read(&priv->intr_count) == 1 (0x1)
  not ok 2 tlmm_test_silent_falling
  tlmm_test_silent_low: ASSERTION FAILED at drivers/pinctrl/qcom/tlmm-test.c:178
  Expected atomic_read(&priv->intr_count) == 0, but
      atomic_read(&priv->intr_count) == 1 (0x1)
  not ok 3 tlmm_test_silent_low
  ok 4 tlmm_test_silent_high

Whether to report interrupts that came in while the IRQ was unclaimed
doesn't seem to be well-defined in the Linux IRQ API. However, looking
closer at these specific cases, we're actually reporting events that do not
match the interrupt type requested by the driver:

 1. After "ok 1 tlmm_test_silent_rising", the GPIO is in low state and
    configured for IRQF_TRIGGER_RISING.

 2. (a) In preparation for "tlmm_test_silent_falling", the GPIO is switched
        to high state. The rising interrupt gets latched.
    (b) The GPIO is re-configured for IRQF_TRIGGER_FALLING, but the latched
        interrupt isn't cleared.
    (c) The IRQ handler is called for the latched interrupt, but there
        wasn't any falling edge.

 3. (a) For "tlmm_test_silent_low", the GPIO remains in high state.
    (b) The GPIO is re-configured for IRQF_TRIGGER_LOW. This seems to
        result in a phantom interrupt that gets latched.
    (c) The IRQ handler is called for the latched interrupt, but the GPIO
        isn't in low state.

 4. (a) For "tlmm_test_silent_high", the GPIO is switched to low state.
    (b) This doesn't result in a latched interrupt, because RAW_STATUS_EN
        was cleared when masking the level-triggered interrupt.

Fix this by clearing the interrupt state whenever making any changes to the
interrupt configuration. This includes previously disabled interrupts, but
also any changes to interrupt polarity or detection type.

With this change, all 16 test cases are now passing for the non-wakeup
GPIOs in the TLMM.

Cc: stable@vger.kernel.org
Fixes: cf9d052aa600 ("pinctrl: qcom: Don't clear pending interrupts when enabling")
Reported-by: Bjorn Andersson <bjorn.andersson@oss.qualcomm.com>
Closes: https://lore.kernel.org/r/20250227-tlmm-test-v1-1-d18877b4a5db@oss.qualcomm.com/
Signed-off-by: Stephan Gerhold <stephan.gerhold@linaro.org>
---
 drivers/pinctrl/qcom/pinctrl-msm.c | 12 ++++++------
 1 file changed, 6 insertions(+), 6 deletions(-)


---
base-commit: e058c5f49ceff38bf1579a679a5ca20842718579
change-id: 20250311-pinctrl-msm-type-latch-6099aede7d92

Best regards,

Comments

Bjorn Andersson March 13, 2025, 9:05 p.m. UTC | #1
On Wed, Mar 12, 2025 at 02:19:27PM +0100, Stephan Gerhold wrote:
> When submitting the TLMM test driver, Bjorn reported that some of the test
> cases are failing for GPIOs that not are backed by PDC (i.e. "non-wakeup"
> GPIOs that are handled directly in pinctrl-msm). Basically, lingering
> latched interrupt state is still being delivered at IRQ request time, e.g.:
> 
>   ok 1 tlmm_test_silent_rising
>   tlmm_test_silent_falling: ASSERTION FAILED at drivers/pinctrl/qcom/tlmm-test.c:178
>   Expected atomic_read(&priv->intr_count) == 0, but
>       atomic_read(&priv->intr_count) == 1 (0x1)
>   not ok 2 tlmm_test_silent_falling
>   tlmm_test_silent_low: ASSERTION FAILED at drivers/pinctrl/qcom/tlmm-test.c:178
>   Expected atomic_read(&priv->intr_count) == 0, but
>       atomic_read(&priv->intr_count) == 1 (0x1)
>   not ok 3 tlmm_test_silent_low
>   ok 4 tlmm_test_silent_high
> 
> Whether to report interrupts that came in while the IRQ was unclaimed
> doesn't seem to be well-defined in the Linux IRQ API. However, looking
> closer at these specific cases, we're actually reporting events that do not
> match the interrupt type requested by the driver:
> 
>  1. After "ok 1 tlmm_test_silent_rising", the GPIO is in low state and
>     configured for IRQF_TRIGGER_RISING.
> 
>  2. (a) In preparation for "tlmm_test_silent_falling", the GPIO is switched
>         to high state. The rising interrupt gets latched.
>     (b) The GPIO is re-configured for IRQF_TRIGGER_FALLING, but the latched
>         interrupt isn't cleared.
>     (c) The IRQ handler is called for the latched interrupt, but there
>         wasn't any falling edge.
> 
>  3. (a) For "tlmm_test_silent_low", the GPIO remains in high state.
>     (b) The GPIO is re-configured for IRQF_TRIGGER_LOW. This seems to
>         result in a phantom interrupt that gets latched.
>     (c) The IRQ handler is called for the latched interrupt, but the GPIO
>         isn't in low state.
> 
>  4. (a) For "tlmm_test_silent_high", the GPIO is switched to low state.
>     (b) This doesn't result in a latched interrupt, because RAW_STATUS_EN
>         was cleared when masking the level-triggered interrupt.
> 
> Fix this by clearing the interrupt state whenever making any changes to the
> interrupt configuration. This includes previously disabled interrupts, but
> also any changes to interrupt polarity or detection type.
> 
> With this change, all 16 test cases are now passing for the non-wakeup
> GPIOs in the TLMM.
> 
> Cc: stable@vger.kernel.org
> Fixes: cf9d052aa600 ("pinctrl: qcom: Don't clear pending interrupts when enabling")
> Reported-by: Bjorn Andersson <bjorn.andersson@oss.qualcomm.com>
> Closes: https://lore.kernel.org/r/20250227-tlmm-test-v1-1-d18877b4a5db@oss.qualcomm.com/
> Signed-off-by: Stephan Gerhold <stephan.gerhold@linaro.org>

Tested-by: Bjorn Andersson <andersson@kernel.org>
Reviewed-by: Bjorn Andersson <andersson@kernel.org>

Regards,
Bjorn

> ---
>  drivers/pinctrl/qcom/pinctrl-msm.c | 12 ++++++------
>  1 file changed, 6 insertions(+), 6 deletions(-)
> 
> diff --git a/drivers/pinctrl/qcom/pinctrl-msm.c b/drivers/pinctrl/qcom/pinctrl-msm.c
> index 47daa47153c970190b0d469dc8d245b3cbeace5e..82f0cc43bbf4f4d24f078af2d0a515d3a03b961a 100644
> --- a/drivers/pinctrl/qcom/pinctrl-msm.c
> +++ b/drivers/pinctrl/qcom/pinctrl-msm.c
> @@ -1045,8 +1045,7 @@ static int msm_gpio_irq_set_type(struct irq_data *d, unsigned int type)
>  	const struct msm_pingroup *g;
>  	u32 intr_target_mask = GENMASK(2, 0);
>  	unsigned long flags;
> -	bool was_enabled;
> -	u32 val;
> +	u32 val, oldval;
>  
>  	if (msm_gpio_needs_dual_edge_parent_workaround(d, type)) {
>  		set_bit(d->hwirq, pctrl->dual_edge_irqs);
> @@ -1108,8 +1107,7 @@ static int msm_gpio_irq_set_type(struct irq_data *d, unsigned int type)
>  	 * internal circuitry of TLMM, toggling the RAW_STATUS
>  	 * could cause the INTR_STATUS to be set for EDGE interrupts.
>  	 */
> -	val = msm_readl_intr_cfg(pctrl, g);
> -	was_enabled = val & BIT(g->intr_raw_status_bit);
> +	val = oldval = msm_readl_intr_cfg(pctrl, g);
>  	val |= BIT(g->intr_raw_status_bit);
>  	if (g->intr_detection_width == 2) {
>  		val &= ~(3 << g->intr_detection_bit);
> @@ -1162,9 +1160,11 @@ static int msm_gpio_irq_set_type(struct irq_data *d, unsigned int type)
>  	/*
>  	 * The first time we set RAW_STATUS_EN it could trigger an interrupt.
>  	 * Clear the interrupt.  This is safe because we have
> -	 * IRQCHIP_SET_TYPE_MASKED.
> +	 * IRQCHIP_SET_TYPE_MASKED. When changing the interrupt type, we could
> +	 * also still have a non-matching interrupt latched, so clear whenever
> +	 * making changes to the interrupt configuration.
>  	 */
> -	if (!was_enabled)
> +	if (val != oldval)
>  		msm_ack_intr_status(pctrl, g);
>  
>  	if (test_bit(d->hwirq, pctrl->dual_edge_irqs))
> 
> ---
> base-commit: e058c5f49ceff38bf1579a679a5ca20842718579
> change-id: 20250311-pinctrl-msm-type-latch-6099aede7d92
> 
> Best regards,
> -- 
> Stephan Gerhold <stephan.gerhold@linaro.org>
>
Linus Walleij March 14, 2025, 11:04 a.m. UTC | #2
On Wed, Mar 12, 2025 at 2:19 PM Stephan Gerhold
<stephan.gerhold@linaro.org> wrote:

> When submitting the TLMM test driver, Bjorn reported that some of the test
> cases are failing for GPIOs that not are backed by PDC (i.e. "non-wakeup"
> GPIOs that are handled directly in pinctrl-msm). Basically, lingering
> latched interrupt state is still being delivered at IRQ request time, e.g.:
>
>   ok 1 tlmm_test_silent_rising
>   tlmm_test_silent_falling: ASSERTION FAILED at drivers/pinctrl/qcom/tlmm-test.c:178
>   Expected atomic_read(&priv->intr_count) == 0, but
>       atomic_read(&priv->intr_count) == 1 (0x1)
>   not ok 2 tlmm_test_silent_falling
>   tlmm_test_silent_low: ASSERTION FAILED at drivers/pinctrl/qcom/tlmm-test.c:178
>   Expected atomic_read(&priv->intr_count) == 0, but
>       atomic_read(&priv->intr_count) == 1 (0x1)
>   not ok 3 tlmm_test_silent_low
>   ok 4 tlmm_test_silent_high
>
> Whether to report interrupts that came in while the IRQ was unclaimed
> doesn't seem to be well-defined in the Linux IRQ API. However, looking
> closer at these specific cases, we're actually reporting events that do not
> match the interrupt type requested by the driver:
>
>  1. After "ok 1 tlmm_test_silent_rising", the GPIO is in low state and
>     configured for IRQF_TRIGGER_RISING.
>
>  2. (a) In preparation for "tlmm_test_silent_falling", the GPIO is switched
>         to high state. The rising interrupt gets latched.
>     (b) The GPIO is re-configured for IRQF_TRIGGER_FALLING, but the latched
>         interrupt isn't cleared.
>     (c) The IRQ handler is called for the latched interrupt, but there
>         wasn't any falling edge.
>
>  3. (a) For "tlmm_test_silent_low", the GPIO remains in high state.
>     (b) The GPIO is re-configured for IRQF_TRIGGER_LOW. This seems to
>         result in a phantom interrupt that gets latched.
>     (c) The IRQ handler is called for the latched interrupt, but the GPIO
>         isn't in low state.
>
>  4. (a) For "tlmm_test_silent_high", the GPIO is switched to low state.
>     (b) This doesn't result in a latched interrupt, because RAW_STATUS_EN
>         was cleared when masking the level-triggered interrupt.
>
> Fix this by clearing the interrupt state whenever making any changes to the
> interrupt configuration. This includes previously disabled interrupts, but
> also any changes to interrupt polarity or detection type.
>
> With this change, all 16 test cases are now passing for the non-wakeup
> GPIOs in the TLMM.
>
> Cc: stable@vger.kernel.org
> Fixes: cf9d052aa600 ("pinctrl: qcom: Don't clear pending interrupts when enabling")
> Reported-by: Bjorn Andersson <bjorn.andersson@oss.qualcomm.com>
> Closes: https://lore.kernel.org/r/20250227-tlmm-test-v1-1-d18877b4a5db@oss.qualcomm.com/
> Signed-off-by: Stephan Gerhold <stephan.gerhold@linaro.org>

Patch applied!

Yours,
Linus Walleij
Stephen Boyd March 15, 2025, 7:07 a.m. UTC | #3
Quoting Stephan Gerhold (2025-03-12 06:19:27)
> When submitting the TLMM test driver, Bjorn reported that some of the test
> cases are failing for GPIOs that not are backed by PDC (i.e. "non-wakeup"
> GPIOs that are handled directly in pinctrl-msm). Basically, lingering
> latched interrupt state is still being delivered at IRQ request time, e.g.:
>
>   ok 1 tlmm_test_silent_rising
>   tlmm_test_silent_falling: ASSERTION FAILED at drivers/pinctrl/qcom/tlmm-test.c:178

I wish it was called pinctrl-msm-test.c but oh well!

>   Expected atomic_read(&priv->intr_count) == 0, but
>       atomic_read(&priv->intr_count) == 1 (0x1)
>   not ok 2 tlmm_test_silent_falling
>   tlmm_test_silent_low: ASSERTION FAILED at drivers/pinctrl/qcom/tlmm-test.c:178
>   Expected atomic_read(&priv->intr_count) == 0, but
>       atomic_read(&priv->intr_count) == 1 (0x1)
>   not ok 3 tlmm_test_silent_low
>   ok 4 tlmm_test_silent_high
>
> Whether to report interrupts that came in while the IRQ was unclaimed
> doesn't seem to be well-defined in the Linux IRQ API. However, looking
> closer at these specific cases, we're actually reporting events that do not
> match the interrupt type requested by the driver:
>
>  1. After "ok 1 tlmm_test_silent_rising", the GPIO is in low state and
>     configured for IRQF_TRIGGER_RISING.
>
>  2. (a) In preparation for "tlmm_test_silent_falling", the GPIO is switched
>         to high state. The rising interrupt gets latched.

Is the interrupt unmasked here while the test is driving the GPIO line
high and the interrupt trigger is IRQF_TRIGGER_RISING? If so, this is
correct behavior.

Why wouldn't the trigger be set to IRQF_TRIGGER_FALLING, then the GPIO
driven high, and then the GPIO driven low for the test to confirm
falling edges work?

Have you seen the big comment in msm_gpio_irq_mask() and how it says we
want to latch edge interrupts even when the interrupt is masked?

>     (b) The GPIO is re-configured for IRQF_TRIGGER_FALLING, but the latched
>         interrupt isn't cleared.
>     (c) The IRQ handler is called for the latched interrupt, but there
>         wasn't any falling edge.
>
>  3. (a) For "tlmm_test_silent_low", the GPIO remains in high state.
>     (b) The GPIO is re-configured for IRQF_TRIGGER_LOW. This seems to
>         result in a phantom interrupt that gets latched.
>     (c) The IRQ handler is called for the latched interrupt, but the GPIO
>         isn't in low state.

Is the test causing phantom behavior by writing to the interrupt
hardware?

>
>  4. (a) For "tlmm_test_silent_high", the GPIO is switched to low state.
>     (b) This doesn't result in a latched interrupt, because RAW_STATUS_EN
>         was cleared when masking the level-triggered interrupt.
>
> Fix this by clearing the interrupt state whenever making any changes to the
> interrupt configuration. This includes previously disabled interrupts, but
> also any changes to interrupt polarity or detection type.

How do we avoid the case where an interrupt happens to come in while the
polarity is being changed? Won't we ignore such an interrupt now? If
these are edge interrupts that's quite bad because we may never see the
interrupt again.

I think we erred on the side of caution here and let extra edge
interrupts through because a rising or falling edge usually means the
interrupt handler just wants to run when there's some event and it will
do the work to find out if it was spurious or not. It's been years
though so I may have forgotten how this hardware works. It just makes me
very nervous that we're going to miss edges now that we always clear the
interrupt.
Bjorn Andersson March 17, 2025, 2:49 a.m. UTC | #4
On Sat, Mar 15, 2025 at 12:07:14AM -0700, Stephen Boyd wrote:
> Quoting Stephan Gerhold (2025-03-12 06:19:27)
> > When submitting the TLMM test driver, Bjorn reported that some of the test
> > cases are failing for GPIOs that not are backed by PDC (i.e. "non-wakeup"
> > GPIOs that are handled directly in pinctrl-msm). Basically, lingering
> > latched interrupt state is still being delivered at IRQ request time, e.g.:
> >
> >   ok 1 tlmm_test_silent_rising
> >   tlmm_test_silent_falling: ASSERTION FAILED at drivers/pinctrl/qcom/tlmm-test.c:178
> 
> I wish it was called pinctrl-msm-test.c but oh well!
> 
> >   Expected atomic_read(&priv->intr_count) == 0, but
> >       atomic_read(&priv->intr_count) == 1 (0x1)
> >   not ok 2 tlmm_test_silent_falling
> >   tlmm_test_silent_low: ASSERTION FAILED at drivers/pinctrl/qcom/tlmm-test.c:178
> >   Expected atomic_read(&priv->intr_count) == 0, but
> >       atomic_read(&priv->intr_count) == 1 (0x1)
> >   not ok 3 tlmm_test_silent_low
> >   ok 4 tlmm_test_silent_high
> >
> > Whether to report interrupts that came in while the IRQ was unclaimed
> > doesn't seem to be well-defined in the Linux IRQ API. However, looking
> > closer at these specific cases, we're actually reporting events that do not
> > match the interrupt type requested by the driver:
> >
> >  1. After "ok 1 tlmm_test_silent_rising", the GPIO is in low state and
> >     configured for IRQF_TRIGGER_RISING.
> >
> >  2. (a) In preparation for "tlmm_test_silent_falling", the GPIO is switched
> >         to high state. The rising interrupt gets latched.
> 
> Is the interrupt unmasked here while the test is driving the GPIO line
> high and the interrupt trigger is IRQF_TRIGGER_RISING? If so, this is
> correct behavior.
> 
> Why wouldn't the trigger be set to IRQF_TRIGGER_FALLING, then the GPIO
> driven high, and then the GPIO driven low for the test to confirm
> falling edges work?
> 

So you're saying that the interrupt consumer needs to take into
consideration any previous interrupt handler being setup for this GPIO?

Test #1 request the interrupt as rising then releases the interrupt,
then before initiating test #2 the GPIO line is driven high, the
interrupt is requested FALLING and the test is that we don't get any
interrupts.

> Have you seen the big comment in msm_gpio_irq_mask() and how it says we
> want to latch edge interrupts even when the interrupt is masked?
> 

So if the bootloader (or hardware default?) configures an interrupt for
e.g. RISING, and sometime during boot there's a rising edge, then a
client driver should expect to get a spurious interrupt?

> >     (b) The GPIO is re-configured for IRQF_TRIGGER_FALLING, but the latched
> >         interrupt isn't cleared.
> >     (c) The IRQ handler is called for the latched interrupt, but there
> >         wasn't any falling edge.
> >
> >  3. (a) For "tlmm_test_silent_low", the GPIO remains in high state.
> >     (b) The GPIO is re-configured for IRQF_TRIGGER_LOW. This seems to
> >         result in a phantom interrupt that gets latched.
> >     (c) The IRQ handler is called for the latched interrupt, but the GPIO
> >         isn't in low state.
> 
> Is the test causing phantom behavior by writing to the interrupt
> hardware?
> 
> >
> >  4. (a) For "tlmm_test_silent_high", the GPIO is switched to low state.
> >     (b) This doesn't result in a latched interrupt, because RAW_STATUS_EN
> >         was cleared when masking the level-triggered interrupt.
> >
> > Fix this by clearing the interrupt state whenever making any changes to the
> > interrupt configuration. This includes previously disabled interrupts, but
> > also any changes to interrupt polarity or detection type.
> 
> How do we avoid the case where an interrupt happens to come in while the
> polarity is being changed? Won't we ignore such an interrupt now? If
> these are edge interrupts that's quite bad because we may never see the
> interrupt again.
> 

Are you referring to the "both edge"-detection dance we're doing in the
driver?

> I think we erred on the side of caution here and let extra edge
> interrupts through because a rising or falling edge usually means the
> interrupt handler just wants to run when there's some event and it will
> do the work to find out if it was spurious or not. It's been years
> though so I may have forgotten how this hardware works. It just makes me
> very nervous that we're going to miss edges now that we always clear the
> interrupt.

I'm not saying that you're wrong, or that your concerns are unwarranted.
I wrote some simple tests to validate the behavior I expected to see. I
didn't expect to see different results between wakeup (PDC) GPIOs and
non-wakeup GPIOs, and now both passes the test case as written.

That said, I'm not finding documentation on what the exact expectations
are for these corner cases. If the expected behavior differs from what I
put in the test cases, I'd be happy to see them updated - and the driver
adjusted accordingly (for both types of GPIOs). Getting these things
clarified was part of the reason for introducing the test cases.

It certainly sounds like we need some test case for the dual-edge case
though.

Regards,
Bjorn
Stephen Boyd March 20, 2025, 10:20 p.m. UTC | #5
Quoting Bjorn Andersson (2025-03-16 19:49:01)
> On Sat, Mar 15, 2025 at 12:07:14AM -0700, Stephen Boyd wrote:
> > Quoting Stephan Gerhold (2025-03-12 06:19:27)
> > >
> > > Whether to report interrupts that came in while the IRQ was unclaimed
> > > doesn't seem to be well-defined in the Linux IRQ API. However, looking
> > > closer at these specific cases, we're actually reporting events that do not
> > > match the interrupt type requested by the driver:
> > >
> > >  1. After "ok 1 tlmm_test_silent_rising", the GPIO is in low state and
> > >     configured for IRQF_TRIGGER_RISING.
> > >
> > >  2. (a) In preparation for "tlmm_test_silent_falling", the GPIO is switched
> > >         to high state. The rising interrupt gets latched.
> >
> > Is the interrupt unmasked here while the test is driving the GPIO line
> > high and the interrupt trigger is IRQF_TRIGGER_RISING? If so, this is
> > correct behavior.
> >
> > Why wouldn't the trigger be set to IRQF_TRIGGER_FALLING, then the GPIO
> > driven high, and then the GPIO driven low for the test to confirm
> > falling edges work?
> >
>
> So you're saying that the interrupt consumer needs to take into
> consideration any previous interrupt handler being setup for this GPIO?

No.

>
> Test #1 request the interrupt as rising then releases the interrupt,
> then before initiating test #2 the GPIO line is driven high, the
> interrupt is requested FALLING and the test is that we don't get any
> interrupts.

Ok. In this case maybe we should mask the irq in struct
irq_chip::irq_shutdown() and clear out any interrupt that comes in
because we touched the TLMM hardware. I don't see why we want to
continue to monitor the GPIO when the interrupt handler is removed.

>
> > Have you seen the big comment in msm_gpio_irq_mask() and how it says we
> > want to latch edge interrupts even when the interrupt is masked?
> >
>
> So if the bootloader (or hardware default?) configures an interrupt for
> e.g. RISING, and sometime during boot there's a rising edge, then a
> client driver should expect to get a spurious interrupt?

No? If there isn't an interrupt handler registered we shouldn't be
latching the interrupt.

>
> > >     (b) The GPIO is re-configured for IRQF_TRIGGER_FALLING, but the latched
> > >         interrupt isn't cleared.
> > >     (c) The IRQ handler is called for the latched interrupt, but there
> > >         wasn't any falling edge.
> > >
> > >  3. (a) For "tlmm_test_silent_low", the GPIO remains in high state.
> > >     (b) The GPIO is re-configured for IRQF_TRIGGER_LOW. This seems to
> > >         result in a phantom interrupt that gets latched.
> > >     (c) The IRQ handler is called for the latched interrupt, but the GPIO
> > >         isn't in low state.
> >
> > Is the test causing phantom behavior by writing to the interrupt
> > hardware?
> >
> > >
> > >  4. (a) For "tlmm_test_silent_high", the GPIO is switched to low state.
> > >     (b) This doesn't result in a latched interrupt, because RAW_STATUS_EN
> > >         was cleared when masking the level-triggered interrupt.
> > >
> > > Fix this by clearing the interrupt state whenever making any changes to the
> > > interrupt configuration. This includes previously disabled interrupts, but
> > > also any changes to interrupt polarity or detection type.
> >
> > How do we avoid the case where an interrupt happens to come in while the
> > polarity is being changed? Won't we ignore such an interrupt now? If
> > these are edge interrupts that's quite bad because we may never see the
> > interrupt again.
> >
>
> Are you referring to the "both edge"-detection dance we're doing in the
> driver?

No I'm thinking of gpio-keys driver where it changes the irq type during
suspend to get the wakeup event. I worry that the gpio-keys driver is
going to miss the edge now that there's a possibility the interrupt is
going to be ignored and we'll never wakeup. But maybe that isn't a
problem because PDC behavior works per your tests?

>
> > I think we erred on the side of caution here and let extra edge
> > interrupts through because a rising or falling edge usually means the
> > interrupt handler just wants to run when there's some event and it will
> > do the work to find out if it was spurious or not. It's been years
> > though so I may have forgotten how this hardware works. It just makes me
> > very nervous that we're going to miss edges now that we always clear the
> > interrupt.
>
> I'm not saying that you're wrong, or that your concerns are unwarranted.
> I wrote some simple tests to validate the behavior I expected to see. I
> didn't expect to see different results between wakeup (PDC) GPIOs and
> non-wakeup GPIOs, and now both passes the test case as written.
>

I suspect PDC based GPIOs are working better because PDC sits in between
the GIC and the TLMM hardware and we actually mask the interrupts in PDC
properly instead of letting the summary line toggle all the time. Thanks
for writing hardware tests in KUnit, it's nice.
diff mbox series

Patch

diff --git a/drivers/pinctrl/qcom/pinctrl-msm.c b/drivers/pinctrl/qcom/pinctrl-msm.c
index 47daa47153c970190b0d469dc8d245b3cbeace5e..82f0cc43bbf4f4d24f078af2d0a515d3a03b961a 100644
--- a/drivers/pinctrl/qcom/pinctrl-msm.c
+++ b/drivers/pinctrl/qcom/pinctrl-msm.c
@@ -1045,8 +1045,7 @@  static int msm_gpio_irq_set_type(struct irq_data *d, unsigned int type)
 	const struct msm_pingroup *g;
 	u32 intr_target_mask = GENMASK(2, 0);
 	unsigned long flags;
-	bool was_enabled;
-	u32 val;
+	u32 val, oldval;
 
 	if (msm_gpio_needs_dual_edge_parent_workaround(d, type)) {
 		set_bit(d->hwirq, pctrl->dual_edge_irqs);
@@ -1108,8 +1107,7 @@  static int msm_gpio_irq_set_type(struct irq_data *d, unsigned int type)
 	 * internal circuitry of TLMM, toggling the RAW_STATUS
 	 * could cause the INTR_STATUS to be set for EDGE interrupts.
 	 */
-	val = msm_readl_intr_cfg(pctrl, g);
-	was_enabled = val & BIT(g->intr_raw_status_bit);
+	val = oldval = msm_readl_intr_cfg(pctrl, g);
 	val |= BIT(g->intr_raw_status_bit);
 	if (g->intr_detection_width == 2) {
 		val &= ~(3 << g->intr_detection_bit);
@@ -1162,9 +1160,11 @@  static int msm_gpio_irq_set_type(struct irq_data *d, unsigned int type)
 	/*
 	 * The first time we set RAW_STATUS_EN it could trigger an interrupt.
 	 * Clear the interrupt.  This is safe because we have
-	 * IRQCHIP_SET_TYPE_MASKED.
+	 * IRQCHIP_SET_TYPE_MASKED. When changing the interrupt type, we could
+	 * also still have a non-matching interrupt latched, so clear whenever
+	 * making changes to the interrupt configuration.
 	 */
-	if (!was_enabled)
+	if (val != oldval)
 		msm_ack_intr_status(pctrl, g);
 
 	if (test_bit(d->hwirq, pctrl->dual_edge_irqs))