From patchwork Sun Mar 10 06:47:51 2019 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Huazhong Tan X-Patchwork-Id: 160023 Delivered-To: patch@linaro.org Received: by 2002:a02:5cc1:0:0:0:0:0 with SMTP id w62csp10588529jad; Sat, 9 Mar 2019 22:48:53 -0800 (PST) X-Google-Smtp-Source: APXvYqxmxfzUIM7pyTugtRP2b9GYCMrJ+CJlyoAfJMtihl7UshjEXe+jQVJsSimMZOOZTfIHJHCj X-Received: by 2002:a65:5c07:: with SMTP id u7mr24901575pgr.320.1552200533383; Sat, 09 Mar 2019 22:48:53 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1552200533; cv=none; d=google.com; s=arc-20160816; b=aYnPnaKfxQblOZE24fdWXNuavFNlKQN7hni9GOTb/su5fM1Poza+yx0j6qpj8LY8nu MlKUZrdYXtC8ytLfsIutEloeHS7dBCWD6uQCvwzLR11gpwv/gGENEwKKleql3YzQOHJ1 GuLjPKQL1eCCtud+OVAZzFkG0qdcpFYTOi4xGryVzyRDh/rBRyra2KWVe6PuUEv3S3iT /tjzMYlSBCi9SUvEoMDvCgKz8mm+cfteRSTmgJVWWPAU/q43BkzounltuNahYqLVG5Ai a5pbSCPkJBqLCKHROwQKzU9/nr8gv41S48UwqV30+SPKPiDYB87fYumm8T1ASNYhrh9Q edXw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:mime-version:message-id:date:subject:cc :to:from; bh=+Fcr02tGKSG/2M1yFmN27gd1fIyzMjYNP/mVj0j1clM=; b=H7e1WHicW2lVMJ6c07m7g5xERofLQYDyuQAHgkrl1A2PxfsfPix1KvJeYhZsI+8EGj 2/75wxdWOhr8LkUNZIkWGGwrbtgTy8ZyxbkigS8DGtOATsW0FdU5HcbMZEeT/HK3q8vB iYFPIaT2A2OtkvyTi+3fDggrLQbFeZ5RAomdiNOBDarnw0IuY2SkrdbuCmw7UHFXpOb/ M7GPEOqrrYSz7PwGM/tSg90fgiaFmacQ7kiXuEeLmQ2b8oIzLy095DxuZrXXmBWKpXwg DVkRCjrPYNiW1Git1Xnk/TnF1mmTB2eHjC73ub9y19iBM0ufWXwfvUAWag6vX0VxJZuk /dfg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id 75si2194621pgb.230.2019.03.09.22.48.53; Sat, 09 Mar 2019 22:48:53 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726280AbfCJGss (ORCPT + 31 others); Sun, 10 Mar 2019 01:48:48 -0500 Received: from szxga07-in.huawei.com ([45.249.212.35]:34440 "EHLO huawei.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1725778AbfCJGss (ORCPT ); Sun, 10 Mar 2019 01:48:48 -0500 Received: from DGGEMS405-HUB.china.huawei.com (unknown [172.30.72.60]) by Forcepoint Email with ESMTP id 9AFA95C5316F36977C78; Sun, 10 Mar 2019 14:48:43 +0800 (CST) Received: from localhost.localdomain (10.67.212.132) by DGGEMS405-HUB.china.huawei.com (10.3.19.205) with Microsoft SMTP Server id 14.3.408.0; Sun, 10 Mar 2019 14:48:37 +0800 From: Huazhong Tan To: CC: , , , , , Shiju Jose , Huazhong Tan Subject: [PATCH net] net: hns3: fix to stop multiple HNS reset due to the AER changes Date: Sun, 10 Mar 2019 14:47:51 +0800 Message-ID: <1552200471-47492-1-git-send-email-tanhuazhong@huawei.com> X-Mailer: git-send-email 2.7.4 MIME-Version: 1.0 X-Originating-IP: [10.67.212.132] X-CFilter-Loop: Reflected Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Shiju Jose The commit bfcb79fca19d ("PCI/ERR: Run error recovery callbacks for all affected devices") affected the non-fatal error recovery logic for the HNS and RDMA devices. This is because each HNS PF under PCIe bus receive callbacks from the AER driver when an error is reported for one of the PF. This causes unwanted PF resets because the HNS decides which PF to reset based on the reset type set. The HNS error handling code sets the reset type based on the hw error type detected. This patch provides fix for the above issue for the recovery of the hw errors in the HNS and RDMA devices. This patch needs backporting to the kernel v5.0+ Fixes: 332fbf576579 ("net: hns3: add handling of hw ras errors using new set of commands") Reported-by: Xiaofei Tan Signed-off-by: Shiju Jose Signed-off-by: Huazhong Tan --- drivers/net/ethernet/hisilicon/hns3/hnae3.h | 1 + drivers/net/ethernet/hisilicon/hns3/hns3_enet.c | 4 +++- drivers/net/ethernet/hisilicon/hns3/hns3pf/hclge_err.c | 9 +++++++-- 3 files changed, 11 insertions(+), 3 deletions(-) -- 2.7.4 diff --git a/drivers/net/ethernet/hisilicon/hns3/hnae3.h b/drivers/net/ethernet/hisilicon/hns3/hnae3.h index 66d7a8b..38b430f 100644 --- a/drivers/net/ethernet/hisilicon/hns3/hnae3.h +++ b/drivers/net/ethernet/hisilicon/hns3/hnae3.h @@ -194,6 +194,7 @@ struct hnae3_ae_dev { const struct hnae3_ae_ops *ops; struct list_head node; u32 flag; + u8 override_pci_need_reset; /* fix to stop multiple reset happening */ enum hnae3_dev_type dev_type; enum hnae3_reset_type reset_type; void *priv; diff --git a/drivers/net/ethernet/hisilicon/hns3/hns3_enet.c b/drivers/net/ethernet/hisilicon/hns3/hns3_enet.c index 0d1ae15..1c1f17e 100644 --- a/drivers/net/ethernet/hisilicon/hns3/hns3_enet.c +++ b/drivers/net/ethernet/hisilicon/hns3/hns3_enet.c @@ -1850,7 +1850,9 @@ static pci_ers_result_t hns3_slot_reset(struct pci_dev *pdev) /* request the reset */ if (ae_dev->ops->reset_event) { - ae_dev->ops->reset_event(pdev, NULL); + if (!ae_dev->override_pci_need_reset) + ae_dev->ops->reset_event(pdev, NULL); + return PCI_ERS_RESULT_RECOVERED; } diff --git a/drivers/net/ethernet/hisilicon/hns3/hns3pf/hclge_err.c b/drivers/net/ethernet/hisilicon/hns3/hns3pf/hclge_err.c index 1feceff..1f52d11 100644 --- a/drivers/net/ethernet/hisilicon/hns3/hns3pf/hclge_err.c +++ b/drivers/net/ethernet/hisilicon/hns3/hns3pf/hclge_err.c @@ -1317,8 +1317,10 @@ pci_ers_result_t hclge_handle_hw_ras_error(struct hnae3_ae_dev *ae_dev) hclge_handle_all_ras_errors(hdev); } else { if (test_bit(HCLGE_STATE_RST_HANDLING, &hdev->state) || - hdev->pdev->revision < 0x21) + hdev->pdev->revision < 0x21) { + ae_dev->override_pci_need_reset = 1; return PCI_ERS_RESULT_RECOVERED; + } } if (status & HCLGE_RAS_REG_ROCEE_ERR_MASK) { @@ -1327,8 +1329,11 @@ pci_ers_result_t hclge_handle_hw_ras_error(struct hnae3_ae_dev *ae_dev) } if (status & HCLGE_RAS_REG_NFE_MASK || - status & HCLGE_RAS_REG_ROCEE_ERR_MASK) + status & HCLGE_RAS_REG_ROCEE_ERR_MASK) { + ae_dev->override_pci_need_reset = 0; return PCI_ERS_RESULT_NEED_RESET; + } + ae_dev->override_pci_need_reset = 1; return PCI_ERS_RESULT_RECOVERED; }