From patchwork Mon Jun 26 13:38:45 2017 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: "Leizhen \(ThunderTown\)" X-Patchwork-Id: 106331 Delivered-To: patch@linaro.org Received: by 10.182.135.102 with SMTP id pr6csp1323693obb; Mon, 26 Jun 2017 06:41:39 -0700 (PDT) X-Received: by 10.84.224.206 with SMTP id k14mr306494pln.72.1498484405119; Mon, 26 Jun 2017 06:40:05 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1498484405; cv=none; d=google.com; s=arc-20160816; b=hgVWfjdkMV8mW8FIROpAxpDbCKGzF97jqKstJl+aM0pndTvpQWiB3m81EC9MhjSPXZ tKyPkwS8lSmPkPk/T0GffIzjyCsE5TXblfQS3AtWZvna5GAZCfDOl1oIC90UNWRneLoX v27rEOzy0R+76yNxQ+6PtT2ieQXzjug+yPEyBXa4iZ5qvQ7/V9vjL3Ey/X2ZMqefED9d ul4YJXDgkjk1cM6xBk+V9sW14gswjRX8f0aHlmTjRd9o0n2HAK3PMKnx794M46BNPJvC 9ZWyBIpokt8UGpYfv12mEImcYnJPk/MKOUWMDnNj3/G6vf4YW/JgSjl2ZNYymqQv41a8 Zk2w== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:mime-version:message-id:date:subject:cc :to:from:arc-authentication-results; bh=eDfWoXl1ZwtYIXbIJ+rqcGOeCUce+GAuOj44CP95k4A=; b=fdnIjSs7CLJRdd0vtxJ7tPqxJkvNQq97985EKx4OMYRoqyQDbBvS5L6fcOVAAAlDeo zVuWKG6Un0AiJCi8/e3DU2um5FqnMwyvQLpO7Dm06MGmHeupWUwqzbDgVh0aQQXEPK3N V+YvA7/es/iflSAWnqZlcrialCQjgTxsDCZW+Q5l/4yqM1J6yiuYPocvBmL+OHYpebqI fpSKxb4LTO7rs/S0WmgT4+t+hPiAnYOKto8JnqZW2K8hOnZuddP9i73MR/zGfDwkGfg7 KNKdxPbhRJ09BX2zwsB8X6GYNtILTD4E+Sg4thzsxL/bGHI94QSbVi9xC3ChuJkuE7nm y7XA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id h24si80902pfk.157.2017.06.26.06.40.04; Mon, 26 Jun 2017 06:40:05 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752532AbdFZNj4 (ORCPT + 25 others); Mon, 26 Jun 2017 09:39:56 -0400 Received: from szxga02-in.huawei.com ([45.249.212.188]:8857 "EHLO szxga02-in.huawei.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752165AbdFZNj2 (ORCPT ); Mon, 26 Jun 2017 09:39:28 -0400 Received: from 172.30.72.53 (EHLO DGGEML401-HUB.china.huawei.com) ([172.30.72.53]) by dggrg02-dlp.huawei.com (MOS 4.4.6-GA FastPath queued) with ESMTP id AQA47789; Mon, 26 Jun 2017 21:39:25 +0800 (CST) Received: from localhost (10.177.23.164) by DGGEML401-HUB.china.huawei.com (10.3.17.32) with Microsoft SMTP Server id 14.3.301.0; Mon, 26 Jun 2017 21:39:16 +0800 From: Zhen Lei To: Will Deacon , Joerg Roedel , linux-arm-kernel , iommu , Robin Murphy , linux-kernel CC: Zefan Li , Xinwei Hu , "Tianhong Ding" , Hanjun Guo , Zhen Lei , John Garry Subject: [PATCH 0/5] arm-smmu: performance optimization Date: Mon, 26 Jun 2017 21:38:45 +0800 Message-ID: <1498484330-10840-1-git-send-email-thunder.leizhen@huawei.com> X-Mailer: git-send-email 1.9.5.msysgit.0 MIME-Version: 1.0 X-Originating-IP: [10.177.23.164] X-CFilter-Loop: Reflected X-Mirapoint-Virus-RAPID-Raw: score=unknown(0), refid=str=0001.0A020204.59510E8D.01A5, ss=1, re=0.000, recu=0.000, reip=0.000, cl=1, cld=1, fgs=0, ip=0.0.0.0, so=2014-11-16 11:51:01, dmn=2013-03-21 17:37:32 X-Mirapoint-Loop-Id: ed632fea4c77095e8867deb8f74b4e94 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org I described the optimization more detail in patch 1 and 2, and patch 3-5 are the implementation on arm-smmu/arm-smmu-v3 of patch 2. Patch 1 is v2. In v1, I directly replaced writel with writel_relaxed in queue_inc_prod. But Robin figured that it may lead SMMU consume stale memory contents. I thought more than 3 whole days and got this one. This patchset is based on Robin Murphy's [PATCH v2 0/8] io-pgtable lock removal. Zhen Lei (5): iommu/arm-smmu-v3: put off the execution of TLBI* to reduce lock confliction iommu: add a new member unmap_tlb_sync into struct iommu_ops iommu/arm-smmu-v3: add support for unmap an iova range with only one tlb sync iommu/arm-smmu: add support for unmap a memory range with only one tlb sync iommu/io-pgtable: delete member tlb_sync_pending of struct io_pgtable drivers/iommu/arm-smmu-v3.c | 52 ++++++++++++++++++++++++++++++++++---- drivers/iommu/arm-smmu.c | 10 ++++++++ drivers/iommu/io-pgtable-arm-v7s.c | 32 +++++++++++++++-------- drivers/iommu/io-pgtable-arm.c | 30 ++++++++++++++-------- drivers/iommu/io-pgtable.h | 9 ++----- drivers/iommu/iommu.c | 3 +++ include/linux/iommu.h | 1 + 7 files changed, 104 insertions(+), 33 deletions(-) -- 2.5.0