From patchwork Mon May 15 03:34:48 2017 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Jianbo Liu X-Patchwork-Id: 99760 Delivered-To: patch@linaro.org Received: by 10.182.142.97 with SMTP id rv1csp1240330obb; Sun, 14 May 2017 20:35:18 -0700 (PDT) X-Received: by 10.223.163.158 with SMTP id l30mr2759096wrb.174.1494819318574; Sun, 14 May 2017 20:35:18 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1494819318; cv=none; d=google.com; s=arc-20160816; b=otIO473VSPvPokw+ELC8NXoH0N5CZ1xS7LZaH5JHZVlfckkmPUfrnStYwFKsa40orF slF97FQasKNFw5pLt22mRzxN2K2LZtyQSoukJgBhLX9tmwG//+ANdqUntI1WIQOvAE4z P13x+GWlolqQbN6fG18lVFUUSf0C4K5qQRFNvap8ruyTwQgvcSnsAKf+4IhO0DMvz3y9 +lwj0qUy134d8U1UE9xDsQ0F/pexYrXCI/NQtffamluiK+Uu2WyfFHEooUvgKg51tZAV cT4xxLFlEfN4cT+/pao6qsxF/KBVdqQCAs1k/IC3aJYv7TVUoBFa7QscLXpLARL8gral locg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=sender:errors-to:list-subscribe:list-help:list-post:list-archive :list-unsubscribe:list-id:precedence:subject:references:in-reply-to :message-id:date:cc:to:from:arc-authentication-results; bh=GNW7BfActc6pmC1xtjMGwxQnQQpYUNrSxA4hYFFcuso=; b=tq+CmJGsEViKTtqtnCp3TWDaqW8XXdJgGZ2ZpcmceNXxIK01BD7IHwDQziBny/DqgY s/Qx3wCcuTj7MwVV3DfhGnGi8mifs0uEW3icJBRJu69fJASqzwuOP+BR6kmpLu3rOlkP xHN4dE1PpzAtlRdIPiDWWlE9xpBHq2FWYWMzzvEXJ7Pho97+YIB/dOUloKlgJxfZIqDt lnZ7CwQ44d+/2kyKY533LCt/eSbOXk0lZTHpqPYHdTJ6SREGa1BcFihiDSQL8+boUHhj /9ae5jbueqFlkcwSzLc3gcFYGR9AYEXY4c1WeHHZnFdb47OiryTMIp8f0+9mLWevK2D3 TTKQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of dev-bounces@dpdk.org designates 2001:4b98:dc0:41:216:3eff:fe72:dd13 as permitted sender) smtp.mailfrom=dev-bounces@dpdk.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=linaro.org Return-Path: Received: from dpdk.org ([2001:4b98:dc0:41:216:3eff:fe72:dd13]) by mx.google.com with ESMTP id r41si10671977wrb.298.2017.05.14.20.35.18; Sun, 14 May 2017 20:35:18 -0700 (PDT) Received-SPF: pass (google.com: domain of dev-bounces@dpdk.org designates 2001:4b98:dc0:41:216:3eff:fe72:dd13 as permitted sender) client-ip=2001:4b98:dc0:41:216:3eff:fe72:dd13; Authentication-Results: mx.google.com; spf=pass (google.com: domain of dev-bounces@dpdk.org designates 2001:4b98:dc0:41:216:3eff:fe72:dd13 as permitted sender) smtp.mailfrom=dev-bounces@dpdk.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=linaro.org Received: from [92.243.14.124] (localhost [IPv6:::1]) by dpdk.org (Postfix) with ESMTP id 4596329CB; Mon, 15 May 2017 05:35:16 +0200 (CEST) Received: from foss.arm.com (foss.arm.com [217.140.101.70]) by dpdk.org (Postfix) with ESMTP id 0AAB411D4 for ; Mon, 15 May 2017 05:35:13 +0200 (CEST) Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.72.51.249]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 3512780D; Sun, 14 May 2017 20:35:13 -0700 (PDT) Received: from localhost.localdomain.com (usa-sjc-imap-foss1.foss.arm.com [10.72.51.249]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPA id 5D12C3F23B; Sun, 14 May 2017 20:35:12 -0700 (PDT) From: Jianbo Liu To: dev@dpdk.org, tomasz.kantecki@intel.com, jerin.jacob@caviumnetworks.com, ashwin.sekhar@caviumnetworks.com Cc: Jianbo Liu Date: Mon, 15 May 2017 11:34:48 +0800 Message-Id: <1494819296-23049-1-git-send-email-jianbo.liu@linaro.org> X-Mailer: git-send-email 1.8.3.1 In-Reply-To: <1493709255-8887-1-git-send-email-jianbo.liu@linaro.org> References: <1493709255-8887-1-git-send-email-jianbo.liu@linaro.org> Subject: [dpdk-dev] [PATCH v4 0/8] accelerate examples/l3fwd with NEON on ARM64 platform X-BeenThere: dev@dpdk.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: DPDK patches and discussions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dev-bounces@dpdk.org Sender: "dev" v4: - add vcopyq_laneq_u32 for older version of gcc v3: - remove unnecessary perfetch for rte_mbuf - fix typo in git log - Ashwin's suggestions for performance on ThunderX v2: - change name of l3fwd_em_sse.h to l3fwd_em_sequential.h - add the times of hash multi-lookup for different Archs - performance tuning on ThunderX: prefetching, set NO_HASH_LOOKUP_MULTI ... Jianbo Liu (8): examples/l3fwd: extract arch independent code from multi hash lookup examples/l3fwd: rename l3fwd_em_sse.h to l3fwd_em_sequential.h examples/l3fwd: extract common code from multi packet send examples/l3fwd: rearrange the code for lpm_l3fwd arch/arm: add vcopyq_laneq_u32 for old version of gcc examples/l3fwd: add neon support for l3fwd examples/l3fwd: add the times of hash multi-lookup for different Archs examples/l3fwd: change the guard macro name for header file examples/l3fwd/l3fwd_common.h | 293 +++++++++++++++++++++ examples/l3fwd/l3fwd_em.c | 8 +- examples/l3fwd/l3fwd_em_hlm.h | 218 +++++++++++++++ examples/l3fwd/l3fwd_em_hlm_neon.h | 74 ++++++ examples/l3fwd/l3fwd_em_hlm_sse.h | 280 +------------------- .../{l3fwd_em_sse.h => l3fwd_em_sequential.h} | 24 +- examples/l3fwd/l3fwd_lpm.c | 87 +++++- examples/l3fwd/l3fwd_lpm.h | 26 +- examples/l3fwd/l3fwd_lpm_neon.h | 193 ++++++++++++++ examples/l3fwd/l3fwd_lpm_sse.h | 66 ----- examples/l3fwd/l3fwd_neon.h | 259 ++++++++++++++++++ examples/l3fwd/l3fwd_sse.h | 255 +----------------- lib/librte_eal/common/include/arch/arm/rte_vect.h | 9 + 13 files changed, 1166 insertions(+), 626 deletions(-) create mode 100644 examples/l3fwd/l3fwd_common.h create mode 100644 examples/l3fwd/l3fwd_em_hlm.h create mode 100644 examples/l3fwd/l3fwd_em_hlm_neon.h rename examples/l3fwd/{l3fwd_em_sse.h => l3fwd_em_sequential.h} (88%) create mode 100644 examples/l3fwd/l3fwd_lpm_neon.h create mode 100644 examples/l3fwd/l3fwd_neon.h -- 1.8.3.1