From patchwork Wed Jun 12 04:22:44 2013 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: John Stultz X-Patchwork-Id: 17804 Return-Path: X-Original-To: linaro@patches.linaro.org Delivered-To: linaro@patches.linaro.org Received: from mail-ve0-f200.google.com (mail-ve0-f200.google.com [209.85.128.200]) by ip-10-151-82-157.ec2.internal (Postfix) with ESMTPS id 2584425DFB for ; Wed, 12 Jun 2013 04:23:18 +0000 (UTC) Received: by mail-ve0-f200.google.com with SMTP id m1sf9428533ves.3 for ; Tue, 11 Jun 2013 21:23:17 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=mime-version:x-beenthere:x-forwarded-to:x-forwarded-for :delivered-to:from:to:cc:subject:date:message-id:x-mailer :in-reply-to:references:x-gm-message-state:x-original-sender :x-original-authentication-results:precedence:mailing-list:list-id :x-google-group-id:list-post:list-help:list-archive:list-unsubscribe; bh=VdP1ZEESbcWho4wMfoiijoh/HUEwTJ9Sr1bmRwz4GGc=; b=S6ZOKKNxlV0SSCwic6PX4GqXmmEgxpXwfolr8zGxNN6zTFg6419TcNH5/PMgH+3s5w TMyOqOQDEnjRyDotH6SB2GE2crSgHPQnT3wHCAN8chHmMv0dHw3Y9Ja4yRaMNo2x6R3p nTRsQDgZ1yfJAEWRJbpU3q1Ckeb6gRW09uo1rnDX8LHzSYpfA3a26vj+749HuOAn8oTG nAhVR9Z/bwXi93aNZCfriAVjDZ3FrEudXuf1TRKgIAZ8D1+6g+U510/feOPWXG7AVmFt FuA2Fi9MIoX+eKLx4vVlhcWwoUanveqSbcmpe/T7nY4WEQTt7W4rdh6XRk9SUMVoGETU sm4A== X-Received: by 10.224.129.196 with SMTP id p4mr1407871qas.6.1371010997759; Tue, 11 Jun 2013 21:23:17 -0700 (PDT) MIME-Version: 1.0 X-BeenThere: patchwork-forward@linaro.org Received: by 10.49.0.78 with SMTP id 14ls3712234qec.10.gmail; Tue, 11 Jun 2013 21:23:17 -0700 (PDT) X-Received: by 10.58.221.134 with SMTP id qe6mr9159060vec.2.1371010997449; Tue, 11 Jun 2013 21:23:17 -0700 (PDT) Received: from mail-vb0-x229.google.com (mail-vb0-x229.google.com [2607:f8b0:400c:c02::229]) by mx.google.com with ESMTPS id p8si8019615vdv.122.2013.06.11.21.23.17 for (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Tue, 11 Jun 2013 21:23:17 -0700 (PDT) Received-SPF: neutral (google.com: 2607:f8b0:400c:c02::229 is neither permitted nor denied by best guess record for domain of patch+caf_=patchwork-forward=linaro.org@linaro.org) client-ip=2607:f8b0:400c:c02::229; Received: by mail-vb0-f41.google.com with SMTP id p13so4551964vbe.28 for ; Tue, 11 Jun 2013 21:23:17 -0700 (PDT) X-Received: by 10.59.2.199 with SMTP id bq7mr8922487ved.51.1371010997272; Tue, 11 Jun 2013 21:23:17 -0700 (PDT) X-Forwarded-To: patchwork-forward@linaro.org X-Forwarded-For: patch@linaro.org patchwork-forward@linaro.org Delivered-To: patches@linaro.org Received: by 10.58.191.99 with SMTP id gx3csp130067vec; Tue, 11 Jun 2013 21:23:16 -0700 (PDT) X-Received: by 10.66.4.10 with SMTP id g10mr22303921pag.217.1371010995789; Tue, 11 Jun 2013 21:23:15 -0700 (PDT) Received: from mail-pa0-x236.google.com (mail-pa0-x236.google.com [2607:f8b0:400e:c03::236]) by mx.google.com with ESMTPS id wu4si8508679pbc.108.2013.06.11.21.23.15 for (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Tue, 11 Jun 2013 21:23:15 -0700 (PDT) Received-SPF: neutral (google.com: 2607:f8b0:400e:c03::236 is neither permitted nor denied by best guess record for domain of john.stultz@linaro.org) client-ip=2607:f8b0:400e:c03::236; Received: by mail-pa0-f54.google.com with SMTP id kx10so3888089pab.41 for ; Tue, 11 Jun 2013 21:23:15 -0700 (PDT) X-Received: by 10.66.118.104 with SMTP id kl8mr21445150pab.182.1371010995241; Tue, 11 Jun 2013 21:23:15 -0700 (PDT) Received: from localhost.localdomain (c-67-170-153-23.hsd1.or.comcast.net. [67.170.153.23]) by mx.google.com with ESMTPSA id xe9sm17439221pbc.21.2013.06.11.21.23.13 for (version=TLSv1.1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Tue, 11 Jun 2013 21:23:14 -0700 (PDT) From: John Stultz To: LKML Cc: Minchan Kim , Andrew Morton , Android Kernel Team , Robert Love , Mel Gorman , Hugh Dickins , Dave Hansen , Rik van Riel , Dmitry Adamushko , Dave Chinner , Neil Brown , Andrea Righi , Andrea Arcangeli , "Aneesh Kumar K.V" , Mike Hommey , Taras Glek , Dhaval Giani , Jan Kara , KOSAKI Motohiro , Michel Lespinasse , "linux-mm@kvack.org" , John Stultz Subject: [PATCH 1/8] vrange: Add basic data structure and functions Date: Tue, 11 Jun 2013 21:22:44 -0700 Message-Id: <1371010971-15647-2-git-send-email-john.stultz@linaro.org> X-Mailer: git-send-email 1.8.1.2 In-Reply-To: <1371010971-15647-1-git-send-email-john.stultz@linaro.org> References: <1371010971-15647-1-git-send-email-john.stultz@linaro.org> X-Gm-Message-State: ALoCoQnRwOYhfRVC2VIxZoVnRetPkDxtLW3ouXmO9rSir9VwE1zMEe+iZ80rVHOXVhn9aZirvUph X-Original-Sender: john.stultz@linaro.org X-Original-Authentication-Results: mx.google.com; spf=neutral (google.com: 2607:f8b0:400c:c02::229 is neither permitted nor denied by best guess record for domain of patch+caf_=patchwork-forward=linaro.org@linaro.org) smtp.mail=patch+caf_=patchwork-forward=linaro.org@linaro.org Precedence: list Mailing-list: list patchwork-forward@linaro.org; contact patchwork-forward+owners@linaro.org List-ID: X-Google-Group-Id: 836684582541 List-Post: , List-Help: , List-Archive: List-Unsubscribe: , From: Minchan Kim This patch adds vrange data structure(interval tree) and related functions. The vrange uses generic interval tree as main data structure because it handles address range so generic interval tree fits well for the purpose. The add_vrange/remove_vrange are core functions for system call will be introduced next patch. 1. add_vrange inserts new address range into interval tree. If new address range crosses over existing volatile range, existing volatile range will be expanded to cover new range. Then, if existing volatile range has purged state, new range will have a purged state. It's not good and we need more fine-grained purged state handling in a vrange(TODO) If new address range is inside existing range, we ignore it 2. remove_vrange removes address range Then, return a purged state of the address ranges. This patch copied some part from John Stultz's work but different semantic. Cc: Andrew Morton Cc: Android Kernel Team Cc: Robert Love Cc: Mel Gorman Cc: Hugh Dickins Cc: Dave Hansen Cc: Rik van Riel Cc: Dmitry Adamushko Cc: Dave Chinner Cc: Neil Brown Cc: Andrea Righi Cc: Andrea Arcangeli Cc: Aneesh Kumar K.V Cc: Mike Hommey Cc: Taras Glek Cc: Dhaval Giani Cc: Jan Kara Cc: KOSAKI Motohiro Cc: Michel Lespinasse Cc: Minchan Kim Cc: linux-mm@kvack.org Signed-off-by: Minchan Kim [jstultz: Heavy rework and cleanups to make this infrastructure more easily reused for both file and anonymous pages] Signed-off-by: John Stultz --- include/linux/vrange.h | 44 +++++++++++ include/linux/vrange_types.h | 19 +++++ init/main.c | 2 + lib/Makefile | 2 +- mm/Makefile | 2 +- mm/vrange.c | 181 +++++++++++++++++++++++++++++++++++++++++++ 6 files changed, 248 insertions(+), 2 deletions(-) create mode 100644 include/linux/vrange.h create mode 100644 include/linux/vrange_types.h create mode 100644 mm/vrange.c diff --git a/include/linux/vrange.h b/include/linux/vrange.h new file mode 100644 index 0000000..2064cb0 --- /dev/null +++ b/include/linux/vrange.h @@ -0,0 +1,44 @@ +#ifndef _LINUX_VRANGE_H +#define _LINUX_VRANGE_H + +#include +#include + +#define vrange_entry(ptr) \ + container_of(ptr, struct vrange, node.rb) + +#ifdef CONFIG_MMU + +static inline void vrange_root_init(struct vrange_root *vroot, int type) +{ + vroot->type = type; + vroot->v_rb = RB_ROOT; + mutex_init(&vroot->v_lock); +} + +static inline void vrange_lock(struct vrange_root *vroot) +{ + mutex_lock(&vroot->v_lock); +} + +static inline void vrange_unlock(struct vrange_root *vroot) +{ + mutex_unlock(&vroot->v_lock); +} + +static inline int vrange_type(struct vrange *vrange) +{ + return vrange->owner->type; +} + +void vrange_init(void); +extern void vrange_root_cleanup(struct vrange_root *vroot); + +#else + +static inline void vrange_init(void) {}; +static inline void vrange_root_init(struct vrange_root *vroot, int type) {}; +static inline void vrange_root_cleanup(struct vrange_root *vroot) {}; + +#endif +#endif /* _LINIUX_VRANGE_H */ diff --git a/include/linux/vrange_types.h b/include/linux/vrange_types.h new file mode 100644 index 0000000..7f44c01 --- /dev/null +++ b/include/linux/vrange_types.h @@ -0,0 +1,19 @@ +#ifndef _LINUX_VRANGE_TYPES_H +#define _LINUX_VRANGE_TYPES_H + +#include +#include + +struct vrange_root { + struct rb_root v_rb; /* vrange rb tree */ + struct mutex v_lock; /* Protect v_rb */ + enum {VRANGE_MM, VRANGE_FILE} type; /* range root type */ +}; + +struct vrange { + struct interval_tree_node node; + struct vrange_root *owner; + int purged; +}; +#endif + diff --git a/init/main.c b/init/main.c index 9484f4b..9cf08ba 100644 --- a/init/main.c +++ b/init/main.c @@ -74,6 +74,7 @@ #include #include #include +#include #include #include @@ -601,6 +602,7 @@ asmlinkage void __init start_kernel(void) calibrate_delay(); pidmap_init(); anon_vma_init(); + vrange_init(); #ifdef CONFIG_X86 if (efi_enabled(EFI_RUNTIME_SERVICES)) efi_enter_virtual_mode(); diff --git a/lib/Makefile b/lib/Makefile index c55a037..ccd15ff 100644 --- a/lib/Makefile +++ b/lib/Makefile @@ -13,7 +13,7 @@ lib-y := ctype.o string.o vsprintf.o cmdline.o \ sha1.o md5.o irq_regs.o reciprocal_div.o argv_split.o \ proportions.o flex_proportions.o prio_heap.o ratelimit.o show_mem.o \ is_single_threaded.o plist.o decompress.o kobject_uevent.o \ - earlycpio.o + earlycpio.o interval_tree.o obj-$(CONFIG_ARCH_HAS_DEBUG_STRICT_USER_COPY_CHECKS) += usercopy.o lib-$(CONFIG_MMU) += ioremap.o diff --git a/mm/Makefile b/mm/Makefile index 72c5acb..b67fcf5 100644 --- a/mm/Makefile +++ b/mm/Makefile @@ -5,7 +5,7 @@ mmu-y := nommu.o mmu-$(CONFIG_MMU) := fremap.o highmem.o madvise.o memory.o mincore.o \ mlock.o mmap.o mprotect.o mremap.o msync.o rmap.o \ - vmalloc.o pagewalk.o pgtable-generic.o + vmalloc.o pagewalk.o pgtable-generic.o vrange.o ifdef CONFIG_CROSS_MEMORY_ATTACH mmu-$(CONFIG_MMU) += process_vm_access.o diff --git a/mm/vrange.c b/mm/vrange.c new file mode 100644 index 0000000..e3042e0 --- /dev/null +++ b/mm/vrange.c @@ -0,0 +1,181 @@ +/* + * mm/vrange.c + */ + +#include +#include + +static struct kmem_cache *vrange_cachep; + +void __init vrange_init(void) +{ + vrange_cachep = KMEM_CACHE(vrange, SLAB_PANIC); +} + +static struct vrange *__vrange_alloc(gfp_t flags) +{ + struct vrange *vrange = kmem_cache_alloc(vrange_cachep, flags); + if (!vrange) + return vrange; + vrange->owner = NULL; + return vrange; +} + +static void __vrange_free(struct vrange *range) +{ + WARN_ON(range->owner); + kmem_cache_free(vrange_cachep, range); +} + +static void __vrange_add(struct vrange *range, struct vrange_root *vroot) +{ + range->owner = vroot; + interval_tree_insert(&range->node, &vroot->v_rb); +} + +static void __vrange_remove(struct vrange *range) +{ + interval_tree_remove(&range->node, &range->owner->v_rb); + range->owner = NULL; +} + +static inline void __vrange_set(struct vrange *range, + unsigned long start_idx, unsigned long end_idx, + bool purged) +{ + range->node.start = start_idx; + range->node.last = end_idx; + range->purged = purged; +} + +static inline void __vrange_resize(struct vrange *range, + unsigned long start_idx, unsigned long end_idx) +{ + struct vrange_root *vroot = range->owner; + bool purged = range->purged; + + __vrange_remove(range); + __vrange_set(range, start_idx, end_idx, purged); + __vrange_add(range, vroot); +} + +static int vrange_add(struct vrange_root *vroot, + unsigned long start_idx, unsigned long end_idx) +{ + struct vrange *new_range, *range; + struct interval_tree_node *node, *next; + int purged = 0; + + new_range = __vrange_alloc(GFP_KERNEL); + if (!new_range) + return -ENOMEM; + + vrange_lock(vroot); + + node = interval_tree_iter_first(&vroot->v_rb, start_idx, end_idx); + while (node) { + next = interval_tree_iter_next(node, start_idx, end_idx); + range = container_of(node, struct vrange, node); + /* old range covers new range fully */ + if (node->start <= start_idx && node->last >= end_idx) { + __vrange_free(new_range); + goto out; + } + + start_idx = min_t(unsigned long, start_idx, node->start); + end_idx = max_t(unsigned long, end_idx, node->last); + purged |= range->purged; + + __vrange_remove(range); + __vrange_free(range); + + node = next; + } + + __vrange_set(new_range, start_idx, end_idx, purged); + __vrange_add(new_range, vroot); +out: + vrange_unlock(vroot); + return 0; +} + +static int vrange_remove(struct vrange_root *vroot, + unsigned long start_idx, unsigned long end_idx, + int *purged) +{ + struct vrange *new_range, *range; + struct interval_tree_node *node, *next; + bool used_new = false; + + if (!purged) + return -EINVAL; + + *purged = 0; + + new_range = __vrange_alloc(GFP_KERNEL); + if (!new_range) + return -ENOMEM; + + vrange_lock(vroot); + + node = interval_tree_iter_first(&vroot->v_rb, start_idx, end_idx); + while (node) { + next = interval_tree_iter_next(node, start_idx, end_idx); + range = container_of(node, struct vrange, node); + + *purged |= range->purged; + + if (start_idx <= node->start && end_idx >= node->last) { + /* argumented range covers the range fully */ + __vrange_remove(range); + __vrange_free(range); + } else if (node->start >= start_idx) { + /* + * Argumented range covers over the left of the + * range + */ + __vrange_resize(range, end_idx + 1, node->last); + } else if (node->last <= end_idx) { + /* + * Argumented range covers over the right of the + * range + */ + __vrange_resize(range, node->start, start_idx - 1); + } else { + /* + * Argumented range is middle of the range + */ + used_new = true; + __vrange_resize(range, node->start, start_idx - 1); + __vrange_set(new_range, end_idx + 1, node->last, + range->purged); + __vrange_add(new_range, vroot); + break; + } + + node = next; + } + vrange_unlock(vroot); + + if (!used_new) + __vrange_free(new_range); + + return 0; +} + +void vrange_root_cleanup(struct vrange_root *vroot) +{ + struct vrange *range; + struct rb_node *next; + + vrange_lock(vroot); + next = rb_first(&vroot->v_rb); + while (next) { + range = vrange_entry(next); + next = rb_next(next); + __vrange_remove(range); + __vrange_free(range); + } + vrange_unlock(vroot); +} +