From: Lee Schermerhorn <Lee.Schermerhorn@hp.com>
To: Randy Dunlap <randy.dunlap@oracle.com>
Cc: linux-mm@kvack.org, linux-numa@vger.kernel.org,
akpm@linux-foundation.org, Mel Gorman <mel@csn.ul.ie>,
Nishanth Aravamudan <nacc@us.ibm.com>,
David Rientjes <rientjes@google.com>, Adam Litke <agl@us.ibm.com>,
Andy Whitcroft <apw@canonical.com>,
eric.whitney@hp.com
Subject: Re: [PATCH 7/10] hugetlb: update hugetlb documentation for NUMA controls.
Date: Fri, 02 Oct 2009 07:43:57 -0400 [thread overview]
Message-ID: <1254483837.7951.30.camel@useless.americas.hpqcorp.net> (raw)
In-Reply-To: <20091001124742.cb6ca371.randy.dunlap@oracle.com>
On Thu, 2009-10-01 at 12:47 -0700, Randy Dunlap wrote:
> On Thu, 01 Oct 2009 12:58:51 -0400 Lee Schermerhorn wrote:
>
> > [PATCH 7/10] hugetlb: update hugetlb documentation for NUMA controls
> >
> > Against: 2.6.31-mmotm-090925-1435
> >
> >
> > This patch updates the kernel huge tlb documentation to describe the
> > numa memory policy based huge page management. Additionaly, the patch
> > includes a fair amount of rework to improve consistency, eliminate
> > duplication and set the context for documenting the memory policy
> > interaction.
> >
> > Signed-off-by: Lee Schermerhorn <lee.schermerhorn@hp.com>
> > Acked-by: David Rientjes <rientjes@google.com>
> > Acked-by: Mel Gorman <mel@csn.ul.ie>
> >
> > Documentation/vm/hugetlbpage.txt | 267 ++++++++++++++++++++++++++-------------
> > 1 file changed, 179 insertions(+), 88 deletions(-)
> >
> > Index: linux-2.6.31-mmotm-090925-1435/Documentation/vm/hugetlbpage.txt
> > ===================================================================
> > --- linux-2.6.31-mmotm-090925-1435.orig/Documentation/vm/hugetlbpage.txt 2009-09-30 15:04:40.000000000 -0400
> > +++ linux-2.6.31-mmotm-090925-1435/Documentation/vm/hugetlbpage.txt 2009-09-30 15:05:22.000000000 -0400
> > @@ -159,6 +163,101 @@ Inside each of these directories, the sa
> >
> > which function as described above for the default huge page-sized case.
> >
> > +
> > +Interaction of Task Memory Policy with Huge Page Allocation/Freeing:
>
> Preferable not to end section "title" with a colon.
Thanks for the quick review, Randy. I'll fix these in an incremental or
respun patch.
Lee
>
> > +
> > +Whether huge pages are allocated and freed via the /proc interface or
> > +the /sysfs interface using the nr_hugepages_mempolicy attribute, the NUMA
> > +nodes from which huge pages are allocated or freed are controlled by the
> > +NUMA memory policy of the task that modifies the nr_hugepages_mempolicy
> > +sysctl or attribute. When the nr_hugepages attribute is used, mempolicy
> > +is ignored
>
> ignored.
>
> > +
> > +The recommended method to allocate or free huge pages to/from the kernel
> > +huge page pool, using the nr_hugepages example above, is:
> > +
> > + numactl --interleave <node-list> echo 20 \
> > + >/proc/sys/vm/nr_hugepages_mempolicy
> > +
> > +or, more succinctly:
> > +
> > + numactl -m <node-list> echo 20 >/proc/sys/vm/nr_hugepages_mempolicy
> > +
> > +This will allocate or free abs(20 - nr_hugepages) to or from the nodes
> > +specified in <node-list>, depending on whether number of persistent huge pages
> > +is initially less than or greater than 20, respectively. No huge pages will be
> > +allocated nor freed on any node not included in the specified <node-list>.
> > +
> > +When adjusting the persistent hugepage count via nr_hugepages_mempolicy, any
> > +memory policy mode--bind, preferred, local or interleave--may be used. The
> > +resulting effect on persistent huge page allocation is as follows:
> > +
> ...
> > +
> > +Per Node Hugepages Attributes
> > +
> > +A subset of the contents of the root huge page control directory in sysfs,
> > +described above, has been replicated under each "node" system device in:
> > +
> > + /sys/devices/system/node/node[0-9]*/hugepages/
> > +
> > +Under this directory, the subdirectory for each supported huge page size
> > +contains the following attribute files:
> > +
> > + nr_hugepages
> > + free_hugepages
> > + surplus_hugepages
> > +
> > +The free_' and surplus_' attribute files are read-only. They return the number
> > +of free and surplus [overcommitted] huge pages, respectively, on the parent
> > +node.
> > +
> > +The nr_hugepages attribute will return the total number of huge pages on the
>
> s/will return/returns/ [just a preference]
>
> > +specified node. When this attribute is written, the number of persistent huge
> > +pages on the parent node will be adjusted to the specified value, if sufficient
> > +resources exist, regardless of the task's mempolicy or cpuset constraints.
> > +
> > +Note that the number of overcommit and reserve pages remain global quantities,
> > +as we don't know until fault time, when the faulting task's mempolicy is
> > +applied, from which node the huge page allocation will be attempted.
> > +
> > +
> > +Using Huge Pages:
>
> Drop ':'.
>
> > +
> > If the user applications are going to request huge pages using mmap system
> > call, then it is required that system administrator mount a file system of
> > type hugetlbfs:
>
>
> ---
> ~Randy
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
next prev parent reply other threads:[~2009-10-02 11:30 UTC|newest]
Thread overview: 30+ messages / expand[flat|nested] mbox.gz Atom feed top
2009-10-01 16:57 [PATCH 0/10] hugetlb: V8 numa control of persistent huge pages alloc/free Lee Schermerhorn
2009-10-01 16:57 ` [PATCH 1/10] hugetlb: rework hstate_next_node_* functions Lee Schermerhorn
2009-10-01 16:57 ` [PATCH 2/10] hugetlb: add nodemask arg to huge page alloc, free and surplus adjust fcns Lee Schermerhorn
2009-10-01 16:58 ` [PATCH 3/10] hugetlb: factor init_nodemask_of_node Lee Schermerhorn
2009-10-02 9:48 ` Mel Gorman
2009-10-02 11:38 ` Lee Schermerhorn
2009-10-03 17:25 ` Andrew Morton
2009-10-01 16:58 ` [PATCH 4/10] hugetlb: derive huge pages nodes allowed from task mempolicy Lee Schermerhorn
2009-10-02 10:11 ` Mel Gorman
2009-10-02 22:41 ` David Rientjes
2009-10-02 22:16 ` [patch] nodemask: make NODEMASK_ALLOC more general David Rientjes
2009-10-02 22:48 ` Christoph Lameter
2009-10-03 1:03 ` KAMEZAWA Hiroyuki
2009-10-06 3:46 ` David Rientjes
2009-10-03 0:59 ` KAMEZAWA Hiroyuki
2009-10-02 22:16 ` [PATCH 4/10] hugetlb: derive huge pages nodes allowed from task mempolicy David Rientjes
2009-10-02 23:23 ` Christoph Lameter
2009-10-05 11:15 ` Lee Schermerhorn
2009-10-05 20:58 ` David Rientjes
2009-10-06 2:54 ` Lee Schermerhorn
2009-10-06 3:33 ` David Rientjes
2009-10-01 16:58 ` [PATCH 5/10] hugetlb: add generic definition of NUMA_NO_NODE Lee Schermerhorn
2009-10-01 16:58 ` [PATCH 6/10] hugetlb: add per node hstate attributes Lee Schermerhorn
2009-10-01 16:58 ` [PATCH 7/10] hugetlb: update hugetlb documentation for NUMA controls Lee Schermerhorn
2009-10-01 19:47 ` Randy Dunlap
2009-10-02 11:43 ` Lee Schermerhorn [this message]
2009-10-01 16:58 ` [PATCH 8/10] hugetlb: use only nodes with memory for huge pages Lee Schermerhorn
2009-10-01 16:59 ` [PATCH 9/10] hugetlb: handle memory hot-plug events Lee Schermerhorn
2009-10-01 16:59 ` [PATCH 10/10] hugetlb: offload per node attribute registrations Lee Schermerhorn
2010-03-31 11:23 ` [APPLIED] [PATCH 0/10] hugetlb: V8 numa control of persistent huge pages alloc/free Andy Whitcroft
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1254483837.7951.30.camel@useless.americas.hpqcorp.net \
--to=lee.schermerhorn@hp.com \
--cc=agl@us.ibm.com \
--cc=akpm@linux-foundation.org \
--cc=apw@canonical.com \
--cc=eric.whitney@hp.com \
--cc=linux-mm@kvack.org \
--cc=linux-numa@vger.kernel.org \
--cc=mel@csn.ul.ie \
--cc=nacc@us.ibm.com \
--cc=randy.dunlap@oracle.com \
--cc=rientjes@google.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox