From: Hugh Dickins <hugh@veritas.com>
To: Andrew Morton <akpm@linux-foundation.org>
Cc: mel@csn.ul.ie, andi@firstfloor.org, davem@davemloft.net,
netdev@vger.kernel.org, linux-kernel@vger.kernel.org,
linux-mm@kvack.org
Subject: Re: [PATCH mmotm] mm: alloc_large_system_hash check order
Date: Fri, 1 May 2009 14:40:38 +0100 (BST) [thread overview]
Message-ID: <Pine.LNX.4.64.0905011354560.19012@blonde.anvils> (raw)
In-Reply-To: <20090429142825.6dcf233d.akpm@linux-foundation.org>
On Wed, 29 Apr 2009, Andrew Morton wrote:
>
> yes, the code is a bit odd:
>
> : do {
> : size = bucketsize << log2qty;
> : if (flags & HASH_EARLY)
> : table = alloc_bootmem_nopanic(size);
> : else if (hashdist)
> : table = __vmalloc(size, GFP_ATOMIC, PAGE_KERNEL);
> : else {
> : unsigned long order = get_order(size);
> : table = (void*) __get_free_pages(GFP_ATOMIC, order);
> : /*
> : * If bucketsize is not a power-of-two, we may free
> : * some pages at the end of hash table.
> : */
> : if (table) {
> : unsigned long alloc_end = (unsigned long)table +
> : (PAGE_SIZE << order);
> : unsigned long used = (unsigned long)table +
> : PAGE_ALIGN(size);
> : split_page(virt_to_page(table), order);
> : while (used < alloc_end) {
> : free_page(used);
> : used += PAGE_SIZE;
> : }
> : }
> : }
> : } while (!table && size > PAGE_SIZE && --log2qty);
>
> In the case where it does the __vmalloc(), the order-11 allocation will
> succeed. But in the other cases, the allocation attempt will need to
> be shrunk and we end up with a smaller hash table. Is that sensible?
It is a little odd, but the __vmalloc() route is used by default on
64-bit with CONFIG_NUMA, and this route otherwise. (The hashdist
Doc isn't up-to-date on that, I'll send a patch.)
>
> If we want to regularise all three cases, doing
>
> size = min(size, MAX_ORDER);
If I take you literally, the resulting hash tables are going to
be rather small ;) but I know what you mean.
>
> before starting the loop would be suitable, although the huge
> __get_free_pages() might still fail.
Oh, I don't feel a great urge to regularize these cases in such
a way. I particularly don't feel like limiting 64-bit NUMA to
MAX_ORDER-1 size, if netdev have been happy with more until now.
Could consider a __vmalloc fallback when order is too large,
but let's not do so unless someone actually needs that.
> (But it will then warn, won't it?
> And nobody is reporting that).
Well, it was hard to report it while mmotm's WARN_ON_ONCE was itself
oopsing. With that fixed, I've reported it on x86_64 with 4GB
(without CONFIG_NUMA).
>
> I was a bit iffy about adding the warning in the first place, let it go
> through due to its potential to lead us to code which isn't doing what
> it thinks it's doing, or is being generally peculiar.
DaveM has confirmed that the code is doing what they want it to do.
So I think mmotm wants this patch (for alloc_large_system_hash to
keep away from that warning), plus Mel's improvement on top of it.
Hugh
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
next prev parent reply other threads:[~2009-05-01 13:40 UTC|newest]
Thread overview: 18+ messages / expand[flat|nested] mbox.gz Atom feed top
2009-04-29 21:09 Hugh Dickins
2009-04-29 21:28 ` Andrew Morton
2009-05-01 13:40 ` Hugh Dickins [this message]
2009-05-01 13:45 ` [PATCH 2.6.30] Doc: hashdist defaults on for 64bit Hugh Dickins
2009-05-01 14:29 ` Mel Gorman
2009-05-01 17:20 ` David Miller
2009-04-30 0:25 ` [PATCH mmotm] mm: alloc_large_system_hash check order David Miller
2009-04-30 13:25 ` Mel Gorman
2009-05-01 11:30 ` Hugh Dickins
2009-05-01 11:46 ` Eric Dumazet
2009-05-01 12:05 ` Hugh Dickins
2009-05-01 14:00 ` Mel Gorman
2009-05-01 13:59 ` Christoph Lameter
2009-05-01 15:09 ` Mel Gorman
2009-05-01 15:14 ` Christoph Lameter
2009-05-01 14:12 ` Mel Gorman
2009-05-01 14:28 ` Hugh Dickins
2009-05-01 14:43 ` Mel Gorman
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=Pine.LNX.4.64.0905011354560.19012@blonde.anvils \
--to=hugh@veritas.com \
--cc=akpm@linux-foundation.org \
--cc=andi@firstfloor.org \
--cc=davem@davemloft.net \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=mel@csn.ul.ie \
--cc=netdev@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox