From: Nicholas Piggin <npiggin@gmail.com>
To: Andrew Morton <akpm@linux-foundation.org>
Cc: linux-kernel@vger.kernel.org, linux-mm@kvack.org,
Linus Torvalds <torvalds@linux-foundation.org>
Subject: Re: [PATCH 1/2] mm/large system hash: use vmalloc for size > MAX_ORDER when !hashdist
Date: Thu, 06 Jun 2019 12:27:01 +1000 [thread overview]
Message-ID: <1559787457.x4yxr4e2tw.astroid@bobo.none> (raw)
In-Reply-To: <20190605142209.eb30cd883551a5bd81b09f00@linux-foundation.org>
Andrew Morton's on June 6, 2019 7:22 am:
> On Thu, 6 Jun 2019 00:48:13 +1000 Nicholas Piggin <npiggin@gmail.com> wrote:
>
>> The kernel currently clamps large system hashes to MAX_ORDER when
>> hashdist is not set, which is rather arbitrary.
>>
>> vmalloc space is limited on 32-bit machines, but this shouldn't
>> result in much more used because of small physical memory limiting
>> system hash sizes.
>>
>> Include "vmalloc" or "linear" in the kernel log message.
>>
>> Signed-off-by: Nicholas Piggin <npiggin@gmail.com>
>> ---
>>
>> This is a better solution than the previous one for the case of !NUMA
>> systems running on CONFIG_NUMA kernels, we can clear the default
>> hashdist early and have everything allocated out of the linear map.
>>
>> The hugepage vmap series I will post later, but it's quite
>> independent from this improvement.
>>
>> ...
>>
>> --- a/mm/page_alloc.c
>> +++ b/mm/page_alloc.c
>> @@ -7966,6 +7966,7 @@ void *__init alloc_large_system_hash(const char *tablename,
>> unsigned long log2qty, size;
>> void *table = NULL;
>> gfp_t gfp_flags;
>> + bool virt;
>>
>> /* allow the kernel cmdline to have a say */
>> if (!numentries) {
>> @@ -8022,6 +8023,7 @@ void *__init alloc_large_system_hash(const char *tablename,
>>
>> gfp_flags = (flags & HASH_ZERO) ? GFP_ATOMIC | __GFP_ZERO : GFP_ATOMIC;
>> do {
>> + virt = false;
>> size = bucketsize << log2qty;
>> if (flags & HASH_EARLY) {
>> if (flags & HASH_ZERO)
>> @@ -8029,26 +8031,26 @@ void *__init alloc_large_system_hash(const char *tablename,
>> else
>> table = memblock_alloc_raw(size,
>> SMP_CACHE_BYTES);
>> - } else if (hashdist) {
>> + } else if (get_order(size) >= MAX_ORDER || hashdist) {
>> table = __vmalloc(size, gfp_flags, PAGE_KERNEL);
>> + virt = true;
>> } else {
>> /*
>> * If bucketsize is not a power-of-two, we may free
>> * some pages at the end of hash table which
>> * alloc_pages_exact() automatically does
>> */
>> - if (get_order(size) < MAX_ORDER) {
>> - table = alloc_pages_exact(size, gfp_flags);
>> - kmemleak_alloc(table, size, 1, gfp_flags);
>> - }
>> + table = alloc_pages_exact(size, gfp_flags);
>> + kmemleak_alloc(table, size, 1, gfp_flags);
>> }
>> } while (!table && size > PAGE_SIZE && --log2qty);
>>
>> if (!table)
>> panic("Failed to allocate %s hash table\n", tablename);
>>
>> - pr_info("%s hash table entries: %ld (order: %d, %lu bytes)\n",
>> - tablename, 1UL << log2qty, ilog2(size) - PAGE_SHIFT, size);
>> + pr_info("%s hash table entries: %ld (order: %d, %lu bytes, %s)\n",
>> + tablename, 1UL << log2qty, ilog2(size) - PAGE_SHIFT, size,
>> + virt ? "vmalloc" : "linear");
>
> Could remove `bool virt' and use is_vmalloc_addr() in the printk?
>
It can run before mem_init() and it looks like some archs set
VMALLOC_START/END (high_memory) there (e.g., x86-32, ppc32).
Thanks,
Nick
prev parent reply other threads:[~2019-06-06 2:28 UTC|newest]
Thread overview: 4+ messages / expand[flat|nested] mbox.gz Atom feed top
2019-06-05 14:48 Nicholas Piggin
2019-06-05 14:48 ` [PATCH 2/2] mm/large system hash: clear hashdist when only one node with memory is booted Nicholas Piggin
2019-06-05 21:22 ` [PATCH 1/2] mm/large system hash: use vmalloc for size > MAX_ORDER when !hashdist Andrew Morton
2019-06-06 2:27 ` Nicholas Piggin [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1559787457.x4yxr4e2tw.astroid@bobo.none \
--to=npiggin@gmail.com \
--cc=akpm@linux-foundation.org \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=torvalds@linux-foundation.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox