From: Michal Hocko <mhocko@suse.com>
To: Florian Westphal <fw@strlen.de>
Cc: linux-mm@kvack.org, linux-kernel@vger.kernel.org, vbabka@suse.cz,
akpm@linux-foundation.org, urezki@gmail.com,
netdev@vger.kernel.org, netfilter-devel@vger.kernel.org,
Martin Zaharinov <micron10@gmail.com>
Subject: Re: [PATCH mm] mm: fix BUG with kvzalloc+GFP_ATOMIC
Date: Mon, 26 Sep 2022 10:57:59 +0200 [thread overview]
Message-ID: <YzFplwSxwwsLpzzX@dhcp22.suse.cz> (raw)
In-Reply-To: <20220926075639.GA908@breakpoint.cc>
On Mon 26-09-22 09:56:39, Florian Westphal wrote:
> Michal Hocko <mhocko@suse.com> wrote:
> > > kvzalloc(GFP_ATOMIC) was perfectly fine, is this illegal again?
> >
> > kvmalloc has never really supported GFP_ATOMIC semantic.
>
> It did, you added it:
> ce91f6ee5b3b ("mm: kvmalloc does not fallback to vmalloc for incompatible gfp flags")
Yes, I am very well aware of this commit and I have to say I wasn't
really supper happy about it TBH. Linus has argued this will result in a
saner code and in some cases this was true.
Later on we really had to add support some extensions beyond
GFP_KERNEL. Your change would break those GFP_NOFAIL and NOFS
usecases. GFP_NOWAIT and GFP_ATOMIC are explicitly documented as
unsupported. One we can do to continue in ce91f6ee5b3b sense is to
do this instead
diff --git a/mm/util.c b/mm/util.c
index 0837570c9225..a27b3fce1f0e 100644
--- a/mm/util.c
+++ b/mm/util.c
@@ -618,6 +618,10 @@ void *kvmalloc_node(size_t size, gfp_t flags, int node)
*/
if (ret || size <= PAGE_SIZE)
return ret;
+
+ /* non-sleeping allocations are not supported by vmalloc */
+ if (!gfpflags_allow_blocking(flags))
+ return NULL;
/* Don't even allow crazy sizes */
if (unlikely(size > INT_MAX)) {
A better option to me seems to be reworking the rhashtable_insert_rehash
to not rely on an atomic allocation. I am not familiar with that code
but it seems to me that the only reason this allocation mode is used is
due to rcu locking around rhashtable_try_insert. Is there any reason we
cannot drop the rcu lock, allocate with the full GFP_KERNEL allocation
power and retry with the pre allocated object? rhashtable_insert_slow is
already doing that to implement its never fail semantic.
--
Michal Hocko
SUSE Labs
next prev parent reply other threads:[~2022-09-26 8:58 UTC|newest]
Thread overview: 19+ messages / expand[flat|nested] mbox.gz Atom feed top
2022-09-23 10:38 Florian Westphal
2022-09-23 13:29 ` Michal Hocko
2022-09-23 13:35 ` Florian Westphal
2022-09-23 14:43 ` Uladzislau Rezki
2022-09-23 14:54 ` Florian Westphal
2022-09-23 15:10 ` Uladzislau Rezki
2022-09-26 15:03 ` Vlastimil Babka
2022-09-26 15:28 ` Sebastian Andrzej Siewior
2022-09-26 7:49 ` Michal Hocko
2022-09-26 7:56 ` Florian Westphal
2022-09-26 8:57 ` Michal Hocko [this message]
2022-09-26 9:30 ` Michal Hocko
2022-09-26 10:08 ` Florian Westphal
2022-09-26 12:02 ` Michal Hocko
2022-09-26 13:08 ` Florian Westphal
2022-09-26 13:38 ` Florian Westphal
2022-09-26 14:05 ` Michal Hocko
2022-09-26 14:20 ` Florian Westphal
2022-09-26 14:32 ` Michal Hocko
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=YzFplwSxwwsLpzzX@dhcp22.suse.cz \
--to=mhocko@suse.com \
--cc=akpm@linux-foundation.org \
--cc=fw@strlen.de \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=micron10@gmail.com \
--cc=netdev@vger.kernel.org \
--cc=netfilter-devel@vger.kernel.org \
--cc=urezki@gmail.com \
--cc=vbabka@suse.cz \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox