From: Heiko Carstens <heiko.carstens@de.ibm.com>
To: Andrew Morton <akpm@linux-foundation.org>
Cc: linux-mm@kvack.org, dm-devel@redhat.com,
Daniel Kobras <kobras@linux.de>,
Alasdair G Kergon <agk@redhat.com>,
Stefan Weinhuber <wein@de.ibm.com>,
Stefan Bader <shbader@de.ibm.com>,
Linus Torvalds <torvalds@linux-foundation.org>
Subject: Re: [PATCH] dm: Fix deadlock under high i/o load in raid1 setup.
Date: Thu, 16 Aug 2007 01:59:56 +0200 [thread overview]
Message-ID: <20070815235956.GD8741@osiris.ibm.com> (raw)
In-Reply-To: <20070815155604.87318305.akpm@linux-foundation.org>
On Wed, Aug 15, 2007 at 03:56:04PM -0700, Andrew Morton wrote:
> On Mon, 13 Aug 2007 13:33:40 +0200
> Heiko Carstens <heiko.carstens@de.ibm.com> wrote:
> > the patch below went into 2.6.18. Now my question is: why doesn't it check
> > if kmalloc(..., GFP_NOIO) returns with a NULL pointer?
> > Did I miss anything that guarentees that this will always succeed or is it
> > just a bug?
> > --- a/drivers/md/dm-raid1.c
> > +++ b/drivers/md/dm-raid1.c
> > @@ -255,7 +255,9 @@ static struct region *__rh_alloc(struct region_hash *rh, region_t region)
> > struct region *reg, *nreg;
> >
> > read_unlock(&rh->hash_lock);
> > - nreg = mempool_alloc(rh->region_pool, GFP_NOIO);
> > + nreg = mempool_alloc(rh->region_pool, GFP_ATOMIC);
> > + if (unlikely(!nreg))
> > + nreg = kmalloc(sizeof(struct region), GFP_NOIO);
> > nreg->state = rh->log->type->in_sync(rh->log, region, 1) ?
> > RH_CLEAN : RH_NOSYNC;
> > nreg->rh = rh;
> >
>
> Yeah, that's a bug.
>
> kmalloc(small_amount, GFP_NOIO) can fail if the calling process gets
> oom-killed, and it can fail if the system is using fault-injection.
>
> One could say "don't use fault injection" and, perhaps, "this is only
> ever called by a kernel thread and kernel threads don't get oom-killed".
> But the former is lame and the latter assumes current implementation
> details which could change (and indeed have in the past).
Thanks for clarifying!
> So yes, I'd say this is a bug in DM.
>
> Also, __rh_alloc() is called under read_lock(), via __rh_find(). If
> __rh_alloc()'s mempool_alloc() fails, it will perform a sleeping allocation
> under read_lock(), which is deadlockable and will generate might_sleep()
> warnings
The read_lock() is unlocked at the beginning of the function. Unless
you're talking of a different lock, but I couldn't find any.
So at least _currently_ this should work unless somebody uses fault
injection. Would it make sense then to add the __GFP_NOFAIL flag to
the kmalloc call?
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
next prev parent reply other threads:[~2007-08-16 0:00 UTC|newest]
Thread overview: 5+ messages / expand[flat|nested] mbox.gz Atom feed top
2007-08-13 11:33 Heiko Carstens
2007-08-15 22:56 ` Andrew Morton
2007-08-15 23:59 ` Heiko Carstens [this message]
2007-08-16 3:10 ` Andrew Morton
2007-08-16 7:09 ` [dm-devel] " Stefan Bader
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20070815235956.GD8741@osiris.ibm.com \
--to=heiko.carstens@de.ibm.com \
--cc=agk@redhat.com \
--cc=akpm@linux-foundation.org \
--cc=dm-devel@redhat.com \
--cc=kobras@linux.de \
--cc=linux-mm@kvack.org \
--cc=shbader@de.ibm.com \
--cc=torvalds@linux-foundation.org \
--cc=wein@de.ibm.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox