From: Doug Thompson <norsk5@yahoo.com>
To: Tim Small <tim@buttersideup.com>,
bluesmoke-devel@lists.sourceforge.net, linux-mm@kvack.org
Subject: Re: Failing memory auto-hotremove support?
Date: Thu, 3 Jul 2008 10:42:57 -0700 (PDT) [thread overview]
Message-ID: <90872.19606.qm@web50110.mail.re2.yahoo.com> (raw)
In-Reply-To: <486CC533.6080302@buttersideup.com>
--- Tim Small <tim@buttersideup.com> wrote:
> Hello,
>
> I just noticed that there is memory hotplug / hotremove support in the
> kernel.org kernel now.
cool, good to hear. Now I (or others) need some cycles to review it and mod EDAC to utilize it if
possible and/or provide feedback to the memory guys
>
> I was thinking that it may be desirable (e.g. on large NUMA systems) to
> automatically trigger the removal of memory modules (or just take a
> section of the memory module out of use, if applicable), if a memory
> module exceeded a pre-set correctable error rate (or RIGHT-NOW, if an
> uncorrectable memory error was detected).
THAT is exactly what one of the goals of EDAC (then bluesmoke) had in mind years ago, but there
was no easy mechanism, within the kernel, to perform those types of controls (take a section of
memory out of commision).
When you have a NUMA node with 64 or 128 gigbabytes of memory and have 5,000 such nodes, rebooting
in not a very good thing to do.
BUT being able to detect a bad DIMM (or a pair) via EDAC and then notify the memory subsystem to
de-activate that DIMM (pair) from active use is GREAT feature to have. The node graciously handles
the downed memory and stays UP running that big cluster task, all the while notifying the admin
that a DIMM needs replacement at the next maintaince cycle.
doug t
>
> Tim.
>
W1DUG
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
next prev parent reply other threads:[~2008-07-03 17:42 UTC|newest]
Thread overview: 4+ messages / expand[flat|nested] mbox.gz Atom feed top
2008-07-03 12:25 Tim Small
2008-07-03 17:42 ` Doug Thompson [this message]
2008-07-04 5:24 ` Yasunori Goto
2008-07-09 23:34 ` Badari Pulavarty
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=90872.19606.qm@web50110.mail.re2.yahoo.com \
--to=norsk5@yahoo.com \
--cc=bluesmoke-devel@lists.sourceforge.net \
--cc=linux-mm@kvack.org \
--cc=tim@buttersideup.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox