From: ebiederm@xmission.com (Eric W. Biederman)
To: Amerigo Wang <amwang@redhat.com>
Cc: linux-kernel@vger.kernel.org, tony.luck@intel.com,
linux-ia64@vger.kernel.org, linux-mm@kvack.org,
Neil Horman <nhorman@redhat.com>,
Andi Kleen <andi@firstfloor.org>,
akpm@linux-foundation.org, bernhard.walle@gmx.de,
Fenghua Yu <fenghua.yu@intel.com>, Ingo Molnar <mingo@elte.hu>,
Anton Vorontsov <avorontsov@ru.mvista.com>
Subject: Re: [Patch 8/8] kexec: allow to shrink reserved memory
Date: Wed, 12 Aug 2009 23:18:33 -0700 [thread overview]
Message-ID: <m1eirg5j9i.fsf@fess.ebiederm.org> (raw)
In-Reply-To: <4A83893D.50707@redhat.com> (Amerigo Wang's message of "Thu\, 13 Aug 2009 11\:32\:13 +0800")
Amerigo Wang <amwang@redhat.com> writes:
> Eric W. Biederman wrote:
>> Amerigo Wang <amwang@redhat.com> writes:
>>
>>
>>> This patch implements shrinking the reserved memory for crash kernel,
>>> if it is more than enough.
>>>
>>> For example, if you have already reserved 128M, now you just want 100M,
>>> you can do:
>>>
>>> # echo $((100*1024*1024)) > /sys/kernel/kexec_crash_size
>>>
>>
>> Getting closer (comments inline)
>>
>> Semantically this patch is non-contriversial and pretty
>> simple, but still needs a fair amount of review. Can
>> you put this patch at the front of your patch set.
>>
>>
>
> Sure, I will do it when I resend them next time.
>
> I add mm people into Cc.
>>> Index: linux-2.6/kernel/kexec.c
>>> ===================================================================
>>> --- linux-2.6.orig/kernel/kexec.c
>>> +++ linux-2.6/kernel/kexec.c
>>> @@ -1083,6 +1083,76 @@ void crash_kexec(struct pt_regs *regs)
>>> }
>>> }
>>> +int kexec_crash_kernel_loaded(void)
>>> +{
>>> + int ret;
>>> + if (!mutex_trylock(&kexec_mutex))
>>> + return 1;
>>>
>>
>> We don't need trylock on this code path
>
> OK.
>
>>
>>> + ret = kexec_crash_image != NULL;
>>> + mutex_unlock(&kexec_mutex);
>>> + return ret;
>>> +}
>>> +
>>> +size_t get_crash_memory_size(void)
>>> +{
>>> + size_t size;
>>> + if (!mutex_trylock(&kexec_mutex))
>>> + return 1;
>>>
>>
>> We don't need trylock on this code path
>>
>>
>
> Hmm, crashk_res is a global struct, so other process can also
> change it... but currently no process does that, right?
>
We still need the lock. Just doing trylock doesn't instead
of just sleeping doesn't seem to make any sense on these
code paths.
>>> + size = crashk_res.end - crashk_res.start + 1;
>>> + mutex_unlock(&kexec_mutex);
>>> + return size;
>>> +}
>>> +
>>> +int shrink_crash_memory(unsigned long new_size)
>>> +{
>>> + struct page **pages;
>>> + int ret = 0;
>>> + int npages, i;
>>> + unsigned long addr;
>>> + unsigned long start, end;
>>> + void *vaddr;
>>> +
>>> + if (!mutex_trylock(&kexec_mutex))
>>> + return -EBUSY;
>>>
>>
>> We don't need trylock on this code path
>>
>> We are missing the check to see if the crash_kernel is loaded
>> under this lock instance. So I please move the kexec_crash_image != NULL
>> test inline here and kill the kexec_crash_kernel_loaded function.
>>
>
> Ok, no problem.
>
>>
>>> + start = crashk_res.start;
>>> + end = crashk_res.end;
>>> +
>>> + if (new_size >= end - start + 1) {
>>> + ret = -EINVAL;
>>> + if (new_size == end - start + 1)
>>> + ret = 0;
>>> + goto unlock;
>>> + }
>>> +
>>> + start = roundup(start, PAGE_SIZE);
>>> + end = roundup(start + new_size, PAGE_SIZE) - 1;
>>> + npages = (end + 1 - start ) / PAGE_SIZE;
>>> +
>>> + pages = kmalloc(sizeof(struct page *) * npages, GFP_KERNEL);
>>> + if (!pages) {
>>> + ret = -ENOMEM;
>>> + goto unlock;
>>> + }
>>> + for (i = 0; i < npages; i++) {
>>> + addr = end + 1 + i * PAGE_SIZE;
>>> + pages[i] = virt_to_page(addr);
>>> + }
>>> +
>>> + vaddr = vm_map_ram(pages, npages, 0, PAGE_KERNEL);
>>>
>>
>> This is the wrong kernel call to use. I expect this needs to look
>> like a memory hotplug event. This does not put the pages into the
>> free page pool.
>>
>
> Well, I also wanted to use an memory-hotplug API, but that will make the code
> depend on memory-hotplug, which certainly is not what we want...
>
> I checked the mm code, actually what I need is an API which is similar to
> add_active_range(), but add_active_range() can't be used here since it is marked
> as "__init".
>
> Do we have that kind of API in mm? I can't find one.
Perhaps we will need to remove __init from add_active_range. I know the logic
but I'm not up to speed on the mm pieces at the moment.
Eric
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
next prev parent reply other threads:[~2009-08-13 6:18 UTC|newest]
Thread overview: 19+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <20090812081731.5757.25254.sendpatchset@localhost.localdomain>
[not found] ` <20090812081906.5757.39417.sendpatchset@localhost.localdomain>
[not found] ` <m1bpmk8l1g.fsf@fess.ebiederm.org>
2009-08-13 3:32 ` Amerigo Wang
2009-08-13 6:18 ` Eric W. Biederman [this message]
2009-08-13 8:23 ` Amerigo Wang
2009-08-14 22:17 ` Eric W. Biederman
2009-08-17 9:50 ` Amerigo Wang
2009-08-18 0:29 ` KAMEZAWA Hiroyuki
2009-08-18 6:31 ` Amerigo Wang
2009-08-18 8:25 ` KAMEZAWA Hiroyuki
2009-08-18 8:51 ` Amerigo Wang
2009-08-18 10:35 ` Amerigo Wang
2009-08-18 23:57 ` KAMEZAWA Hiroyuki
2009-08-19 2:41 ` Amerigo Wang
2009-08-19 8:13 ` KAMEZAWA Hiroyuki
2009-08-19 10:47 ` Amerigo Wang
2009-08-20 9:15 ` Amerigo Wang
2009-08-21 0:34 ` KAMEZAWA Hiroyuki
2009-08-21 1:59 ` Amerigo Wang
2009-08-21 2:03 ` KAMEZAWA Hiroyuki
2009-08-21 2:47 ` Amerigo Wang
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=m1eirg5j9i.fsf@fess.ebiederm.org \
--to=ebiederm@xmission.com \
--cc=akpm@linux-foundation.org \
--cc=amwang@redhat.com \
--cc=andi@firstfloor.org \
--cc=avorontsov@ru.mvista.com \
--cc=bernhard.walle@gmx.de \
--cc=fenghua.yu@intel.com \
--cc=linux-ia64@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=mingo@elte.hu \
--cc=nhorman@redhat.com \
--cc=tony.luck@intel.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox