From: mawupeng <mawupeng1@huawei.com>
To: <david@redhat.com>, <anshuman.khandual@arm.com>, <will@kernel.org>
Cc: <mawupeng1@huawei.com>, <catalin.marinas@arm.com>,
<akpm@linux-foundation.org>, <sudaraja@codeaurora.org>,
<linux-mm@kvack.org>, <linux-kernel@vger.kernel.org>,
<wangkefeng.wang@huawei.com>,
<linux-arm-kernel@lists.infradead.org>, <mark.rutland@arm.com>
Subject: Re: [RFC PATCH] arm64: mm: Fix kernel page tables incorrectly deleted during memory removal
Date: Fri, 28 Jul 2023 09:06:16 +0800 [thread overview]
Message-ID: <b2aaeed0-26b4-bb00-de17-6a2f2cb183ff@huawei.com> (raw)
In-Reply-To: <a8775e0b-a206-3ec8-7499-a3c3cfd782e2@redhat.com>
On 2023/7/26 15:50, David Hildenbrand wrote:
> On 26.07.23 08:20, mawupeng wrote:
>>
>>
>> On 2023/7/24 14:11, David Hildenbrand wrote:
>>> On 24.07.23 07:54, Anshuman Khandual wrote:
>>>>
>>>>
>>>> On 7/24/23 06:55, mawupeng wrote:
>>>>>
>>>>> On 2023/7/21 18:36, Will Deacon wrote:
>>>>>> On Mon, Jul 17, 2023 at 07:51:50PM +0800, Wupeng Ma wrote:
>>>>>>> From: Ma Wupeng <mawupeng1@huawei.com>
>>>>>>>
>>>>>>> During our test, we found that kernel page table may be unexpectedly
>>>>>>> cleared with rodata off. The root cause is that the kernel page is
>>>>>>> initialized with pud size(1G block mapping) while offline is memory
>>>>>>> block size(MIN_MEMORY_BLOCK_SIZE 128M), eg, if 2G memory is hot-added,
>>>>>>> when offline a memory block, the call trace is shown below,
>>>
>>> Is someone adding memory in 2 GiB granularity and then removing parts of it in 128 MiB granularity? That would be against what we support using the add_memory() / offline_and_remove_memory() API and that driver should be fixed instead.
>>
>> Yes, this kind of situation.
>>
>> The problem occurs in the following scenarios:
>> 1. use mem=xxG to reserve memory.
>> 2. add_momory to online memory.
>> 3. offline part of the memroy via offline_and_remove_memory.
>>
>> During my research, ACPI memory removal use memory_subsys_offline to offline memory section and
>> this will not delete page table entry which do not trigger this kind of problem.
>>
>> So I understand what you are talking about.
>> 1. 3rd-party driver shouldn't use add_memory/offline_and_remove_memory to online/offline memory.
>> If it have to use, this can be achieved by driver.
>> 2. memory_subsys_offline is perfered to do such thing.
>
> No, my point is that
>
> 1) If you use add_memory() and offline_and_remove_memory() in the *same
> granularity* it has to be working, otherwise it has to be fixed.
>
> 2) If you use add_memory() and offline_and_remove_memory() in different
> granularity (especially, add_memory() in bigger granularity) , then
> change your code to do add_memory() in the same granularity.
>
>
> If you run into 1), then we populated a PUD for boot memory that also covers yet unpopulated physical memory ranges that are later populated by add_memory(). If that's the case, then we can either fix it by
>
> a) Not doing that. Use PMD tables instead for that piece of memory.
>
> b) Detecting that that PUD still covers memory and refusing to remove
> that PUD.
>
> c) Rejecting to hotadd memory in this situation at that location. We
> have mhp_get_pluggable_range() -> arch_get_mappable_range() to kind-
> of handle something like that.
Thank you for your patient answer.
This I do understand and answer my question.
>
next prev parent reply other threads:[~2023-07-28 1:06 UTC|newest]
Thread overview: 9+ messages / expand[flat|nested] mbox.gz Atom feed top
2023-07-17 11:51 Wupeng Ma
2023-07-21 10:36 ` Will Deacon
2023-07-24 1:25 ` mawupeng
2023-07-24 5:54 ` Anshuman Khandual
2023-07-24 6:11 ` David Hildenbrand
2023-07-26 6:20 ` mawupeng
2023-07-26 7:50 ` David Hildenbrand
2023-07-28 1:06 ` mawupeng [this message]
2023-07-28 4:01 ` Anshuman Khandual
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=b2aaeed0-26b4-bb00-de17-6a2f2cb183ff@huawei.com \
--to=mawupeng1@huawei.com \
--cc=akpm@linux-foundation.org \
--cc=anshuman.khandual@arm.com \
--cc=catalin.marinas@arm.com \
--cc=david@redhat.com \
--cc=linux-arm-kernel@lists.infradead.org \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=mark.rutland@arm.com \
--cc=sudaraja@codeaurora.org \
--cc=wangkefeng.wang@huawei.com \
--cc=will@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox