* [PATCH] mm/compaction: fix the total_isolated in strict mode
@ 2024-11-02 20:16 Qiang Liu
2024-11-12 0:22 ` Andrew Morton
2024-11-12 1:24 ` Baolin Wang
0 siblings, 2 replies; 6+ messages in thread
From: Qiang Liu @ 2024-11-02 20:16 UTC (permalink / raw)
To: akpm; +Cc: linux-mm, linux-kernel, Qiang Liu
If the last cycle reads bogus compound_order() and blockpfn > end_pfn occurs,
it is possible that total_isolated will be less than nr_scanned. In this case,
strict mode should return 0, but the “if (strict && blockpfn < end_pfn)”
statement cannot recognize this situation
Signed-off-by: Qiang Liu <liuq131@chinatelecom.cn>
---
mm/compaction.c | 2 +-
1 file changed, 1 insertion(+), 1 deletion(-)
diff --git a/mm/compaction.c b/mm/compaction.c
index a2b16b08cbbf..6009f5d1021a 100644
--- a/mm/compaction.c
+++ b/mm/compaction.c
@@ -699,7 +699,7 @@ static unsigned long isolate_freepages_block(struct compact_control *cc,
* pages requested were isolated. If there were any failures, 0 is
* returned and CMA will fail.
*/
- if (strict && blockpfn < end_pfn)
+ if (strict && (blockpfn < end_pfn || total_isolated != nr_scanned))
total_isolated = 0;
cc->total_free_scanned += nr_scanned;
--
2.27.0
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: [PATCH] mm/compaction: fix the total_isolated in strict mode
2024-11-02 20:16 [PATCH] mm/compaction: fix the total_isolated in strict mode Qiang Liu
@ 2024-11-12 0:22 ` Andrew Morton
2024-11-12 1:24 ` Baolin Wang
1 sibling, 0 replies; 6+ messages in thread
From: Andrew Morton @ 2024-11-12 0:22 UTC (permalink / raw)
To: Qiang Liu; +Cc: linux-mm, linux-kernel, Mel Gorman, Laura Abbott
On Sat, 2 Nov 2024 20:16:21 +0000 Qiang Liu <liuq131@chinatelecom.cn> wrote:
> If the last cycle reads bogus compound_order() and blockpfn > end_pfn occurs,
> it is possible that total_isolated will be less than nr_scanned. In this case,
> strict mode should return 0, but the “if (strict && blockpfn < end_pfn)”
> statement cannot recognize this situation
>
> ...
>
> --- a/mm/compaction.c
> +++ b/mm/compaction.c
> @@ -699,7 +699,7 @@ static unsigned long isolate_freepages_block(struct compact_control *cc,
> * pages requested were isolated. If there were any failures, 0 is
> * returned and CMA will fail.
> */
> - if (strict && blockpfn < end_pfn)
> + if (strict && (blockpfn < end_pfn || total_isolated != nr_scanned))
> total_isolated = 0;
>
> cc->total_free_scanned += nr_scanned;
That's really old code. What userspace-visible effects might this
have? Is this from code inspection, or was some misbehaviour observed?
Thanks.
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: [PATCH] mm/compaction: fix the total_isolated in strict mode
2024-11-02 20:16 [PATCH] mm/compaction: fix the total_isolated in strict mode Qiang Liu
2024-11-12 0:22 ` Andrew Morton
@ 2024-11-12 1:24 ` Baolin Wang
[not found] ` <2024111210165296529720@chinatelecom.cn>
1 sibling, 1 reply; 6+ messages in thread
From: Baolin Wang @ 2024-11-12 1:24 UTC (permalink / raw)
To: Qiang Liu, akpm; +Cc: linux-mm, linux-kernel
On 2024/11/3 04:16, Qiang Liu wrote:
> If the last cycle reads bogus compound_order() and blockpfn > end_pfn occurs,
if blockpfn > end_pfn occurs, we will reset the blockpfn, right?
/*
* Be careful to not go outside of the pageblock.
*/
if (unlikely(blockpfn > end_pfn))
blockpfn = end_pfn;
So how this can happen?
> it is possible that total_isolated will be less than nr_scanned. In this case,
> strict mode should return 0, but the “if (strict && blockpfn < end_pfn)”
> statement cannot recognize this situation
>
> Signed-off-by: Qiang Liu <liuq131@chinatelecom.cn>
> ---
> mm/compaction.c | 2 +-
> 1 file changed, 1 insertion(+), 1 deletion(-)
>
> diff --git a/mm/compaction.c b/mm/compaction.c
> index a2b16b08cbbf..6009f5d1021a 100644
> --- a/mm/compaction.c
> +++ b/mm/compaction.c
> @@ -699,7 +699,7 @@ static unsigned long isolate_freepages_block(struct compact_control *cc,
> * pages requested were isolated. If there were any failures, 0 is
> * returned and CMA will fail.
> */
> - if (strict && blockpfn < end_pfn)
> + if (strict && (blockpfn < end_pfn || total_isolated != nr_scanned))
> total_isolated = 0;
>
> cc->total_free_scanned += nr_scanned;
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: [PATCH] mm/compaction: fix the total_isolated in strict mode
[not found] ` <2024111210165296529720@chinatelecom.cn>
@ 2024-11-12 9:47 ` Baolin Wang
2024-11-14 2:58 ` Baolin Wang
0 siblings, 1 reply; 6+ messages in thread
From: Baolin Wang @ 2024-11-12 9:47 UTC (permalink / raw)
To: liuq131, akpm; +Cc: linux-mm, linux-kernel
On 2024/11/12 10:16, liuq131@chinatelecom.cn wrote:
> "We assume that the block we are currently processing is distributed as follows:
> 0 1 2 511
> --------------------------------------------------
> | | | |
> ---------------------------------------------------
> Index 0 and 1 are both pages with an order of 0.
> Index 2 has a bogus order (let's assume the order is 9).
> When the for loop reaches index 2, it will enter the following code:
> /*
> * For compound pages such as THP and hugetlbfs, we can save
> * potentially a lot of iterations if we skip them at once.
> * The check is racy, but we can consider only valid values
> * and the only danger is skipping too much.
> */
> if (PageCompound(page)) {
> const unsigned int order = compound_order(page);
> if (blockpfn + (1UL << order) <= end_pfn) {
> blockpfn += (1UL << order) - 1;
> page += (1UL << order) - 1;
> nr_scanned += (1UL << order) - 1;
> }
> goto isolate_fail;
> }
>
> After exiting the for loop:
> blockpfn =basepfn+ 2+2^9 = basepfn+514
> endpfn = basepfn +512
> total_isolated = 2
> nr_scanned = 514
In your case, the 'blockpfn' will not be updated to 'basepfn+514',
because 'blockpfn + (1UL << order) > end_pfn', right? And remember the
'end_pfn' is the end of the pageblock.
So I'm still confused about your case. Is this from code inspection?
> /*
> * Be careful to not go outside of the pageblock.
> */
> if (unlikely(blockpfn > end_pfn))
> blockpfn = end_pfn;
>
> So this can happen
>
> /*
> * If strict isolation is requested by CMA then check that all the
> * pages requested were isolated. If there were any failures, 0 is
> * returned and CMA will fail.
> */
> if (strict && blockpfn < end_pfn)
> total_isolated = 0;
>
> If processed according to the old code, it will not enter the if statement to reset total_isolated, but the correct handling is to reset total_isolated to 0.
Please do not top-posting:
"
- Use interleaved ("inline") replies, which makes your response easier
to read. (i.e. avoid top-posting -- the practice of putting your answer
above the quoted text you are responding to.) For more details, see
:ref:`Documentation/process/submitting-patches.rst
<interleaved_replies>`.
"
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: [PATCH] mm/compaction: fix the total_isolated in strict mode
2024-11-12 9:47 ` Baolin Wang
@ 2024-11-14 2:58 ` Baolin Wang
0 siblings, 0 replies; 6+ messages in thread
From: Baolin Wang @ 2024-11-14 2:58 UTC (permalink / raw)
To: Qiang Liu, akpm; +Cc: linux-mm, linux-kernel
On 2024/11/14 10:10, Qiang Liu wrote:
> From: Baolin Wang <baolin.wang@linux.alibaba.com>
>
>
> On 2024/11/12 17:47, baolin.wang@linux.alibaba.com wrote:
>> On 2024/11/12 10:16, liuq131@chinatelecom.cn wrote:
>>> "We assume that the block we are currently processing is distributed
>>> as follows:
>>> 0 1 2 511
>>> --------------------------------------------------
>>> | |
>>> | |
>>> ---------------------------------------------------
>>> Index 0 and 1 are both pages with an order of 0.
>>> Index 2 has a bogus order (let's assume the order is 9).
>>> When the for loop reaches index 2, it will enter the following code:
>>> /*
>>> * For compound pages such as THP and hugetlbfs, we can save
>>> * potentially a lot of iterations if we skip them at once.
>>> * The check is racy, but we can consider only valid values
>>> * and the only danger is skipping too much.
>>> */
>>> if (PageCompound(page)) {
>>> const unsigned int order = compound_order(page);
>>> if (blockpfn + (1UL << order) <= end_pfn) {
>>> blockpfn += (1UL << order) - 1;
>>> page += (1UL << order) - 1;
>>> nr_scanned += (1UL << order) - 1;
>>> }
>>> goto isolate_fail;
>>> }
>>>
>>> After exiting the for loop:
>>> blockpfn =basepfn+ 2+2^9 = basepfn+514
>>> endpfn = basepfn +512
>>> total_isolated = 2
>>> nr_scanned = 514
>>
>> In your case, the 'blockpfn' will not be updated to 'basepfn+514',
>> because 'blockpfn + (1UL << order) > end_pfn', right? And remember the
>> 'end_pfn' is the end of the pageblock.
>>
>> So I'm still confused about your case. Is this from code inspection?
> You're right, the situation where blockpfn > end_pfn would not actually
> occur here.
> I encountered this issue in the 4.19 kernel, which did not have this check.
> I didn't carefully examine this scenario later. Sorry about that.
Never mind:)
> However, when blockpfn == end_pfn, I believe the patch is still applicable,
> but the git log needs to be updated. Is there still an opportunity to
> submit
> a revised version of the patch?
Of course yes, and please describe your issue clearly in the next
verion. However, IIUC when blockpfn == end_pfn in your case, the
'total_isolated' is still 0.
>>> /*
>>> * Be careful to not go outside of the pageblock.
>>> */
>>> if (unlikely(blockpfn > end_pfn))
>>> blockpfn = end_pfn;
>>> So this can happen
>>>
>>> /*
>>> * If strict isolation is requested by CMA then check that all the
>>> * pages requested were isolated. If there were any failures, 0 is
>>> * returned and CMA will fail.
>>> */
>>> if (strict && blockpfn < end_pfn)
>>> total_isolated = 0;
>>>
>>> If processed according to the old code, it will not enter the if
>>> statement to reset total_isolated, but the correct handling is to
>>> reset total_isolated to 0.
>>
>> Please do not top-posting:
>>
>> "
>> - Use interleaved ("inline") replies, which makes your response easier
>> to read. (i.e. avoid top-posting -- the practice of putting your
>> answer above the quoted text you are responding to.) For more details,
>> see
>> :ref:`Documentation/process/submitting-patches.rst
>> <interleaved_replies>`.
>> "
>>
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: [PATCH] mm/compaction: fix the total_isolated in strict mode
@ 2024-11-14 2:29 liuq131
0 siblings, 0 replies; 6+ messages in thread
From: liuq131 @ 2024-11-14 2:29 UTC (permalink / raw)
To: baolin.wang; +Cc: akpm, linux-kernel, linux-mm, liuq131
On 2024/11/12 17:47, baolin.wang@linux.alibaba.com wrote:
>On 2024/11/12 10:16, liuq131@chinatelecom.cn wrote:
>> "We assume that the block we are currently processing is distributed as follows:
>> 0 1 2 511
>> --------------------------------------------------
>> | | | |
>> ---------------------------------------------------
>> Index 0 and 1 are both pages with an order of 0.
>> Index 2 has a bogus order (let's assume the order is 9).
>> When the for loop reaches index 2, it will enter the following code:
>> /*
>> * For compound pages such as THP and hugetlbfs, we can save
>> * potentially a lot of iterations if we skip them at once.
>> * The check is racy, but we can consider only valid values
>> * and the only danger is skipping too much.
>> */
>> if (PageCompound(page)) {
>> const unsigned int order = compound_order(page);
>> if (blockpfn + (1UL << order) <= end_pfn) {
>> blockpfn += (1UL << order) - 1;
>> page += (1UL << order) - 1;
>> nr_scanned += (1UL << order) - 1;
>> }
>> goto isolate_fail;
>> }
>>
>> After exiting the for loop:
>> blockpfn =basepfn+ 2+2^9 = basepfn+514
>> endpfn = basepfn +512
>> total_isolated = 2
>> nr_scanned = 514
>
>In your case, the 'blockpfn' will not be updated to 'basepfn+514',
>because 'blockpfn + (1UL << order) > end_pfn', right? And remember the
>'end_pfn' is the end of the pageblock.
>
>So I'm still confused about your case. Is this from code inspection?
You're right, the situation where blockpfn > end_pfn would not actually occur here.
I encountered this issue in the 4.19 kernel, which did not have this check.
I didn't carefully examine this scenario later. Sorry about that.
However, when blockpfn == end_pfn, I believe the patch is still applicable,
but the git log needs to be updated. Is there still an opportunity to submit
a revised version of the patch?
>> /*
>> * Be careful to not go outside of the pageblock.
>> */
>> if (unlikely(blockpfn > end_pfn))
>> blockpfn = end_pfn;
>>
>> So this can happen
>>
>> /*
>> * If strict isolation is requested by CMA then check that all the
>> * pages requested were isolated. If there were any failures, 0 is
>> * returned and CMA will fail.
>> */
>> if (strict && blockpfn < end_pfn)
>> total_isolated = 0;
>>
>> If processed according to the old code, it will not enter the if statement to reset total_isolated, but the correct handling is to reset total_isolated to 0.
>
>Please do not top-posting:
>
>"
>- Use interleaved ("inline") replies, which makes your response easier
>to read. (i.e. avoid top-posting -- the practice of putting your answer
>above the quoted text you are responding to.) For more details, see
> :ref:`Documentation/process/submitting-patches.rst
><interleaved_replies>`.
>"
^ permalink raw reply [flat|nested] 6+ messages in thread
end of thread, other threads:[~2024-11-14 2:58 UTC | newest]
Thread overview: 6+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2024-11-02 20:16 [PATCH] mm/compaction: fix the total_isolated in strict mode Qiang Liu
2024-11-12 0:22 ` Andrew Morton
2024-11-12 1:24 ` Baolin Wang
[not found] ` <2024111210165296529720@chinatelecom.cn>
2024-11-12 9:47 ` Baolin Wang
2024-11-14 2:58 ` Baolin Wang
2024-11-14 2:29 liuq131
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox