From: Laura Abbott <lauraa@codeaurora.org>
To: Minchan Kim <minchan@kernel.org>,
Ramakrishnan Muthukrishnan <vu3rdd@gmail.com>
Cc: linux-mm@kvack.org
Subject: Re: cma: alloc_contig_range test_pages_isolated .. failed
Date: Thu, 13 Mar 2014 18:37:37 -0700 [thread overview]
Message-ID: <53225D61.8090101@codeaurora.org> (raw)
In-Reply-To: <20140314001658.GG16062@bbox>
On 3/13/2014 5:16 PM, Minchan Kim wrote:
> On Thu, Mar 13, 2014 at 09:24:25AM +0530, Ramakrishnan Muthukrishnan wrote:
>> Hello,
>>
>> On Thu, Mar 13, 2014 at 4:59 AM, Minchan Kim <minchan@kernel.org> wrote:
>>>
>>> On Tue, Mar 11, 2014 at 07:32:34PM +0530, Ramakrishnan Muthukrishnan wrote:
>>>> Hello linux-mm hackers,
>>>>
>>>> We have a TI OMAP4 based system running 3.4 kernel. OMAP4 has got 2 M3
>>>> processors which is used for some media tasks.
>>>>
>>>> During bootup, the M3 firmware is loaded and it used CMA to allocate 3
>>>> regions for DMA, as seen by these logs:
>>>>
>>>> [ 0.000000] cma: dma_declare_contiguous(size a400000, base
>>>> 99000000, limit 00000000)
>>>> [ 0.000000] cma: CMA: reserved 168 MiB at 99000000
>>>> [ 0.000000] cma: dma_declare_contiguous(size 2000000, base
>>>> 00000000, limit 00000000)
>>>> [ 0.000000] cma: CMA: reserved 32 MiB at ad800000
>>>> [ 0.000000] cma: dma_contiguous_reserve(limit af800000)
>>>> [ 0.000000] cma: dma_contiguous_reserve: reserving 16 MiB for global area
>>>> [ 0.000000] cma: dma_declare_contiguous(size 1000000, base
>>>> 00000000, limit af800000)
>>>> [ 0.000000] cma: CMA: reserved 16 MiB at ac000000
>>>> [ 0.243652] cma: cma_init_reserved_areas()
>>>> [ 0.243682] cma: cma_create_area(base 00099000, count a800)
>>>> [ 0.253417] cma: cma_create_area: returned ed0ee400
>>>> [...]
>>>>
>>>> We observed that if we reboot a system without unmounting the file
>>>> systems (like in abrupt power off..etc), after the fresh reboot, the
>>>> file system checks are performed, the firmware load is delayed by ~4
>>>> seconds (compared to the one without fsck) and then we see the
>>>> following in the kernel bootup logs:
>>>>
>>>> [ 26.846313] alloc_contig_range test_pages_isolated(a2e00, a3400) failed
>>>> [ 26.853515] alloc_contig_range test_pages_isolated(a2e00, a3500) failed
>>>> [ 26.860809] alloc_contig_range test_pages_isolated(a3100, a3700) failed
>>>> [ 26.868133] alloc_contig_range test_pages_isolated(a3200, a3800) failed
>>>> [ 26.875213] rproc remoteproc0: dma_alloc_coherent failed: 6291456
>>>> [ 26.881744] rproc remoteproc0: Failed to process resources: -12
>>>> [ 26.902221] omap_hwmod: ipu: failed to hardreset
>>>> [ 26.909545] omap_hwmod: ipu: _wait_target_disable failed
>>>> [ 26.916748] rproc remoteproc0: rproc_boot() failed -12
>>>>
>>>> The M3 firmware load fails because of this. I have been looking at the
>>>> git logs to see if this is fixed in the later checkins, since this is
>>>> a bit old kernel. For various non-technical reasons which I have no
>>>> control of, we can't move to a newer kernel. But I could backport any
>>>> fixes done in newer kernel. Also I am totally new to memory management
>>>> in the kernel, so any help in debugging is highly appreciated.
>>>
>>> Could you try this one?
>>> https://lkml.org/lkml/2012/8/31/313
>>> I didn't reviewd that patch carefully but I guess you have similar problem.
>>> So, if it fixes your problem, we should review that patch carefully and
>>> merge if it doesn't have any problem and we couldn't find better solution.
>>
>> It didn't fix the problem, unfortunately. In fact my kernel already
>> had that patch applied (by a TI engineer):
>>
>> commit df9cf0bdf4a59e0fe6604f92f52028c259da69ad
>> Author: Guillaume Aubertin <g-aubertin@ti.com>
>> Date: Mon Sep 10 20:27:08 2012 +0800
>>
>> CMA: removing buffers from LRU when migrating
>>
>> based on the fix provided by Laura Abbott :
>> https://lkml.org/lkml/2012/8/31/313
>
> 3.4 was initial version for CMA and AFAIR, there were lots of problem and
> have fixed until now. I don't know how many patches TI backported to 3.4
> so it's really hard to see your problem.
>
> Anyway, patches I can suggest to you are following as
>
> [1] bb13ffeb9, mm: compaction: cache if a pageblock was scanned and no pages were isolated
> [2] 627260595, mm: compaction: fix bit ranges in {get,clear,set}_pageblock_skip()
>
> Totally, I forgot what they are but at least, Thierry had similar problem
> and it was fixed by that.
> https://lkml.org/lkml/2012/9/27/281
>
> Hopefully, It helps you, too.
>
> And please keep in mind. In 3.4, CMA has many problems so although we might
> fix poped up problem, you could encounter others in runtime, too unless TI
> enginner follows recent fixes.
>
>
Can you try picking up c060f943d0929f3e429c5d9522290584f6281d6e
(mm: use aligned zone start for pfn_to_bitidx calculation)
and 7c45512df987c5619db041b5c9b80d281e26d3db
(mm: fix pageblock bitmap allocation)
The first commit fixed a known failure mode that caused isolation
failures frequently and the second commit fixed a bug in the 1st
commit. From experience though if you are hitting a large number of
isolation failures that generally means the system is under memory/file
pressure and the CMA pages aren't even eligible to be isolated yet
(!PageLRU in isolate_migratepages_range)
You can also try this 'unique enhancement' (It sounds better than
performance dropping hack)
diff --git a/include/linux/mmzone.h b/include/linux/mmzone.h
index 9b61b9b..31b36e8 100644
--- a/include/linux/mmzone.h
+++ b/include/linux/mmzone.h
@@ -63,10 +63,20 @@ enum {
MIGRATE_TYPES
};
+static inline int get_pageblock_migratetype(struct page *page)
+{
+ return get_pageblock_flags_group(page, PB_migrate, PB_migrate_end);
+}
+
#ifdef CONFIG_CMA
+static inline bool is_cma_pageblock(struct page *)
+{
+ return get_pageblock_migratetype(page) == MIGRATE_CMA;
+}
# define is_migrate_cma(migratetype) unlikely((migratetype) ==
MIGRATE_CMA)
#else
# define is_migrate_cma(migratetype) false
+#define is_cma_pageblock(page) false
#endif
#define for_each_migratetype_order(order, type) \
@@ -75,11 +85,6 @@ enum {
extern int page_group_by_mobility_disabled;
-static inline int get_pageblock_migratetype(struct page *page)
-{
- return get_pageblock_flags_group(page, PB_migrate, PB_migrate_end);
-}
-
struct free_area {
struct list_head free_list[MIGRATE_TYPES];
unsigned long nr_free;
diff --git a/mm/filemap.c b/mm/filemap.c
index 7a13f6a..1fa8e58 100644
--- a/mm/filemap.c
+++ b/mm/filemap.c
@@ -788,12 +788,21 @@ struct page *find_or_create_page(struct
address_space *mapping,
{
struct page *page;
int err;
+ gfp_t gfp_notmask = 0;
+
repeat:
page = find_lock_page(mapping, index);
if (!page) {
- page = __page_cache_alloc(gfp_mask);
+retry:
+ page = __page_cache_alloc(gfp_mask & ~gfp_notmask);
if (!page)
return NULL;
+
+ if (is_cma_pageblock(page)) {
+ __free_page(page);
+ gfp_notmask |= __GFP_MOVABLE;
+ goto retry;
+ }
/*
* We want a regular kernel memory (not highmem or DMA etc)
* allocation for the radix tree nodes, but we need to
honour
>>
>> Thanks
>> Ramakrishnan
>>
>> --
>> To unsubscribe, send a message with 'unsubscribe linux-mm' in
>> the body to majordomo@kvack.org. For more info on Linux MM,
>> see: http://www.linux-mm.org/ .
>> Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
>
--
Qualcomm Innovation Center, Inc. is a member of Code Aurora Forum,
hosted by The Linux Foundation
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
next prev parent reply other threads:[~2014-03-14 1:37 UTC|newest]
Thread overview: 10+ messages / expand[flat|nested] mbox.gz Atom feed top
2014-03-11 14:02 Ramakrishnan Muthukrishnan
2014-03-12 23:29 ` Minchan Kim
2014-03-13 3:54 ` Ramakrishnan Muthukrishnan
2014-03-14 0:16 ` Minchan Kim
2014-03-14 1:37 ` Laura Abbott [this message]
2014-03-14 7:21 ` Ramakrishnan Muthukrishnan
2014-03-13 4:40 ` Heesub Shin
2014-03-13 13:43 ` Ramakrishnan Muthukrishnan
2014-03-14 0:41 ` Joonsoo Kim
2014-03-14 8:19 ` Jianguo Wu
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=53225D61.8090101@codeaurora.org \
--to=lauraa@codeaurora.org \
--cc=linux-mm@kvack.org \
--cc=minchan@kernel.org \
--cc=vu3rdd@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox