From: Michal Hocko <mhocko@kernel.org>
To: Ming Ling <ming.ling@spreadtrum.com>
Cc: akpm@linux-foundation.org, mgorman@techsingularity.net,
vbabka@suse.cz, hannes@cmpxchg.org,
baiyaowei@cmss.chinamobile.com, iamjoonsoo.kim@lge.com,
minchan@kernel.org, rientjes@google.com, hughd@google.com,
kirill.shutemov@linux.intel.com, riel@redhat.com,
mgorman@suse.de, aquini@redhat.com, corbet@lwn.net,
linux-mm@kvack.org, linux-kernel@vger.kernel.org,
orson.zhai@spreadtrum.com, geng.ren@spreadtrum.com,
chunyan.zhang@spreadtrum.com, zhizhou.tian@spreadtrum.com,
yuming.han@spreadtrum.com, xiajing@spreadst.com
Subject: Re: [PATCH v2] mm: exclude isolated non-lru pages from NR_ISOLATED_ANON or NR_ISOLATED_FILE.
Date: Fri, 14 Oct 2016 13:30:44 +0200 [thread overview]
Message-ID: <20161014113044.GB6063@dhcp22.suse.cz> (raw)
In-Reply-To: <20161014083219.GA20260@spreadtrum.com>
On Fri 14-10-16 16:32:19, Ming Ling wrote:
> On a??, 10ae?? 13, 2016 at 10:09:37a,?a?? +0200, Michal Hocko wrote:
> Hello,
> > On Thu 13-10-16 14:39:09, ming.ling wrote:
> > > From: Ming Ling <ming.ling@spreadtrum.com>
> > >
> > > Non-lru pages don't belong to any lru, so counting them to
> > > NR_ISOLATED_ANON or NR_ISOLATED_FILE doesn't make any sense.
> > > It may misguide functions such as pgdat_reclaimable_pages and
> > > too_many_isolated.
> >
> > That doesn't make much sense to me. I guess you wanted to say something
> > like
> > "
> > Accounting non-lru pages isolated for migration during pfn walk to
> > NR_ISOLATED_{ANON,FILE} doesn't make any sense and it can misguide
> > heuristics based on those counters such as pgdat_reclaimable_pages resp.
> > too_many_isolated. Note that __alloc_contig_migrate_range can isolate
> > a lot of pages at once.
> > "
> Yesi 1/4 ?your understanding is right, and your description is clearer than
> mine. Do your mind if i borrow it as a comment of this patch in next
> version?
sure, go ahead
> > > On mobile devices such as 512M ram android Phone, it may use
> > > a big zram swap. In some cases zram(zsmalloc) uses too many
> > > non-lru pages, such as:
> > > MemTotal: 468148 kB
> > > Normal free:5620kB
> > > Free swap:4736kB
> > > Total swap:409596kB
> > > ZRAM: 164616kB(zsmalloc non-lru pages)
> > > active_anon:60700kB
> > > inactive_anon:60744kB
> > > active_file:34420kB
> > > inactive_file:37532kB
> >
> > I assume those zsmalloc pages are migrateable and that is the problem?
> > Please state that explicitly so that even people not familiar with
> > zsmalloc understand the motivation.
>
> Yes, since Minchan Kim had committed a??mm: migrate: support non-lru
> movable page migrationa??, those zsmalloc pages are migrateable now.
> And i will state that explicitly in next version.
OK
> > > More non-lru pages which used by zram for swap, it influences
> > > pgdat_reclaimable_pages and too_many_isolated more.
> >
> > It would be good to mention what would be a visible effect of this.
> > "If the NR_ISOLATED_* is too large then the direct reclaim might get
> > throttled prematurely inducing longer allocation latencies without any
> > strong reason."
> >
> I will detail the effect of counting so many non-lru pages into
> NR_ISOLATED_{ANON,FILE} such as:
>
> 'In function shrink_inactive_list, if there are too many isolated
> pages,it will wait for a moment. So If we miscounting large number
> non-lru pages into NR_ISOLATED_{ANON,FILE}, direct reclaim might
> getthrottled prematurely inducing longer allocation latencies
> without any strong reason. Actually there is no need to take non-lru
> pages into account in shrink_inactive_list which just deals with
> lru pages.
Note that this is true also for the direct compaction.
> In function pgdat_reclaimable_pages, you had considered isolated
> pages in zone_reclaimable_pages. So miscounting non-lru pages into
> NR_ISOLATED_{ANON,FILE} also larger zone_reclaimable_pages and will
> lead to a more optimistic zone_reclaimable judgement.
Which shouldn't be such a big deal.
> '
> > > This patch excludes isolated non-lru pages from NR_ISOLATED_ANON
> > > or NR_ISOLATED_FILE to ensure their counts are right.
> >
> > But this patch doesn't do that. It just relies on __PageMovable. It is
> > true that all LRU pages should be movable (well except for
> > NR_UNEVICTABLE in certain configurations) but is it true that all
> > movable pages are on the LRU list?
> >
>
> I don't think so. In commit bda807d4 'mm: migrate: support non-lru
> movable page migration', Minchan Kim point out :
> 'For testing of non-lru movable page, VM supports __PageMovable function.
> However, it doesn't guarantee to identify non-lru movable page because
> page->mapping field is unified with other variables in struct page. As
> well, if driver releases the page after isolation by VM, page->mapping
> doesn't have stable value although it has PAGE_MAPPING_MOVABLE (Look at
> __ClearPageMovable). But __PageMovable is cheap to catch whether page
> is LRU or non-lru movable once the page has been isolated. Because LRU
> pages never can have PAGE_MAPPING_MOVABLE in page->mapping. It is also
> good for just peeking to test non-lru movable pages before more
> expensive checking with lock_page in pfn scanning to select victim.'.
>
> And he uses __PageMovable to judge whether a isolated page is a lru page
> such as:
> void putback_movable_pages(struct list_head *l)
> {
> ......
> /*
> * We isolated non-lru movable page so here we can use
> * __PageMovable because LRU page's mapping cannot have
> * PAGE_MAPPING_MOVABLE.
> */
> if (unlikely(__PageMovable(page))) {
> VM_BUG_ON_PAGE(!PageIsolated(page), page);
> lock_page(page);
> if (PageMovable(page))
> putback_movable_page(page);
> else
> __ClearPageIsolated(page);
> unlock_page(page);
> put_page(page);
> } else {
> putback_lru_page(page);
> }
> }
I am not familiar with this code enough to comment but to me it all
sounds quite subtle.
> > Why don't you simply mimic what shrink_inactive_list does? Aka count the
> > number of isolated pages and then account them when appropriate?
> >
> I think i am correcting clearly wrong part. So, there is no need to
> describe it too detailed. It's a misunderstanding, and i will add
> more comments as you suggest.
OK, so could you explain why you prefer to relyon __PageMovable rather
than do a trivial counting during the isolation?
--
Michal Hocko
SUSE Labs
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
next prev parent reply other threads:[~2016-10-14 11:30 UTC|newest]
Thread overview: 16+ messages / expand[flat|nested] mbox.gz Atom feed top
2016-10-13 6:39 ming.ling
2016-10-13 8:09 ` Michal Hocko
2016-10-14 8:32 ` Ming Ling
2016-10-14 11:30 ` Michal Hocko [this message]
2016-10-14 13:46 ` Minchan Kim
2016-10-14 13:53 ` Michal Hocko
2016-10-14 14:44 ` Minchan Kim
2016-10-14 15:03 ` Michal Hocko
2016-10-14 15:26 ` Minchan Kim
2016-10-15 7:10 ` Michal Hocko
2016-10-16 23:06 ` Minchan Kim
2016-10-17 8:42 ` Michal Hocko
2016-10-17 11:10 ` Michal Hocko
2016-10-18 6:29 ` Minchan Kim
2016-10-18 12:52 ` Michal Hocko
2016-10-19 2:32 ` Ming Ling
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20161014113044.GB6063@dhcp22.suse.cz \
--to=mhocko@kernel.org \
--cc=akpm@linux-foundation.org \
--cc=aquini@redhat.com \
--cc=baiyaowei@cmss.chinamobile.com \
--cc=chunyan.zhang@spreadtrum.com \
--cc=corbet@lwn.net \
--cc=geng.ren@spreadtrum.com \
--cc=hannes@cmpxchg.org \
--cc=hughd@google.com \
--cc=iamjoonsoo.kim@lge.com \
--cc=kirill.shutemov@linux.intel.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=mgorman@suse.de \
--cc=mgorman@techsingularity.net \
--cc=minchan@kernel.org \
--cc=ming.ling@spreadtrum.com \
--cc=orson.zhai@spreadtrum.com \
--cc=riel@redhat.com \
--cc=rientjes@google.com \
--cc=vbabka@suse.cz \
--cc=xiajing@spreadst.com \
--cc=yuming.han@spreadtrum.com \
--cc=zhizhou.tian@spreadtrum.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox