From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail-pa0-f54.google.com (mail-pa0-f54.google.com [209.85.220.54]) by kanga.kvack.org (Postfix) with ESMTP id 8BA566B0032 for ; Sat, 28 Feb 2015 01:02:29 -0500 (EST) Received: by padbj1 with SMTP id bj1so28183463pad.5 for ; Fri, 27 Feb 2015 22:02:29 -0800 (PST) Received: from cnbjrel01.sonyericsson.com (cnbjrel01.sonyericsson.com. [219.141.167.165]) by mx.google.com with ESMTPS id u7si7766147pbs.92.2015.02.27.22.02.24 for (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Fri, 27 Feb 2015 22:02:27 -0800 (PST) From: "Wang, Yalin" Date: Sat, 28 Feb 2015 14:01:46 +0800 Subject: [RFC V2] mm: change mm_advise_free to clear page dirty Message-ID: <35FD53F367049845BC99AC72306C23D10458D6173BE1@CNBJMBX05.corpusers.net> References: <1424765897-27377-1-git-send-email-minchan@kernel.org> <20150224154318.GA14939@dhcp22.suse.cz> <20150225000809.GA6468@blaptop> <35FD53F367049845BC99AC72306C23D10458D6173BDC@CNBJMBX05.corpusers.net> <20150227210233.GA29002@dhcp22.suse.cz> <35FD53F367049845BC99AC72306C23D10458D6173BE0@CNBJMBX05.corpusers.net> In-Reply-To: <35FD53F367049845BC99AC72306C23D10458D6173BE0@CNBJMBX05.corpusers.net> Content-Language: en-US Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 Sender: owner-linux-mm@kvack.org List-ID: To: 'Michal Hocko' Cc: 'Minchan Kim' , 'Andrew Morton' , "'linux-kernel@vger.kernel.org'" , "'linux-mm@kvack.org'" , 'Rik van Riel' , 'Johannes Weiner' , 'Mel Gorman' , 'Shaohua Li' This patch add ClearPageDirty() to clear AnonPage dirty flag, if not clear page dirty for this anon page, the page will never be treated as freeable. we also make sure the shared AnonPage is not freeable, we implement it by dirty all copyed AnonPage pte, so that make sure the Anonpage will not become freeable, unless all process which shared this page call madvise_free syscall. Another change is that we also handle file map page, we just clear pte young bit for file map, this is useful, it can make reclaim patch move file pages into inactive lru list aggressively. Signed-off-by: Yalin Wang --- mm/madvise.c | 26 +++++++++++++++----------- mm/memory.c | 12 ++++++++++-- 2 files changed, 25 insertions(+), 13 deletions(-) diff --git a/mm/madvise.c b/mm/madvise.c index 6d0fcb8..712756b 100644 --- a/mm/madvise.c +++ b/mm/madvise.c @@ -299,30 +299,38 @@ static int madvise_free_pte_range(pmd_t *pmd, unsigne= d long addr, page =3D vm_normal_page(vma, addr, ptent); if (!page) continue; + if (!PageAnon(page)) + goto set_pte; + if (!trylock_page(page)) + continue; =20 if (PageSwapCache(page)) { - if (!trylock_page(page)) - continue; - if (!try_to_free_swap(page)) { unlock_page(page); continue; } - - ClearPageDirty(page); - unlock_page(page); } =20 /* + * we clear page dirty flag for AnonPage, no matter if this + * page is in swapcahce or not, AnonPage not in swapcache also set + * dirty flag sometimes, this happened when an AnonPage is removed + * from swapcahce by try_to_free_swap() + */ + ClearPageDirty(page); + unlock_page(page); + /* * Some of architecture(ex, PPC) don't update TLB * with set_pte_at and tlb_remove_tlb_entry so for * the portability, remap the pte with old|clean * after pte clearing. */ +set_pte: ptent =3D ptep_get_and_clear_full(mm, addr, pte, tlb->fullmm); ptent =3D pte_mkold(ptent); - ptent =3D pte_mkclean(ptent); + if (PageAnon(page)) + ptent =3D pte_mkclean(ptent); set_pte_at(mm, addr, pte, ptent); tlb_remove_tlb_entry(tlb, pte, addr); } @@ -364,10 +372,6 @@ static int madvise_free_single_vma(struct vm_area_stru= ct *vma, if (vma->vm_flags & (VM_LOCKED|VM_HUGETLB|VM_PFNMAP)) return -EINVAL; =20 - /* MADV_FREE works for only anon vma at the moment */ - if (vma->vm_file) - return -EINVAL; - start =3D max(vma->vm_start, start_addr); if (start >=3D vma->vm_end) return -EINVAL; diff --git a/mm/memory.c b/mm/memory.c index 8068893..3d949b3 100644 --- a/mm/memory.c +++ b/mm/memory.c @@ -874,10 +874,18 @@ copy_one_pte(struct mm_struct *dst_mm, struct mm_stru= ct *src_mm, if (page) { get_page(page); page_dup_rmap(page); - if (PageAnon(page)) + if (PageAnon(page)) { + /* + * we dirty the copyed pte for anon page, + * this is useful for madvise_free_pte_range(), + * this can prevent shared anon page freed by madvise_free + * syscall + */ + pte =3D pte_mkdirty(pte); rss[MM_ANONPAGES]++; - else + } else { rss[MM_FILEPAGES]++; + } } =20 out_set_pte: --=20 2.2.2 -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@kvack.org. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: email@kvack.org