Re: [PATCH 0/1] mm, shmem: map few pages around fault address if they are in page cache

linux-mm.kvack.org archive mirror
 help / color / mirror / Atom feed

From: Andrew Morton <akpm@linux-foundation.org>
To: Ning Qu <quning@gmail.com>
Cc: Linus Torvalds <torvalds@linux-foundation.org>,
	Mel Gorman <mgorman@suse.de>, Rik van Riel <riel@redhat.com>,
	"Kirill A. Shutemov" <kirill.shutemov@linux.intel.com>,
	Hugh Dickins <hughd@google.com>, Andi Kleen <ak@linux.intel.com>,
	Matthew Wilcox <matthew.r.wilcox@intel.com>,
	Dave Hansen <dave.hansen@linux.intel.com>,
	Alexander Viro <viro@zeniv.linux.org.uk>,
	Dave Chinner <david@fromorbit.com>,
	linux-mm@kvack.org, linux-fsdevel@vger.kernel.org,
	linux-kernel@vger.kernel.org
Subject: Re: [PATCH 0/1] mm, shmem: map few pages around fault address if they are in page cache
Date: Mon, 3 Mar 2014 14:38:34 -0800	[thread overview]
Message-ID: <20140303143834.90ebe8ec5c6a369e54a599ec@linux-foundation.org> (raw)
In-Reply-To: <CACQD4-5SmUf+krLbef9Yg9HhJ-ipT2QKKq-NW=2C6G=XwXcMcQ@mail.gmail.com>

On Fri, 28 Feb 2014 22:27:04 -0800 Ning Qu <quning@gmail.com> wrote:

> On Fri, Feb 28, 2014 at 10:10 PM, Ning Qu <quning@gmail.com> wrote:
> > Yes, I am using the iozone -i 0 -i 1. Let me try the most simple test
> > as you mentioned.
> > Best wishes,
> > --
> > Ning Qu
> >
> >
> > On Fri, Feb 28, 2014 at 5:41 PM, Andrew Morton
> > <akpm@linux-foundation.org> wrote:
> >> On Fri, 28 Feb 2014 16:35:16 -0800 Ning Qu <quning@gmail.com> wrote:
> >>
> >>
> >> int main(int argc, char *argv[])
> >> {
> >>         char *p;
> >>         int fd;
> >>         unsigned long idx;
> >>         int sum = 0;
> >>
> >>         fd = open("foo", O_RDONLY);
> >>         if (fd < 0) {
> >>                 perror("open");
> >>                 exit(1);
> >>         }
> >>         p = mmap(NULL, 1 * G, PROT_READ, MAP_PRIVATE, fd, 0);
> >>         if (p == MAP_FAILED) {
> >>                 perror("mmap");
> >>                 exit(1);
> >>         }
> >>
> >>         for (idx = 0; idx < 1 * G; idx += 4096)
> >>                 sum += p[idx];
> >>         printf("%d\n", sum);
> >>         exit(0);
> >> }
> >>
> >> z:/home/akpm> /usr/bin/time ./a.out
> >> 0
> >> 0.05user 0.33system 0:00.38elapsed 99%CPU (0avgtext+0avgdata 4195856maxresident)k
> >> 0inputs+0outputs (0major+262264minor)pagefaults 0swaps
> >>
> >> z:/home/akpm> dc
> >> 16o
> >> 262264 4 * p
> >> 1001E0
> >>
> >> That's close!

OK, I'm repairing your top-posting here.  It makes it unnecessarily
hard to conduct a conversation - please just don't do it.

> Yes, the simple test does verify that the page fault number are
> correct with the patch. So my previous results are from those command
> lines, which also show some performance improvement with this change
> in tmpfs.
> 
> sequential access
> /usr/bin/time -a ./iozone -B s 8g -i 0 -i 1
> 
> random access
> /usr/bin/time -a ./iozone -B s 8g -i 0 -i 2

I don't understand your point here.

Running my simple test app with and without Kirill's
mm-introduce-vm_ops-map_pages and
mm-implement-map_pages-for-page-cache, minor faults are reduced 16x
when the file is cached, as expected:

0.02user 0.22system 0:00.24elapsed 97%CPU (0avgtext+0avgdata 4198080maxresident)k
0inputs+0outputs (0major+16433minor)pagefaults 0swaps


When the file is uncached, results are peculiar:

0.00user 2.84system 0:50.90elapsed 5%CPU (0avgtext+0avgdata 4198096maxresident)k
0inputs+0outputs (1major+49666minor)pagefaults 0swaps

That's approximately 3x more minor faults.  I thought it might be due
to the fact that userspace pagefaults and disk IO completions are both
working in the same order through the same pages, so the pagefaults
keep stumbling across not-yet-completed pages.  So I attempted to
complete the pages in reverse order:

--- a/fs/mpage.c~a
+++ a/fs/mpage.c
@@ -41,12 +41,16 @@
  * status of that page is hard.  See end_buffer_async_read() for the details.
  * There is no point in duplicating all that complexity.
  */
+#define bio_for_each_segment_all_reverse(bvl, bio, i)			\
+	for (i = 0, bvl = (bio)->bi_io_vec + (bio)->bi_vcnt - 1;	\
+	i < (bio)->bi_vcnt; i++, bvl--)
+
 static void mpage_end_io(struct bio *bio, int err)
 {
 	struct bio_vec *bv;
 	int i;
 
-	bio_for_each_segment_all(bv, bio, i) {
+	bio_for_each_segment_all_reverse(bv, bio, i) {
 		struct page *page = bv->bv_page;
 
 		if (bio_data_dir(bio) == READ) {

But that made no difference.  Maybe I got the wrong BIO completion
routine, but I don't think so (it's ext3).  Probably my theory is
wrong.

Anyway, could you please resend your patch with Hugh's fix and with a
more carefully written and more accurate changelog?

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

next prev parent reply	other threads:[~2014-03-03 22:38 UTC|newest]

Thread overview: 20+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2014-02-28 22:18 Ning Qu
2014-02-28 22:18 ` [PATCH 1/1] mm: implement ->map_pages for shmem/tmpfs Ning Qu
2014-03-01  1:20   ` Hugh Dickins
2014-03-01  6:36     ` Ning Qu
2014-03-03 11:07       ` Kirill A. Shutemov
2014-03-03 18:49         ` Ning Qu
2014-03-04 20:02     ` Hugh Dickins
2014-02-28 22:34 ` [PATCH 0/1] mm, shmem: map few pages around fault address if they are in page cache Andrew Morton
2014-02-28 22:35   ` Ning Qu
2014-03-01  0:35 ` Ning Qu
2014-03-01  1:41   ` Andrew Morton
2014-03-01  6:10     ` Ning Qu
2014-03-01  6:27       ` Ning Qu
2014-03-03 22:38         ` Andrew Morton [this message]
2014-03-03 23:07           ` Ning Qu
2014-03-03 23:29           ` Linus Torvalds
2014-03-03 23:37             ` Andrew Morton
2014-03-04  0:50               ` Kirill A. Shutemov
2014-03-05 22:20 ` Ning Qu
2014-03-13 20:46   ` Ning Qu

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20140303143834.90ebe8ec5c6a369e54a599ec@linux-foundation.org \
    --to=akpm@linux-foundation.org \
    --cc=ak@linux.intel.com \
    --cc=dave.hansen@linux.intel.com \
    --cc=david@fromorbit.com \
    --cc=hughd@google.com \
    --cc=kirill.shutemov@linux.intel.com \
    --cc=linux-fsdevel@vger.kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-mm@kvack.org \
    --cc=matthew.r.wilcox@intel.com \
    --cc=mgorman@suse.de \
    --cc=quning@gmail.com \
    --cc=riel@redhat.com \
    --cc=torvalds@linux-foundation.org \
    --cc=viro@zeniv.linux.org.uk \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox