From: Balbir Singh <balbir@linux.vnet.ibm.com>
To: Nick Piggin <npiggin@suse.de>
Cc: Balbir Singh <balbir@in.ibm.com>,
Shailabh Nagar <nagar1234@in.ibm.com>,
Linux Memory Management List <linux-mm@kvack.org>,
Hugh Dickins <hugh@veritas.com>
Subject: Re: [MAILER-DAEMON@watson.ibm.com: Returned mail: see transcript for details]
Date: Tue, 04 Sep 2007 07:31:14 +0100 [thread overview]
Message-ID: <46DCFBB2.3060200@linux.vnet.ibm.com> (raw)
In-Reply-To: <20070903201645.GA11502@wotan.suse.de>
Nick Piggin wrote:
> Hi,
>
> This mail to Shailabh bounced, but I noticed you're on the Signed-off-by
> trail too, so forwarding to you so you don't miss it. Thanks,
>
Thanks, Nick. I am cc'ing Shailabh's new email id.
> Hi,
>
> I can't convince myself that delay accounting for swapin is quite right
> at the moment (not having a test setup handy to run and check for myself).
> Maybe I'm not reading the swapin code very well...
>
> lookup_swap_cache, and read_swap_cache_async should be non-blocking
> operations for the most part.
>
> read_swap_cache_async might, when allocating the new page, go into reclaim
> and take a long time to come back. However is that any more a "swapin" delay
> than eg. when we sleep on mmap_sem when first taking the fault, or any other
> types of fault which require allocations? None of which we account for as
> swapin delay.
>
Yes, for us it is. We measure the end to end delay of swapping in/out a page.
> But the most obvious delay, where we actually lock the page waiting for
> the swap IO to finish, does not seem to be accounted at all!
>
Hmm.. Does lock_page() eventually call io_schedule() or io_schedule_timeout()?
I think it does -- via sync_page(). The way our accounting works is that we
account for block I/O in io_schedule*(). If we see the SWAPIN flag set, we
then account for that I/O as swap I/O.
> My proposed fix is to just move the swaping delay accounting to the
> point where the VM does actually wait, for the swapin.
>
> I have no idea what uses swapin delay accounting, but it would be good to
> see if this makes a positive (or at least not negative) impact on those
> users...
>
> Thanks,
> Nick
>
> --
> Index: linux-2.6/mm/memory.c
> ===================================================================
> --- linux-2.6.orig/mm/memory.c
> +++ linux-2.6/mm/memory.c
> @@ -2158,7 +2158,6 @@ static int do_swap_page(struct mm_struct
> migration_entry_wait(mm, pmd, address);
> goto out;
> }
> - delayacct_set_flag(DELAYACCT_PF_SWAPIN);
Let's start the accounting here.
> page = lookup_swap_cache(entry);
> if (!page) {
> grab_swap_token(); /* Contend for token _before_ read-in */
> @@ -2172,7 +2171,6 @@ static int do_swap_page(struct mm_struct
> page_table = pte_offset_map_lock(mm, pmd, address, &ptl);
> if (likely(pte_same(*page_table, orig_pte)))
> ret = VM_FAULT_OOM;
> - delayacct_clear_flag(DELAYACCT_PF_SWAPIN);
> goto unlock;
> }
>
> @@ -2181,9 +2179,10 @@ static int do_swap_page(struct mm_struct
> count_vm_event(PGMAJFAULT);
> }
>
> - delayacct_clear_flag(DELAYACCT_PF_SWAPIN);
> mark_page_accessed(page);
> + delayacct_set_flag(DELAYACCT_PF_SWAPIN);
> lock_page(page);
> + delayacct_clear_flag(DELAYACCT_PF_SWAPIN);
>
I agree that we should end it after lock_page().
> /*
> * Back out if somebody else already faulted in this pte.
>
>
> ----- End forwarded message -----
--
Warm Regards,
Balbir Singh
Linux Technology Center
IBM, ISTL
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
next parent reply other threads:[~2007-09-04 6:31 UTC|newest]
Thread overview: 2+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <20070903201645.GA11502@wotan.suse.de>
2007-09-04 6:31 ` Balbir Singh [this message]
2007-09-04 17:37 ` Nick Piggin
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=46DCFBB2.3060200@linux.vnet.ibm.com \
--to=balbir@linux.vnet.ibm.com \
--cc=balbir@in.ibm.com \
--cc=hugh@veritas.com \
--cc=linux-mm@kvack.org \
--cc=nagar1234@in.ibm.com \
--cc=npiggin@suse.de \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox