From: KOSAKI Motohiro <kosaki.motohiro@jp.fujitsu.com>
To: Mel Gorman <mel@csn.ul.ie>
Cc: kosaki.motohiro@jp.fujitsu.com,
Larry Woodman <lwoodman@redhat.com>,
Andrew Morton <akpm@linux-foundation.org>,
riel@redhat.com, Ingo Molnar <mingo@elte.hu>,
Peter Zijlstra <peterz@infradead.org>,
LKML <linux-kernel@vger.kernel.org>,
linux-mm@kvack.org
Subject: Re: [PATCH 1/4] tracing, page-allocator: Add trace events for page allocation and page freeing
Date: Wed, 5 Aug 2009 18:13:09 +0900 (JST) [thread overview]
Message-ID: <20090805165302.5BC8.A69D9226@jp.fujitsu.com> (raw)
In-Reply-To: <1249409546-6343-2-git-send-email-mel@csn.ul.ie>
Hi
sorry for the delayed review.
> This patch adds trace events for the allocation and freeing of pages,
> including the freeing of pagevecs. Using the events, it will be known what
> struct page and pfns are being allocated and freed and what the call site
> was in many cases.
>
> The page alloc tracepoints be used as an indicator as to whether the workload
> was heavily dependant on the page allocator or not. You can make a guess based
> on vmstat but you can't get a per-process breakdown. Depending on the call
> path, the call_site for page allocation may be __get_free_pages() instead
> of a useful callsite. Instead of passing down a return address similar to
> slab debugging, the user should enable the stacktrace and seg-addr options
> to get a proper stack trace.
>
> The pagevec free tracepoint has a different usecase. It can be used to get
> a idea of how many pages are being dumped off the LRU and whether it is
> kswapd doing the work or a process doing direct reclaim.
>
> Signed-off-by: Mel Gorman <mel@csn.ul.ie>
> Acked-by: Rik van Riel <riel@redhat.com>
> ---
> include/trace/events/kmem.h | 86 +++++++++++++++++++++++++++++++++++++++++++
> mm/page_alloc.c | 6 ++-
> 2 files changed, 91 insertions(+), 1 deletions(-)
>
> diff --git a/include/trace/events/kmem.h b/include/trace/events/kmem.h
> index 1493c54..57bf13c 100644
> --- a/include/trace/events/kmem.h
> +++ b/include/trace/events/kmem.h
> @@ -225,6 +225,92 @@ TRACE_EVENT(kmem_cache_free,
>
> TP_printk("call_site=%lx ptr=%p", __entry->call_site, __entry->ptr)
> );
> +
> +TRACE_EVENT(mm_page_free_direct,
> +
> + TP_PROTO(unsigned long call_site, const void *page, unsigned int order),
> +
> + TP_ARGS(call_site, page, order),
> +
> + TP_STRUCT__entry(
> + __field( unsigned long, call_site )
> + __field( const void *, page )
Why void? Is there any benefit?
> + __field( unsigned int, order )
> + ),
> +
> + TP_fast_assign(
> + __entry->call_site = call_site;
> + __entry->page = page;
> + __entry->order = order;
> + ),
> +
> + TP_printk("call_site=%lx page=%p pfn=%lu order=%d",
> + __entry->call_site,
> + __entry->page,
> + page_to_pfn((struct page *)__entry->page),
> + __entry->order)
> +);
> +
> +TRACE_EVENT(mm_pagevec_free,
> +
> + TP_PROTO(unsigned long call_site, const void *page, int order, int cold),
> +
> + TP_ARGS(call_site, page, order, cold),
> +
> + TP_STRUCT__entry(
> + __field( unsigned long, call_site )
> + __field( const void *, page )
> + __field( int, order )
> + __field( int, cold )
> + ),
> +
> + TP_fast_assign(
> + __entry->call_site = call_site;
> + __entry->page = page;
> + __entry->order = order;
> + __entry->cold = cold;
> + ),
> +
> + TP_printk("call_site=%lx page=%p pfn=%lu order=%d cold=%d",
> + __entry->call_site,
> + __entry->page,
> + page_to_pfn((struct page *)__entry->page),
> + __entry->order,
> + __entry->cold)
> +);
> +
> +TRACE_EVENT(mm_page_alloc,
> +
> + TP_PROTO(unsigned long call_site, const void *page, unsigned int order,
> + gfp_t gfp_flags, int migratetype),
> +
> + TP_ARGS(call_site, page, order, gfp_flags, migratetype),
> +
> + TP_STRUCT__entry(
> + __field( unsigned long, call_site )
> + __field( const void *, page )
> + __field( unsigned int, order )
> + __field( gfp_t, gfp_flags )
> + __field( int, migratetype )
> + ),
> +
> + TP_fast_assign(
> + __entry->call_site = call_site;
> + __entry->page = page;
> + __entry->order = order;
> + __entry->gfp_flags = gfp_flags;
> + __entry->migratetype = migratetype;
> + ),
> +
> + TP_printk("call_site=%lx page=%p pfn=%lu order=%d migratetype=%d gfp_flags=%s",
> + __entry->call_site,
> + __entry->page,
> + page_to_pfn((struct page *)__entry->page),
> + __entry->order,
> + __entry->migratetype,
> + show_gfp_flags(__entry->gfp_flags))
> +);
> +
> #endif /* _TRACE_KMEM_H */
>
> /* This part must be outside protection */
> diff --git a/mm/page_alloc.c b/mm/page_alloc.c
> index d052abb..843bdec 100644
> --- a/mm/page_alloc.c
> +++ b/mm/page_alloc.c
> @@ -1905,6 +1905,7 @@ __alloc_pages_nodemask(gfp_t gfp_mask, unsigned int order,
> zonelist, high_zoneidx, nodemask,
> preferred_zone, migratetype);
>
> + trace_mm_page_alloc(_RET_IP_, page, order, gfp_mask, migratetype);
> return page;
> }
In almost case, __alloc_pages_nodemask() is called from alloc_pages_current().
Can you add call_site argument? (likes slab_alloc)
> EXPORT_SYMBOL(__alloc_pages_nodemask);
> @@ -1945,13 +1946,16 @@ void __pagevec_free(struct pagevec *pvec)
> {
> int i = pagevec_count(pvec);
>
> - while (--i >= 0)
> + while (--i >= 0) {
> + trace_mm_pagevec_free(_RET_IP_, pvec->pages[i], 0, pvec->cold);
> free_hot_cold_page(pvec->pages[i], pvec->cold);
> + }
> }
This _RET_IP_ assume pagevec_free() is inlined function. Then,
pagevec_free() sould also change always_inline?
Yeah, I agree this is theoretical issue. but it improve readability and
studying author's intention.
> void __free_pages(struct page *page, unsigned int order)
> {
> if (put_page_testzero(page)) {
> + trace_mm_page_free_direct(_RET_IP_, page, order);
> if (order == 0)
> free_hot_page(page);
> else
This patch covered free_pages() and __pagevec_free() case.
but it doesn't cover free_hot_page() direct call.
(Fortunately, there is no free_cold_page() caller)
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
next prev parent reply other threads:[~2009-08-05 9:13 UTC|newest]
Thread overview: 37+ messages / expand[flat|nested] mbox.gz Atom feed top
2009-08-04 18:12 [PATCH 0/4] Add some trace events for the page allocator v3 Mel Gorman
2009-08-04 18:12 ` [PATCH 1/4] tracing, page-allocator: Add trace events for page allocation and page freeing Mel Gorman
2009-08-05 9:13 ` KOSAKI Motohiro [this message]
2009-08-05 9:40 ` Mel Gorman
2009-08-07 1:17 ` KOSAKI Motohiro
2009-08-07 17:31 ` Mel Gorman
2009-08-08 5:44 ` KOSAKI Motohiro
2009-08-04 18:12 ` [PATCH 2/4] tracing, mm: Add trace events for anti-fragmentation falling back to other migratetypes Mel Gorman
2009-08-05 9:26 ` KOSAKI Motohiro
2009-08-04 18:12 ` [PATCH 3/4] tracing, page-allocator: Add trace event for page traffic related to the buddy lists Mel Gorman
2009-08-05 9:24 ` KOSAKI Motohiro
2009-08-05 9:43 ` Mel Gorman
2009-08-07 1:03 ` KOSAKI Motohiro
2009-08-04 18:12 ` [PATCH 4/4] tracing, page-allocator: Add a postprocessing script for page-allocator-related ftrace events Mel Gorman
2009-08-04 18:22 ` Andrew Morton
2009-08-04 18:27 ` Rik van Riel
2009-08-04 19:13 ` Andrew Morton
2009-08-04 20:48 ` Mel Gorman
2009-08-05 7:41 ` Ingo Molnar
2009-08-05 9:07 ` Mel Gorman
2009-08-05 9:16 ` Ingo Molnar
2009-08-05 10:27 ` Johannes Weiner
2009-08-06 15:48 ` Mel Gorman
2009-08-05 14:53 ` Larry Woodman
2009-08-06 15:54 ` Mel Gorman
2009-08-04 19:57 ` Ingo Molnar
2009-08-04 20:18 ` Andrew Morton
2009-08-04 20:35 ` Ingo Molnar
2009-08-04 20:53 ` Andrew Morton
2009-08-05 7:53 ` Ingo Molnar
2009-08-05 13:04 ` Peter Zijlstra
2009-08-05 15:07 ` Valdis.Kletnieks
2009-08-05 14:53 ` Valdis.Kletnieks
2009-08-06 15:50 ` Mel Gorman
2009-08-05 3:07 ` KOSAKI Motohiro
-- strict thread matches above, loose matches on Subject: below --
2009-07-29 21:05 [RFC PATCH 0/4] Add some trace events for the page allocator v2 Mel Gorman
2009-07-29 21:05 ` [PATCH 1/4] tracing, page-allocator: Add trace events for page allocation and page freeing Mel Gorman
2009-07-30 0:55 ` Rik van Riel
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20090805165302.5BC8.A69D9226@jp.fujitsu.com \
--to=kosaki.motohiro@jp.fujitsu.com \
--cc=akpm@linux-foundation.org \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=lwoodman@redhat.com \
--cc=mel@csn.ul.ie \
--cc=mingo@elte.hu \
--cc=peterz@infradead.org \
--cc=riel@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox