linux-mm.kvack.org archive mirror
 help / color / mirror / Atom feed
* mbind MPOL_INTERLEAVE existing pages
@ 2023-05-01 18:58 Mike Kravetz
  2023-05-02  7:45 ` Vlastimil Babka
  0 siblings, 1 reply; 4+ messages in thread
From: Mike Kravetz @ 2023-05-01 18:58 UTC (permalink / raw)
  To: linux-mm, linux-kernel; +Cc: Michal Hocko, Vlastimil Babka, Lorenzo Stoakes

I received a question from a customer that was trying to move pages via
the mbind system call.  In this specific case, the system had two nodes
and all pages in the range were already present on node 0.  They then
called mbind with mode MPOL_INTERLEAVE and the MPOL_MF_MOVE_ALL flag.  Their
expectation was that half the pages in the range would be moved to node 1
in an interleaved pattern.

In the above situation, no pages actually get moved.  This is because mbind
creates a list of pages to be moved via:

	ret = queue_pages_range(mm, start, end, nmask,
                          flags | MPOL_MF_INVERT, &pagelist);

No page will be added to the list as queue_folio_required is called for each
page to determine if it resides within the set of nodes.  And, all page are
within the set.

I have reread the mbind man page several times and agree that one might
expect MPOL_INTERLEAVE with MPOL_MF_MOVE_ALL to move pages and create an
interleaved pattern.  My question is should we:
- Change mbind so that pages are moved to an interleaved pattern?
- Update the documentation to be more explicit?

I can do either, but just wanted to get opinions before starting.
-- 
Mike Kravetz


^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2023-05-02 16:34 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2023-05-01 18:58 mbind MPOL_INTERLEAVE existing pages Mike Kravetz
2023-05-02  7:45 ` Vlastimil Babka
2023-05-02 13:12   ` Michal Hocko
2023-05-02 16:34     ` Mike Kravetz

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox