From: Michal Hocko <mhocko@suse.com>
To: David Rientjes <rientjes@google.com>
Cc: lsf-pc@lists.linux-foundation.org, linux-mm@kvack.org,
Dan Williams <dan.j.williams@intel.com>,
John Hubbard <jhubbard@nvidia.com>, Zi Yan <ziy@nvidia.com>,
Bharata B Rao <bharata@amd.com>,
Dave Jiang <dave.jiang@intel.com>,
"Aneesh Kumar K.V" <aneesh.kumar@linux.ibm.com>,
"Huang, Ying" <ying.huang@intel.com>,
Alistair Popple <apopple@nvidia.com>,
Christoph Lameter <cl@gentwo.org>,
Andrew Morton <akpm@linux-foundation.org>,
Linus Torvalds <torvalds@linux-foundation.org>,
Dave Hansen <dave.hansen@linux.intel.com>,
Mel Gorman <mgorman@suse.de>, Jon Grimm <jon.grimm@amd.com>,
Gregory Price <gourry.memverge@gmail.com>,
Wei Xu <weixugc@google.com>, Johannes Weiner <hannes@cmpxchg.org>,
SeongJae Park <sj@kernel.org>,
David Hildenbrand <david@redhat.com>,
Davidlohr Bueso <dave@stgolabs.net>
Subject: Re: [LSF/MM/BPF TOPIC] Locally attached memory tiering
Date: Tue, 7 May 2024 13:52:11 +0200 [thread overview]
Message-ID: <ZjoV63ududhQNAKd@tiehlicka> (raw)
In-Reply-To: <e90dc785-c4e6-47e4-8eda-d35325c82ff9@google.com>
On Mon 06-05-24 20:37:19, David Rientjes wrote:
> Hi all,
>
> I think it would be very worthwhile to have a block set aside for
> discussion on locally attached memory tiering extensions at LSF/MM/BPF
> 2024.
>
> Primarily interested in discussing Linux enlightenment for CXL 1.1 and
> later type-3 memory expansion devices (CXL.mem). I think we could touch
> on CXL 2.0 and later memory pooling architectures if we have time and
> there is interest, but the primary focus here would be local attached.
>
> Based on the premise for a Memory Tiering Working Group[1], there is
> widespread interest in the foundational topics for generally useful Linux
> enlightenment:
>
> - Decoupling CPU balancing from memory balancing (or obsoleting CPU
> balancing entirely)
>
> + John Hubbard notes this would be useful for GPUs:
>
> a) GPUs have their own processors that are invisible to the kernel's
> NUMA "which tasks are active on which NUMA nodes" calculations,
> and
>
> b) Similar to where CXL is generally going, we have already built
> fully memory-coherent hardware, which include memory-only NUMA
> nodes.
>
> - In-kernel hot memory abstraction, informed by hardware hinting drivers
> (incl some architectures like Power10), usable as a NUMA Balancing
> backend for promotion and other areas of the kernel like transparent
> hugepage utilization
>
> - NUMA and memory tiering enlightenment for accelerators, such as for
> optimal use of GPU memory, extremely important for a cloud provider
> (hint hint :)
>
> - Asynchronous memory promotion independent of task_numa_fault() while
> considering the cost of page migration (due to identifying cold memory)
>
> - What the role of userspace plays in this decision-making and how we can
> extend the default policy and mechanisms in the kernel to allow for it
> if necessary
>
> Additional topics that you find interesting are also very helpful!
>
> I'm biased toward a generally useful solution that would leverage the
> kernel as the ultimate source of truth for page hotness that can be
> extended for multiple use caes, one of which is memory tiering support.
> But certainly if there are other approaches, we can discuss that as well.
>
> A few main goals from this discussion:
>
> - Ensure that proposals address, or can be extended to address, the
> emerging needs of the various use cases that users may have
>
> - Surface any constraints that stakeholders may find to be prohibitive
> for support in the core MM subsystem
>
> - Alignment and division of work for developers who are actively looking
> to contribute to this area
Do you think having 2 contigious slots would be sufficient for these
topics?
> As I'm just one of many stakeholders for this discussion, I'd nominate
> Michal Hocko to moderate it if he's willing to do so.
Sure I can help out with that.
--
Michal Hocko
SUSE Labs
next prev parent reply other threads:[~2024-05-07 11:52 UTC|newest]
Thread overview: 11+ messages / expand[flat|nested] mbox.gz Atom feed top
2024-05-07 3:37 David Rientjes
2024-05-07 11:52 ` Michal Hocko [this message]
2024-05-07 20:09 ` David Rientjes
2024-05-08 4:14 ` Huang, Ying
2024-05-10 3:10 ` David Rientjes
2024-05-08 21:39 ` Davidlohr Bueso
2024-05-09 1:42 ` Huang, Ying
2024-05-13 1:49 ` Davidlohr Bueso
2024-05-13 3:28 ` Bharata B Rao
2024-05-13 7:48 ` Huang, Ying
[not found] ` <CGME20240509173529uscas1p1b6e43b169514d36915cd2bc8aabc4200@uscas1p1.samsung.com>
2024-05-09 17:35 ` Adam Manzanares
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=ZjoV63ududhQNAKd@tiehlicka \
--to=mhocko@suse.com \
--cc=akpm@linux-foundation.org \
--cc=aneesh.kumar@linux.ibm.com \
--cc=apopple@nvidia.com \
--cc=bharata@amd.com \
--cc=cl@gentwo.org \
--cc=dan.j.williams@intel.com \
--cc=dave.hansen@linux.intel.com \
--cc=dave.jiang@intel.com \
--cc=dave@stgolabs.net \
--cc=david@redhat.com \
--cc=gourry.memverge@gmail.com \
--cc=hannes@cmpxchg.org \
--cc=jhubbard@nvidia.com \
--cc=jon.grimm@amd.com \
--cc=linux-mm@kvack.org \
--cc=lsf-pc@lists.linux-foundation.org \
--cc=mgorman@suse.de \
--cc=rientjes@google.com \
--cc=sj@kernel.org \
--cc=torvalds@linux-foundation.org \
--cc=weixugc@google.com \
--cc=ying.huang@intel.com \
--cc=ziy@nvidia.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox