[LSF/MM TOPIC] Page Cache Flexibility for NVM

linux-mm.kvack.org archive mirror
 help / color / mirror / Atom feed

From: Adam Manzanares <Adam.Manzanares@wdc.com>
To: "lsf-pc@lists.linux-foundation.org" <lsf-pc@lists.linux-foundation.org>
Cc: "linux-kernel@vger.kernel.org" <linux-kernel@vger.kernel.org>,
	"willy@infradead.org" <willy@infradead.org>,
	"linux-mm@kvack.org" <linux-mm@kvack.org>,
	"yang.shi@linux.alibaba.com" <yang.shi@linux.alibaba.com>,
	"dan.j.williams@intel.com" <dan.j.williams@intel.com>,
	"cl@linux.com" <cl@linux.com>,
	"jglisse@redhat.com" <jglisse@redhat.com>,
	"mhocko@suse.com" <mhocko@suse.com>,
	"dave.hansen@intel.com" <dave.hansen@intel.com>,
	"jack@suse.cz" <jack@suse.cz>
Subject: [LSF/MM TOPIC] Page Cache Flexibility for NVM
Date: Thu, 21 Feb 2019 23:11:51 +0000	[thread overview]
Message-ID: <85ddda85755bf15ed2e56bce21f711ae8154d304.camel@wdc.com> (raw)

Hello,

I would like to attend the LSF/MM Summit 2019. I'm interested in
several MM topics that are mentioned below as well as Zoned Block
Devices and any io determinism topics that come up in the storage
track. 

I have been working on a caching layer, hmmap (heterogeneous memory
map) [1], for emerging NVM and it is in spirit close to the page
cache. The key difference being that the backend device and caching
layer of hmmap is pluggable. In addition, hmmap supports DAX and write
protection, which I believe are key features for emerging NVMs that may
have write/read asymmetry as well as write endurance constraints.
Lastly we can leverage hardware, such as a DMA engine, when moving
pages between the cache while also allowing direct access if the device
is capable.

I am proposing that as an alternative to using NVMs as a NUMA node
we expose the NVM through the page cache or a viable alternative and
have userspace applications mmap the NVM and hand out memory with
their favorite userspace memory allocator.

This would isolate the NVMs to only applications that are well aware
of the performance implications of accessing NVM. I believe that all
of this work could be solved with the NUMA node approach, but the two
approaches are seeming to blur together.

The main points I would like to discuss are:

* Is the page cache model a viable alternative to NVM as a NUMA NODE?
* Can we add more flexibility to the page cache?
* Should we force separation of NVM through an explicit mmap?

I believe this discussion could be merged with NUMA, memory hierarchy
and device memory, Use NVDIMM as NUMA node and NUMA API, or memory
reclaim with NUMA balancing.

Here are some performance numbers of hmmap (in development):

All numbers are collected on a 4GiB hmmap device with a 128MiB cache.
For the mmap tests I used cgroups to limit the page cache usage to
128MiB. All results are an average of 10 runs. W and R access the
entire device with all threads segregated in the address space. RR
reads the entire device randomly 8 bytes at a time and is limited to
8MiB of data accessed.

hmmap brd vs. mmap of brd

	hmmap			mmap			

Threads W     R     RR 	  W 	R     RR 

1  	7.21  5.39  5.04  6.80  5.63  5.23	
2	5.19  3.87  3.74  4.66  3.33  3.20
4	3.65  2.95  3.07  3.53  2.26  2.18
8	4.52  3.43  3.59  4.30  1.98  1.88
16	5.00  3.85  3.98  4.92  2.00  1.99

Memory Backend Test (Dax capable)

	hmmap             hmmap-dax         hmmap-wrprotect

Threads	W     R     RR    W     R     RR    W     R     RR 

1      	6.29  4.94  4.37  2.54  1.36  0.16  7.12  2.13  0.73 
2	4.62  3.63  3.57  1.41  0.69  0.08  5.06  1.14  0.41
4	3.45  2.97  3.11  0.77  0.36  0.04  3.66  0.63  0.25
8	4.10  3.53  3.71  0.44  0.19  0.02  4.03  0.35  0.17
16	4.60  3.98  4.04  0.34  0.16  0.02  4.52  0.27  0.14

Thanks,
Adam

next             reply	other threads:[~2019-02-21 23:11 UTC|newest]

Thread overview: 6+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2019-02-21 23:11 Adam Manzanares [this message]
2019-02-21 23:14 ` Dave Hansen
2019-02-21 23:18   ` Adam Manzanares
2019-02-21 23:49 ` Adam Manzanares
2019-02-22  0:27 ` Jerome Glisse
2019-02-22  1:15   ` Adam Manzanares

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=85ddda85755bf15ed2e56bce21f711ae8154d304.camel@wdc.com \
    --to=adam.manzanares@wdc.com \
    --cc=cl@linux.com \
    --cc=dan.j.williams@intel.com \
    --cc=dave.hansen@intel.com \
    --cc=jack@suse.cz \
    --cc=jglisse@redhat.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-mm@kvack.org \
    --cc=lsf-pc@lists.linux-foundation.org \
    --cc=mhocko@suse.com \
    --cc=willy@infradead.org \
    --cc=yang.shi@linux.alibaba.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox