From: Hugh Dickins <hugh@veritas.com>
To: Jes Sorensen <jes@sgi.com>
Cc: linux-mm@kvack.org, Linus Torvalds <torvalds@osdl.org>,
Carsten Otte <cotte@de.ibm.com>,
roe@sgi.com, Robin Holt <holt@sgi.com>,
Jack Steiner <steiner@sgi.com>
Subject: Re: VM_PFNMAP and do_no_pfn handler
Date: Mon, 20 Feb 2006 15:39:38 +0000 (GMT) [thread overview]
Message-ID: <Pine.LNX.4.61.0602201526260.12160@goblin.wat.veritas.com> (raw)
In-Reply-To: <yq0y806qfgd.fsf@jaguar.mkp.net>
On Mon, 20 Feb 2006, Jes Sorensen wrote:
>
> I am looking at implementing a do_no_pfn handler similar to
> do_no_page, but for pages which are not backed by a struct page. I'd
> like to use it for the mspec driver which maps uncached pages to
> userland. The reason we need the do_no_pfn handler is to get the first
> touch locality of the mapping on NUMA systems.
>
> I have it all working, however I have a question about the VM_PFNMAP
> flag. Right now mm/memory.c claims the following above
> vm_normal_page():
>
> * NOTE! Some mappings do not have "struct pages". A raw PFN mapping
> * will have each page table entry just pointing to a raw page frame
> * number, and as far as the VM layer is concerned, those do not have
> * pages associated with them - even if the PFN might point to memory
> * that otherwise is perfectly fine and has a "struct page".
> *
> * The way we recognize those mappings is through the rules set up
> * by "remap_pfn_range()": the vma will have the VM_PFNMAP bit set,
> * and the vm_pgoff will point to the first PFN mapped: thus every
> * page that is a raw mapping will always honor the rule
> *
> * pfn_of_page == vma->vm_pgoff + ((addr - vma->vm_start) >> PAGE_SHIFT)
>
> vm_normal_page() then does this:
>
> if (vma->vm_flags & VM_PFNMAP) {
> unsigned long off = (addr - vma->vm_start) >> PAGE_SHIFT;
> if (pfn == vma->vm_pgoff + off)
> return NULL;
> if (!is_cow_mapping(vma->vm_flags))
> return NULL;
> }
>
> Everywhere else it is stated that the VM_PFNMAP flag is only set for
> pages without a struct page backing it. In other words, are there any
> cases where the above requirement is really needed? Wouldn't it be
> sufficient to simply return NULL in vm_normal_page() if VM_PFNMAP is
> set?
>
> The problem I have is that it the uncached pages in the mspec driver
> aren't physically contiguous and the above rule doesn't match for
> us. Right now we are safe since the mspec driver doesn't allow cow
> mappings, but I fear that something could change in vm_normal_page()
> that would make the behavior change underneath us. Alternatively one
> could add yet another flag for this, but it seems somewhat overkill
> for something which is so similar in behavior?
>
> Any suggestions? (or rather, what obvious thing did I miss? ;-)
I believe you'll be safe for as long as your driver prohibits COW
mappings. You're not the only one to have VM_PFNMAP areas which
don't follow Linus' vm_pgoff rule: which is why he added the
!is_cow_mapping letout late in 2.6.15-rc. We cannot change that
lightly.
I think you're worrying too much, unless you anticipate wanting to
extend to COW mappings later. That would indeed need vm_normal_page
to be changed (and I know what change to make, but Linus hated it!).
Hugh
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
next prev parent reply other threads:[~2006-02-20 15:39 UTC|newest]
Thread overview: 4+ messages / expand[flat|nested] mbox.gz Atom feed top
2006-02-20 14:20 Jes Sorensen
2006-02-20 15:39 ` Hugh Dickins [this message]
2006-02-20 15:55 ` Jes Sorensen
2006-02-20 16:30 ` Hugh Dickins
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=Pine.LNX.4.61.0602201526260.12160@goblin.wat.veritas.com \
--to=hugh@veritas.com \
--cc=cotte@de.ibm.com \
--cc=holt@sgi.com \
--cc=jes@sgi.com \
--cc=linux-mm@kvack.org \
--cc=roe@sgi.com \
--cc=steiner@sgi.com \
--cc=torvalds@osdl.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox