linux-mm.kvack.org archive mirror
 help / color / mirror / Atom feed
* [PATCH] Remove unnecessary smp_wmb from clear_user_highpage()
@ 2007-07-18 15:05 Mel Gorman
  2007-07-18 16:45 ` Hugh Dickins
  2007-07-19  1:57 ` Nick Piggin
  0 siblings, 2 replies; 12+ messages in thread
From: Mel Gorman @ 2007-07-18 15:05 UTC (permalink / raw)
  To: npiggin, hugh; +Cc: linux-mm

Hi,

At the nudging of Andrew, I was checking to see if the architecture-specific
implementations of alloc_zeroed_user_highpage() can be removed or not.
With the exception of barriers, the differences are negligible and the main
memory barrier is in clear_user_highpage(). However, it's unclear why it's
needed. Do you mind looking at the following patch and telling me if it's
wrong and if so, why?

Thanks a lot.

===

    This patch removes an unnecessary write barrier from clear_user_highpage().
    
    clear_user_highpage() is called from alloc_zeroed_user_highpage() on a
    number of architectures and from clear_huge_page(). However, these callers
    are already protected by the necessary memory barriers due to spinlocks
    in the fault path and the page should not be visible on other CPUs anyway
    making the barrier unnecessary. A hint of lack of necessity is that there
    does not appear to be a read barrier anywhere for this zeroed page.
    
    The sequence for the first use of alloc_zeroed_user_highpage()
    looks like;
    
    pte_unmap_unlock()
    alloc_zeroed_user_highpage()
    pte_offset_map_lock()
    
    The second is
    
    pte_unmap()	(usually nothing but sometimes a barrier()
    alloc_zeroed_user_highpage()
    pte_offset_map_lock()
    
    The two sequences with the use of locking should already have sufficient
    barriers.
    
    By removing this write barrier, IA64 could use the default implementation
    of alloc_zeroed_user_highpage() instead of a custom version which appears
    to do nothing but avoid calling smp_wmb(). Once that is done, there is
    little reason to have architecture-specific alloc_zeroed_user_highpage()
    helpers as it would be redundant.

diff --git a/include/linux/highmem.h b/include/linux/highmem.h
index 12c5e4e..ace5a32 100644
--- a/include/linux/highmem.h
+++ b/include/linux/highmem.h
@@ -68,8 +68,6 @@ static inline void clear_user_highpage(struct page *page, unsigned long vaddr)
 	void *addr = kmap_atomic(page, KM_USER0);
 	clear_user_page(addr, vaddr, page);
 	kunmap_atomic(addr, KM_USER0);
-	/* Make sure this page is cleared on other CPU's too before using it */
-	smp_wmb();
 }
 
 #ifndef __HAVE_ARCH_ALLOC_ZEROED_USER_HIGHPAGE
-- 
-- 
Mel Gorman
Part-time Phd Student                          Linux Technology Center
University of Limerick                         IBM Dublin Software Lab

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

^ permalink raw reply	[flat|nested] 12+ messages in thread
* Re: [PATCH] Remove unnecessary smp_wmb from clear_user_highpage()
@ 2007-07-20 21:06 Oleg Nesterov
  2007-07-20 21:57 ` Linus Torvalds
  0 siblings, 1 reply; 12+ messages in thread
From: Oleg Nesterov @ 2007-07-20 21:06 UTC (permalink / raw)
  To: Linus Torvalds; +Cc: Hugh Dickins, Mel Gorman, Nick Piggin, linux-mm

(Off-topic)

Linus Torvalds wrote:
>
> A full lock/unlock *pair* should (as far as I know) always be equivalent 
> to a full memory barrier.

Is it so? I am not arguing, I am trying to understand.

> Because, by definition, no reads or writes 
> inside the locked region may escape outside it, and that in turn implies 
> that no access _outside_ the locked region may escape to the other side of 
> it.

This means that unlock + lock is a full barrier,

> However, neither a "lock" nor an "unlock" on *its*own* is a barrier at 
> all, at most they are semi-permeable barriers for some things, where 
> different architectures can be differently semi-permeable.

and this means that lock + unlock is not.

	A;
	lock();
	unlock();
	B;

If both A and B can leak into the critical section, they could be reordered
inside this section, so we can have

	lock();
	B;
	A;
	unlock();

Yes?

Oleg.

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

^ permalink raw reply	[flat|nested] 12+ messages in thread

end of thread, other threads:[~2007-07-23  2:02 UTC | newest]

Thread overview: 12+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2007-07-18 15:05 [PATCH] Remove unnecessary smp_wmb from clear_user_highpage() Mel Gorman
2007-07-18 16:45 ` Hugh Dickins
2007-07-19  2:17   ` Nick Piggin
2007-07-20 13:08     ` Mel Gorman
2007-07-23  2:02       ` Nick Piggin
2007-07-19  2:28   ` Linus Torvalds
2007-07-19  2:58     ` Nick Piggin
2007-07-19  2:36   ` Nick Piggin
2007-07-19 11:16   ` Mel Gorman
2007-07-19  1:57 ` Nick Piggin
2007-07-20 21:06 Oleg Nesterov
2007-07-20 21:57 ` Linus Torvalds

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox