From: Christoph Lameter <clameter@sgi.com>
To: Andrew Morton <akpm@osdl.org>
Cc: Nick Piggin <nickpiggin@yahoo.com.au>,
KAMEZAWA Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com>,
linux-mm@kvack.org
Subject: Re: Page allocator: Single Zone optimizations
Date: Fri, 27 Oct 2006 18:00:42 -0700 (PDT) [thread overview]
Message-ID: <Pine.LNX.4.64.0610271225320.9346@schroedinger.engr.sgi.com> (raw)
In-Reply-To: <20061026150938.bdf9d812.akpm@osdl.org>
On Thu, 26 Oct 2006, Andrew Morton wrote:
> I really really want to drop all those patches[1] and rethink it all.
I think it would be good to do drop it. That may allow a consolidation of
the patches after the experience we have had so far with it but there is
also the danger that I may have to drop it completely for now since the
project is getting to be too much of an effort. Having to repeat the same
arguments is not that productive.
Note that the zone_table work is independent from the ZONE_DMA work.
So please keep the get-rid-of-zone_table.patch. Just drop the rest.
get-rid-of-zone_table-fix-3.patch is not really a fix for the zone_table
patch but addresses an issue created by the optional zone_dma patch.
> Like... would it make sense to eliminate the hard-coded concepts of DMA,
> DMA32, NORMAL and HIGHMEM and simply say "we support 1 to N zones" per
> node? Obviously we'd need to keep the DMA/NORMAL/HIGHMEM nomenclature in
> the interfaces so the rest of the kernel builds and works, but the core mm
> just shouldn't need to care: all it cares about is one or more zones.
Ok. Recap of some of the earlier discussion:
- DMA has no clearly defined boundaries. They vary according to arch
and many arches / platforms depend on particular ZONE_DMA semantics.
- DMA32 is only supported for x86_64 and has a particular role
to play there.
- Highmem requires special treatment with kmap that is different from
all others.
In order to have N zones (I think you would want that to cover
different restricted DMA areas?) one would need to have some sort of
standard definition and purposes for those N zones. They would need to be
able to be treated in the same way. For the ZONE_DMAxx zones you may be
able to get there. HIGHMEM is definitely much different.
Then you would probably want to support the dma_mask supported by the SCSI
subsystem and dma_alloc_coherent functions to allow arbitrary bitmasks. In
order to support that with zones we would need a large quantity of those
or a way to dynamically create zones. I am pretty sure this will not
simplify mm. There is a potential here for increasing fragmentation and
getting into complicated load balancing situations between the zones.
A number of architectures and platforms (I think we are up to 8
to 10 or so?) do not need what ZONE_DMA provides and can avoid having to
deal with this mess right now if we allow an opt out as provided by my
current patches in mm. No additional measures would be needed.
For those platforms that still need the abiltity to allocate from a
subse of memory it would be possible to provide a page allocator
function where one can specify an allowed memory range. That would
avoid the need for various DMA style zones.
But I cannot find any justification in my contexts to complete work on
this functionality because plainly all the hardware that I use does not
have problem laden DMA controllers and works just fine with a single
zone. This includes x86_64, i386 and ia64 boxes that I test my patches on.
I would have to find time to research this and test with such a device.
So far I have not found a way to justify taking time for that beyond the
initial RFC that I posted a while back.
> Or something like that. Something which makes the mm easier to understand,
> easier to maintain and faster. Rather than harder to understand, harder to
> maintain and faster.
The simplest approach is to allow configurations with a single zone. That
makes mm easier to understand, faster and maintainable. For that purpose
functionality provided by specialized zones like ZONE_HIGHMEM, ZONE_DMA32
and ZONE_DMA needs to be isolated and made configurable. I have done that
for HIGHEMEM and DMA32 and the code is in 2.6.19.
The point of the patches is to do the same thing for ZONE_DMA.
There are many other subsystems that add special DMA overhead like in the
slab allocators etc. On platforms that do not need ZONE_DMA we
currently just see empty counters, create dead slabs, have dead code etc.
This seems where I ran into trouble since it seems that you think it gets
too complicated to have the ability to compile a kernel without the
useless and problematic GFP_DMA. ZONE_DMA material.
I think just the opposite is happening. The patches separate out ZONE_DMA
functionality that is badly defined, not standardized, rarely used and has
caused lots of weird code in the kernel to be written. Ever seen the code
in some arches alloc_dma_coherent where they allocate a page and then
check if its in a certain range? If not more creative artistry follows.
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
next prev parent reply other threads:[~2006-10-28 1:00 UTC|newest]
Thread overview: 83+ messages / expand[flat|nested] mbox.gz Atom feed top
2006-10-17 0:50 Christoph Lameter
2006-10-17 1:10 ` Andrew Morton
2006-10-17 1:13 ` Christoph Lameter
2006-10-17 1:27 ` KAMEZAWA Hiroyuki
2006-10-17 1:25 ` Christoph Lameter
2006-10-17 6:04 ` Nick Piggin
2006-10-17 17:54 ` Christoph Lameter
2006-10-18 11:15 ` Nick Piggin
2006-10-18 19:38 ` Andrew Morton
2006-10-23 23:08 ` Christoph Lameter
2006-10-24 1:07 ` Christoph Lameter
2006-10-26 22:09 ` Andrew Morton
2006-10-26 22:28 ` Christoph Lameter
2006-10-28 1:00 ` Christoph Lameter [this message]
2006-10-28 2:04 ` Andrew Morton
2006-10-28 2:12 ` Christoph Lameter
2006-10-28 2:24 ` Andrew Morton
2006-10-28 2:31 ` Christoph Lameter
2006-10-28 4:43 ` Andrew Morton
2006-10-28 7:47 ` KAMEZAWA Hiroyuki
2006-10-28 16:12 ` Andi Kleen
2006-10-29 0:48 ` Christoph Lameter
2006-10-29 1:04 ` Andrew Morton
2006-10-29 1:29 ` Christoph Lameter
2006-10-29 11:32 ` Nick Piggin
2006-10-30 16:41 ` Christoph Lameter
2006-11-01 18:26 ` Mel Gorman
2006-11-01 20:34 ` Andrew Morton
2006-11-01 21:00 ` Christoph Lameter
2006-11-01 21:46 ` Andrew Morton
2006-11-01 21:50 ` Christoph Lameter
2006-11-01 22:13 ` Mel Gorman
2006-11-01 23:29 ` Christoph Lameter
2006-11-02 0:22 ` Andrew Morton
2006-11-02 0:27 ` Christoph Lameter
2006-11-02 12:45 ` Mel Gorman
2006-11-01 22:10 ` Mel Gorman
2006-11-02 17:37 ` Andy Whitcroft
2006-11-02 18:08 ` Christoph Lameter
2006-11-02 20:58 ` Mel Gorman
2006-11-02 21:04 ` Christoph Lameter
2006-11-02 21:16 ` Mel Gorman
2006-11-02 21:52 ` Christoph Lameter
2006-11-02 22:37 ` Mel Gorman
2006-11-02 22:50 ` Christoph Lameter
2006-11-03 9:14 ` Mel Gorman
2006-11-03 13:17 ` Andy Whitcroft
2006-11-03 18:11 ` Christoph Lameter
2006-11-03 19:06 ` Mel Gorman
2006-11-03 19:44 ` Christoph Lameter
2006-11-03 21:11 ` Mel Gorman
2006-11-03 21:42 ` Christoph Lameter
2006-11-03 21:50 ` Andrew Morton
2006-11-03 21:53 ` Christoph Lameter
2006-11-03 22:12 ` Andrew Morton
2006-11-03 22:15 ` Christoph Lameter
2006-11-03 22:19 ` Andi Kleen
2006-11-04 0:37 ` Christoph Lameter
2006-11-04 1:32 ` Andi Kleen
2006-11-06 16:40 ` Christoph Lameter
2006-11-06 16:56 ` Andi Kleen
2006-11-06 17:00 ` Christoph Lameter
2006-11-06 17:07 ` Andi Kleen
2006-11-06 17:12 ` Hugh Dickins
2006-11-06 17:15 ` Christoph Lameter
2006-11-06 17:20 ` Andi Kleen
2006-11-06 17:26 ` Christoph Lameter
2006-11-07 16:30 ` Mel Gorman
2006-11-07 17:54 ` Christoph Lameter
2006-11-07 18:14 ` Mel Gorman
2006-11-08 0:29 ` KAMEZAWA Hiroyuki
2006-11-08 2:08 ` Christoph Lameter
2006-11-13 21:08 ` Mel Gorman
2006-11-03 12:48 ` Peter Zijlstra
2006-11-03 18:15 ` Christoph Lameter
2006-11-03 18:53 ` Peter Zijlstra
2006-11-03 19:23 ` Christoph Lameter
2006-11-02 18:52 ` Andrew Morton
2006-11-02 21:51 ` Mel Gorman
2006-11-02 22:03 ` Andy Whitcroft
2006-11-02 22:11 ` Andrew Morton
2006-11-01 18:13 ` Mel Gorman
2006-11-01 17:39 ` Mel Gorman
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=Pine.LNX.4.64.0610271225320.9346@schroedinger.engr.sgi.com \
--to=clameter@sgi.com \
--cc=akpm@osdl.org \
--cc=kamezawa.hiroyu@jp.fujitsu.com \
--cc=linux-mm@kvack.org \
--cc=nickpiggin@yahoo.com.au \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox