* [patch 1/3] mm: NUMA slab -- add alien cache drain statistics
@ 2005-11-29 8:50 Ravikiran G Thirumalai
2005-11-29 8:53 ` [patch 2/3] mm: NUMA slab -- node local memory for off slab slab descriptors Ravikiran G Thirumalai
` (3 more replies)
0 siblings, 4 replies; 10+ messages in thread
From: Ravikiran G Thirumalai @ 2005-11-29 8:50 UTC (permalink / raw)
To: Andrew Morton; +Cc: linux-mm, manfred, clameter, Alok Kataria
NUMA slab allocator frees remote objects to a local alien cache.
But if the local alien cache is full, the alien cache
is drained directly to the remote node.
This patch adds a statistics counter which is incremented everytime the
local alien cache is full and we have to drain it to the remote nodes list3.
This will be useful when we can dynamically tune the alien cache limit.
Currently, the alien cache limit is fixed at 12.
Signed-off-by: Alok N Kataria <alokk@calsoftinc.com>
Signed-off-by: Ravikiran Thirumalai <kiran@scalex86.org>
Signed-off-by: Shai Fultheim <shai@scalex86.org>
Index: linux-2.6.15-rc1/mm/slab.c
===================================================================
--- linux-2.6.15-rc1.orig/mm/slab.c 2005-11-17 15:37:26.000000000 -0800
+++ linux-2.6.15-rc1/mm/slab.c 2005-11-17 21:32:37.000000000 -0800
@@ -416,9 +416,11 @@
unsigned long max_freeable;
unsigned long node_allocs;
unsigned long node_frees;
+ unsigned long node_overflow;
atomic_t allochit;
atomic_t allocmiss;
atomic_t freehit;
+
atomic_t freemiss;
#endif
#if DEBUG
@@ -452,6 +454,7 @@
#define STATS_INC_ERR(x) ((x)->errors++)
#define STATS_INC_NODEALLOCS(x) ((x)->node_allocs++)
#define STATS_INC_NODEFREES(x) ((x)->node_frees++)
+#define STATS_INC_OVERFLOW(x) ((x)->node_overflow++)
#define STATS_SET_FREEABLE(x, i) \
do { if ((x)->max_freeable < i) \
(x)->max_freeable = i; \
@@ -471,6 +474,7 @@
#define STATS_INC_ERR(x) do { } while (0)
#define STATS_INC_NODEALLOCS(x) do { } while (0)
#define STATS_INC_NODEFREES(x) do { } while (0)
+#define STATS_INC_OVERFLOW(x) do { } while (0)
#define STATS_SET_FREEABLE(x, i) \
do { } while (0)
@@ -2765,9 +2769,11 @@
if (l3->alien && l3->alien[nodeid]) {
alien = l3->alien[nodeid];
spin_lock(&alien->lock);
- if (unlikely(alien->avail == alien->limit))
+ if (unlikely(alien->avail == alien->limit)) {
+ STATS_INC_OVERFLOW(cachep);
__drain_alien_cache(cachep,
alien, nodeid);
+ }
alien->entry[alien->avail++] = objp;
spin_unlock(&alien->lock);
} else {
@@ -3386,7 +3392,7 @@
seq_puts(m, " : slabdata <active_slabs> <num_slabs> <sharedavail>");
#if STATS
seq_puts(m, " : globalstat <listallocs> <maxobjs> <grown> <reaped>"
- " <error> <maxfreeable> <nodeallocs> <remotefrees>");
+ " <error> <maxfreeable> <nodeallocs> <remotefrees> <overflow>");
seq_puts(m, " : cpustat <allochit> <allocmiss> <freehit> <freemiss>");
#endif
seq_putc(m, '\n');
@@ -3492,11 +3498,13 @@
unsigned long max_freeable = cachep->max_freeable;
unsigned long node_allocs = cachep->node_allocs;
unsigned long node_frees = cachep->node_frees;
+ unsigned long overflows = cachep->node_overflow;
seq_printf(m, " : globalstat %7lu %6lu %5lu %4lu \
- %4lu %4lu %4lu %4lu",
+ %4lu %4lu %4lu %4lu %4lu",
allocs, high, grown, reaped, errors,
- max_freeable, node_allocs, node_frees);
+ max_freeable, node_allocs, node_frees,
+ overflows);
}
/* cpu stats */
{
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [patch 2/3] mm: NUMA slab -- node local memory for off slab slab descriptors
2005-11-29 8:50 [patch 1/3] mm: NUMA slab -- add alien cache drain statistics Ravikiran G Thirumalai
@ 2005-11-29 8:53 ` Ravikiran G Thirumalai
2005-11-29 17:48 ` Christoph Lameter
2005-11-29 8:54 ` [patch 3/3] mm: NUMA slab -- minor optimizations Ravikiran G Thirumalai
` (2 subsequent siblings)
3 siblings, 1 reply; 10+ messages in thread
From: Ravikiran G Thirumalai @ 2005-11-29 8:53 UTC (permalink / raw)
To: Andrew Morton; +Cc: linux-mm, manfred, clameter, Alok Kataria
Off slab slab management is currently not allocated from node local
memory. This patch fixes that.
Signed-off-by: Alok N Kataria <alokk@calsoftinc.com>
Signed-off-by: Ravikiran Thirumalai <kiran@scalex86.org>
Signed-off-by: Shai Fultheim <shai@scalex86.org>
Index: linux-2.6.15-rc1/mm/slab.c
===================================================================
--- linux-2.6.15-rc1.orig/mm/slab.c 2005-11-17 21:32:37.000000000 -0800
+++ linux-2.6.15-rc1/mm/slab.c 2005-11-17 21:32:43.000000000 -0800
@@ -2062,13 +2062,13 @@
/* Get the memory for a slab management obj. */
static struct slab* alloc_slabmgmt(kmem_cache_t *cachep, void *objp,
- int colour_off, gfp_t local_flags)
+ int colour_off, gfp_t local_flags, int nodeid)
{
struct slab *slabp;
if (OFF_SLAB(cachep)) {
/* Slab management obj is off-slab. */
- slabp = kmem_cache_alloc(cachep->slabp_cache, local_flags);
+ slabp = kmem_cache_alloc_node(cachep->slabp_cache, local_flags, nodeid);
if (!slabp)
return NULL;
} else {
@@ -2078,6 +2078,7 @@
slabp->inuse = 0;
slabp->colouroff = colour_off;
slabp->s_mem = objp+colour_off;
+ slabp->nodeid = nodeid;
return slabp;
}
@@ -2221,10 +2222,9 @@
goto failed;
/* Get slab management. */
- if (!(slabp = alloc_slabmgmt(cachep, objp, offset, local_flags)))
+ if (!(slabp = alloc_slabmgmt(cachep, objp, offset, local_flags, nodeid)))
goto opps1;
- slabp->nodeid = nodeid;
set_slab_attr(cachep, slabp, objp);
cache_init_objs(cachep, slabp, ctor_flags);
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [patch 3/3] mm: NUMA slab -- minor optimizations
2005-11-29 8:50 [patch 1/3] mm: NUMA slab -- add alien cache drain statistics Ravikiran G Thirumalai
2005-11-29 8:53 ` [patch 2/3] mm: NUMA slab -- node local memory for off slab slab descriptors Ravikiran G Thirumalai
@ 2005-11-29 8:54 ` Ravikiran G Thirumalai
2005-11-29 17:53 ` Christoph Lameter
2005-12-26 23:42 ` Manfred Spraul
2005-11-29 9:25 ` [patch 1/3] mm: NUMA slab -- add alien cache drain statistics Andrew Morton
2005-11-29 17:57 ` Christoph Lameter
3 siblings, 2 replies; 10+ messages in thread
From: Ravikiran G Thirumalai @ 2005-11-29 8:54 UTC (permalink / raw)
To: Andrew Morton; +Cc: linux-mm, manfred, clameter, Alok Kataria
Patch adds some minor optimizations:
1. Keeps on chip interrupts enabled for a bit longer while draining cpu
caches
2. Calls numa_node_id once in cache_reap
Signed-off-by: Alok N Kataria <alokk@calsoftinc.com>
Signed-off-by: Ravikiran Thirumalai <kiran@scalex86.org>
Signed-off-by: Shai Fultheim <shai@scalex86.org>
Index: linux-2.6.15-rc1/mm/slab.c
===================================================================
--- linux-2.6.15-rc1.orig/mm/slab.c 2005-11-17 21:32:43.000000000 -0800
+++ linux-2.6.15-rc1/mm/slab.c 2005-11-17 21:32:50.000000000 -0800
@@ -1914,18 +1914,18 @@
smp_call_function_all_cpus(do_drain, cachep);
check_irq_on();
- spin_lock_irq(&cachep->spinlock);
+ spin_lock(&cachep->spinlock);
for_each_online_node(node) {
l3 = cachep->nodelists[node];
if (l3) {
- spin_lock(&l3->list_lock);
+ spin_lock_irq(&l3->list_lock);
drain_array_locked(cachep, l3->shared, 1, node);
- spin_unlock(&l3->list_lock);
+ spin_unlock_irq(&l3->list_lock);
if (l3->alien)
drain_alien_cache(cachep, l3);
}
}
- spin_unlock_irq(&cachep->spinlock);
+ spin_unlock(&cachep->spinlock);
}
static int __node_shrink(kmem_cache_t *cachep, int node)
@@ -3304,7 +3304,7 @@
list_for_each(walk, &cache_chain) {
kmem_cache_t *searchp;
struct list_head* p;
- int tofree;
+ int tofree, nodeid;
struct slab *slabp;
searchp = list_entry(walk, kmem_cache_t, next);
@@ -3314,13 +3314,14 @@
check_irq_on();
- l3 = searchp->nodelists[numa_node_id()];
+ nodeid = numa_node_id();
+ l3 = searchp->nodelists[nodeid];
if (l3->alien)
drain_alien_cache(searchp, l3);
spin_lock_irq(&l3->list_lock);
drain_array_locked(searchp, ac_data(searchp), 0,
- numa_node_id());
+ nodeid);
if (time_after(l3->next_reap, jiffies))
goto next_unlock;
@@ -3329,7 +3330,7 @@
if (l3->shared)
drain_array_locked(searchp, l3->shared, 0,
- numa_node_id());
+ nodeid);
if (l3->free_touched) {
l3->free_touched = 0;
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [patch 1/3] mm: NUMA slab -- add alien cache drain statistics
2005-11-29 8:50 [patch 1/3] mm: NUMA slab -- add alien cache drain statistics Ravikiran G Thirumalai
2005-11-29 8:53 ` [patch 2/3] mm: NUMA slab -- node local memory for off slab slab descriptors Ravikiran G Thirumalai
2005-11-29 8:54 ` [patch 3/3] mm: NUMA slab -- minor optimizations Ravikiran G Thirumalai
@ 2005-11-29 9:25 ` Andrew Morton
2005-11-29 17:57 ` Christoph Lameter
3 siblings, 0 replies; 10+ messages in thread
From: Andrew Morton @ 2005-11-29 9:25 UTC (permalink / raw)
To: Ravikiran G Thirumalai; +Cc: linux-mm, manfred, clameter, alokk
Ravikiran G Thirumalai <kiran@scalex86.org> wrote:
>
> This patch adds a statistics counter which is incremented everytime the
> local alien cache is full and we have to drain it to the remote nodes list3.
>
argh. -mm is full. I'm currently carrying 90 patches against ./mm/* and
11 against just slab.c.
If you want to rediff and retest against
http://www.zip.com.au/~akpm/linux/patches/stuff/x.bz2, which is
-mm-of-the-minute then feel free, but beware that it's going to take some
time to sort through all this stuff.
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [patch 2/3] mm: NUMA slab -- node local memory for off slab slab descriptors
2005-11-29 8:53 ` [patch 2/3] mm: NUMA slab -- node local memory for off slab slab descriptors Ravikiran G Thirumalai
@ 2005-11-29 17:48 ` Christoph Lameter
0 siblings, 0 replies; 10+ messages in thread
From: Christoph Lameter @ 2005-11-29 17:48 UTC (permalink / raw)
To: Ravikiran G Thirumalai; +Cc: Andrew Morton, linux-mm, manfred, Alok Kataria
On Tue, 29 Nov 2005, Ravikiran G Thirumalai wrote:
> Off slab slab management is currently not allocated from node local
> memory. This patch fixes that.
>
> Signed-off-by: Alok N Kataria <alokk@calsoftinc.com>
> Signed-off-by: Ravikiran Thirumalai <kiran@scalex86.org>
> Signed-off-by: Shai Fultheim <shai@scalex86.org>
Ack-by: Christoph Lameter <clameter@sgi.com>
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [patch 3/3] mm: NUMA slab -- minor optimizations
2005-11-29 8:54 ` [patch 3/3] mm: NUMA slab -- minor optimizations Ravikiran G Thirumalai
@ 2005-11-29 17:53 ` Christoph Lameter
2005-12-26 23:42 ` Manfred Spraul
1 sibling, 0 replies; 10+ messages in thread
From: Christoph Lameter @ 2005-11-29 17:53 UTC (permalink / raw)
To: Ravikiran G Thirumalai; +Cc: Andrew Morton, linux-mm, manfred, Alok Kataria
On Tue, 29 Nov 2005, Ravikiran G Thirumalai wrote:
> Patch adds some minor optimizations:
> 1. Keeps on chip interrupts enabled for a bit longer while draining cpu
> caches
> 2. Calls numa_node_id once in cache_reap
>
> Signed-off-by: Alok N Kataria <alokk@calsoftinc.com>
> Signed-off-by: Ravikiran Thirumalai <kiran@scalex86.org>
> Signed-off-by: Shai Fultheim <shai@scalex86.org>
Ack-by: Christoph Lameter <clameter@sgi.com>
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [patch 1/3] mm: NUMA slab -- add alien cache drain statistics
2005-11-29 8:50 [patch 1/3] mm: NUMA slab -- add alien cache drain statistics Ravikiran G Thirumalai
` (2 preceding siblings ...)
2005-11-29 9:25 ` [patch 1/3] mm: NUMA slab -- add alien cache drain statistics Andrew Morton
@ 2005-11-29 17:57 ` Christoph Lameter
2005-11-29 18:49 ` Ravikiran G Thirumalai
3 siblings, 1 reply; 10+ messages in thread
From: Christoph Lameter @ 2005-11-29 17:57 UTC (permalink / raw)
To: Ravikiran G Thirumalai; +Cc: Andrew Morton, linux-mm, manfred, Alok Kataria
On Tue, 29 Nov 2005, Ravikiran G Thirumalai wrote:
> NUMA slab allocator frees remote objects to a local alien cache.
> But if the local alien cache is full, the alien cache
> is drained directly to the remote node.
>
> This patch adds a statistics counter which is incremented everytime the
> local alien cache is full and we have to drain it to the remote nodes list3.
>
> This will be useful when we can dynamically tune the alien cache limit.
> Currently, the alien cache limit is fixed at 12.
It may be best to first enable the basic manual tuning. See
slabinfo_write.
How would you propose to determine the length?
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [patch 1/3] mm: NUMA slab -- add alien cache drain statistics
2005-11-29 17:57 ` Christoph Lameter
@ 2005-11-29 18:49 ` Ravikiran G Thirumalai
0 siblings, 0 replies; 10+ messages in thread
From: Ravikiran G Thirumalai @ 2005-11-29 18:49 UTC (permalink / raw)
To: Christoph Lameter; +Cc: Andrew Morton, linux-mm, manfred, Alok Kataria
On Tue, Nov 29, 2005 at 09:57:58AM -0800, Christoph Lameter wrote:
> On Tue, 29 Nov 2005, Ravikiran G Thirumalai wrote:
>
> >
> > This will be useful when we can dynamically tune the alien cache limit.
> > Currently, the alien cache limit is fixed at 12.
>
> It may be best to first enable the basic manual tuning. See
> slabinfo_write.
We already have a patch for that on our local tree. Will send it out soon
after some more tests
>
> How would you propose to determine the length?
>
All kmem caches won't experience remote frees. Depending on the work-load,
some caches might experience frequent remote frees. This statistic helps us
determine which cache is experiencing heavy remote free activity, and the
sysadmin may tune the alien cache limit dynamically (just like the array
cache limit) by writing to /proc/slabinfo. There cannot be one value good
enough for everyone so this should be a tunable.
Thanks,
Kiran
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [patch 3/3] mm: NUMA slab -- minor optimizations
2005-11-29 8:54 ` [patch 3/3] mm: NUMA slab -- minor optimizations Ravikiran G Thirumalai
2005-11-29 17:53 ` Christoph Lameter
@ 2005-12-26 23:42 ` Manfred Spraul
2005-12-27 20:35 ` Christoph Lameter
1 sibling, 1 reply; 10+ messages in thread
From: Manfred Spraul @ 2005-12-26 23:42 UTC (permalink / raw)
To: Ravikiran G Thirumalai; +Cc: Andrew Morton, linux-mm, clameter, Alok Kataria
Ravikiran G Thirumalai wrote:
>Patch adds some minor optimizations:
>1. Keeps on chip interrupts enabled for a bit longer while draining cpu
>caches
>2. Calls numa_node_id once in cache_reap
>
>Signed-off-by: Alok N Kataria <alokk@calsoftinc.com>
>Signed-off-by: Ravikiran Thirumalai <kiran@scalex86.org>
>Signed-off-by: Shai Fultheim <shai@scalex86.org>
>
>Index: linux-2.6.15-rc1/mm/slab.c
>===================================================================
>--- linux-2.6.15-rc1.orig/mm/slab.c 2005-11-17 21:32:43.000000000 -0800
>+++ linux-2.6.15-rc1/mm/slab.c 2005-11-17 21:32:50.000000000 -0800
>@@ -1914,18 +1914,18 @@
>
> smp_call_function_all_cpus(do_drain, cachep);
> check_irq_on();
>- spin_lock_irq(&cachep->spinlock);
>+ spin_lock(&cachep->spinlock);
>
>
Isn't that a bug? What prevents an interrupt from occuring after the
spin_lock() and then causing a deadlock on cachep->spinlock?
--
Manfred
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [patch 3/3] mm: NUMA slab -- minor optimizations
2005-12-26 23:42 ` Manfred Spraul
@ 2005-12-27 20:35 ` Christoph Lameter
0 siblings, 0 replies; 10+ messages in thread
From: Christoph Lameter @ 2005-12-27 20:35 UTC (permalink / raw)
To: Manfred Spraul
Cc: Ravikiran G Thirumalai, Andrew Morton, linux-mm, Alok Kataria
On Tue, 27 Dec 2005, Manfred Spraul wrote:
> Isn't that a bug? What prevents an interrupt from occuring after the
> spin_lock() and then causing a deadlock on cachep->spinlock?
Right. cache_grow() may be called when doing slab allocations in an
interrupt and it takes the lock in order to modify colour_next.
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 10+ messages in thread
end of thread, other threads:[~2005-12-27 20:35 UTC | newest]
Thread overview: 10+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2005-11-29 8:50 [patch 1/3] mm: NUMA slab -- add alien cache drain statistics Ravikiran G Thirumalai
2005-11-29 8:53 ` [patch 2/3] mm: NUMA slab -- node local memory for off slab slab descriptors Ravikiran G Thirumalai
2005-11-29 17:48 ` Christoph Lameter
2005-11-29 8:54 ` [patch 3/3] mm: NUMA slab -- minor optimizations Ravikiran G Thirumalai
2005-11-29 17:53 ` Christoph Lameter
2005-12-26 23:42 ` Manfred Spraul
2005-12-27 20:35 ` Christoph Lameter
2005-11-29 9:25 ` [patch 1/3] mm: NUMA slab -- add alien cache drain statistics Andrew Morton
2005-11-29 17:57 ` Christoph Lameter
2005-11-29 18:49 ` Ravikiran G Thirumalai
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox