From: Daisuke Nishimura <nishimura@mxp.nes.nec.co.jp>
To: KAMEZAWA Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com>
Cc: nishimura@mxp.nes.nec.co.jp,
"linux-mm@kvack.org" <linux-mm@kvack.org>,
"balbir@linux.vnet.ibm.com" <balbir@linux.vnet.ibm.com>
Subject: Re: [RFC][PATCH 14/14]memcg: mem+swap accounting
Date: Mon, 1 Sep 2008 16:15:01 +0900 [thread overview]
Message-ID: <20080901161501.2cba948e.nishimura@mxp.nes.nec.co.jp> (raw)
In-Reply-To: <20080822204455.922f87dc.kamezawa.hiroyu@jp.fujitsu.com>
Hi, Kamezawa-san.
I'm testing these patches on mmotm-2008-08-29-01-08
(with some trivial fixes I've reported and some debug codes),
but swap_in_bytes sometimes becomes very huge(it seems that
over uncharge is happening..) and I can see OOM
if I've set memswap_limit.
I'm digging this now, but have you also ever seen it?
Thanks,
Daisuke Nishimura.
On Fri, 22 Aug 2008 20:44:55 +0900, KAMEZAWA Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com> wrote:
> Add Swap accounting feature to memory resource controller.
>
> Accounting is done in following logic.
>
> Swap-out:
> - When add_to_swap_cache() is called, swp_entry is marked as to be under
> page->page_cgroup->mem_cgroup.
> - When swap-cache is uncharged (fully unmapped), we don't uncharge it.
> - When swap-cache is deleted, we uncharge it from memory and charge it to
> swaps. This ops is done only when swap cache is already charged.
> res.pages -=1, res.swaps +=1.
>
> Swap-in:
> - When add_to_swapcache() is called, we do nothing.
> - When swap is mapped, we charge to memory and uncharge from swap
> res.pages +=1, res.swaps -=1.
>
> SwapCache-Deleting:
> - If the page doesn't have page_cgroup, nothing to do.
> - If the page is still charged as swap, just uncharge memory.
> (This can happen under shmem/tmpfs.)
> - If the page is not charged as swap, res.pages -= 1, res.swaps +=1.
>
> Swap-Freeing:
> - if swap entry is charged, res.swaps -= 1.
>
> Almost all operations are done against SwapCache, which is Locked.
>
> This patch uses an array to remember the owner of swp_entry. Considering x86-32,we should avoid to use NORMAL memory and vmalloc() area too much. This patch
> uses HIGHMEM to record information under kmap_atomic(KM_USER0). And information
> is recored in 2 bytes per 1 swap page.
> (memory controller's id is defined as smaller than unsigned short)
>
> Changelog: (preview) -> (v2)
> - removed radix-tree. just use array.
> - removed linked-list.
> - use memcgroup_id rather than pointer.
> - added force_empty (temporal) support.
> This should be reworked in future. (But for now, this works well for us.)
>
> Signed-off-by: KAMEZAWA Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com>
>
> ---
> include/linux/swap.h | 38 +++++
> init/Kconfig | 2
> mm/memcontrol.c | 364 ++++++++++++++++++++++++++++++++++++++++++++++++++-
> mm/migrate.c | 7
> mm/swap_state.c | 7
> mm/swapfile.c | 14 +
> 6 files changed, 422 insertions(+), 10 deletions(-)
>
> Index: mmtom-2.6.27-rc3+/mm/memcontrol.c
> ===================================================================
> --- mmtom-2.6.27-rc3+.orig/mm/memcontrol.c
> +++ mmtom-2.6.27-rc3+/mm/memcontrol.c
> @@ -34,6 +34,10 @@
> #include <linux/mm_inline.h>
> #include <linux/pagemap.h>
> #include <linux/page_cgroup.h>
> +#ifdef CONFIG_CGROUP_MEM_RES_CTLR_SWAP
> +#include <linux/swap.h>
> +#include <linux/swapops.h>
> +#endif
>
> #include <asm/uaccess.h>
>
> @@ -43,9 +47,28 @@ static struct kmem_cache *page_cgroup_ca
> #define NR_MEMCGRP_ID (32767)
>
> #ifdef CONFIG_CGROUP_MEM_RES_CTLR_SWAP
> +
> #define do_swap_account (1)
> +
> +static void
> +swap_cgroup_delete_account(struct mem_cgroup *mem, struct page *page);
> +
> +static struct mem_cgroup *lookup_mem_cgroup_from_swap(struct page *page);
> +static void swap_cgroup_clean_account(struct mem_cgroup *mem);
> #else
> #define do_swap_account (0)
> +
> +static void
> +swap_cgroup_delete_account(struct mem_cgroup *mem, struct page *page)
> +{
> +}
> +static struct mem_cgroup *lookup_mem_cgroup_from_swap(struct page *page)
> +{
> + return NULL;
> +}
> +static void swap_cgroup_clean_account(struct mem_cgroup *mem)
> +{
> +}
> #endif
>
>
> @@ -889,6 +912,9 @@ static int mem_cgroup_charge_common(stru
> __mem_cgroup_add_list(mz, pc);
> spin_unlock_irqrestore(&mz->lru_lock, flags);
>
> + /* We did swap-in, uncharge swap. */
> + if (do_swap_account && PageSwapCache(page))
> + swap_cgroup_delete_account(mem, page);
> return 0;
> out:
> css_put(&mem->css);
> @@ -899,6 +925,8 @@ err:
>
> int mem_cgroup_charge(struct page *page, struct mm_struct *mm, gfp_t gfp_mask)
> {
> + struct mem_cgroup *memcg = NULL;
> +
> if (mem_cgroup_subsys.disabled)
> return 0;
>
> @@ -935,13 +963,19 @@ int mem_cgroup_charge(struct page *page,
> }
> rcu_read_unlock();
> }
> + /* Swap-in ? */
> + if (do_swap_account && PageSwapCache(page))
> + memcg = lookup_mem_cgroup_from_swap(page);
> +
> return mem_cgroup_charge_common(page, mm, gfp_mask,
> - MEM_CGROUP_CHARGE_TYPE_MAPPED, NULL);
> + MEM_CGROUP_CHARGE_TYPE_MAPPED, memcg);
> }
>
> int mem_cgroup_cache_charge(struct page *page, struct mm_struct *mm,
> gfp_t gfp_mask)
> {
> + struct mem_cgroup *memcg = NULL;
> +
> if (mem_cgroup_subsys.disabled)
> return 0;
>
> @@ -971,9 +1005,11 @@ int mem_cgroup_cache_charge(struct page
>
> if (unlikely(!mm))
> mm = &init_mm;
> + if (do_swap_account && PageSwapCache(page))
> + memcg = lookup_mem_cgroup_from_swap(page);
>
> return mem_cgroup_charge_common(page, mm, gfp_mask,
> - MEM_CGROUP_CHARGE_TYPE_CACHE, NULL);
> + MEM_CGROUP_CHARGE_TYPE_CACHE, memcg);
> }
>
> /*
> @@ -998,9 +1034,11 @@ __mem_cgroup_uncharge_common(struct page
>
> VM_BUG_ON(pc->page != page);
>
> - if ((ctype == MEM_CGROUP_CHARGE_TYPE_MAPPED)
> - && ((PcgCache(pc) || page_mapped(page))))
> - goto out;
> + if ((ctype != MEM_CGROUP_CHARGE_TYPE_FORCE))
> + if (PageSwapCache(page) || page_mapped(page) ||
> + (page->mapping && !PageAnon(page)))
> + goto out;
> +
> mem = pc->mem_cgroup;
> SetPcgObsolete(pc);
> page_assign_page_cgroup(page, NULL);
> @@ -1577,6 +1615,8 @@ static void mem_cgroup_pre_destroy(struc
> {
> struct mem_cgroup *mem = mem_cgroup_from_cont(cont);
> mem_cgroup_force_empty(mem);
> + if (do_swap_account)
> + swap_cgroup_clean_account(mem);
> }
>
> static void mem_cgroup_destroy(struct cgroup_subsys *ss,
> @@ -1635,3 +1675,317 @@ struct cgroup_subsys mem_cgroup_subsys =
> .attach = mem_cgroup_move_task,
> .early_init = 0,
> };
> +
> +#ifdef CONFIG_CGROUP_MEM_RES_CTLR_SWAP
> +/*
> + * swap accounting infrastructure.
> + */
> +DEFINE_MUTEX(swap_cgroup_mutex);
> +spinlock_t swap_cgroup_lock[MAX_SWAPFILES];
> +struct page **swap_cgroup_map[MAX_SWAPFILES];
> +unsigned long swap_cgroup_pages[MAX_SWAPFILES];
> +
> +
> +/* This definition is based onf NR_MEM_CGROUP==32768 */
> +struct swap_cgroup {
> + unsigned short memcgrp_id:15;
> + unsigned short count:1;
> +};
> +#define ENTS_PER_PAGE (PAGE_SIZE/sizeof(struct swap_cgroup))
> +
> +/*
> + * Called from get_swap_ent().
> + */
> +int swap_cgroup_prepare(swp_entry_t ent, gfp_t mask)
> +{
> + struct page *page;
> + unsigned long array_index = swp_offset(ent) / ENTS_PER_PAGE;
> + int type = swp_type(ent);
> + unsigned long flags;
> +
> + if (swap_cgroup_map[type][array_index])
> + return 0;
> + page = alloc_page(mask | __GFP_HIGHMEM | __GFP_ZERO);
> + if (!page)
> + return -ENOMEM;
> + spin_lock_irqsave(&swap_cgroup_lock[type], flags);
> + if (swap_cgroup_map[type][array_index] == NULL) {
> + swap_cgroup_map[type][array_index] = page;
> + page = NULL;
> + }
> + spin_unlock_irqrestore(&swap_cgroup_lock[type], flags);
> +
> + if (page)
> + __free_page(page);
> + return 0;
> +}
> +
> +/**
> + * swap_cgroup_record_info
> + * @page ..... a page which is in some mem_cgroup.
> + * @entry .... swp_entry of the page. (or old swp_entry of the page)
> + * @delete ... if 0 add entry, if 1 remove entry.
> + *
> + * At set new value:
> + * This is called from add_to_swap_cache() after added to swapper_space.
> + * Then...this is called under page_lock() and this page is on radix-tree
> + * We're safe to access page->page_cgroup->mem_cgroup.
> + * This function never fails. (may leak information...but it's not Oops.)
> + *
> + * At delettion:
> + * Returns count is set or not.
> + */
> +int swap_cgroup_record_info(struct page *page, swp_entry_t entry, bool del)
> +{
> + unsigned long flags;
> + int type = swp_type(entry);
> + unsigned long offset = swp_offset(entry);
> + unsigned long array_index = offset/ENTS_PER_PAGE;
> + unsigned long index = offset & (ENTS_PER_PAGE - 1);
> + struct page *mappage;
> + struct swap_cgroup *map;
> + struct page_cgroup *pc = NULL;
> + int ret = 0;
> +
> + if (!del) {
> + /*
> + * At swap-in, the page is added to swap cache before tied to
> + * mem_cgroup. This page will be finally charged at page fault.
> + * Ignore this at this point.
> + */
> + pc = page_get_page_cgroup(page);
> + if (!pc)
> + return ret;
> + }
> + if (!swap_cgroup_map[type])
> + return ret;
> + mappage = swap_cgroup_map[type][array_index];
> + if (!mappage)
> + return ret;
> +
> + local_irq_save(flags);
> + map = kmap_atomic(mappage, KM_USER0);
> + if (!del) {
> + map[index].memcgrp_id = pc->mem_cgroup->memcgrp_id;
> + map[index].count = 0;
> + } else {
> + if (map[index].count) {
> + ret = map[index].memcgrp_id;
> + map[index].count = 0;
> + }
> + map[index].memcgrp_id = 0;
> + }
> + kunmap_atomic(mappage, KM_USER0);
> + local_irq_restore(flags);
> + return ret;
> +}
> +
> +/*
> + * returns mem_cgroup pointer when swp_entry is assgiend to.
> + */
> +static struct mem_cgroup *swap_cgroup_lookup(swp_entry_t entry)
> +{
> + unsigned long flags;
> + int type = swp_type(entry);
> + unsigned long offset = swp_offset(entry);
> + unsigned long array_index = offset/ENTS_PER_PAGE;
> + unsigned long index = offset & (ENTS_PER_PAGE - 1);
> + struct page *mappage;
> + struct swap_cgroup *map;
> + unsigned short id;
> +
> + if (!swap_cgroup_map[type])
> + return NULL;
> + mappage = swap_cgroup_map[type][array_index];
> + if (!mappage)
> + return NULL;
> +
> + local_irq_save(flags);
> + map = kmap_atomic(mappage, KM_USER0);
> + id = map[index].memcgrp_id;
> + kunmap_atomic(mappage, KM_USER0);
> + local_irq_restore(flags);
> + return mem_cgroup_id_lookup(id);
> +}
> +
> +static struct mem_cgroup *lookup_mem_cgroup_from_swap(struct page *page)
> +{
> + swp_entry_t entry = { .val = page_private(page) };
> + return swap_cgroup_lookup(entry);
> +}
> +
> +/*
> + * set/clear accounting information of swap_cgroup.
> + *
> + * Called when set/clear accounting information.
> + * returns 1 at success.
> + */
> +static int swap_cgroup_account(struct mem_cgroup *memcg,
> + swp_entry_t entry, bool set)
> +{
> + unsigned long flags;
> + int type = swp_type(entry);
> + unsigned long offset = swp_offset(entry);
> + unsigned long array_index = offset/ENTS_PER_PAGE;
> + unsigned long index = offset & (ENTS_PER_PAGE - 1);
> + struct page *mappage;
> + struct swap_cgroup *map;
> + int ret = 0;
> +
> + if (!swap_cgroup_map[type])
> + return ret;
> + mappage = swap_cgroup_map[type][array_index];
> + if (!mappage)
> + return ret;
> +
> +
> + local_irq_save(flags);
> + map = kmap_atomic(mappage, KM_USER0);
> + if (map[index].memcgrp_id == memcg->memcgrp_id) {
> + if (set && map[index].count == 0) {
> + map[index].count = 1;
> + ret = 1;
> + } else if (!set && map[index].count == 1) {
> + map[index].count = 0;
> + ret = 1;
> + }
> + }
> + kunmap_atomic(mappage, KM_USER0);
> + local_irq_restore(flags);
> + return ret;
> +}
> +
> +void swap_cgroup_delete_account(struct mem_cgroup *mem, struct page *page)
> +{
> + swp_entry_t val = { .val = page_private(page) };
> + if (swap_cgroup_account(mem, val, false))
> + mem_counter_uncharge_swap(mem);
> +}
> +
> +/*
> + * Called from delete_from_swap_cache() then, page is Locked! and
> + * swp_entry is still in use.
> + */
> +void swap_cgroup_delete_swapcache(struct page *page, swp_entry_t entry)
> +{
> + struct page_cgroup *pc;
> +
> + pc = page_get_page_cgroup(page);
> + /* swap-in but not mapped. */
> + if (!pc)
> + return;
> +
> + if (swap_cgroup_account(pc->mem_cgroup, entry, true))
> + __mem_cgroup_uncharge_common(page,
> + MEM_CGROUP_CHARGE_TYPE_SWAPOUT);
> + else if (page->mapping && !PageAnon(page))
> + __mem_cgroup_uncharge_common(page,
> + MEM_CGROUP_CHARGE_TYPE_CACHE);
> + else
> + __mem_cgroup_uncharge_common(page,
> + MEM_CGROUP_CHARGE_TYPE_MAPPED);
> + return;
> +}
> +
> +void swap_cgroup_delete_swap(swp_entry_t entry)
> +{
> + int ret;
> + struct mem_cgroup *mem;
> +
> + ret = swap_cgroup_record_info(NULL, entry, true);
> + if (ret) {
> + mem = mem_cgroup_id_lookup(ret);
> + if (mem)
> + mem_counter_uncharge_swap(mem);
> + }
> +}
> +
> +
> +/*
> + * Forget all accounts under swap_cgroup of memcg.
> + * Called from destroying context.
> + */
> +static void swap_cgroup_clean_account(struct mem_cgroup *memcg)
> +{
> + int type;
> + unsigned long array_index, flags;
> + int index;
> + struct page *page;
> + struct swap_cgroup *map;
> +
> + if (!memcg->res.swaps)
> + return;
> + mutex_lock(&swap_cgroup_mutex);
> + for (type = 0; type < MAX_SWAPFILES; type++) {
> + if (swap_cgroup_pages[type] == 0)
> + continue;
> + for (array_index = 0;
> + array_index < swap_cgroup_pages[type];
> + array_index++) {
> + page = swap_cgroup_map[type][array_index];
> + if (!page)
> + continue;
> + local_irq_save(flags);
> + map = kmap_atomic(page, KM_USER0);
> + for (index = 0; index < ENTS_PER_PAGE; index++) {
> + if (map[index].memcgrp_id
> + == memcg->memcgrp_id) {
> + map[index].memcgrp_id = 0;
> + map[index].count = 0;
> + }
> + }
> + kunmap_atomic(page, KM_USER0);
> + local_irq_restore(flags);
> + }
> + mutex_unlock(&swap_cgroup_mutex);
> + yield();
> + mutex_lock(&swap_cgroup_mutex);
> + }
> + mutex_unlock(&swap_cgroup_mutex);
> +}
> +
> +/*
> + * called from swapon().
> + */
> +int swap_cgroup_swapon(int type, unsigned long max_pages)
> +{
> + void *array;
> + int array_size;
> +
> + VM_BUG_ON(swap_cgroup_map[type]);
> +
> + array_size = ((max_pages/ENTS_PER_PAGE) + 1) * sizeof(void *);
> +
> + array = vmalloc(array_size);
> + if (!array) {
> + printk("swap %d will not be accounted\n", type);
> + return -ENOMEM;
> + }
> + memset(array, 0, array_size);
> + mutex_lock(&swap_cgroup_mutex);
> + swap_cgroup_pages[type] = (max_pages/ENTS_PER_PAGE + 1);
> + swap_cgroup_map[type] = array;
> + mutex_unlock(&swap_cgroup_mutex);
> + spin_lock_init(&swap_cgroup_lock[type]);
> + return 0;
> +}
> +
> +/*
> + * called from swapoff().
> + */
> +void swap_cgroup_swapoff(int type)
> +{
> + int i;
> + for (i = 0; i < swap_cgroup_pages[type]; i++) {
> + struct page *page = swap_cgroup_map[type][i];
> + if (page)
> + __free_page(page);
> + }
> + mutex_lock(&swap_cgroup_mutex);
> + vfree(swap_cgroup_map[type]);
> + swap_cgroup_map[type] = NULL;
> + mutex_unlock(&swap_cgroup_mutex);
> + swap_cgroup_pages[type] = 0;
> +}
> +
> +#endif
> Index: mmtom-2.6.27-rc3+/include/linux/swap.h
> ===================================================================
> --- mmtom-2.6.27-rc3+.orig/include/linux/swap.h
> +++ mmtom-2.6.27-rc3+/include/linux/swap.h
> @@ -335,6 +335,44 @@ static inline void disable_swap_token(vo
> put_swap_token(swap_token_mm);
> }
>
> +#ifdef CONFIG_CGROUP_MEM_RES_CTLR_SWAP
> +extern int swap_cgroup_swapon(int type, unsigned long max_pages);
> +extern void swap_cgroup_swapoff(int type);
> +extern void swap_cgroup_delete_swap(swp_entry_t entry);
> +extern int swap_cgroup_prepare(swp_entry_t ent, gfp_t mask);
> +extern int swap_cgroup_record_info(struct page *, swp_entry_t ent, bool del);
> +extern void swap_cgroup_delete_swapcache(struct page *page, swp_entry_t entry);
> +
> +#else
> +static inline int swap_cgroup_swapon(int type, unsigned long max_pages)
> +{
> + return 0;
> +}
> +static inline void swap_cgroup_swapoff(int type)
> +{
> + return;
> +}
> +static inline void swap_cgroup_delete_swap(swp_entry_t entry)
> +{
> + return;
> +}
> +static inline int swap_cgroup_prapare(swp_entry_t ent, gfp_t mask)
> +{
> + return 0;
> +}
> +static inline int
> + swap_cgroup_record_info(struct page *, swp_entry_t ent, bool del)
> +{
> + return 0;
> +}
> +static inline
> +void swap_cgroup_delete_swapcache(struct page *page, swp_entry_t entry)
> +{
> + return;
> +}
> +#endif
> +
> +
> #else /* CONFIG_SWAP */
>
> #define total_swap_pages 0
> Index: mmtom-2.6.27-rc3+/mm/swapfile.c
> ===================================================================
> --- mmtom-2.6.27-rc3+.orig/mm/swapfile.c
> +++ mmtom-2.6.27-rc3+/mm/swapfile.c
> @@ -270,8 +270,9 @@ out:
> return NULL;
> }
>
> -static int swap_entry_free(struct swap_info_struct *p, unsigned long offset)
> +static int swap_entry_free(struct swap_info_struct *p, swp_entry_t entry)
> {
> + unsigned long offset = swp_offset(entry);
> int count = p->swap_map[offset];
>
> if (count < SWAP_MAP_MAX) {
> @@ -286,6 +287,7 @@ static int swap_entry_free(struct swap_i
> swap_list.next = p - swap_info;
> nr_swap_pages++;
> p->inuse_pages--;
> + swap_cgroup_delete_swap(entry);
> }
> }
> return count;
> @@ -301,7 +303,7 @@ void swap_free(swp_entry_t entry)
>
> p = swap_info_get(entry);
> if (p) {
> - swap_entry_free(p, swp_offset(entry));
> + swap_entry_free(p, entry);
> spin_unlock(&swap_lock);
> }
> }
> @@ -420,7 +422,7 @@ void free_swap_and_cache(swp_entry_t ent
>
> p = swap_info_get(entry);
> if (p) {
> - if (swap_entry_free(p, swp_offset(entry)) == 1) {
> + if (swap_entry_free(p, entry) == 1) {
> page = find_get_page(&swapper_space, entry.val);
> if (page && !trylock_page(page)) {
> page_cache_release(page);
> @@ -1343,6 +1345,7 @@ asmlinkage long sys_swapoff(const char _
> spin_unlock(&swap_lock);
> mutex_unlock(&swapon_mutex);
> vfree(swap_map);
> + swap_cgroup_swapoff(type);
> inode = mapping->host;
> if (S_ISBLK(inode->i_mode)) {
> struct block_device *bdev = I_BDEV(inode);
> @@ -1669,6 +1672,11 @@ asmlinkage long sys_swapon(const char __
> 1 /* header page */;
> if (error)
> goto bad_swap;
> +
> + if (swap_cgroup_swapon(type, maxpages)) {
> + printk("We don't enable swap accounting because of"
> + "memory shortage\n");
> + }
> }
>
> if (nr_good_pages) {
> Index: mmtom-2.6.27-rc3+/mm/swap_state.c
> ===================================================================
> --- mmtom-2.6.27-rc3+.orig/mm/swap_state.c
> +++ mmtom-2.6.27-rc3+/mm/swap_state.c
> @@ -76,6 +76,9 @@ int add_to_swap_cache(struct page *page,
> BUG_ON(PageSwapCache(page));
> BUG_ON(PagePrivate(page));
> BUG_ON(!PageSwapBacked(page));
> + error = swap_cgroup_prepare(entry, gfp_mask);
> + if (error)
> + return error;
> error = radix_tree_preload(gfp_mask);
> if (!error) {
> page_cache_get(page);
> @@ -89,6 +92,7 @@ int add_to_swap_cache(struct page *page,
> total_swapcache_pages++;
> __inc_zone_page_state(page, NR_FILE_PAGES);
> INC_CACHE_INFO(add_total);
> + swap_cgroup_record_info(page, entry, false);
> }
> spin_unlock_irq(&swapper_space.tree_lock);
> radix_tree_preload_end();
> @@ -108,6 +112,8 @@ int add_to_swap_cache(struct page *page,
> */
> void __delete_from_swap_cache(struct page *page)
> {
> + swp_entry_t entry = { .val = page_private(page) };
> +
> BUG_ON(!PageLocked(page));
> BUG_ON(!PageSwapCache(page));
> BUG_ON(PageWriteback(page));
> @@ -117,6 +123,7 @@ void __delete_from_swap_cache(struct pag
> set_page_private(page, 0);
> ClearPageSwapCache(page);
> total_swapcache_pages--;
> + swap_cgroup_delete_swapcache(page, entry);
> __dec_zone_page_state(page, NR_FILE_PAGES);
> INC_CACHE_INFO(del_total);
> }
> Index: mmtom-2.6.27-rc3+/init/Kconfig
> ===================================================================
> --- mmtom-2.6.27-rc3+.orig/init/Kconfig
> +++ mmtom-2.6.27-rc3+/init/Kconfig
> @@ -416,7 +416,7 @@ config CGROUP_MEM_RES_CTLR
> could in turn add some fork/exit overhead.
>
> config CGROUP_MEM_RES_CTLR_SWAP
> - bool "Memory Resource Controller Swap Extension (Broken)"
> + bool "Memory Resource Controller Swap Extension (EXPERIMENTAL)"
> depends on CGROUP_MEM_RES_CTLR && SWAP && EXPERIMENTAL
> help
> Add swap management feature to memory resource controller. By this,
> Index: mmtom-2.6.27-rc3+/mm/migrate.c
> ===================================================================
> --- mmtom-2.6.27-rc3+.orig/mm/migrate.c
> +++ mmtom-2.6.27-rc3+/mm/migrate.c
> @@ -339,6 +339,8 @@ static int migrate_page_move_mapping(str
> */
> static void migrate_page_copy(struct page *newpage, struct page *page)
> {
> + int was_swapcache = 0;
> +
> copy_highpage(newpage, page);
>
> if (PageError(page))
> @@ -372,14 +374,17 @@ static void migrate_page_copy(struct pag
> mlock_migrate_page(newpage, page);
>
> #ifdef CONFIG_SWAP
> + was_swapcache = PageSwapCache(page);
> ClearPageSwapCache(page);
> #endif
> ClearPagePrivate(page);
> set_page_private(page, 0);
> /* page->mapping contains a flag for PageAnon() */
> if (PageAnon(page)) {
> - /* This page is uncharged at try_to_unmap(). */
> + /* This page is uncharged at try_to_unmap() if not SwapCache. */
> page->mapping = NULL;
> + if (was_swapcache)
> + mem_cgroup_uncharge_page(page);
> } else {
> /* Obsolete file cache should be uncharged */
> page->mapping = NULL;
>
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
next prev parent reply other threads:[~2008-09-01 7:15 UTC|newest]
Thread overview: 61+ messages / expand[flat|nested] mbox.gz Atom feed top
2008-08-22 11:27 [RFC][PATCH 0/14] Mem+Swap Controller v2 KAMEZAWA Hiroyuki
2008-08-22 11:30 ` [RFC][PATCH 1/14] memcg: unlimted root cgroup KAMEZAWA Hiroyuki
2008-08-22 22:51 ` Balbir Singh
2008-08-23 0:38 ` kamezawa.hiroyu
2008-08-25 3:19 ` KAMEZAWA Hiroyuki
2008-08-22 11:31 ` [RFC][PATCH 2/14] memcg: rewrite force_empty KAMEZAWA Hiroyuki
2008-08-25 3:21 ` KAMEZAWA Hiroyuki
2008-08-29 11:45 ` Daisuke Nishimura
2008-08-30 7:30 ` KAMEZAWA Hiroyuki
2008-08-22 11:32 ` [RFC][PATCH 3/14] memcg: atomic_flags KAMEZAWA Hiroyuki
2008-08-26 4:55 ` Balbir Singh
2008-08-26 23:50 ` KAMEZAWA Hiroyuki
2008-08-27 1:58 ` KAMEZAWA Hiroyuki
2008-08-26 8:46 ` kamezawa.hiroyu
2008-08-26 8:49 ` Balbir Singh
2008-08-26 23:41 ` KAMEZAWA Hiroyuki
2008-08-22 11:33 ` [RFC][PATCH 4/14] delay page_cgroup freeing KAMEZAWA Hiroyuki
2008-08-26 11:46 ` Balbir Singh
2008-08-26 23:55 ` KAMEZAWA Hiroyuki
2008-08-27 1:17 ` Balbir Singh
2008-08-27 1:39 ` KAMEZAWA Hiroyuki
2008-08-27 2:25 ` Balbir Singh
2008-08-27 2:46 ` KAMEZAWA Hiroyuki
2008-08-22 11:34 ` [RFC][PATCH 5/14] memcg: free page_cgroup by RCU KAMEZAWA Hiroyuki
2008-08-28 10:06 ` Balbir Singh
2008-08-28 10:44 ` KAMEZAWA Hiroyuki
2008-09-01 6:51 ` YAMAMOTO Takashi
2008-09-01 7:01 ` KAMEZAWA Hiroyuki
2008-08-22 11:35 ` [RFC][PATCH 6/14] memcg: lockless page cgroup KAMEZAWA Hiroyuki
2008-09-09 5:40 ` Daisuke Nishimura
2008-09-09 7:56 ` KAMEZAWA Hiroyuki
2008-09-09 8:11 ` Daisuke Nishimura
2008-09-09 11:11 ` KAMEZAWA Hiroyuki
2008-09-09 11:48 ` Balbir Singh
2008-09-09 14:24 ` Balbir Singh
2008-09-09 14:04 ` Balbir Singh
2008-08-22 11:36 ` [RFC][PATCH 7/14] memcg: add prefetch to spinlock KAMEZAWA Hiroyuki
2008-08-28 11:00 ` Balbir Singh
2008-08-22 11:37 ` [RFC][PATCH 8/14] memcg: make mapping null before uncharge KAMEZAWA Hiroyuki
2008-08-22 11:38 ` [RFC][PATCH 9/14] memcg: add page_cgroup.h file KAMEZAWA Hiroyuki
2008-08-22 11:39 ` [RFC][PATCH 10/14] memcg: replace res_counter KAMEZAWA Hiroyuki
2008-08-27 0:44 ` Daisuke Nishimura
2008-08-27 1:26 ` KAMEZAWA Hiroyuki
2008-08-22 11:40 ` [RFC][PATCH 11/14] memcg: mem_cgroup private ID KAMEZAWA Hiroyuki
2008-08-22 11:41 ` [RFC][PATCH 12/14] memcg: mem+swap controller Kconfig KAMEZAWA Hiroyuki
2008-08-22 11:41 ` [RFC][PATCH 13/14] memcg: mem+swap counter KAMEZAWA Hiroyuki
2008-08-28 8:51 ` Daisuke Nishimura
2008-08-28 9:32 ` KAMEZAWA Hiroyuki
2008-08-22 11:44 ` [RFC][PATCH 14/14]memcg: mem+swap accounting KAMEZAWA Hiroyuki
2008-09-01 7:15 ` Daisuke Nishimura [this message]
2008-09-01 7:58 ` KAMEZAWA Hiroyuki
2008-09-01 8:53 ` Daisuke Nishimura
2008-09-01 9:53 ` KAMEZAWA Hiroyuki
2008-09-01 10:21 ` Daisuke Nishimura
2008-09-02 2:21 ` Daisuke Nishimura
2008-09-02 11:09 ` Daisuke Nishimura
2008-09-02 11:40 ` KAMEZAWA Hiroyuki
2008-09-03 6:23 ` Daisuke Nishimura
2008-09-03 7:05 ` KAMEZAWA Hiroyuki
2008-08-22 13:20 ` [RFC][PATCH 0/14] Mem+Swap Controller v2 Balbir Singh
2008-08-22 15:34 ` kamezawa.hiroyu
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20080901161501.2cba948e.nishimura@mxp.nes.nec.co.jp \
--to=nishimura@mxp.nes.nec.co.jp \
--cc=balbir@linux.vnet.ibm.com \
--cc=kamezawa.hiroyu@jp.fujitsu.com \
--cc=linux-mm@kvack.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox