OpenVZ Forum: Devel » [PATCH v5 00/18] slab accounting for memcg

Home » Mailing lists » Devel » [PATCH v5 00/18] slab accounting for memcg

Show: Today's Messages :: Show Polls :: Message Navigator
E-mail to friend

[PATCH v5 14/18] memcg/sl[au]b: shrink dead caches [message #48529 is a reply to message #48518]

Fri, 19 October 2012 14:20

Glauber Costa
Messages: 916
Registered: October 2011

Senior Member

In the slub allocator, when the last object of a page goes away, we
don't necessarily free it - there is not necessarily a test for empty
page in any slab_free path.

This means that when we destroy a memcg cache that happened to be empty,
those caches may take a lot of time to go away: removing the memcg
reference won't destroy them - because there are pending references, and
the empty pages will stay there, until a shrinker is called upon for any
reason.

This patch marks all memcg caches as dead. kmem_cache_shrink is called
for the ones who are not yet dead - this will force internal cache
reorganization, and then all references to empty pages will be removed.

An unlikely branch is used to make sure this case does not affect
performance in the usual slab_free path.

The slab allocator has a time based reaper that would eventually get rid
of the objects, but we can also call it explicitly, since dead caches
are not a likely event.

[ v2: also call verify_dead for the slab ]
[ v3: use delayed_work to avoid calling verify_dead at every free]

Signed-off-by: Glauber Costa <glommer@parallels.com>
CC: Christoph Lameter <cl@linux.com>
CC: Pekka Enberg <penberg@cs.helsinki.fi>
CC: Michal Hocko <mhocko@suse.cz>
CC: Kamezawa Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com>
CC: Johannes Weiner <hannes@cmpxchg.org>
CC: Suleiman Souhlal <suleiman@google.com>
CC: Tejun Heo <tj@kernel.org>
---
include/linux/slab.h | 2 +-
mm/memcontrol.c | 47 +++++++++++++++++++++++++++++++++++++++++------
2 files changed, 42 insertions(+), 7 deletions(-)

diff --git a/include/linux/slab.h b/include/linux/slab.h
index bb698dc..4a3a749 100644
--- a/include/linux/slab.h
+++ b/include/linux/slab.h
@@ -212,7 +212,7 @@ struct memcg_cache_params {
struct kmem_cache *root_cache;
bool dead;
atomic_t nr_pages;
- struct work_struct destroy;
+ struct delayed_work destroy;
};
};
};
diff --git a/mm/memcontrol.c b/mm/memcontrol.c
index f5089b3..c7732fa 100644
--- a/mm/memcontrol.c
+++ b/mm/memcontrol.c
@@ -2956,14 +2956,37 @@ static void kmem_cache_destroy_work_func(struct work_struct *w)
{
struct kmem_cache *cachep;
struct memcg_cache_params *p;
+ struct delayed_work *dw = to_delayed_work(w);

- p = container_of(w, struct memcg_cache_params, destroy);
+ p = container_of(dw, struct memcg_cache_params, destroy);

VM_BUG_ON(p->is_root_cache);
cachep = p->root_cache;
cachep = cachep->memcg_params->memcg_caches[memcg_css_id(p->memcg)];

- if (!atomic_read(&cachep->memcg_params->nr_pages))
+ /*
+ * If we get down to 0 after shrink, we could delete right away.
+ * However, memcg_release_pages() already puts us back in the workqueue
+ * in that case. If we proceed deleting, we'll get a dangling
+ * reference, and removing the object from the workqueue in that case
+ * is unnecessary complication. We are not a fast path.
+ *
+ * Note that this case is fundamentally different from racing with
+ * shrink_slab(): if memcg_cgroup_destroy_cache() is called in
+ * kmem_cache_shrink, not only we would be reinserting a dead cache
+ * into the queue, but doing so from inside the worker racing to
+ * destroy it.
+ *
+ * So if we aren't down to zero, we'll just schedule a worker and try
+ * again
+ */
+ if (atomic_read(&cachep->memcg_params->nr_pages) != 0) {
+ kmem_cache_shrink(cachep);
+ if (atomic_read(&cachep->memcg_params->nr_pages) == 0)
+ return;
+ /* Once per minute should be good enough. */
+ schedule_delayed_work(&cachep->memcg_params->destroy, 60 * HZ);
+ } else
kmem_cache_destroy(cachep);
}

@@ -2973,10 +2996,22 @@ void mem_cgroup_destroy_cache(struct kmem_cache *cachep)
return;

/*
+ * We can get to a memory-pressure situation while the delayed work is
+ * still pending to run. The vmscan shrinkers can then release all
+ * cache memory and get us to destruction. If this is the case, we'll
+ * be executed twice, which is a bug (the second time will execute over
+ * bogus data).
+ *
+ * Since we can't possibly know who got us here, just refrain from
+ * running if there is already work pending
+ */
+ if (delayed_work_pending(&cachep->memcg_params->destroy))
+ return;
+ /*
* We have to defer the actual destroying to a workqueue, because
* we might currently be in a context that cannot sleep.
*/
- schedule_work(&cachep->memcg_params->destroy);
+ schedule_delayed_work(&cachep->memcg_params->destroy, 0);
}

/*
@@ -3142,9 +3177,9 @@ static void mem_cgroup_destroy_all_caches(struct mem_cgroup *memcg)
list_for_each_entry(cachep, &memcg->memcg_slab_caches, list) {

cachep->memcg_params->dead = true;
- INIT_WORK(&cachep->memcg_params->destroy,
- kmem_cache_destroy_work_func);
- schedule_work(&cachep->memcg_params->destroy);
+ INIT_DELAYED_WORK(&cachep->memcg_params->destroy,
+ kmem_cache_destroy_work_func);
+ schedule_delayed_work(&cachep->memcg_params->destroy, 0);
}
mutex_unlock(&memcg->slab_caches_mutex);
}
--
1.7.11.7

Report message to a moderator

[Message index]

		[PATCH v5 00/18] slab accounting for memcg By: Glauber Costa on Fri, 19 October 2012 14:20
		[PATCH v5 10/18] sl[au]b: always get the cache from its page in kfree By: Glauber Costa on Fri, 19 October 2012 14:20
		Re: [PATCH v5 10/18] sl[au]b: always get the cache from its page in kfree By: Christoph Lameter on Fri, 19 October 2012 19:44
		Re: [PATCH v5 10/18] sl[au]b: always get the cache from its page in kfree By: Glauber Costa on Mon, 22 October 2012 10:13
		[PATCH v5 04/18] slab: don't preemptively remove element from list in cache destroy By: Glauber Costa on Fri, 19 October 2012 14:20
		Re: [PATCH v5 04/18] slab: don't preemptively remove element from list in cache destroy By: Christoph Lameter on Fri, 19 October 2012 19:34
		Re: [PATCH v5 04/18] slab: don't preemptively remove element from list in cache destroy By: Glauber Costa on Mon, 22 October 2012 08:40
		Re: [PATCH v5 04/18] slab: don't preemptively remove element from list in cache destroy By: Pekka Enberg on Wed, 24 October 2012 06:54
		Re: [PATCH v5 04/18] slab: don't preemptively remove element from list in cache destroy By: Glauber Costa on Wed, 24 October 2012 08:21
		[PATCH v5 09/18] memcg: skip memcg kmem allocations in specified code regions By: Glauber Costa on Fri, 19 October 2012 14:20
		[PATCH v5 02/18] move print_slabinfo_header to slab_common.c By: Glauber Costa on Fri, 19 October 2012 14:20
		[PATCH v5 11/18] sl[au]b: Allocate objects from memcg cache By: Glauber Costa on Fri, 19 October 2012 14:20
		Re: [PATCH v5 11/18] sl[au]b: Allocate objects from memcg cache By: Christoph Lameter on Fri, 19 October 2012 19:46
		Re: [PATCH v5 11/18] sl[au]b: Allocate objects from memcg cache By: JoonSoo Kim on Mon, 29 October 2012 15:14
		Re: [PATCH v5 11/18] sl[au]b: Allocate objects from memcg cache By: Glauber Costa on Mon, 29 October 2012 15:19
		[PATCH v5 17/18] slub: slub-specific propagation changes. By: Glauber Costa on Fri, 19 October 2012 14:20
		[PATCH v5 06/18] consider a memcg parameter in kmem_create_cache By: Glauber Costa on Fri, 19 October 2012 14:20
		Re: [PATCH v5 06/18] consider a memcg parameter in kmem_create_cache By: JoonSoo Kim on Tue, 23 October 2012 17:50
		Re: [PATCH v5 06/18] consider a memcg parameter in kmem_create_cache By: Glauber Costa on Wed, 24 October 2012 08:42
		Re: [PATCH v5 06/18] consider a memcg parameter in kmem_create_cache By: Glauber Costa on Thu, 25 October 2012 13:42
		[PATCH v5 05/18] slab/slub: struct memcg_params By: Glauber Costa on Fri, 19 October 2012 14:20
		Re: [PATCH v5 05/18] slab/slub: struct memcg_params By: JoonSoo Kim on Tue, 23 October 2012 17:25
		Re: [PATCH v5 05/18] slab/slub: struct memcg_params By: Glauber Costa on Wed, 24 October 2012 08:42
		[PATCH v5 18/18] Add slab-specific documentation about the kmem controller By: Glauber Costa on Fri, 19 October 2012 14:20
		[PATCH v5 13/18] memcg/sl[au]b Track all the memcg children of a kmem_cache. By: Glauber Costa on Fri, 19 October 2012 14:20
		Re: [PATCH v5 13/18] memcg/sl[au]b Track all the memcg children of a kmem_cache. By: JoonSoo Kim on Mon, 29 October 2012 15:26
		Re: [PATCH v5 13/18] memcg/sl[au]b Track all the memcg children of a kmem_cache. By: Glauber Costa on Tue, 30 October 2012 11:31
		[PATCH v5 14/18] memcg/sl[au]b: shrink dead caches By: Glauber Costa on Fri, 19 October 2012 14:20
		Re: [PATCH v5 14/18] memcg/sl[au]b: shrink dead caches By: Christoph Lameter on Fri, 19 October 2012 19:47
		Re: [PATCH v5 14/18] memcg/sl[au]b: shrink dead caches By: Glauber Costa on Mon, 22 October 2012 07:37
		[PATCH v5 03/18] sl[au]b: process slabinfo_show in common code By: Glauber Costa on Fri, 19 October 2012 14:20
		[PATCH v5 01/18] move slabinfo processing to slab_common.c By: Glauber Costa on Fri, 19 October 2012 14:20
		Re: [PATCH v5 01/18] move slabinfo processing to slab_common.c By: Pekka Enberg on Wed, 24 October 2012 06:43
		[PATCH v5 15/18] Aggregate memcg cache values in slabinfo By: Glauber Costa on Fri, 19 October 2012 14:20
		Re: [PATCH v5 15/18] Aggregate memcg cache values in slabinfo By: Christoph Lameter on Fri, 19 October 2012 19:50
		Re: [PATCH v5 15/18] Aggregate memcg cache values in slabinfo By: Glauber Costa on Mon, 22 October 2012 15:11
		[PATCH v5 08/18] memcg: infrastructure to match an allocation to the right cache By: Glauber Costa on Fri, 19 October 2012 14:20
		Re: [PATCH v5 08/18] memcg: infrastructure to match an allocation to the right cache By: JoonSoo Kim on Wed, 24 October 2012 18:10
		Re: [PATCH v5 08/18] memcg: infrastructure to match an allocation to the right cache By: Glauber Costa on Thu, 25 October 2012 11:05
		Re: [PATCH v5 08/18] memcg: infrastructure to match an allocation to the right cache By: Tejun Heo on Thu, 25 October 2012 18:06
		Re: [PATCH v5 08/18] memcg: infrastructure to match an allocation to the right cache By: Tejun Heo on Thu, 25 October 2012 18:08
		[PATCH v5 07/18] Allocate memory for memcg caches whenever a new memcg appears By: Glauber Costa on Fri, 19 October 2012 14:20
		[PATCH v5 12/18] memcg: destroy memcg caches By: Glauber Costa on Fri, 19 October 2012 14:20
		[PATCH v5 16/18] slab: propagate tunables values By: Glauber Costa on Fri, 19 October 2012 14:20
		Re: [PATCH v5 16/18] slab: propagate tunables values By: Christoph Lameter on Fri, 19 October 2012 19:51
		Re: [PATCH v5 16/18] slab: propagate tunables values By: Glauber Costa on Mon, 22 October 2012 07:48
		Re: [PATCH v5 16/18] slab: propagate tunables values By: Christoph Lameter on Tue, 23 October 2012 20:44

Previous Topic:	[PATCH v3] SUNRPC: set desired file system root before connecting local transports
Next Topic:	[PATCH v5] slab: Ignore internal flags in cache creation

Goto Forum:

-=] Back to Top [=-

[ Syndicate this forum (XML) ] [

] [

]

Current Time: Tue Jul 14 15:07:21 GMT 2026

Total time taken to generate the page: 0.16025 seconds