From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-20.1 required=3.0 tests=BAYES_00,DKIM_INVALID, DKIM_SIGNED,HEADER_FROM_DIFFERENT_DOMAINS,INCLUDES_CR_TRAILER,INCLUDES_PATCH, MAILING_LIST_MULTI,MENTIONS_GIT_HOSTING,NICE_REPLY_A,SPF_HELO_NONE,SPF_PASS, USER_AGENT_SANE_1 autolearn=ham autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 1452EC2B9F7 for ; Fri, 28 May 2021 15:19:21 +0000 (UTC) Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by mail.kernel.org (Postfix) with ESMTP id 88054613AB for ; Fri, 28 May 2021 15:19:20 +0000 (UTC) DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org 88054613AB Authentication-Results: mail.kernel.org; dmarc=none (p=none dis=none) header.from=suse.cz Authentication-Results: mail.kernel.org; spf=pass smtp.mailfrom=owner-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix) id 2457A6B006E; Fri, 28 May 2021 11:19:20 -0400 (EDT) Received: by kanga.kvack.org (Postfix, from userid 40) id 1F5E76B0070; Fri, 28 May 2021 11:19:20 -0400 (EDT) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id F15326B0071; Fri, 28 May 2021 11:19:19 -0400 (EDT) X-Delivered-To: linux-mm@kvack.org Received: from forelay.hostedemail.com (smtprelay0127.hostedemail.com [216.40.44.127]) by kanga.kvack.org (Postfix) with ESMTP id B258A6B006E for ; Fri, 28 May 2021 11:19:19 -0400 (EDT) Received: from smtpin14.hostedemail.com (10.5.19.251.rfc1918.com [10.5.19.251]) by forelay02.hostedemail.com (Postfix) with ESMTP id 4336B40E1 for ; Fri, 28 May 2021 15:19:19 +0000 (UTC) X-FDA: 78190998438.14.088075D Received: from smtp-out1.suse.de (smtp-out1.suse.de [195.135.220.28]) by imf04.hostedemail.com (Postfix) with ESMTP id E1A71542 for ; Fri, 28 May 2021 15:19:13 +0000 (UTC) Received: from imap.suse.de (imap-alt.suse-dmz.suse.de [192.168.254.47]) (using TLSv1.2 with cipher ECDHE-ECDSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by smtp-out1.suse.de (Postfix) with ESMTPS id 297D4218B3; Fri, 28 May 2021 15:19:17 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=suse.cz; s=susede2_rsa; t=1622215157; h=from:from:reply-to:date:date:message-id:message-id:to:to:cc:cc: mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=6chLmJiVPfngVtoOuiXkJhibKoKI1Z0mcSpxFoLAEO0=; b=0I2NkE/E2tbZCBqQMU2AaqFOT9qgYDZzA4Rd7KE1e5g6HXPTaxkUumQe6OJ1fYLevSyZVz U3dDidhvDu6gJksIlh1OnGFidc9eA7LmdL8T/tEP/zLXEAuBrakksowKsUqAplbK92pIYD /MKDpbElkhtubczmsMObQ3+CH5+dq5g= DKIM-Signature: v=1; a=ed25519-sha256; c=relaxed/relaxed; d=suse.cz; s=susede2_ed25519; t=1622215157; h=from:from:reply-to:date:date:message-id:message-id:to:to:cc:cc: mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=6chLmJiVPfngVtoOuiXkJhibKoKI1Z0mcSpxFoLAEO0=; b=RaH83uhsppZPP6az8xJA4xoDDBJ3YX9ZyWC6C+H2OmE25e6WxCeWa3jL9xJwA3g58J1Grj vM0rR1JocaBjNVAg== Received: from imap3-int (imap-alt.suse-dmz.suse.de [192.168.254.47]) by imap.suse.de (Postfix) with ESMTP id 8E55E11906; Fri, 28 May 2021 15:19:16 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=suse.cz; s=susede2_rsa; t=1622215156; h=from:from:reply-to:date:date:message-id:message-id:to:to:cc:cc: mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=6chLmJiVPfngVtoOuiXkJhibKoKI1Z0mcSpxFoLAEO0=; b=vMi8WxIufMXntdFavFta+efUU1589XCbzj3X/1BQSaIf52uMUiUWk8iwc3gVsTGmntafoo TsdtD5Aux6W1iQBnMQR/NwtfPgvVRQZFYRZxEqSbUDaMD6dRZprAPJ6PWJjh3iCjX2U2Lh SEpT07aL260zBJlDjhh65+OFOOTXb/M= DKIM-Signature: v=1; a=ed25519-sha256; c=relaxed/relaxed; d=suse.cz; s=susede2_ed25519; t=1622215156; h=from:from:reply-to:date:date:message-id:message-id:to:to:cc:cc: mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=6chLmJiVPfngVtoOuiXkJhibKoKI1Z0mcSpxFoLAEO0=; b=ufdPVlK9iEILtLMAbFsxuReiP3w06diFZ30cQrskibGruW8RbafJjY1jZ4+MYxPPV3/l8m yCJ4zKJV9g8CKCDQ== Received: from director2.suse.de ([192.168.254.72]) by imap3-int with ESMTPSA id mb4zIvQJsWBcDQAALh3uQQ (envelope-from ); Fri, 28 May 2021 15:19:16 +0000 To: Charan Teja Reddy , akpm@linux-foundation.org, mcgrof@kernel.org, keescook@chromium.org, yzaikin@google.com, nigupta@nvidia.com, bhe@redhat.com, mateusznosek0@gmail.com, sh_def@163.com, iamjoonsoo.kim@lge.com, vinmenon@codeaurora.org Cc: linux-kernel@vger.kernel.org, linux-mm@kvack.org, linux-fsdevel@vger.kernel.org, Linux API References: <1621345058-26676-1-git-send-email-charante@codeaurora.org> From: Vlastimil Babka Subject: Re: [PATCH V2] mm: compaction: support triggering of proactive compaction by user Message-ID: Date: Fri, 28 May 2021 17:19:16 +0200 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:78.0) Gecko/20100101 Thunderbird/78.10.2 MIME-Version: 1.0 In-Reply-To: <1621345058-26676-1-git-send-email-charante@codeaurora.org> Content-Type: text/plain; charset=utf-8 Content-Language: en-US X-Spamd-Result: default: False [0.00 / 100.00]; ARC_NA(0.00)[]; RCVD_VIA_SMTP_AUTH(0.00)[]; FROM_HAS_DN(0.00)[]; TO_DN_SOME(0.00)[]; TO_MATCH_ENVRCPT_ALL(0.00)[]; FREEMAIL_ENVRCPT(0.00)[163.com,gmail.com]; MIME_GOOD(-0.10)[text/plain]; DKIM_SIGNED(0.00)[suse.cz:s=susede2_rsa,suse.cz:s=susede2_ed25519]; RCPT_COUNT_TWELVE(0.00)[15]; FREEMAIL_TO(0.00)[codeaurora.org,linux-foundation.org,kernel.org,chromium.org,google.com,nvidia.com,redhat.com,gmail.com,163.com,lge.com]; RCVD_NO_TLS_LAST(0.10)[]; FROM_EQ_ENVFROM(0.00)[]; MIME_TRACE(0.00)[0:+]; RCVD_COUNT_TWO(0.00)[2]; MID_RHS_MATCH_FROM(0.00)[] Authentication-Results: imf04.hostedemail.com; dkim=pass header.d=suse.cz header.s=susede2_rsa header.b="0I2NkE/E"; dkim=pass header.d=suse.cz header.s=susede2_ed25519 header.b=RaH83uhs; dkim=pass header.d=suse.cz header.s=susede2_rsa header.b=vMi8WxIu; dkim=pass header.d=suse.cz header.s=susede2_ed25519 header.b=ufdPVlK9; dmarc=none; spf=pass (imf04.hostedemail.com: domain of vbabka@suse.cz designates 195.135.220.28 as permitted sender) smtp.mailfrom=vbabka@suse.cz X-Rspamd-Server: rspam05 X-Rspamd-Queue-Id: E1A71542 X-Stat-Signature: po4wht1fbob5zemk7safpfyznp5ooxop X-HE-Tag: 1622215153-466527 Content-Transfer-Encoding: quoted-printable X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: +CC linux-api On 5/18/21 3:37 PM, Charan Teja Reddy wrote: > The proactive compaction[1] gets triggered for every 500msec and run > compaction on the node for COMPACTION_HPAGE_ORDER (usually order-9) > pages based on the value set to sysctl.compaction_proactiveness. > Triggering the compaction for every 500msec in search of > COMPACTION_HPAGE_ORDER pages is not needed for all applications, > especially on the embedded system usecases which may have few MB's of > RAM. Enabling the proactive compaction in its state will endup in > running almost always on such systems. >=20 > Other side, proactive compaction can still be very much useful for > getting a set of higher order pages in some controllable > manner(controlled by using the sysctl.compaction_proactiveness). Thus o= n > systems where enabling the proactive compaction always may proove not > required, can trigger the same from user space on write to its sysctl > interface. As an example, say app launcher decide to launch the memory > heavy application which can be launched fast if it gets more higher > order pages thus launcher can prepare the system in advance by > triggering the proactive compaction from userspace. >=20 > This triggering of proactive compaction is done on a write to > sysctl.compaction_proactiveness by user. >=20 > [1]https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/c= ommit?id=3Dfacdaa917c4d5a376d09d25865f5a863f906234a >=20 > Signed-off-by: Charan Teja Reddy Cancelling all current sleeps immediately when the controlling variable c= hanges doesn't sound wrong to me. A question below: > --- > changes in V2:=20 > - remove /proc interface trigger for proactive compaction > - Intention is same that add a way to trigger proactive compaction = by user. >=20 > changes in V1: > - https://lore.kernel.org/lkml/1619098678-8501-1-git-send-email-ch= arante@codeaurora.org/ >=20 > include/linux/compaction.h | 2 ++ > include/linux/mmzone.h | 1 + > kernel/sysctl.c | 2 +- > mm/compaction.c | 35 ++++++++++++++++++++++++++++++++--- > 4 files changed, 36 insertions(+), 4 deletions(-) >=20 > diff --git a/include/linux/compaction.h b/include/linux/compaction.h > index 4221888..04d5d9f 100644 > --- a/include/linux/compaction.h > +++ b/include/linux/compaction.h > @@ -84,6 +84,8 @@ static inline unsigned long compact_gap(unsigned int = order) > extern unsigned int sysctl_compaction_proactiveness; > extern int sysctl_compaction_handler(struct ctl_table *table, int writ= e, > void *buffer, size_t *length, loff_t *ppos); > +extern int compaction_proactiveness_sysctl_handler(struct ctl_table *t= able, > + int write, void *buffer, size_t *length, loff_t *ppos); > extern int sysctl_extfrag_threshold; > extern int sysctl_compact_unevictable_allowed; > =20 > diff --git a/include/linux/mmzone.h b/include/linux/mmzone.h > index 0d53eba..9455809 100644 > --- a/include/linux/mmzone.h > +++ b/include/linux/mmzone.h > @@ -815,6 +815,7 @@ typedef struct pglist_data { > enum zone_type kcompactd_highest_zoneidx; > wait_queue_head_t kcompactd_wait; > struct task_struct *kcompactd; > + bool proactive_compact_trigger; > #endif > /* > * This is a per-node reserve of pages that are not available > diff --git a/kernel/sysctl.c b/kernel/sysctl.c > index 14edf84..bed2fad 100644 > --- a/kernel/sysctl.c > +++ b/kernel/sysctl.c > @@ -2840,7 +2840,7 @@ static struct ctl_table vm_table[] =3D { > .data =3D &sysctl_compaction_proactiveness, > .maxlen =3D sizeof(sysctl_compaction_proactiveness), > .mode =3D 0644, > - .proc_handler =3D proc_dointvec_minmax, > + .proc_handler =3D compaction_proactiveness_sysctl_handler, > .extra1 =3D SYSCTL_ZERO, > .extra2 =3D &one_hundred, > }, > diff --git a/mm/compaction.c b/mm/compaction.c > index 84fde27..9056693 100644 > --- a/mm/compaction.c > +++ b/mm/compaction.c > @@ -2708,6 +2708,30 @@ static void compact_nodes(void) > */ > unsigned int __read_mostly sysctl_compaction_proactiveness =3D 20; > =20 > +int compaction_proactiveness_sysctl_handler(struct ctl_table *table, i= nt write, > + void *buffer, size_t *length, loff_t *ppos) > +{ > + int rc, nid; > + > + rc =3D proc_dointvec_minmax(table, write, buffer, length, ppos); > + if (rc) > + return rc; > + > + if (write && sysctl_compaction_proactiveness) { > + for_each_online_node(nid) { > + pg_data_t *pgdat =3D NODE_DATA(nid); > + > + if (pgdat->proactive_compact_trigger) > + continue; > + > + pgdat->proactive_compact_trigger =3D true; > + wake_up_interruptible(&pgdat->kcompactd_wait); > + } > + } > + > + return 0; > +} > + > /* > * This is the entry point for compacting all nodes via > * /proc/sys/vm/compact_memory > @@ -2752,7 +2776,8 @@ void compaction_unregister_node(struct node *node= ) > =20 > static inline bool kcompactd_work_requested(pg_data_t *pgdat) > { > - return pgdat->kcompactd_max_order > 0 || kthread_should_stop(); > + return pgdat->kcompactd_max_order > 0 || kthread_should_stop() || > + pgdat->proactive_compact_trigger; > } > =20 > static bool kcompactd_node_suitable(pg_data_t *pgdat) > @@ -2905,7 +2930,8 @@ static int kcompactd(void *p) > trace_mm_compaction_kcompactd_sleep(pgdat->node_id); > if (wait_event_freezable_timeout(pgdat->kcompactd_wait, > kcompactd_work_requested(pgdat), > - msecs_to_jiffies(HPAGE_FRAG_CHECK_INTERVAL_MSEC))) { > + msecs_to_jiffies(HPAGE_FRAG_CHECK_INTERVAL_MSEC)) && > + !pgdat->proactive_compact_trigger) { > =20 > psi_memstall_enter(&pflags); > kcompactd_do_work(pgdat); > @@ -2919,7 +2945,7 @@ static int kcompactd(void *p) > =20 > if (proactive_defer) { > proactive_defer--; > - continue; > + goto loop; I don't understand this part. If we kick kcompactd from the sysctl handle= r because we are changing proactiveness, shouldn't we also discard any accu= mulated defer score? > } > prev_score =3D fragmentation_score_node(pgdat); > proactive_compact_node(pgdat); > @@ -2931,6 +2957,9 @@ static int kcompactd(void *p) > proactive_defer =3D score < prev_score ? > 0 : 1 << COMPACT_MAX_DEFER_SHIFT; > } > +loop: > + if (pgdat->proactive_compact_trigger) > + pgdat->proactive_compact_trigger =3D false; > } > =20 > return 0; >=20