From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id F0FFFC4332F for ; Tue, 20 Dec 2022 07:25:16 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id 6BCDA8E0003; Tue, 20 Dec 2022 02:25:16 -0500 (EST) Received: by kanga.kvack.org (Postfix, from userid 40) id 645EE8E0001; Tue, 20 Dec 2022 02:25:16 -0500 (EST) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id 4BF8F8E0003; Tue, 20 Dec 2022 02:25:16 -0500 (EST) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0014.hostedemail.com [216.40.44.14]) by kanga.kvack.org (Postfix) with ESMTP id 3E9AF8E0001 for ; Tue, 20 Dec 2022 02:25:16 -0500 (EST) Received: from smtpin26.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay02.hostedemail.com (Postfix) with ESMTP id 062011209EE for ; Tue, 20 Dec 2022 07:25:16 +0000 (UTC) X-FDA: 80261848632.26.457CED6 Received: from mail-pf1-f170.google.com (mail-pf1-f170.google.com [209.85.210.170]) by imf07.hostedemail.com (Postfix) with ESMTP id 522DD40002 for ; Tue, 20 Dec 2022 07:25:14 +0000 (UTC) Authentication-Results: imf07.hostedemail.com; dkim=pass header.d=gmail.com header.s=20210112 header.b=EtsN9zsJ; spf=pass (imf07.hostedemail.com: domain of shiyn.lin@gmail.com designates 209.85.210.170 as permitted sender) smtp.mailfrom=shiyn.lin@gmail.com; dmarc=pass (policy=none) header.from=gmail.com ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1671521114; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:dkim-signature; bh=zN8GQXHI00/HeROFQmKChe4iFQdC5KCgpNeUIT0KwLk=; b=DcewC8D+t7gn1GNwj0ixVmALFWQpAE/KEm2XjRTxcdxp8Hoe7tfuaE9l9ao+4iQajQzltq lvzjNwVijeVaQ1O7DrteyVJMX+CYWlqS0wFVCeUVMmA5sTw7In1QJZsfF/FEgv5ZzbIxmS 1w3x7Om/0eBdzhrp5UaUDMWiQOm68Xg= ARC-Authentication-Results: i=1; imf07.hostedemail.com; dkim=pass header.d=gmail.com header.s=20210112 header.b=EtsN9zsJ; spf=pass (imf07.hostedemail.com: domain of shiyn.lin@gmail.com designates 209.85.210.170 as permitted sender) smtp.mailfrom=shiyn.lin@gmail.com; dmarc=pass (policy=none) header.from=gmail.com ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1671521114; a=rsa-sha256; cv=none; b=Pe+Edf+v9UpGsYkd2zdsADTFgK6RxWvts9Cuaue3FVDrfXhRZyz2bb8SI4H5GyHOW3Eoxo tFMaXTFaPT8G64Cd3jLaiaMx6aePFBxCf8OzovM75s/Z1WQVKj8NoUG/8Mud4LnnJEKTAV 7wCxFHRQyp7NrrV3JTEv5PvLUVLL4Lg= Received: by mail-pf1-f170.google.com with SMTP id c7so7892772pfc.12 for ; Mon, 19 Dec 2022 23:25:14 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:from:to:cc:subject:date :message-id:reply-to; bh=zN8GQXHI00/HeROFQmKChe4iFQdC5KCgpNeUIT0KwLk=; b=EtsN9zsJ3pirsct1axwSXx4qLQ1AewI0Q6+8HUX5Q51hUSXHkvjDCYErp3TgLe7ajY 0sRp2l1jXLkz7Fy6reBRji4kSlFqrrkuZ8YdL0doJbFjZN2Rz5nWrs9EYXCyzekF4m0G NLitGc5Nay+QWeU6bpxn9/khd11FbuGY48vk5ZxFOqNm6dVEqcb2ahY6OnOrVcYbvYFU +wbAKrZe5YEZWGq0dZfV5K9X9zE9yhctCyAMsaMY0WGVwykhGeXjL9gdsb+cowMkk3aZ MfK94XMaAsIo9CUWFtFOlw4aOtnipNUkGGJ/SC1AKwv48MtgS1iYkC45E1u4/UrAguyr Cvvg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=zN8GQXHI00/HeROFQmKChe4iFQdC5KCgpNeUIT0KwLk=; b=i5hGaFQS1KffVAbm3OpYpvLb9nbdyqPdKJEUMCpeRZ01egAoEhYd+Dn4oY0A34f912 Jx7RLYavW0Hfn6zSqpkVWhKElEkJcu8Z/F+BjvwFYmearVE46JvALClL3sPh8yMHCiGf sLUhIGTLAz/TsmEepOSYiWD4TWG798MnNsorXEnvWShohJ/LWlf/zCTYNKIsXaAQK+/7 1f6b1SKPM2ynjPTkuE85tnE8yGXy9ToNzG9IL2NusTbi8mLRH8epLFYlPe0YLyhaGlWQ b+R6+bu33q3fCnm5EJ4ijfenu6AmlL1MJ8cshp5r6Ok3Ir2lvDCRDgXzArA0loJd6o0Q /sgw== X-Gm-Message-State: ANoB5pmlIlqKQnSjKSe9ucOamFZrDQu/XWBKhWrI5OZZ1U/oXdgvl6xM dGkVOR8pacW/hJrNlsRxfyE= X-Google-Smtp-Source: AA0mqf6gYVbHTs7yr9zs2kZWYw2+Km59Nz3mlW60Y6fD48DeDmKuygSgqg9b/c4U7WwnJgSjc/H0kA== X-Received: by 2002:a05:6a00:1988:b0:577:49da:6074 with SMTP id d8-20020a056a00198800b0057749da6074mr60208490pfl.19.1671521113122; Mon, 19 Dec 2022 23:25:13 -0800 (PST) Received: from archlinux.localdomain ([140.121.198.213]) by smtp.googlemail.com with ESMTPSA id q15-20020aa7982f000000b00576f9773c80sm7865544pfl.206.2022.12.19.23.25.04 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 19 Dec 2022 23:25:12 -0800 (PST) From: Chih-En Lin To: Andrew Morton , Qi Zheng , David Hildenbrand , Matthew Wilcox , Christophe Leroy , John Hubbard , Nadav Amit Cc: linux-kernel@vger.kernel.org, linux-mm@kvack.org, Steven Rostedt , Masami Hiramatsu , Peter Zijlstra , Ingo Molnar , Arnaldo Carvalho de Melo , Mark Rutland , Alexander Shishkin , Jiri Olsa , Namhyung Kim , Yang Shi , Peter Xu , Zach O'Keefe , "Liam R . Howlett" , Alex Sierra , Xianting Tian , Colin Cross , Suren Baghdasaryan , Barry Song , Pasha Tatashin , Suleiman Souhlal , Brian Geffon , Yu Zhao , Tong Tiangen , Liu Shixin , Li kunyu , Anshuman Khandual , Vlastimil Babka , Hugh Dickins , Minchan Kim , Miaohe Lin , Gautam Menghani , Catalin Marinas , Mark Brown , Will Deacon , "Eric W . Biederman" , Thomas Gleixner , Sebastian Andrzej Siewior , Andy Lutomirski , Fenghua Yu , Barret Rhoden , Davidlohr Bueso , "Jason A . Donenfeld" , Dinglan Peng , Pedro Fonseca , Jim Huang , Huichun Feng , Chih-En Lin Subject: [PATCH v3 01/14] mm: Allow user to control COW PTE via prctl Date: Tue, 20 Dec 2022 15:27:30 +0800 Message-Id: <20221220072743.3039060-2-shiyn.lin@gmail.com> X-Mailer: git-send-email 2.37.3 In-Reply-To: <20221220072743.3039060-1-shiyn.lin@gmail.com> References: <20221220072743.3039060-1-shiyn.lin@gmail.com> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-Rspamd-Server: rspam07 X-Rspamd-Queue-Id: 522DD40002 X-Rspam-User: X-Stat-Signature: hur9isqrubb5ieiwi6ah9kst7a9h5bkn X-HE-Tag: 1671521114-707187 X-HE-Meta: U2FsdGVkX19XqvXAyxzQjw5m592hDPxRVklbS6p1ohaNc3n/0G66XbQDF0jtVifruAQr+2f7ADZOGTDowLFyA+eKoQ3n3nisu74lICKdS97rjTXIf/b0X06c9ADAmvqNKqGLXNxeXMDjV3io1qe5mjm13tQUxcWkQ9xiHXWmR5XZkWg6hUHhmifSxPj8Zta9Luo0rXqHDntYZt1Lat90zUSKpFMcwLL3+j8ho9+P+Oo4ly7R160CUcGhheUvmypvjyERb95Iajzh9y5bFXa10dTOvhpLwscWsiFsbeu+ZAK13jkep/lK58cskY9THmZzGuOy+1qNMaeP3UWzJcillkRrvMEShBTyHI+z8lQ8Cmnvs5xM2EgAmeHgsUy8ThTAJvFizrM/RWPu+1IJ/F7msS7ikf3Li04IWg3PIQ/SiUZr7adB205pONV94V66NXvZcoCPCQWDQZl9nj3gsRptEvE4F67mXPgFFd/mS2SYNWlLJOOYD6FldGz4en4nE9lXTfziAVWNwYuV6egfzvQoGVPgr0iXvY/HRlXSbv4Po8xLXa0QAj1XPF56lI4WnNRen9c4HU5LwfjeSRLCq1h+kKfy/9+IjJxdzD37RSWfmXSOJzGgaBHfC5kfAIQPqYCioxOxFWReDEMKKOS6kZ4fXPOOyhcibdxvihhMPzOm9Kpy76YW1KOA1+sNgCCA1Upyk3vV0sBOlxCOSpfpTIKOTjRBAb9crYoZbkutDe9DbLWjlzPQTJcFnsSkXcj818d4EEQmhPEo0Qht+UKROoK3086lfDNMhqgsUfcv3GeKImB8xlpPMaeuO/culs0LXZDU6QIsy8tYdazaEgHlAtOY26Nc9uFb3R7BwkK69nIyHkEwL2WVWiUPbZDgQC9GNtJsfvTynEakdBu8vF0BSo82+SRWicPnzVp7dWNYyCXB9rjQ1/0t6dbBRd05pyCJXNF96YgMG+xd35a6K5jMOOs WI2A3MX3 Hb6vbPVx1+92a2VebDKhI/ja2z1AFxdmDNApC/8ks55eYtqFWAsN5jlaXS0FCksguIBKpALLFHUjrWSxAgodjIZ2RlT4nA5AB81F7zgecqBfMOgCEpwaCoKfXA7Jg2RC3T+g7UJvrCPHIZlkbQleSiHYD6GWh2CccbSTd+/1lcyuURSiZC2LZlD6yFL3RLF3w4Bqis8iqeqN8t9H2ihO9pk7JC0ySxzumswo1zIGUocIQ/3w= X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: Add a new prctl, PR_SET_COW_PTE, to allow the user to enable COW PTE. Since it has a time gap between using the prctl to enable the COW PTE and doing the fork, we use two states (MMF_COW_PTE_READY and MMF_COW_PTE) to determine the task that wants to do COW PTE or already doing it. The MMF_COW_PTE_READY flag marks the task to do COW PTE in the next time of fork(). During fork(), if MMF_COW_PTE_READY set, fork() will unset the flag and set the MMF_COW_PTE flag. After that, fork() might shares PTEs instead of duplicates it. Signed-off-by: Chih-En Lin --- include/linux/sched/coredump.h | 12 +++++++++++- include/uapi/linux/prctl.h | 6 ++++++ kernel/sys.c | 11 +++++++++++ 3 files changed, 28 insertions(+), 1 deletion(-) diff --git a/include/linux/sched/coredump.h b/include/linux/sched/coredump.h index 8270ad7ae14c2..570d599ebc851 100644 --- a/include/linux/sched/coredump.h +++ b/include/linux/sched/coredump.h @@ -83,7 +83,17 @@ static inline int get_dumpable(struct mm_struct *mm) #define MMF_HAS_PINNED 27 /* FOLL_PIN has run, never cleared */ #define MMF_DISABLE_THP_MASK (1 << MMF_DISABLE_THP) +/* + * MMF_COW_PTE_READY: Marking the task to do COW PTE in the next time of + * fork(). During fork(), if MMF_COW_PTE_READY set, fork() will unset the + * flag and set the MMF_COW_PTE flag. After that, fork() might shares PTEs + * rather than duplicates it. + */ +#define MMF_COW_PTE_READY 29 /* Share PTE tables in next time of fork() */ +#define MMF_COW_PTE 30 /* PTE tables are shared between processes */ +#define MMF_COW_PTE_MASK (1 << MMF_COW_PTE) + #define MMF_INIT_MASK (MMF_DUMPABLE_MASK | MMF_DUMP_FILTER_MASK |\ - MMF_DISABLE_THP_MASK) + MMF_DISABLE_THP_MASK | MMF_COW_PTE_MASK) #endif /* _LINUX_SCHED_COREDUMP_H */ diff --git a/include/uapi/linux/prctl.h b/include/uapi/linux/prctl.h index a5e06dcbba136..664a3c0230192 100644 --- a/include/uapi/linux/prctl.h +++ b/include/uapi/linux/prctl.h @@ -284,4 +284,10 @@ struct prctl_mm_map { #define PR_SET_VMA 0x53564d41 # define PR_SET_VMA_ANON_NAME 0 +/* + * Set the prepare flag, MMF_COW_PTE_READY, to do the share (copy-on-write) + * page table in the next time of fork. + */ +#define PR_SET_COW_PTE 65 + #endif /* _LINUX_PRCTL_H */ diff --git a/kernel/sys.c b/kernel/sys.c index 5fd54bf0e8867..d1062ea33981e 100644 --- a/kernel/sys.c +++ b/kernel/sys.c @@ -2348,6 +2348,14 @@ static int prctl_set_vma(unsigned long opt, unsigned long start, } #endif /* CONFIG_ANON_VMA_NAME */ +static int prctl_set_cow_pte(struct mm_struct *mm) +{ + if (test_bit(MMF_COW_PTE, &mm->flags)) + return -EINVAL; + set_bit(MMF_COW_PTE_READY, &mm->flags); + return 0; +} + SYSCALL_DEFINE5(prctl, int, option, unsigned long, arg2, unsigned long, arg3, unsigned long, arg4, unsigned long, arg5) { @@ -2626,6 +2634,9 @@ SYSCALL_DEFINE5(prctl, int, option, unsigned long, arg2, unsigned long, arg3, case PR_SET_VMA: error = prctl_set_vma(arg2, arg3, arg4, arg5); break; + case PR_SET_COW_PTE: + error = prctl_set_cow_pte(me->mm); + break; default: error = -EINVAL; break; -- 2.37.3