From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id 54088C0015E for ; Thu, 27 Jul 2023 08:14:25 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id E29CD8D000A; Thu, 27 Jul 2023 04:14:24 -0400 (EDT) Received: by kanga.kvack.org (Postfix, from userid 40) id DB3118D0001; Thu, 27 Jul 2023 04:14:24 -0400 (EDT) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id C2DD28D000A; Thu, 27 Jul 2023 04:14:24 -0400 (EDT) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0016.hostedemail.com [216.40.44.16]) by kanga.kvack.org (Postfix) with ESMTP id ADE768D0001 for ; Thu, 27 Jul 2023 04:14:24 -0400 (EDT) Received: from smtpin14.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay10.hostedemail.com (Postfix) with ESMTP id 796EFC0897 for ; Thu, 27 Jul 2023 08:14:24 +0000 (UTC) X-FDA: 81056679648.14.B8270F8 Received: from mail-pf1-f172.google.com (mail-pf1-f172.google.com [209.85.210.172]) by imf15.hostedemail.com (Postfix) with ESMTP id 93FF4A0003 for ; Thu, 27 Jul 2023 08:14:22 +0000 (UTC) Authentication-Results: imf15.hostedemail.com; dkim=pass header.d=bytedance.com header.s=google header.b=M90va2ss; dmarc=pass (policy=quarantine) header.from=bytedance.com; spf=pass (imf15.hostedemail.com: domain of zhengqi.arch@bytedance.com designates 209.85.210.172 as permitted sender) smtp.mailfrom=zhengqi.arch@bytedance.com ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1690445662; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:dkim-signature; bh=Z8shjZqP4Y8hZDaFLi0y7A+vG6rpfakjLEwLFWECHfw=; b=JywjcJiNvKU0ycNQOLGtmKuwJy4TUAApOakScRlEm7ugSSGhzr/ynfOccW07CJX5MmEcb5 qUt45kXTCn08SFmNS+TzxpXq6rnUtgTv9anZ/vui+BaOsfEI5yxUvhLz71jr8iyniUhVjD AYNcb5Z0unMjy025TRgqq87x68SDELI= ARC-Authentication-Results: i=1; imf15.hostedemail.com; dkim=pass header.d=bytedance.com header.s=google header.b=M90va2ss; dmarc=pass (policy=quarantine) header.from=bytedance.com; spf=pass (imf15.hostedemail.com: domain of zhengqi.arch@bytedance.com designates 209.85.210.172 as permitted sender) smtp.mailfrom=zhengqi.arch@bytedance.com ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1690445662; a=rsa-sha256; cv=none; b=DU+EzNYGlARDZ4/XL6Cnq8qEnCh9hKyCFPfMYnAKZaS1LjCOGt1O82ipLyaU2Q92eV7v/r ZyUv6DmSCWzKi0SYsLYbGQEn30MPVg9q5SAgW4FmKvjb0CnDTyY9NU/zfDPDOPczAHCXg7 cC+sAHPLgFZLM8lfLdaxAORywYSwZ9c= Received: by mail-pf1-f172.google.com with SMTP id d2e1a72fcca58-66d6a9851f3so168855b3a.0 for ; Thu, 27 Jul 2023 01:14:22 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=bytedance.com; s=google; t=1690445661; x=1691050461; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:from:to:cc:subject:date :message-id:reply-to; bh=Z8shjZqP4Y8hZDaFLi0y7A+vG6rpfakjLEwLFWECHfw=; b=M90va2ss9YMh+De8/KZ+mVAcBrFGEU+GzcGNqxT+B7hxTFEFMd5WuiRk55c0waVg/i KUSKiUTPKXnQRLgL7+hNGJB6n7fW48sv8jtOOA7F/yud6N73bo3romR87IwxTWFghhtj +YqkOMHpv3kLBbjEgh8NplH0r6imKflx3JaCipHnW/k8FR0eXG63PDYGiheNwgxfAvYO s4RNvSYPNCMRAhjFOV4c7zk3yoOMMnjDLt+tw6HnbliXEGOVORcHeWA+2V79c3P+B/9H uhy/DS05CaNG4GhbfQJuetBgKgR0N62DJbP4l44spA0q6/uulU3xcf96dcv+7F9txF9C bWPA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20221208; t=1690445661; x=1691050461; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=Z8shjZqP4Y8hZDaFLi0y7A+vG6rpfakjLEwLFWECHfw=; b=ObNVZjGoF1IPNd4i1QcCWN4m1rcRDS9uKnN+6UnOzVkGJgalC6/Qu048SZZ8msdYSa Ijb+zS+wcHjinqLpgEpAnrUjWqP9MHtZi7YC83/FemYyi+y+LewvmQ1Onwv6n9wKoOuW 8WJ2y6+EV9fLThuf5MG43CfigCWFTDQi78W3Dciau3yn6owu1Rx6I5O9EamHmUluVfLT qMy4NtRyUZoI2CdTZLYAbxFy/Ckmh41nLn8IdZqBqaslxI9k4/WmennkDfmMIMyKGd1D HiT/Wjve65r91wTwCWXws7joKPXqeIeQmcOvOMCJQagqedh2IFiY7q6cW2IArIE+dXoh L4YQ== X-Gm-Message-State: ABy/qLY7DIpmEmbvBMzf1SZwuqE0wITnhv0RgBovnQfVGMvyHCPX9jeJ /Nm8Ns8kyi9m0joW7rXoIpRjmg== X-Google-Smtp-Source: APBJJlFLUJzqiKbCmr51AmVixakC62oPoMujZW4POdUPOnyVlrtViOXc2TPW8l2+gcY0Xyv3B9lWUA== X-Received: by 2002:a05:6a20:4304:b0:111:a0e5:d2b7 with SMTP id h4-20020a056a20430400b00111a0e5d2b7mr5727834pzk.4.1690445661472; Thu, 27 Jul 2023 01:14:21 -0700 (PDT) Received: from C02DW0BEMD6R.bytedance.net ([203.208.167.147]) by smtp.gmail.com with ESMTPSA id j8-20020aa78d08000000b006828e49c04csm885872pfe.75.2023.07.27.01.14.09 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 27 Jul 2023 01:14:21 -0700 (PDT) From: Qi Zheng To: akpm@linux-foundation.org, david@fromorbit.com, tkhai@ya.ru, vbabka@suse.cz, roman.gushchin@linux.dev, djwong@kernel.org, brauner@kernel.org, paulmck@kernel.org, tytso@mit.edu, steven.price@arm.com, cel@kernel.org, senozhatsky@chromium.org, yujie.liu@intel.com, gregkh@linuxfoundation.org, muchun.song@linux.dev Cc: linux-kernel@vger.kernel.org, linux-mm@kvack.org, x86@kernel.org, kvm@vger.kernel.org, xen-devel@lists.xenproject.org, linux-erofs@lists.ozlabs.org, linux-f2fs-devel@lists.sourceforge.net, cluster-devel@redhat.com, linux-nfs@vger.kernel.org, linux-mtd@lists.infradead.org, rcu@vger.kernel.org, netdev@vger.kernel.org, dri-devel@lists.freedesktop.org, linux-arm-msm@vger.kernel.org, dm-devel@redhat.com, linux-raid@vger.kernel.org, linux-bcache@vger.kernel.org, virtualization@lists.linux-foundation.org, linux-fsdevel@vger.kernel.org, linux-ext4@vger.kernel.org, linux-xfs@vger.kernel.org, linux-btrfs@vger.kernel.org, Qi Zheng , Muchun Song Subject: [PATCH v3 42/49] fs: super: dynamically allocate the s_shrink Date: Thu, 27 Jul 2023 16:04:55 +0800 Message-Id: <20230727080502.77895-43-zhengqi.arch@bytedance.com> X-Mailer: git-send-email 2.24.3 (Apple Git-128) In-Reply-To: <20230727080502.77895-1-zhengqi.arch@bytedance.com> References: <20230727080502.77895-1-zhengqi.arch@bytedance.com> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-Rspam-User: X-Stat-Signature: fxx9tx11xhjzh8dcxddw1rcdhau8nro7 X-Rspamd-Server: rspam07 X-Rspamd-Queue-Id: 93FF4A0003 X-HE-Tag: 1690445662-186113 X-HE-Meta: U2FsdGVkX19w+4IALuch2FP8AAPhd7MhyJnPEBQFD5qhjHPjLwUZ1xEFGDZ1ekLmQbt91AvPvHy481yZR9eNIrwCpu6U4QVLAGk2sGF9MP5Wf6YSZZGRo6kcmBoczBOuqHFajT3ZTDiWrkYrsBQpkHfy6zP1GbnyOg5MG6YYnN+js2DCHat24V1hxbIhCsVwwTPlCHavoFpyDboOuE+SUDpkdDoF1g6hNf2ZgsKrJ5+ApI35duDa7Ud+nrTAjjbTQHcERQ3GXw1Ji5sQGZrC4c+HBOvZQZadSbLqu9ssDkjNUD3f73jE7ua4HaDhkvZ1kO4OazqxQkLFm/+bQYVbCwbjky8n4EdQYDy/IgPOX5XDc1qbcWpYY12vPCSe0HXANspGa9z7npGDtcZH2jB1Pnoee5fzJ2O1vfn9sPfivL/BgsE+O3QiK2OIWDWh2uwkeeVbSJ+ha3VxkySKcrNfYqvyrva/R0BS70EBcJO9P+lW1aHP+BiFAhTd/6PRdVfiogl0HBu4dmfTo8CnrH/hD6ErtUO+aqyRHnXn4X+vsNBZ/g+S6VQ6whtDNWHozlgXNdLX8bc5TB09Sl0wUarPKkmxX/BqfdcaZJk3N9uR0NANdmsto6rDBIgZBZj5XkPlIPsVUcuqrplh8aJIevAXGrT0dVFfkjc9rywdDOXYnUKDpVu5Olf8Q4WkNdR1Qiz+iJki9daCLtV9C+O8vfK+cKstCz9veVpV3l2acNFhVCL4gUnPS18TRpgXUUMDwJafhSPrakWDHrugrqc6uRvs0fnYPqjMJjT9ZGTpiAq/PxavFsIV5UqaV8y0kpEcKs/UHJvvWIJsqRjCgDrIK7UOmVN0zdYKBR8kLh8MX5yQMdbBq4Rq7bRAhDLhTOEvKfIfWV1WZLIl8gYYrL9+2K53ikv/gj690tC6DYv5S4y0aqA6ZWEr/GDf6B1K+FTzhd3FGdO0oKE5GRKVraSrarc 66Str5o7 8p8BRsToYK0G745Lp1I9eaOh0OayDGqN+4Dr970hXSzDQX0RXlcJ+svSfkKYoUdKC+MqT6Kdt9NAzf2e1yvZWTwBhTom4Q+sSdaxIui04odE+Xmw/vLBW6rdmP/iRpDXjK6VogHoY3tL0lWmsKNmNUdiqorUFx8vqthdYsnIZJWFCdBuRaMTvwr40rMEx3mt31t2qRlWFXja7WSKnEbVPU8p972t1ARrERMjq/w2lXZmJZG8zQcEva0n+cj4lJ5keT14I+/3NStqlIsgGqlo67pHhkWbMr06f0Z7DRGJQ/SW9dKw/7VrT2C/kOK7Qs7qHMOQ8ayF+5Y1aoR93XjC3PoMunVzxe3NnUlrzjkplIQxfpMvIbbnnelo3Ftt1jPNS9KEdMXKfzYLPTRrh2xRer/7l6oxh0j+eFiF/nMOWy8ruuADSA7Vn7dGw7JdwOCkLE33WPxuprdAW+a23SxpYkZKg7A== X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: In preparation for implementing lockless slab shrink, use new APIs to dynamically allocate the s_shrink, so that it can be freed asynchronously using kfree_rcu(). Then it doesn't need to wait for RCU read-side critical section when releasing the struct super_block. Signed-off-by: Qi Zheng Reviewed-by: Muchun Song --- fs/btrfs/super.c | 2 +- fs/kernfs/mount.c | 2 +- fs/proc/root.c | 2 +- fs/super.c | 36 ++++++++++++++++++++---------------- include/linux/fs.h | 2 +- 5 files changed, 24 insertions(+), 20 deletions(-) diff --git a/fs/btrfs/super.c b/fs/btrfs/super.c index cffdd6f7f8e8..4c9c878b0da4 100644 --- a/fs/btrfs/super.c +++ b/fs/btrfs/super.c @@ -1519,7 +1519,7 @@ static struct dentry *btrfs_mount_root(struct file_system_type *fs_type, error = -EBUSY; } else { snprintf(s->s_id, sizeof(s->s_id), "%pg", bdev); - shrinker_debugfs_rename(&s->s_shrink, "sb-%s:%s", fs_type->name, + shrinker_debugfs_rename(s->s_shrink, "sb-%s:%s", fs_type->name, s->s_id); btrfs_sb(s)->bdev_holder = fs_type; error = btrfs_fill_super(s, fs_devices, data); diff --git a/fs/kernfs/mount.c b/fs/kernfs/mount.c index d49606accb07..2657ff1181f1 100644 --- a/fs/kernfs/mount.c +++ b/fs/kernfs/mount.c @@ -256,7 +256,7 @@ static int kernfs_fill_super(struct super_block *sb, struct kernfs_fs_context *k sb->s_time_gran = 1; /* sysfs dentries and inodes don't require IO to create */ - sb->s_shrink.seeks = 0; + sb->s_shrink->seeks = 0; /* get root inode, initialize and unlock it */ down_read(&kf_root->kernfs_rwsem); diff --git a/fs/proc/root.c b/fs/proc/root.c index a86e65a608da..22b78b28b477 100644 --- a/fs/proc/root.c +++ b/fs/proc/root.c @@ -188,7 +188,7 @@ static int proc_fill_super(struct super_block *s, struct fs_context *fc) s->s_stack_depth = FILESYSTEM_MAX_STACK_DEPTH; /* procfs dentries and inodes don't require IO to create */ - s->s_shrink.seeks = 0; + s->s_shrink->seeks = 0; pde_get(&proc_root); root_inode = proc_get_inode(s, &proc_root); diff --git a/fs/super.c b/fs/super.c index da68584815e4..68b3877af941 100644 --- a/fs/super.c +++ b/fs/super.c @@ -67,7 +67,7 @@ static unsigned long super_cache_scan(struct shrinker *shrink, long dentries; long inodes; - sb = container_of(shrink, struct super_block, s_shrink); + sb = shrink->private_data; /* * Deadlock avoidance. We may hold various FS locks, and we don't want @@ -120,7 +120,7 @@ static unsigned long super_cache_count(struct shrinker *shrink, struct super_block *sb; long total_objects = 0; - sb = container_of(shrink, struct super_block, s_shrink); + sb = shrink->private_data; /* * We don't call trylock_super() here as it is a scalability bottleneck, @@ -182,7 +182,7 @@ static void destroy_unused_super(struct super_block *s) security_sb_free(s); put_user_ns(s->s_user_ns); kfree(s->s_subtype); - free_prealloced_shrinker(&s->s_shrink); + shrinker_free(s->s_shrink); /* no delays needed */ destroy_super_work(&s->destroy_work); } @@ -259,16 +259,20 @@ static struct super_block *alloc_super(struct file_system_type *type, int flags, s->s_time_min = TIME64_MIN; s->s_time_max = TIME64_MAX; - s->s_shrink.seeks = DEFAULT_SEEKS; - s->s_shrink.scan_objects = super_cache_scan; - s->s_shrink.count_objects = super_cache_count; - s->s_shrink.batch = 1024; - s->s_shrink.flags = SHRINKER_NUMA_AWARE | SHRINKER_MEMCG_AWARE; - if (prealloc_shrinker(&s->s_shrink, "sb-%s", type->name)) + s->s_shrink = shrinker_alloc(SHRINKER_NUMA_AWARE | SHRINKER_MEMCG_AWARE, + "sb-%s", type->name); + if (!s->s_shrink) goto fail; - if (list_lru_init_memcg(&s->s_dentry_lru, &s->s_shrink)) + + s->s_shrink->seeks = DEFAULT_SEEKS; + s->s_shrink->scan_objects = super_cache_scan; + s->s_shrink->count_objects = super_cache_count; + s->s_shrink->batch = 1024; + s->s_shrink->private_data = s; + + if (list_lru_init_memcg(&s->s_dentry_lru, s->s_shrink)) goto fail; - if (list_lru_init_memcg(&s->s_inode_lru, &s->s_shrink)) + if (list_lru_init_memcg(&s->s_inode_lru, s->s_shrink)) goto fail; return s; @@ -326,7 +330,7 @@ void deactivate_locked_super(struct super_block *s) { struct file_system_type *fs = s->s_type; if (atomic_dec_and_test(&s->s_active)) { - unregister_shrinker(&s->s_shrink); + shrinker_free(s->s_shrink); fs->kill_sb(s); /* @@ -599,7 +603,7 @@ struct super_block *sget_fc(struct fs_context *fc, hlist_add_head(&s->s_instances, &s->s_type->fs_supers); spin_unlock(&sb_lock); get_filesystem(s->s_type); - register_shrinker_prepared(&s->s_shrink); + shrinker_register(s->s_shrink); return s; share_extant_sb: @@ -678,7 +682,7 @@ struct super_block *sget(struct file_system_type *type, hlist_add_head(&s->s_instances, &type->fs_supers); spin_unlock(&sb_lock); get_filesystem(type); - register_shrinker_prepared(&s->s_shrink); + shrinker_register(s->s_shrink); return s; } EXPORT_SYMBOL(sget); @@ -1312,7 +1316,7 @@ int get_tree_bdev(struct fs_context *fc, down_write(&s->s_umount); } else { snprintf(s->s_id, sizeof(s->s_id), "%pg", bdev); - shrinker_debugfs_rename(&s->s_shrink, "sb-%s:%s", + shrinker_debugfs_rename(s->s_shrink, "sb-%s:%s", fc->fs_type->name, s->s_id); sb_set_blocksize(s, block_size(bdev)); error = fill_super(s, fc); @@ -1385,7 +1389,7 @@ struct dentry *mount_bdev(struct file_system_type *fs_type, down_write(&s->s_umount); } else { snprintf(s->s_id, sizeof(s->s_id), "%pg", bdev); - shrinker_debugfs_rename(&s->s_shrink, "sb-%s:%s", + shrinker_debugfs_rename(s->s_shrink, "sb-%s:%s", fs_type->name, s->s_id); sb_set_blocksize(s, block_size(bdev)); error = fill_super(s, data, flags & SB_SILENT ? 1 : 0); diff --git a/include/linux/fs.h b/include/linux/fs.h index 891cf662b26f..500238213fd9 100644 --- a/include/linux/fs.h +++ b/include/linux/fs.h @@ -1232,7 +1232,7 @@ struct super_block { const struct dentry_operations *s_d_op; /* default d_op for dentries */ - struct shrinker s_shrink; /* per-sb shrinker handle */ + struct shrinker *s_shrink; /* per-sb shrinker handle */ /* Number of inodes with nlink == 0 but still referenced */ atomic_long_t s_remove_count; -- 2.30.2