From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-14.0 required=3.0 tests=BAYES_00,INCLUDES_CR_TRAILER, INCLUDES_PATCH,MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS autolearn=ham autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 2F32EC433DB for ; Tue, 5 Jan 2021 00:55:11 +0000 (UTC) Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by mail.kernel.org (Postfix) with ESMTP id ADC4E227C3 for ; Tue, 5 Jan 2021 00:55:10 +0000 (UTC) DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org ADC4E227C3 Authentication-Results: mail.kernel.org; dmarc=fail (p=none dis=none) header.from=kernel.org Authentication-Results: mail.kernel.org; spf=pass smtp.mailfrom=owner-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix) id B93D68D0040; Mon, 4 Jan 2021 19:55:09 -0500 (EST) Received: by kanga.kvack.org (Postfix, from userid 40) id B45248D0036; Mon, 4 Jan 2021 19:55:09 -0500 (EST) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id A32AE8D0040; Mon, 4 Jan 2021 19:55:09 -0500 (EST) X-Delivered-To: linux-mm@kvack.org Received: from forelay.hostedemail.com (smtprelay0004.hostedemail.com [216.40.44.4]) by kanga.kvack.org (Postfix) with ESMTP id 835848D0036 for ; Mon, 4 Jan 2021 19:55:09 -0500 (EST) Received: from smtpin26.hostedemail.com (10.5.19.251.rfc1918.com [10.5.19.251]) by forelay05.hostedemail.com (Postfix) with ESMTP id 3FBEB181AC9B6 for ; Tue, 5 Jan 2021 00:55:09 +0000 (UTC) X-FDA: 77669902338.26.cub63_260013b274d4 Received: from filter.hostedemail.com (10.5.16.251.rfc1918.com [10.5.16.251]) by smtpin26.hostedemail.com (Postfix) with ESMTP id 1FADB1804B65C for ; Tue, 5 Jan 2021 00:55:09 +0000 (UTC) X-HE-Tag: cub63_260013b274d4 X-Filterd-Recvd-Size: 7643 Received: from mail-il1-f175.google.com (mail-il1-f175.google.com [209.85.166.175]) by imf26.hostedemail.com (Postfix) with ESMTP for ; Tue, 5 Jan 2021 00:55:08 +0000 (UTC) Received: by mail-il1-f175.google.com with SMTP id w17so27067040ilj.8 for ; Mon, 04 Jan 2021 16:55:08 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:cc:subject:message-id:references :mime-version:content-disposition:in-reply-to; bh=n/CGDoYBE+dXOpGqRxCS2ivcb9jRqslobwDq2y3UTJw=; b=g0ixFLzwTCafWEb/mR5cTDCa77NeW/qMv3eI95bEDFTPoCaBuQ1xNh2LSG/5KXRAG0 uHapckSwsGoT146718DChVuqijf1H9Fjnq3KARb8U5zZZ37B9Dmh6XWXJjjl0P3QJLXq MY7d0SfwRvQKjuCs9gI4Z/KxS/6H9aS7/EyAEOta+hyUkicwYiYJT0ZUsn3oJ5ioe8Ho wRXI7nR61LS/mFICcoRoci2O80Y9BxeAvUh/QFdTZFKTdCndNZUVmzGc0MljT6Oi5aXC GXQC+66MRLOuJyx6TK+o86DjvfMB1jHn6PB8sn9t/1dtSzRoQwafBY6hinVVwy7qpcEk r3SA== X-Gm-Message-State: AOAM531uEMc9inEt9xxt9o9wvGD3PxnkvTL2Xs6Yr/cbJl+xxYCFl05M yzekZgk7c9dZXQuVogtXKn4= X-Google-Smtp-Source: ABdhPJwCDc4No6Ih+nP5WcZFjxbRaVJtlmxjrpcZkiRm5I2kMAJU5BQk98PIzmmTEyCpgZZWOZVu6w== X-Received: by 2002:a92:6f12:: with SMTP id k18mr73850762ilc.66.1609808107998; Mon, 04 Jan 2021 16:55:07 -0800 (PST) Received: from google.com (110.41.72.34.bc.googleusercontent.com. [34.72.41.110]) by smtp.gmail.com with ESMTPSA id 15sm33356984ilx.84.2021.01.04.16.55.07 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 04 Jan 2021 16:55:07 -0800 (PST) Date: Tue, 5 Jan 2021 00:55:06 +0000 From: Dennis Zhou To: Nathan Chancellor Cc: Tejun Heo , Christoph Lameter , linux-mm@kvack.org, linux-kernel@vger.kernel.org, clang-built-linux@googlegroups.com, kbuild-all@lists.01.org, kernel test robot Subject: Re: [PATCH] percpu: fix clang modpost warning in pcpu_build_alloc_info() Message-ID: References: <20201231212852.3175381-1-dennis@kernel.org> <20210104234651.GA3548546@ubuntu-m3-large-x86> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20210104234651.GA3548546@ubuntu-m3-large-x86> X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: On Mon, Jan 04, 2021 at 04:46:51PM -0700, Nathan Chancellor wrote: > On Thu, Dec 31, 2020 at 09:28:52PM +0000, Dennis Zhou wrote: > > This is an unusual situation so I thought it best to explain it in a > > separate patch. > > > > "percpu: reduce the number of cpu distance comparisons" introduces a > > dependency on cpumask helper functions in __init code. This code > > references a struct cpumask annotated __initdata. When the function is > > inlined (gcc), everything is fine, but clang decides not to inline these > > function calls. This causes modpost to warn about an __initdata access > > by a function not annotated with __init [1]. > > > > Ways I thought about fixing it: > > 1. figure out why clang thinks this inlining is too costly. > > 2. create a wrapper function annotated __init (this). > > 3. annotate cpumask with __refdata. > > > > Ultimately it comes down to if it's worth saving the cpumask memory and > > allowing it to be freed. IIUC, __refdata won't be freed, so option 3 is > > just a little wasteful. 1 is out of my depth, leaving 2. I don't feel > > great about this behavior being dependent on inlining semantics, but > > cpumask helpers are small and probably should be inlined. > > > > modpost complaint: > > WARNING: modpost: vmlinux.o(.text+0x735425): Section mismatch in reference from the function cpumask_clear_cpu() to the variable .init.data:pcpu_build_alloc_info.mask > > The function cpumask_clear_cpu() references > > the variable __initdata pcpu_build_alloc_info.mask. > > This is often because cpumask_clear_cpu lacks a __initdata > > annotation or the annotation of pcpu_build_alloc_info.mask is wrong. > > > > clang output: > > mm/percpu.c:2724:5: remark: cpumask_clear_cpu not inlined into pcpu_build_alloc_info because too costly to inline (cost=725, threshold=325) [-Rpass-missed=inline] > > > > [1] https://lore.kernel.org/linux-mm/202012220454.9F6Bkz9q-lkp@intel.com/ > > > > Reported-by: kernel test robot > > Signed-off-by: Dennis Zhou > > --- > > This is on top of percpu#for-5.12. > > > > mm/percpu.c | 16 ++++++++++++++-- > > 1 file changed, 14 insertions(+), 2 deletions(-) > > > > diff --git a/mm/percpu.c b/mm/percpu.c > > index 80f8f885a990..357977c4cb00 100644 > > --- a/mm/percpu.c > > +++ b/mm/percpu.c > > @@ -2642,6 +2642,18 @@ early_param("percpu_alloc", percpu_alloc_setup); > > > > /* pcpu_build_alloc_info() is used by both embed and page first chunk */ > > #if defined(BUILD_EMBED_FIRST_CHUNK) || defined(BUILD_PAGE_FIRST_CHUNK) > > + > > +/* > > + * This wrapper is to avoid a warning where cpumask_clear_cpu() is not inlined > > + * when compiling with clang causing modpost to warn about accessing __initdata > > + * from a non __init function. By doing this, we allow the struct cpumask to be > > + * freed instead of it taking space by annotating with __refdata. > > + */ > > +static void __init pcpu_cpumask_clear_cpu(int cpu, struct cpumask *mask) > > +{ > > + cpumask_clear_cpu(cpu, mask); > > +} > > + > > /** > > * pcpu_build_alloc_info - build alloc_info considering distances between CPUs > > * @reserved_size: the size of reserved percpu area in bytes > > @@ -2713,7 +2725,7 @@ static struct pcpu_alloc_info * __init pcpu_build_alloc_info( > > cpu = cpumask_first(&mask); > > group_map[cpu] = group; > > group_cnt[group]++; > > - cpumask_clear_cpu(cpu, &mask); > > + pcpu_cpumask_clear_cpu(cpu, &mask); > > > > for_each_cpu(tcpu, &mask) { > > if (!cpu_distance_fn || > > @@ -2721,7 +2733,7 @@ static struct pcpu_alloc_info * __init pcpu_build_alloc_info( > > cpu_distance_fn(tcpu, cpu) == LOCAL_DISTANCE)) { > > group_map[tcpu] = group; > > group_cnt[group]++; > > - cpumask_clear_cpu(tcpu, &mask); > > + pcpu_cpumask_clear_cpu(tcpu, &mask); > > } > > } > > } > > -- > > 2.29.2.729.g45daf8777d-goog > > Hi Nathan, > > Hi Dennis, > > I did a bisect of the problematic config against defconfig and it points > out that CONFIG_GCOV_PROFILE_ALL is in the bad config but not the good > config, which makes some sense as that will mess with clang's inlining > heuristics. It does not appear to be the single config that makes a > difference but it gives some clarity. > Ah, thanks. To me it's kind of a corner case that I don't have a lot of insight into. __init code is pretty limited and this warning is really at the compilers whim. However, in this case only clang throws this warning. > I do not personally have any strong opinions around the patch but is it > really that much wasted memory to just annotate mask with __refdata? It's really not much memory, 1 bit per max # of cpus. The reported config is on the extreme side compiling with 8k NR_CPUS, so 1kb. I'm just not in love with the idea of adding a patch to improve readability and it cost idle memory to resolve a compile time warning. If no one else chimes in in the next few days, I'll probably just apply it and go from there. If another issue comes up I'll drop this and tag it as __refdata. Thanks, Dennis