linux-mm.kvack.org archive mirror
 help / color / mirror / Atom feed
From: Linus Torvalds <torvalds@linux-foundation.org>
To: Andrew Morton <akpm@linux-foundation.org>
Cc: Yafang Shao <laoar.shao@gmail.com>,
	ebiederm@xmission.com,  alexei.starovoitov@gmail.com,
	rostedt@goodmis.org, linux-mm@kvack.org,
	 linux-fsdevel@vger.kernel.org,
	linux-trace-kernel@vger.kernel.org,  audit@vger.kernel.org,
	linux-security-module@vger.kernel.org,  selinux@vger.kernel.org,
	bpf@vger.kernel.org, netdev@vger.kernel.org,
	 dri-devel@lists.freedesktop.org
Subject: Re: [PATCH v2 05/10] mm/util: Fix possible race condition in kstrdup()
Date: Thu, 13 Jun 2024 15:17:53 -0700	[thread overview]
Message-ID: <CAHk-=wgqrwFXK-CO8-V4fwUh5ymnUZ=wJnFyufV1dM9rC1t3Lg@mail.gmail.com> (raw)
In-Reply-To: <20240613141435.fad09579c934dbb79a3086cc@linux-foundation.org>

On Thu, 13 Jun 2024 at 14:14, Andrew Morton <akpm@linux-foundation.org> wrote:
>
> The concept sounds a little strange.  If some code takes a copy of a
> string while some other code is altering it, yes, the result will be a
> mess.  This is why get_task_comm() exists, and why it uses locking.

The thing is, get_task_comm() is terminally broken.

Nobody sane uses it, and sometimes it's literally _because_ it uses locking.

Let's look at the numbers:

 - 39 uses of get_task_comm()

 - 2 uses of __get_task_comm() because the locking doesn't work

 - 447 uses of raw "current->comm"

 - 112 uses of raw 'ta*sk->comm' (and possibly

IOW, we need to just accept the fact that nobody actually wants to use
"get_task_comm()". It's a broken interface. It's inconvenient, and the
locking makes it worse.

Now, I'm not convinced that kstrdup() is what anybody should use
should, but of the 600 "raw" uses of ->comm, four of them do seem to
be kstrdup.

Not great, I think they could be removed, but they are examples of
people doing this. And I think it *would* be good to have the
guarantee that yes, the kstrdup() result is always a proper string,
even if it's used for unstable sources. Who knows what other unstable
sources exist?

I do suspect that most of the raw uses of 'xyz->comm' is for
printouts. And I think we would be better with a '%pTSK' vsnprintf()
format thing for that.

Sadly, I don't think coccinelle can do the kinds of transforms that
involve printf format strings.

And no, a printk() string still couldn't use the locking version.

               Linus


  reply	other threads:[~2024-06-13 22:18 UTC|newest]

Thread overview: 19+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2024-06-13  2:30 [PATCH v2 00/10] Improve the copy of task comm Yafang Shao
2024-06-13  2:30 ` [PATCH v2 01/10] fs/exec: Drop task_lock() inside __get_task_comm() Yafang Shao
2024-06-13  2:30 ` [PATCH v2 02/10] auditsc: Replace memcpy() with __get_task_comm() Yafang Shao
2024-06-13  2:30 ` [PATCH v2 03/10] security: " Yafang Shao
2024-06-13  2:30 ` [PATCH v2 04/10] bpftool: Ensure task comm is always NUL-terminated Yafang Shao
2024-06-13  2:30 ` [PATCH v2 05/10] mm/util: Fix possible race condition in kstrdup() Yafang Shao
2024-06-13 21:14   ` Andrew Morton
2024-06-13 22:17     ` Linus Torvalds [this message]
2024-06-14  2:41       ` Yafang Shao
2024-06-14  2:33     ` Yafang Shao
2024-06-13  2:30 ` [PATCH v2 06/10] mm/kmemleak: Replace strncpy() with __get_task_comm() Yafang Shao
2024-06-13  8:37   ` Catalin Marinas
2024-06-13 12:10     ` Yafang Shao
2024-06-14 10:57       ` Catalin Marinas
2024-06-14 11:45         ` Yafang Shao
2024-06-13  2:30 ` [PATCH v2 07/10] tsacct: " Yafang Shao
2024-06-13  2:30 ` [PATCH v2 08/10] tracing: " Yafang Shao
2024-06-13  2:30 ` [PATCH v2 09/10] net: Replace strcpy() " Yafang Shao
2024-06-13  2:30 ` [PATCH v2 10/10] drm: " Yafang Shao

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to='CAHk-=wgqrwFXK-CO8-V4fwUh5ymnUZ=wJnFyufV1dM9rC1t3Lg@mail.gmail.com' \
    --to=torvalds@linux-foundation.org \
    --cc=akpm@linux-foundation.org \
    --cc=alexei.starovoitov@gmail.com \
    --cc=audit@vger.kernel.org \
    --cc=bpf@vger.kernel.org \
    --cc=dri-devel@lists.freedesktop.org \
    --cc=ebiederm@xmission.com \
    --cc=laoar.shao@gmail.com \
    --cc=linux-fsdevel@vger.kernel.org \
    --cc=linux-mm@kvack.org \
    --cc=linux-security-module@vger.kernel.org \
    --cc=linux-trace-kernel@vger.kernel.org \
    --cc=netdev@vger.kernel.org \
    --cc=rostedt@goodmis.org \
    --cc=selinux@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox