From: Petr Mladek <pmladek@suse.com>
To: Steven Rostedt <rostedt@goodmis.org>
Cc: Peter Zijlstra <peterz@infradead.org>,
"ksummit-discuss@lists.linuxfoundation.org"
<ksummit-discuss@lists.linuxfoundation.org>
Subject: Re: [Ksummit-discuss] [TECH TOPIC] printk redesign
Date: Wed, 21 Jun 2017 14:23:04 +0200 [thread overview]
Message-ID: <20170621122304.GC1538@pathway.suse.cz> (raw)
In-Reply-To: <20170620192858.142a43ff@gandalf.local.home>
On Tue 2017-06-20 19:28:58, Steven Rostedt wrote:
> I've thought about this a little too.
>
> I would like printk to have per-cpu buffers. Then we don't even need to
> store the CPU number, that would be explicit by which buffer the data
> is stored in.
>
> The one thing that is needed, is the consumer. In ftrace, it's whatever
> reads the buffer, which is usually user space, but can be the kernel
> (see sysctl-z). But there's only one consumer at a time.
>
> I was thinking about a new design for printk. Similar to ftrace, but
> different.
>
> 1) have per cpu buffers, that are lockless. Writes happen immediately,
> but the output happens later.
My problems with per-CPU buffers is that:
+ I am not sure how big per-CPU buffers we could afford.
Any non-balanced usage increases the chance of loosing
messages.
+ The information is scattered and extra tools are needed
to locate the messages and sort them.
+ It suggests that the solution should be lockless. But
lockless code is very complex in principle. The ring
buffer used by ftrace is a good example and it is
still limited to one reader.
> 2) have two types of console interfaces. A normal and a critical.
>
> 3) have a thread that is woken whenever there is data in any of the
> buffers, and reads the buffers, again lockless. But to do this in a
> reasonable manner, unless you break the printks up in sub buffers like
> ftrace, if the consumer isn't fast enough, newer messages are dropped.
>
> 4) If a critical print is needed (and here's why we have two console
> interfaces), the normal console interface gets turned off, and the
> buffers stop being output through them. What ever called the critical
> print, will take over, and flush out all the contents of the current
> buffers. Then anything printed during the critical section will go out
> immediately (no buffering). The printk thread, will stop having access
> to the buffers, and shutdown till the critical section is complete.
IMHO, this is something that we are already trying to implement by
the printk_kthread.
To be honest, I am not sure if I have a good top view at the moment.
Especially I am not sure about all the existing problems and requirements.
I always hear the the printk code is too complex. Then people complain
about various limitations. Solution of the limitations usually make
the code even more complex.
IMHO, the two main fighting tasks are:
1. store messages as fast as possible
2. show the messages as reliably as possible
IMHO, we are relatively good in the storing part. The biggest
problems are on the showing side, especially when it comes to
slow and messy consoles.
I tried to look at it also in more details. The problems that come
to my mind are:
1. hard lockups in NMI
2. hard lockups caused by recursive calls, e.g. warnings
triggered from printk() code
3. soft lockups caused by console handling
4. Lost messages when the is a flood of them
5. Lost messages when the system hangs
6. mixed part of continues lines or related lines, e.g. backtraces,
WARN()
7. Unreliable time stamps and sorting of messages.
8. console code is a big mess and I am afraid that I am still not
aware of many hidden traps there.
Let me to look closer at the problems:
Ad 1. hard lockups in NMI
It is almost solved by the printk_safe buffer. One drawback
is that the messages are temporary stored separately and
the buffer is rather small.
Lock-less ring buffer would help. The question is if is
worth the cost. It still does not solve pushing to consoles
that might have their own locks.
Ad 2. hard lockups by recursive calls
The recursion printk() -> some_func() -> printk()
is mostly solved by printk_safe. It has the same drawbacks
as the NMI solution.
The recursion some_func() -> printk() -> some_func() -> printk()
is partly solved by printk_deferred(). It avoids the recursion
from the console handling code. I actually do not know about
better solution. Note that the deadlock usually happens in
some_func() and _not_ in printk(). I do not see how printk()
itself could detect and prevent this. We could try to detect
this problems earlier using lockdep.
Ad 3. soft lockups caused by console handling
We basically need some offloading for the console handling.
The current problem is how to detect critical situation
and switch to the sync mode.
Ad 4. Lost messages when the is a flood of them
Separate buffers or reshuffling (dropping) less important
messages would help.
Ad 5. Lost messages when the system hangs
We already have troubles and the console offloading
makes it worse.
We should reduce the negative effects of offloading.
We should make sure that someone is always handling
console and reduce sleeps with console_lock. Also
everyone should try to handle some messages when
the console_lock is available to handle sudden death.
It was never perfect. The patchset from Peter Zijlstra
(early printk) looks like an interesting fallback to me.
We should make more consoles lock less.
We could also implement storing log into persistent memory.
Ad 6. Mixed parts of continues lines and related lines.
We need to be careful here. The cont buffer handling
made the printk code much more complex, introduced
many regressions. We always have to consider
the complexity and the gain.
There are some proposals for an API that would allow
to enter/exit a buffered mode. One question is if
we could afford to disable preemption (use per-CPU
buffers). Another question is the complexity
and extra memory needs.
IMPORTANT: Any buffering is dangerous for the reliability of
the output. By other words, buffering delays output and
we might never see such messages.
7. Unreliable time stamps and sorting messages
The current extra buffers (cont, printk_safe, printk_safe_nmi)
makes this worse. The timestamp is added later.
We could surely improve this. But it is always with the cost
of complexity. Also it might bring new problems when interacting
with the timer code.
Did I miss some important problems?
Did I miss some possible solutions?
I have to admit that I did not have time to think about the last
proposals from Sergey about printk_kthread. So, some of the above
summary might be a bit out of date.
Anyway, I wanted to move the discussion from implementation
back to gathering requirements and the problems with the current
implementation. At least I am not able to judge other implementation
proposals without it. Also I wanted to summarize the current
know-how. I hope that it would help to move forward and avoid
discussion cycles.
I hope that I did not kill the brainstorming effect with this.
Best Regards,
Petr
next prev parent reply other threads:[~2017-06-21 12:23 UTC|newest]
Thread overview: 57+ messages / expand[flat|nested] mbox.gz Atom feed top
2017-06-19 5:21 Sergey Senozhatsky
2017-06-19 6:22 ` Hannes Reinecke
2017-06-19 14:39 ` Steven Rostedt
2017-06-19 15:20 ` Andrew Lunn
2017-06-19 15:54 ` Hannes Reinecke
2017-06-19 16:17 ` Andrew Lunn
2017-06-19 16:23 ` Mark Brown
2017-06-20 15:58 ` Sergey Senozhatsky
2017-06-20 16:44 ` Luck, Tony
2017-06-20 17:11 ` Sergey Senozhatsky
2017-06-20 17:27 ` Mark Brown
2017-06-20 23:28 ` Steven Rostedt
2017-06-21 7:17 ` Hannes Reinecke
2017-06-21 11:12 ` Sergey Senozhatsky
2017-06-22 14:06 ` Steven Rostedt
2017-06-23 5:43 ` Sergey Senozhatsky
2017-06-23 13:09 ` Steven Rostedt
2017-06-21 12:23 ` Petr Mladek [this message]
2017-06-21 14:18 ` Andrew Lunn
2017-06-23 8:46 ` Petr Mladek
2017-06-21 16:09 ` Andrew Lunn
2017-06-23 8:49 ` Petr Mladek
2017-07-19 7:35 ` David Woodhouse
2017-07-20 7:53 ` Sergey Senozhatsky
2017-06-20 16:09 ` Sergey Senozhatsky
2017-06-19 16:26 ` Steven Rostedt
2017-06-19 16:35 ` Andrew Lunn
2017-06-24 11:14 ` Mauro Carvalho Chehab
2017-06-24 14:06 ` Andrew Lunn
2017-06-24 22:42 ` Steven Rostedt
2017-06-24 23:21 ` Andrew Lunn
2017-06-24 23:26 ` Linus Torvalds
2017-06-24 23:40 ` Steven Rostedt
2017-06-26 11:16 ` Sergey Senozhatsky
2017-06-24 23:48 ` Al Viro
2017-06-25 1:29 ` Andrew Lunn
2017-06-25 2:41 ` Linus Torvalds
2017-06-26 8:46 ` Jiri Kosina
2017-07-19 7:59 ` David Woodhouse
2017-06-20 15:56 ` Sergey Senozhatsky
2017-06-20 18:45 ` Daniel Vetter
2017-06-21 9:29 ` Petr Mladek
2017-06-21 10:15 ` Sergey Senozhatsky
2017-06-22 13:42 ` Daniel Vetter
2017-06-22 13:48 ` Daniel Vetter
2017-06-23 9:07 ` Bartlomiej Zolnierkiewicz
2017-06-27 13:06 ` Sergey Senozhatsky
2017-06-23 5:20 ` Sergey Senozhatsky
2017-06-19 23:46 ` Josh Triplett
2017-06-20 8:24 ` Arnd Bergmann
2017-06-20 14:36 ` Steven Rostedt
2017-06-20 15:26 ` Sergey Senozhatsky
2017-06-22 16:35 ` David Howells
2017-07-19 6:24 ` Sergey Senozhatsky
2017-07-19 6:25 ` Sergey Senozhatsky
2017-07-19 7:26 ` Daniel Vetter
2017-07-20 5:19 ` Sergey Senozhatsky
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20170621122304.GC1538@pathway.suse.cz \
--to=pmladek@suse.com \
--cc=ksummit-discuss@lists.linuxfoundation.org \
--cc=peterz@infradead.org \
--cc=rostedt@goodmis.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox