From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from smtp1.linuxfoundation.org (smtp1.linux-foundation.org [172.17.192.35]) by mail.linuxfoundation.org (Postfix) with ESMTPS id E9D5D29C6 for ; Mon, 8 Jul 2019 18:01:56 +0000 (UTC) Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp1.linuxfoundation.org (Postfix) with ESMTPS id 7C81D827 for ; Mon, 8 Jul 2019 18:01:56 +0000 (UTC) Date: Mon, 8 Jul 2019 14:01:55 -0400 From: Sasha Levin To: Mark Brown Message-ID: <20190708180155.GP10104@sasha-vm> References: <20190703013557.GQ11506@sasha-vm> <20190705164142.GC20625@sirena.org.uk> <20190705201231.GI10104@sasha-vm> <20190706003214.GE20625@sirena.org.uk> <20190708110208.GN10104@sasha-vm> <20190708123733.GC8576@sirena.org.uk> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii; format=flowed Content-Disposition: inline In-Reply-To: <20190708123733.GC8576@sirena.org.uk> Cc: ksummit-discuss@lists.linuxfoundation.org Subject: Re: [Ksummit-discuss] [MAINTAINERS SUMMIT] stable kernel process automation and improvement List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , On Mon, Jul 08, 2019 at 01:37:33PM +0100, Mark Brown wrote: >On Mon, Jul 08, 2019 at 07:02:08AM -0400, Sasha Levin wrote: >> On Sat, Jul 06, 2019 at 01:32:14AM +0100, Mark Brown wrote: > >> > I'm not saying leave it alone, it's more a question of how >> > aggressive we are about picking up things we think might be >> > relevant fixes but haven't had some sort of domain specific >> > analysis of. Testing is a good way to mitigate the potential >> > risks here. > >> I agree, and for various subsystems and drivers where the maintainers >> volunteer their domain specific expertise to send backports to stable, I >> have "blacklisted" it from AUTOSEL since indeed it's a much better >> option. > >Hrm, it's definitely getting a bunch of stuff for my subsystems >where I do tag things for stable... You still need to explicitly ask me to blacklist it, but I'm more than happy to if you feel the AUTOSEL process doesn't add value. Some maintainers choose to keep AUTOSEL but just respond with "NAK" on patches they don't want in. >> > > This came up in the last MS, and the agreement there was that we expect >> > > stable kernel users to test their workloads before throwing it into >> > > production. > >> > That's kind of the problem - if people are doing testing and end >> > up finding problems coming back in the stable kernel that's the >> > sort of thing that encourages them to not just take stable en >> > masse as we say they should. Part of the deal with stable is >> > that it is conservative, people can trust it to be a low risk >> > update. That's not happening now as far as I'm aware but it does >> > worry me that it might happen. > >> Right, and the rate at which AUTOSEL commits are reverted is lower than >> commits that are actually tagged for stable. If AUTOSEL commits on their >> own were being reverted left and right I'd agree we need to tone it >> down, but I don't see it happening now. > >I'm not sure how many people will actually report problems they >experience upstream rather than just fixing things locally and >just moving on. The more code is the more likely it is that one >of the users will report things. > >> > > If we were to start avoiding driver updates, it would act as an >> > > incentive for people not to upgrade their kernel. > >> > I'm not sure I follow the logic here? > >> The way I see it, the lower your "effective delta" is between to >> kernels, the easier it is to move forward. For example, if I have a >> product that runs on 4.19 and uses all our core kernel code + 10 >> drivers, and I know that those drivers had most of the fixes backported >> to my LTS tree, I'd feel much more confident going to 5.4 knowning that >> I already have most of the patches that come with 5.4. > >I see, that's definitely a new one to me. The concerns people >usually have about upgrading are more around the core kernel >changing performance characteristics or something in a way that >disrupts important workloads. I'm not quite sure I follow the >logic there TBH, it seems to be discounting new development >rather too much - even if the drivers have been very static >there's all the integration with the rest of the kernel to think >about. My thinking is that we will need to address new core kernel developments either way, which is why I haven't mentioned them here. The variable cost here is how much effort will go into validating my hardware devices and the code that runs them. >> For me it's a matter of how one would budget a move from a kernel X LTS >> to kernel Y LTS, and I think that as that budget requirement grows it's >> actually harder to actually do it (and convince management), acting as a >> negative incentive to stay with whatever works now. > >If the drivers are static enough to only be getting bug fixes >surely the rest of the kernel is a massively more substantial >concern? They're not too static, and sadly them being less tested means I'm more worried about drivers than core kernel code. Sure, the core kernel is also a concern but as I've mentioned above, you will pay the price for re-testing core kernel stuff anyway. -- Thanks, Sasha