From mboxrd@z Thu Jan 1 00:00:00 1970 Date: Wed, 16 Oct 2002 18:39:07 +0530 From: Maneesh Soni Subject: Re: 2.5.42-mm2 hangs system Message-ID: <20021016183907.B29405@in.ibm.com> Reply-To: maneesh@in.ibm.com References: <20021013160451.GA25494@hswn.dk> <3DA9CA28.155BA5CB@digeo.com> <20021013223332.GA870@hswn.dk> Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <20021013223332.GA870@hswn.dk>; from henrik@hswn.dk on Sun, Oct 13, 2002 at 10:34:40PM +0000 Sender: owner-linux-mm@kvack.org Return-Path: To: =?iso-8859-1?Q?Henrik_St=F8rner?= Cc: linux-mm@kvack.org, akpm@digeo.com, Dipankar Sarma List-ID: On Sun, Oct 13, 2002 at 10:34:40PM +0000, Henrik Storner wrote: > On Sun, Oct 13, 2002 at 12:31:52PM -0700, Andrew Morton wrote: > > Henrik Storner wrote: > > > > > > I gave 2.5.42-mm2 a test run yesterday, and it hung the box solid > > > while doing a kernel compile. The compile stopped dead in the middle > > > of a file, and there was no response when trying to access another > > > console (no X running). Alt-sysrq worked, so it wasn't completely dead > > > - sync/umount/reboot worked. > > > > > > Nothing in the logs - no oops or other kernel messages. > > > > > > Rebooted and repeated the experiment with the same result, > > > so it appears to be reproducible. > > > > > > Stock 2.5.42 has worked OK for a day now, including kernel > > > compiles - the system has performed flawlessly for a > > > couple of years as my normal workstation. > > > > > > PII processor, 384 MB RAM, SCSI disk (ncr53c8xx driver), > > > Intel eepro/100 network adapter. Kernel config at > > > http://www.hswn.dk/config-2.5.42-mm2 > > > > Very odd. > > > > If you have time, could you please enable "load all symbols" > > in the kernel hacking menu and capture a sysrq-T trace? > > Thanks. > > Did so - built it again from a fresh kernel tree, just to be sure. > Compiler is gcc 3.2 from Red Hat 8, by the way. > > Bug is still there. sysrq-T scrolls off the screen too fast for me to > read, but the last screenful has several processes like this (could > see sh, make, sh, gcc): > > Call Trace: Hello Henrik, I tired recreating the hang, but it didnot occur. I could guess from the call trace that you are using reiserfs and nfs but I not very clear how are you recreating it. I created a resierfs partition and exported it. Then tried to compile a kernel over it. I used the config file from the site you mentioned. It will be nice if you can list the exact recreation steps mentioning the filesystems you are using. As the hang looks like a loop in d_lookup can you try recreating it *without* dcache_rcu.patch. You can backout this patch http://www.zipworld.com.au/~akpm/linux/patches/2.5/2.5.42/2.5.42-mm2/broken-out/dcache_rcu.patch Thanks Maneesh -- Maneesh Soni IBM Linux Technology Center, IBM India Software Lab, Bangalore. Phone: +91-80-5044999 email: maneesh@in.ibm.com http://lse.sourceforge.net/ -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@kvack.org. For more info on Linux MM, see: http://www.linux-mm.org/