* 2.5.33-mm4 filemap_copy_from_user: Unexpected page fault
@ 2002-09-06 15:48 Steven Cole
2002-09-06 16:51 ` Andrew Morton
0 siblings, 1 reply; 5+ messages in thread
From: Steven Cole @ 2002-09-06 15:48 UTC (permalink / raw)
To: Andrew Morton; +Cc: linux-mm
With 2.5.33-mm4, I tried running dbench on an ext2 partition and was
able to run up to dbench 80 successfully. However, at dbench 96, I got
four messages like this:
filemap_copy_from_user: Unexpected page fault
Shortly after this, the box hung again, responsive to pings but little
else. I did sysrq-p and typed in the results, which are slightly
different than before using an ext3 partition. Sysrq-e and sysrq-i had
no effect, so I had to sysrq-b. The following fsck on the ext2 disk was
not fun. It may be worth noting that this hang occurred at 96 clients
with dbench on ext2 and at 8 clients on ext3 (data=ordered).
Here is the output of ksymoops on the sysrq-p result:
Steven
[steven@spc5 linux-2.5.33-mm4]$ ksymoops -K -L -O -v vmlinux -m System.map <mm4-sysrq-p.txt
ksymoops 2.4.4 on i686 2.5.33. Options used
-v vmlinux (specified)
-K (specified)
-L (specified)
-O (specified)
-m System.map (specified)
Pid: 1945, comm: pdflush
EIP: 068:[<c0159bf4>] CPU: 1 EFLAGS: 00000202 Not tainted
Using defaults from ksymoops -t elf32-i386 -a i386
EAX: 00000001 EBX: c90f3730 ECX: c90f3738 EDX: eb32bf88
ESI: c90f3730 EDI: 000065c2 EBP: 00000001 DS: 0068 ES: 0068
CR0: 8005003b CR2: 4212db0c CR3: 2a378000 CR4: 00000690
Call Trace: [<c0159e1e>] [<c013b8aa>] [<c013b4cb>] [<c013b570>] [<c013b57b>]
[<c013b830>] [<c0107284>] [<c0107289>]
Warning (Oops_read): Code line not seen, dumping what data is available
>>EIP; c0159bf4 <sync_sb_inodes+84/260> <=====
Trace; c0159e1e <writeback_inodes+4e/80>
Trace; c013b8aa <background_writeout+7a/c0>
Trace; c013b4cb <__pdflush+12b/1d0>
Trace; c013b570 <pdflush+0/10>
Trace; c013b57b <pdflush+b/10>
Trace; c013b830 <background_writeout+0/c0>
Trace; c0107284 <kernel_thread_helper+0/c>
Trace; c0107289 <kernel_thread_helper+5/c>
1 warning issued. Results may not be reliable.
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: 2.5.33-mm4 filemap_copy_from_user: Unexpected page fault
2002-09-06 15:48 2.5.33-mm4 filemap_copy_from_user: Unexpected page fault Steven Cole
@ 2002-09-06 16:51 ` Andrew Morton
2002-09-06 17:03 ` Steven Cole
0 siblings, 1 reply; 5+ messages in thread
From: Andrew Morton @ 2002-09-06 16:51 UTC (permalink / raw)
To: Steven Cole; +Cc: linux-mm
Steven Cole wrote:
>
> With 2.5.33-mm4, I tried running dbench on an ext2 partition and was
> able to run up to dbench 80 successfully. However, at dbench 96, I got
> four messages like this:
>
> filemap_copy_from_user: Unexpected page fault
Yep. This means that the page we faulted in by-hand in generic_file_write()
wasn't resident during the subsequent copy_from_user().
That fault-in by-hand is there to prevent a deadlock. That printk
meand that it isnt working all the time. We have (always had) a
problem.
> Shortly after this, the box hung again,
> ...
> >>EIP; c0159bf4 <sync_sb_inodes+84/260> <=====
> Trace; c0159e1e <writeback_inodes+4e/80>
> Trace; c013b8aa <background_writeout+7a/c0>
> Trace; c013b4cb <__pdflush+12b/1d0>
Hum. Thanks for that.
I've been dbenching and compiling all night. And yet, it
seems that the dirty inode search in sync_sb_inodes() can trivially
lock up.
Does this fix?
fs-writeback.c | 4 ++--
1 files changed, 2 insertions(+), 2 deletions(-)
--- 2.5.33/fs/fs-writeback.c~scole Fri Sep 6 09:50:25 2002
+++ 2.5.33-akpm/fs/fs-writeback.c Fri Sep 6 09:50:47 2002
@@ -242,16 +242,16 @@ sync_sb_inodes(struct super_block *sb, s
if (wbc->nonblocking && bdi_write_congested(bdi)) {
wbc->encountered_congestion = 1;
+ list_move(&inode->i_list, &sb->s_dirty);
if (sb != blockdev_superblock)
break; /* Skip the entire fs */
- list_move(&inode->i_list, &sb->s_dirty);
continue;
}
if (wbc->bdi && bdi != wbc->bdi) {
+ list_move(&inode->i_list, &sb->s_dirty);
if (sb != blockdev_superblock)
break; /* inappropriate superblock */
- list_move(&inode->i_list, &sb->s_dirty);
continue; /* not this blockdev */
}
.
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: 2.5.33-mm4 filemap_copy_from_user: Unexpected page fault
2002-09-06 16:51 ` Andrew Morton
@ 2002-09-06 17:03 ` Steven Cole
2002-09-06 17:36 ` Andrew Morton
0 siblings, 1 reply; 5+ messages in thread
From: Steven Cole @ 2002-09-06 17:03 UTC (permalink / raw)
To: Andrew Morton; +Cc: linux-mm
On Fri, 2002-09-06 at 10:51, Andrew Morton wrote:
> Steven Cole wrote:
> >
> > With 2.5.33-mm4, I tried running dbench on an ext2 partition and was
> > able to run up to dbench 80 successfully. However, at dbench 96, I got
> > four messages like this:
> >
> > filemap_copy_from_user: Unexpected page fault
>
> Yep. This means that the page we faulted in by-hand in generic_file_write()
> wasn't resident during the subsequent copy_from_user().
>
> That fault-in by-hand is there to prevent a deadlock. That printk
> meand that it isnt working all the time. We have (always had) a
> problem.
>
> > Shortly after this, the box hung again,
> > ...
> > >>EIP; c0159bf4 <sync_sb_inodes+84/260> <=====
> > Trace; c0159e1e <writeback_inodes+4e/80>
> > Trace; c013b8aa <background_writeout+7a/c0>
> > Trace; c013b4cb <__pdflush+12b/1d0>
>
> Hum. Thanks for that.
>
> I've been dbenching and compiling all night. And yet, it
> seems that the dirty inode search in sync_sb_inodes() can trivially
> lock up.
>
> Does this fix?
>
> fs-writeback.c | 4 ++--
> 1 files changed, 2 insertions(+), 2 deletions(-)
[patch snipped]
Unfortunately no. With that patch, it got up to dbench 10 on ext3 and
then it hung again.
I haven't reboot the box yet in case the output from sysrq-p would be
useful. It is a pain to type in though.
I had vmstat -n 1 1200 running in another terminal and here are the last
several lines of that output before the freeze:
0 0 0 0 636688 25356 19192 0 0 0 0 1077 2983 19 44 37
10 0 0 0 589504 25856 65452 0 0 24 0 1022 41 2 37 61
0 10 4 0 531940 26644 122060 0 0 12 22684 1194 431 6 51 42
0 10 3 0 527316 26736 126592 0 0 4 12944 1189 246 2 7 91
0 10 3 0 487488 27372 165676 0 0 12 15652 1238 569 7 41 52
0 10 1 0 487476 27372 165676 0 0 0 4868 1167 84 0 3 97
3 7 0 0 466636 27692 185728 0 0 20 12340 1188 202 6 24 70
0 10 2 0 462580 27772 189280 0 0 20 13240 1302 282 1 7 91
0 10 3 0 457832 27848 191244 0 0 8 9680 1350 334 2 5 93
0 10 3 0 457448 27864 191564 0 0 4 10064 1406 277 0 2 99
0 10 3 0 450812 27940 197804 0 0 12 12552 1382 351 2 6 92
4 6 3 0 442804 28132 205204 0 0 12 21928 1231 721 7 17 75
0 10 3 0 441248 28432 206376 0 0 12 18552 1195 5658 17 34 49
0 10 2 0 440500 28468 206712 0 0 4 2288 1153 394 1 3 96
9 3 5 0 438728 28580 207632 0 0 24 15100 1293 319 4 13 83
Steven
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: 2.5.33-mm4 filemap_copy_from_user: Unexpected page fault
2002-09-06 17:03 ` Steven Cole
@ 2002-09-06 17:36 ` Andrew Morton
2002-09-06 17:54 ` Steven Cole
0 siblings, 1 reply; 5+ messages in thread
From: Andrew Morton @ 2002-09-06 17:36 UTC (permalink / raw)
To: Steven Cole; +Cc: linux-mm
Steven Cole wrote:
>
> ...
> > Does this fix?
> ...
> Unfortunately no.
Well, isn't this fun? umm. You're _sure_ you ran the right kernel
and such?
Could you send your /proc/mounts, and tell me which of those partitions
you're running the test on?
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: 2.5.33-mm4 filemap_copy_from_user: Unexpected page fault
2002-09-06 17:36 ` Andrew Morton
@ 2002-09-06 17:54 ` Steven Cole
0 siblings, 0 replies; 5+ messages in thread
From: Steven Cole @ 2002-09-06 17:54 UTC (permalink / raw)
To: Andrew Morton; +Cc: linux-mm
On Fri, 2002-09-06 at 11:36, Andrew Morton wrote:
> Steven Cole wrote:
> >
> > ...
> > > Does this fix?
> > ...
> > Unfortunately no.
>
> Well, isn't this fun? umm. You're _sure_ you ran the right kernel
> and such?
Oops, forgot to include linux-mm on the earlier reply.
Yes I am sure I ran the right kernel.
>
> Could you send your /proc/mounts, and tell me which of those partitions
> you're running the test on?
>
Here that is again, for the linux-mm list:
[steven@spc5 linux-2.5.33-mm4]$ cat /proc/mounts
rootfs / rootfs rw 0 0
/dev/root / ext3 rw 0 0
/proc /proc proc rw 0 0
none /dev/pts devpts rw 0 0
/dev/sda5 /home ext3 rw 0 0
none /dev/shm tmpfs rw 0 0
/dev/sdb2 /share ext2 rw 0 0
/dev/sda3 /usr ext3 rw 0 0
Test were run on /home (ext3) with single exception of one earlier test
on /share (ext2).
Steven
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/
^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2002-09-06 17:54 UTC | newest]
Thread overview: 5+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2002-09-06 15:48 2.5.33-mm4 filemap_copy_from_user: Unexpected page fault Steven Cole
2002-09-06 16:51 ` Andrew Morton
2002-09-06 17:03 ` Steven Cole
2002-09-06 17:36 ` Andrew Morton
2002-09-06 17:54 ` Steven Cole
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox