linux-mm.kvack.org archive mirror
 help / color / mirror / Atom feed
* http://ds9a.nl/cacheinfo project - please comment & improve
@ 2001-05-27 20:20 bert hubert
  2001-05-30 19:54 ` Marcelo Tosatti
  0 siblings, 1 reply; 8+ messages in thread
From: bert hubert @ 2001-05-27 20:20 UTC (permalink / raw)
  To: linux-mm

Hello mm people!

I've written a module plus a tiny userspace program to query the page
cache. In short:

$ cinfo /lib/libc.so.6
/lib/libc.so.6: 182 of 272 (66.91%) pages in the cache, of which 0 (0.00%)
are dirty

Now, I'm a complete and utter beginner when it comes to kernelcoding. Also,
this is very much a 'release early, release often'-release. In other words,
it sucks & I know.

So I would like to ask you to look at it and send comments/patches to me.
I'm especially interested in architectural decisions - I currently export
data over a filesystem (cinfofs), which may or not be right.

The tarball (http://ds9a.nl/cacheinfo/cinfo-0.1.tar.gz) contains 2 manpages
which very lightly document how it works.

Thanks for your time!

Regards,

bert hubert

-- 
http://www.PowerDNS.com      Versatile DNS Services  
Trilab                       The Technology People   
'SYN! .. SYN|ACK! .. ACK!' - the mating call of the internet
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: http://ds9a.nl/cacheinfo project - please comment & improve
  2001-05-27 20:20 http://ds9a.nl/cacheinfo project - please comment & improve bert hubert
@ 2001-05-30 19:54 ` Marcelo Tosatti
  2001-05-30 21:48   ` bert hubert
  0 siblings, 1 reply; 8+ messages in thread
From: Marcelo Tosatti @ 2001-05-30 19:54 UTC (permalink / raw)
  To: bert hubert; +Cc: linux-mm


On Sun, 27 May 2001, bert hubert wrote:

> Hello mm people!
> 
> I've written a module plus a tiny userspace program to query the page
> cache. In short:
> 
> $ cinfo /lib/libc.so.6
> /lib/libc.so.6: 182 of 272 (66.91%) pages in the cache, of which 0 (0.00%)
> are dirty
> 
> Now, I'm a complete and utter beginner when it comes to kernelcoding. Also,
> this is very much a 'release early, release often'-release. In other words,
> it sucks & I know.
> 
> So I would like to ask you to look at it and send comments/patches to me.
> I'm especially interested in architectural decisions - I currently export
> data over a filesystem (cinfofs), which may or not be right.
> 
> The tarball (http://ds9a.nl/cacheinfo/cinfo-0.1.tar.gz) contains 2 manpages
> which very lightly document how it works.

Hi Bert, 

You're using the "address_space->dirty_pages" list to calculate the number
of dirty pages.

Its interesting to note that pages on this list may not be really dirty
since we don't mark them clean when writting them out. (we only do that at
fdatasync/fsync time) 

So I suggest you to check for the PG_dirty (with the PageDirty macro) bit
on pages of that list to know if they are really dirty. 


--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: http://ds9a.nl/cacheinfo project - please comment & improve
  2001-05-30 21:48   ` bert hubert
@ 2001-05-30 20:27     ` Marcelo Tosatti
  2001-05-31 17:17     ` Ingo Oeser
  1 sibling, 0 replies; 8+ messages in thread
From: Marcelo Tosatti @ 2001-05-30 20:27 UTC (permalink / raw)
  To: bert hubert; +Cc: linux-mm


On Wed, 30 May 2001, bert hubert wrote:

> On Wed, May 30, 2001 at 04:54:12PM -0300, Marcelo Tosatti wrote:
> 
> > You're using the "address_space->dirty_pages" list to calculate the number
> > of dirty pages.
> 
> I was wondering about that. In limited testing I've never seen a non-0
> content of the dirty list. I ran:
> 
> dd if=/dev/zero of=test count=100000 &
> while true ; do ./cinfo test; done
> 
> And saw no dirty pages. 

Oops.

You will see no dirty pages here anyway --- data written through
write() is commited to the buffer cache directly. 

You can loop in each page into the clean_list and check their
"page->buffers" pointer.

If there are dirty buffer_head's there, you can count the page as dirty. 

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: http://ds9a.nl/cacheinfo project - please comment & improve
  2001-05-30 19:54 ` Marcelo Tosatti
@ 2001-05-30 21:48   ` bert hubert
  2001-05-30 20:27     ` Marcelo Tosatti
  2001-05-31 17:17     ` Ingo Oeser
  0 siblings, 2 replies; 8+ messages in thread
From: bert hubert @ 2001-05-30 21:48 UTC (permalink / raw)
  To: Marcelo Tosatti; +Cc: linux-mm

On Wed, May 30, 2001 at 04:54:12PM -0300, Marcelo Tosatti wrote:

> You're using the "address_space->dirty_pages" list to calculate the number
> of dirty pages.

I was wondering about that. In limited testing I've never seen a non-0
content of the dirty list. I ran:

dd if=/dev/zero of=test count=100000 &
while true ; do ./cinfo test; done

And saw no dirty pages. 

> So I suggest you to check for the PG_dirty (with the PageDirty macro) bit
> on pages of that list to know if they are really dirty. 

Ok - will do. I plan to release a slightly improved version shortly that
addresses this issue. Thanks!

Oh, if anybody has ideas on statistics that should be exported, please let
me know. On the agenda is a bitmap that describes which pages are actually
in the cache.

Regards,

bert

-- 
http://www.PowerDNS.com      Versatile DNS Services  
Trilab                       The Technology People   
'SYN! .. SYN|ACK! .. ACK!' - the mating call of the internet
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: http://ds9a.nl/cacheinfo project - please comment & improve
  2001-05-30 21:48   ` bert hubert
  2001-05-30 20:27     ` Marcelo Tosatti
@ 2001-05-31 17:17     ` Ingo Oeser
  2001-05-31 21:53       ` bert hubert
  1 sibling, 1 reply; 8+ messages in thread
From: Ingo Oeser @ 2001-05-31 17:17 UTC (permalink / raw)
  To: bert hubert; +Cc: Marcelo Tosatti, linux-mm

On Wed, May 30, 2001 at 11:48:06PM +0200, bert hubert wrote:
> Oh, if anybody has ideas on statistics that should be exported, please let
> me know. On the agenda is a bitmap that describes which pages are actually
> in the cache.

You mean sth. like the mincore() syscall?

Regards

Ingo Oeser
-- 
To the systems programmer,
users and applications serve only to provide a test load.
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: http://ds9a.nl/cacheinfo project - please comment & improve
  2001-05-31 17:17     ` Ingo Oeser
@ 2001-05-31 21:53       ` bert hubert
  2001-06-03 17:04         ` bert hubert
  0 siblings, 1 reply; 8+ messages in thread
From: bert hubert @ 2001-05-31 21:53 UTC (permalink / raw)
  Cc: linux-mm

On Thu, May 31, 2001 at 07:17:30PM +0200, Ingo Oeser wrote:
> On Wed, May 30, 2001 at 11:48:06PM +0200, bert hubert wrote:
> > Oh, if anybody has ideas on statistics that should be exported, please let
> > me know. On the agenda is a bitmap that describes which pages are actually
> > in the cache.
> 
> You mean sth. like the mincore() syscall?

If you first mmap() the file that would probably work. In dire need of a
manpage though - I'll whip one up and send it to Andries. Probably explains
its relative lack of popularity - I'd never heard of mincore() although it's
been around since BSD4.4 it appears.

Pretty sad that it wastes 7 bits per byte though, but standards conformance
is also useful.

Regards,

bert

-- 
http://www.PowerDNS.com      Versatile DNS Services  
Trilab                       The Technology People   
'SYN! .. SYN|ACK! .. ACK!' - the mating call of the internet
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: http://ds9a.nl/cacheinfo project - please comment & improve
  2001-05-31 21:53       ` bert hubert
@ 2001-06-03 17:04         ` bert hubert
  2001-06-04 12:20           ` Hugh Dickins
  0 siblings, 1 reply; 8+ messages in thread
From: bert hubert @ 2001-06-03 17:04 UTC (permalink / raw)
  To: linux-mm, aeb

[-- Attachment #1: Type: text/plain, Size: 861 bytes --]

On Thu, May 31, 2001 at 11:53:27PM +0200, bert hubert wrote:
> > > Oh, if anybody has ideas on statistics that should be exported, please let
> > > me know. On the agenda is a bitmap that describes which pages are actually
> > > in the cache.
> > 
> > You mean sth. like the mincore() syscall?
> 
> If you first mmap() the file that would probably work. In dire need of a
> manpage though - I'll whip one up and send it to Andries. Probably explains
> its relative lack of popularity - I'd never heard of mincore() although it's
> been around since BSD4.4 it appears.

As promised, a manpage. I alreasy sent it to Andries but the people over
here may also have comments.

Regards,

bert

-- 
http://www.PowerDNS.com      Versatile DNS Services  
Trilab                       The Technology People   
'SYN! .. SYN|ACK! .. ACK!' - the mating call of the internet

[-- Attachment #2: mincore.2 --]
[-- Type: text/plain, Size: 3228 bytes --]

.\" Hey Emacs! This file is -*- nroff -*- source.
.\"
.\" Copyright (C) 2001 Andries Brouwer (aeb@cwi.nl)
.\"
.\" Permission is granted to make and distribute verbatim copies of this
.\" manual provided the copyright notice and this permission notice are
.\" preserved on all copies.
.\"
.\" Permission is granted to copy and distribute modified versions of this
.\" manual under the conditions for verbatim copying, provided that the
.\" entire resulting derived work is distributed under the terms of a
.\" permission notice identical to this one
.\" 
.\" Since the Linux kernel and libraries are constantly changing, this
.\" manual page may be incorrect or out-of-date.  The author(s) assume no
.\" responsibility for errors or omissions, or for damages resulting from
.\" the use of the information contained herein.  The author(s) may not
.\" have taken the same level of care in the production of this manual,
.\" which is licensed free of charge, as they might when working
.\" professionally.
.\" 
.\" Formatted or processed versions of this manual, if unaccompanied by
.\" the source, must acknowledge the copyright and authors of this work.
.\"
.\" Created Sun Jun 3 17:23:32 2001 by bert hubert <ahu@ds9a.nl>
.\"
.TH MINCORE 2 "3 June 2001" "Linux 2.4.5" "Linux Programmer's Manual"
.SH NAME
mincore \- get information on whether pages are in core
.SH SYNOPSIS
.B #include <unistd.h>
.br
.B #include <sys/mman.h>
.sp
.BI "int mincore(void *" start ", size_t " length ", unsigned char * " vec );
.SH DESCRIPTION
The
.B mincore
function requests a vector describing which pages of a file are in core and
can be read without disk access. The kernel will supply data for
.I length
bytes following the 
.I start
address. On return, the kernel will have filled
.I vec
with bytes, of which the least significant bit indicates if a page is 
core resident.

For
.B mincore
to return succesfully, 
.I start
must lie on a page boundary. It is the caller's responsibility to round up to the nearest page. The
.I length
parameter need not be a multiple of the page size. The vector
.I vec
must be large enough to contain length/PAGE_SIZE bytes.

.SH "RETURN VALUE"
On success,
.B mincore
returns zero.
On error, \-1 is returned, and
.I errno
is set appropriately.
.SH ERRORS
.B EAGAIN
kernel is temporarily out of resources
.TP
.B EINVAL
.i start
is not a multiple of PAGE_CACHE_SIZE (PAGE_SIZE) or 
.i len
has a non-positive value
.TP
.B EFAULT
.I vec
points to an illegal address
.TP
.B ENOMEM
.I address
to
.I address
+
.I length
contained unmapped memory, or memory not part of a file.

.SH "BUGS"
.B mincore
should return a bit vector and not a byte vector. As of Linux 2.4.5, it is not
possible to gain information on the core residency of pages which are not backed by a file. 
In other words, calling 
.B mincore
on an region returned by an anonymous
.B mmap(2)
does not work and sets errno to ENOMEM. Unless pages are locked in memory, the contents of
.I vec
may be stale by the time they reach userspace.

.SH "CONFORMING TO"
.B mincore
does not appear to be part of POSIX or the Single Unix Specification. 
.SH HISTORY
The mincore() function first appeared in 4.4BSD

.SH "SEE ALSO"
.BR getpagesize (2),
.BR mmap (2)


^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: http://ds9a.nl/cacheinfo project - please comment & improve
  2001-06-03 17:04         ` bert hubert
@ 2001-06-04 12:20           ` Hugh Dickins
  0 siblings, 0 replies; 8+ messages in thread
From: Hugh Dickins @ 2001-06-04 12:20 UTC (permalink / raw)
  To: bert hubert; +Cc: linux-mm, aeb

On Sun, 3 Jun 2001, bert hubert wrote:
> 
> As promised, a manpage. I alreasy sent it to Andries but the people over
> here may also have comments.

I think cut out the reference to PAGE_CACHE_SIZE.  mincore()'s current
kernel implementation may indeed involve PAGE_CACHE_SIZE, but I believe
that if PAGE_CACHE_SIZE is ever changed away from PAGE_SIZE, that will
be a kernel implementation detail, which should not affect its interfaces
with user-space.  You're right to SEE ALSO getpagesize: really even the
references to PAGE_SIZE should defer to getpagesize e.g. the mmap man
page (on my RH7 anyway) says "offset should ordinarily be a multiple of
the page size returned by getpagesize(2)"; but it may get tiresome to use
that circumlocution ("the artist formerly known as Prince") everywhere.

Hugh

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/

^ permalink raw reply	[flat|nested] 8+ messages in thread

end of thread, other threads:[~2001-06-04 12:18 UTC | newest]

Thread overview: 8+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2001-05-27 20:20 http://ds9a.nl/cacheinfo project - please comment & improve bert hubert
2001-05-30 19:54 ` Marcelo Tosatti
2001-05-30 21:48   ` bert hubert
2001-05-30 20:27     ` Marcelo Tosatti
2001-05-31 17:17     ` Ingo Oeser
2001-05-31 21:53       ` bert hubert
2001-06-03 17:04         ` bert hubert
2001-06-04 12:20           ` Hugh Dickins

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox