* ABI compatibility for /proc/zoneinfo
@ 2020-08-10 21:24 Sonny Rao
2020-08-10 21:46 ` Yu Zhao
0 siblings, 1 reply; 7+ messages in thread
From: Sonny Rao @ 2020-08-10 21:24 UTC (permalink / raw)
To: linux-mm; +Cc: Yu Zhao, bhe, David Rientjes, Andrew Morton
[-- Attachment #1: Type: text/plain, Size: 437 bytes --]
We (Chrome OS) noticed recently one of our tests started failing on
upstream kernels while parsing /proc/zoneinfo
I think this patch is the cause:
26e7deadaae17 mm/vmstat.c: do not show lowmem reserve protection
information of empty zone
Maybe our parser was being overly strict by looking for the protection
line, and it's not hard to fix but raised the question of whether there's
any ABI compatibility guarantees about these files?
[-- Attachment #2: Type: text/html, Size: 992 bytes --]
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: ABI compatibility for /proc/zoneinfo
2020-08-10 21:24 ABI compatibility for /proc/zoneinfo Sonny Rao
@ 2020-08-10 21:46 ` Yu Zhao
2020-08-11 2:37 ` Baoquan He
0 siblings, 1 reply; 7+ messages in thread
From: Yu Zhao @ 2020-08-10 21:46 UTC (permalink / raw)
To: Sonny Rao; +Cc: linux-mm, bhe, David Rientjes, Andrew Morton
On Mon, Aug 10, 2020 at 02:24:03PM -0700, Sonny Rao wrote:
> We (Chrome OS) noticed recently one of our tests started failing on
> upstream kernels while parsing /proc/zoneinfo
> I think this patch is the cause:
>
> 26e7deadaae17 mm/vmstat.c: do not show lowmem reserve protection
> information of empty zone
>
> Maybe our parser was being overly strict by looking for the protection
> line, and it's not hard to fix but raised the question of whether there's
> any ABI compatibility guarantees about these files?
According to Documentation/admin-guide/sysctl/vm.rst, "Each zone has
an array of protection pages". I'm not sure if this is the guarantee,
but the doc should reflect the actual format.
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: ABI compatibility for /proc/zoneinfo
2020-08-10 21:46 ` Yu Zhao
@ 2020-08-11 2:37 ` Baoquan He
2020-08-11 3:18 ` Sonny Rao
0 siblings, 1 reply; 7+ messages in thread
From: Baoquan He @ 2020-08-11 2:37 UTC (permalink / raw)
To: Yu Zhao; +Cc: Sonny Rao, linux-mm, David Rientjes, Andrew Morton
On 08/10/20 at 03:46pm, Yu Zhao wrote:
> On Mon, Aug 10, 2020 at 02:24:03PM -0700, Sonny Rao wrote:
> > We (Chrome OS) noticed recently one of our tests started failing on
> > upstream kernels while parsing /proc/zoneinfo
> > I think this patch is the cause:
> >
> > 26e7deadaae17 mm/vmstat.c: do not show lowmem reserve protection
> > information of empty zone
> >
> > Maybe our parser was being overly strict by looking for the protection
> > line, and it's not hard to fix but raised the question of whether there's
> > any ABI compatibility guarantees about these files?
>
> According to Documentation/admin-guide/sysctl/vm.rst, "Each zone has
> an array of protection pages". I'm not sure if this is the guarantee,
> but the doc should reflect the actual format.
The current code will list all zones in one memory node, even though
that node only has one existing zone. E.g in below node 1,
it only has NORMAL zone, but we will list zone DMA, DMA32, MOVABLE,
DEVICE which are all empty zone, namely doesn't exist. So, each zone
has an array of protection pages, it should not include the nonexistent
zone. I thought nobody would check the protection line of an empty zone,
seems I was wrong.
(Note that below /proc/zoneinfo is from an old system, protection line
is still existing for empty zone)
Node 1, zone DMA
pages free 0
min 0
low 0
high 0
spanned 0
present 0
managed 0
protection: (0, 0, 61854, 61854, 61854)
Node 1, zone DMA32
pages free 0
min 0
low 0
high 0
spanned 0
present 0
managed 0
protection: (0, 0, 61854, 61854, 61854)
Node 1, zone Normal
per-node stats
nr_inactive_anon 259
nr_active_anon 11926
...
nr_written 0
nr_kernel_misc_reclaimable 0
pages free 16206452
pages free 16206452
min 11280
low 27114
high 42948
spanned 16777216
present 16777216
managed 15834637
protection: (0, 0, 0, 0, 0)
...
Node 1, zone Movable
pages free 0
min 0
low 0
high 0
spanned 0
present 0
managed 0
protection: (0, 0, 0, 0, 0)
Node 1, zone Device
pages free 0
min 0
low 0
high 0
spanned 0
present 0
managed 0
protection: (0, 0, 0, 0, 0)
>
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: ABI compatibility for /proc/zoneinfo
2020-08-11 2:37 ` Baoquan He
@ 2020-08-11 3:18 ` Sonny Rao
2020-08-11 3:58 ` Baoquan He
2020-08-11 4:01 ` Andrew Morton
0 siblings, 2 replies; 7+ messages in thread
From: Sonny Rao @ 2020-08-11 3:18 UTC (permalink / raw)
To: Baoquan He; +Cc: Yu Zhao, linux-mm, David Rientjes, Andrew Morton
On Mon, Aug 10, 2020 at 7:37 PM Baoquan He <bhe@redhat.com> wrote:
>
> On 08/10/20 at 03:46pm, Yu Zhao wrote:
> > On Mon, Aug 10, 2020 at 02:24:03PM -0700, Sonny Rao wrote:
> > > We (Chrome OS) noticed recently one of our tests started failing on
> > > upstream kernels while parsing /proc/zoneinfo
> > > I think this patch is the cause:
> > >
> > > 26e7deadaae17 mm/vmstat.c: do not show lowmem reserve protection
> > > information of empty zone
> > >
> > > Maybe our parser was being overly strict by looking for the protection
> > > line, and it's not hard to fix but raised the question of whether there's
> > > any ABI compatibility guarantees about these files?
> >
> > According to Documentation/admin-guide/sysctl/vm.rst, "Each zone has
> > an array of protection pages". I'm not sure if this is the guarantee,
> > but the doc should reflect the actual format.
>
> The current code will list all zones in one memory node, even though
> that node only has one existing zone. E.g in below node 1,
> it only has NORMAL zone, but we will list zone DMA, DMA32, MOVABLE,
> DEVICE which are all empty zone, namely doesn't exist. So, each zone
> has an array of protection pages, it should not include the nonexistent
> zone. I thought nobody would check the protection line of an empty zone,
> seems I was wrong.
This particular parser was written as a state machine and that line
was a convenient thing to look for to mark the end of each zone.
AFAICT, there's no explicit documentation on the layout of that file
though.
>
>
> (Note that below /proc/zoneinfo is from an old system, protection line
> is still existing for empty zone)
> Node 1, zone DMA
> pages free 0
> min 0
> low 0
> high 0
> spanned 0
> present 0
> managed 0
> protection: (0, 0, 61854, 61854, 61854)
> Node 1, zone DMA32
> pages free 0
> min 0
> low 0
> high 0
> spanned 0
> present 0
> managed 0
> protection: (0, 0, 61854, 61854, 61854)
> Node 1, zone Normal
> per-node stats
> nr_inactive_anon 259
> nr_active_anon 11926
> ...
> nr_written 0
> nr_kernel_misc_reclaimable 0
> pages free 16206452
> pages free 16206452
> min 11280
> low 27114
> high 42948
> spanned 16777216
> present 16777216
> managed 15834637
> protection: (0, 0, 0, 0, 0)
> ...
>
> Node 1, zone Movable
> pages free 0
> min 0
> low 0
> high 0
> spanned 0
> present 0
> managed 0
> protection: (0, 0, 0, 0, 0)
> Node 1, zone Device
> pages free 0
> min 0
> low 0
> high 0
> spanned 0
> present 0
> managed 0
> protection: (0, 0, 0, 0, 0)
> >
>
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: ABI compatibility for /proc/zoneinfo
2020-08-11 3:18 ` Sonny Rao
@ 2020-08-11 3:58 ` Baoquan He
2020-08-11 4:01 ` Andrew Morton
1 sibling, 0 replies; 7+ messages in thread
From: Baoquan He @ 2020-08-11 3:58 UTC (permalink / raw)
To: Sonny Rao; +Cc: Yu Zhao, linux-mm, David Rientjes, Andrew Morton
On 08/10/20 at 08:18pm, Sonny Rao wrote:
> On Mon, Aug 10, 2020 at 7:37 PM Baoquan He <bhe@redhat.com> wrote:
> >
> > On 08/10/20 at 03:46pm, Yu Zhao wrote:
> > > On Mon, Aug 10, 2020 at 02:24:03PM -0700, Sonny Rao wrote:
> > > > We (Chrome OS) noticed recently one of our tests started failing on
> > > > upstream kernels while parsing /proc/zoneinfo
> > > > I think this patch is the cause:
> > > >
> > > > 26e7deadaae17 mm/vmstat.c: do not show lowmem reserve protection
> > > > information of empty zone
> > > >
> > > > Maybe our parser was being overly strict by looking for the protection
> > > > line, and it's not hard to fix but raised the question of whether there's
> > > > any ABI compatibility guarantees about these files?
> > >
> > > According to Documentation/admin-guide/sysctl/vm.rst, "Each zone has
> > > an array of protection pages". I'm not sure if this is the guarantee,
> > > but the doc should reflect the actual format.
> >
> > The current code will list all zones in one memory node, even though
> > that node only has one existing zone. E.g in below node 1,
> > it only has NORMAL zone, but we will list zone DMA, DMA32, MOVABLE,
> > DEVICE which are all empty zone, namely doesn't exist. So, each zone
> > has an array of protection pages, it should not include the nonexistent
> > zone. I thought nobody would check the protection line of an empty zone,
> > seems I was wrong.
>
>
> This particular parser was written as a state machine and that line
> was a convenient thing to look for to mark the end of each zone.
> AFAICT, there's no explicit documentation on the layout of that file
> though.
Ah, I see. While the protection line can only mark the end of empty zone.
For populated zone, it obvisouly can't. One wild idea is not showing
those empty zones at all. On NUMA system with multiple memory nodes,
the nodes other than node0 won't have DMA, DMA32 zone usually, it doesn't
make any sense to list them in /proc/zoneinfo.
>
> >
> >
> > (Note that below /proc/zoneinfo is from an old system, protection line
> > is still existing for empty zone)
> > Node 1, zone DMA
> > pages free 0
> > min 0
> > low 0
> > high 0
> > spanned 0
> > present 0
> > managed 0
> > protection: (0, 0, 61854, 61854, 61854)
> > Node 1, zone DMA32
> > pages free 0
> > min 0
> > low 0
> > high 0
> > spanned 0
> > present 0
> > managed 0
> > protection: (0, 0, 61854, 61854, 61854)
> > Node 1, zone Normal
> > per-node stats
> > nr_inactive_anon 259
> > nr_active_anon 11926
> > ...
> > nr_written 0
> > nr_kernel_misc_reclaimable 0
> > pages free 16206452
> > pages free 16206452
> > min 11280
> > low 27114
> > high 42948
> > spanned 16777216
> > present 16777216
> > managed 15834637
> > protection: (0, 0, 0, 0, 0)
> > ...
> >
> > Node 1, zone Movable
> > pages free 0
> > min 0
> > low 0
> > high 0
> > spanned 0
> > present 0
> > managed 0
> > protection: (0, 0, 0, 0, 0)
> > Node 1, zone Device
> > pages free 0
> > min 0
> > low 0
> > high 0
> > spanned 0
> > present 0
> > managed 0
> > protection: (0, 0, 0, 0, 0)
> > >
> >
>
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: ABI compatibility for /proc/zoneinfo
2020-08-11 3:18 ` Sonny Rao
2020-08-11 3:58 ` Baoquan He
@ 2020-08-11 4:01 ` Andrew Morton
2020-08-11 4:07 ` Baoquan He
1 sibling, 1 reply; 7+ messages in thread
From: Andrew Morton @ 2020-08-11 4:01 UTC (permalink / raw)
To: Sonny Rao; +Cc: Baoquan He, Yu Zhao, linux-mm, David Rientjes
On Mon, 10 Aug 2020 20:18:11 -0700 Sonny Rao <sonnyrao@chromium.org> wrote:
> On Mon, Aug 10, 2020 at 7:37 PM Baoquan He <bhe@redhat.com> wrote:
> >
> > On 08/10/20 at 03:46pm, Yu Zhao wrote:
> > > On Mon, Aug 10, 2020 at 02:24:03PM -0700, Sonny Rao wrote:
> > > > We (Chrome OS) noticed recently one of our tests started failing on
> > > > upstream kernels while parsing /proc/zoneinfo
> > > > I think this patch is the cause:
> > > >
> > > > 26e7deadaae17 mm/vmstat.c: do not show lowmem reserve protection
> > > > information of empty zone
> > > >
> > > > Maybe our parser was being overly strict by looking for the protection
> > > > line, and it's not hard to fix but raised the question of whether there's
> > > > any ABI compatibility guarantees about these files?
> > >
> > > According to Documentation/admin-guide/sysctl/vm.rst, "Each zone has
> > > an array of protection pages". I'm not sure if this is the guarantee,
> > > but the doc should reflect the actual format.
> >
> > The current code will list all zones in one memory node, even though
> > that node only has one existing zone. E.g in below node 1,
> > it only has NORMAL zone, but we will list zone DMA, DMA32, MOVABLE,
> > DEVICE which are all empty zone, namely doesn't exist. So, each zone
> > has an array of protection pages, it should not include the nonexistent
> > zone. I thought nobody would check the protection line of an empty zone,
> > seems I was wrong.
>
>
> This particular parser was written as a state machine and that line
> was a convenient thing to look for to mark the end of each zone.
> AFAICT, there's no explicit documentation on the layout of that file
> though.
This only affects 5.8 and later, yes?
26e7deadaae17 didn't gain us much at all and we shouldn't break
userspace unless it's super important. So I think it's best to revert,
and to backport the revert to 5.8.x.
Could someone (Baoquan He?) please send a patch with a suitable
changelog, cc:stable, reported-by:, etc?
Thanks.
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: ABI compatibility for /proc/zoneinfo
2020-08-11 4:01 ` Andrew Morton
@ 2020-08-11 4:07 ` Baoquan He
0 siblings, 0 replies; 7+ messages in thread
From: Baoquan He @ 2020-08-11 4:07 UTC (permalink / raw)
To: Andrew Morton; +Cc: Sonny Rao, Yu Zhao, linux-mm, David Rientjes
On 08/10/20 at 09:01pm, Andrew Morton wrote:
> On Mon, 10 Aug 2020 20:18:11 -0700 Sonny Rao <sonnyrao@chromium.org> wrote:
>
> > On Mon, Aug 10, 2020 at 7:37 PM Baoquan He <bhe@redhat.com> wrote:
> > >
> > > On 08/10/20 at 03:46pm, Yu Zhao wrote:
> > > > On Mon, Aug 10, 2020 at 02:24:03PM -0700, Sonny Rao wrote:
> > > > > We (Chrome OS) noticed recently one of our tests started failing on
> > > > > upstream kernels while parsing /proc/zoneinfo
> > > > > I think this patch is the cause:
> > > > >
> > > > > 26e7deadaae17 mm/vmstat.c: do not show lowmem reserve protection
> > > > > information of empty zone
> > > > >
> > > > > Maybe our parser was being overly strict by looking for the protection
> > > > > line, and it's not hard to fix but raised the question of whether there's
> > > > > any ABI compatibility guarantees about these files?
> > > >
> > > > According to Documentation/admin-guide/sysctl/vm.rst, "Each zone has
> > > > an array of protection pages". I'm not sure if this is the guarantee,
> > > > but the doc should reflect the actual format.
> > >
> > > The current code will list all zones in one memory node, even though
> > > that node only has one existing zone. E.g in below node 1,
> > > it only has NORMAL zone, but we will list zone DMA, DMA32, MOVABLE,
> > > DEVICE which are all empty zone, namely doesn't exist. So, each zone
> > > has an array of protection pages, it should not include the nonexistent
> > > zone. I thought nobody would check the protection line of an empty zone,
> > > seems I was wrong.
> >
> >
> > This particular parser was written as a state machine and that line
> > was a convenient thing to look for to mark the end of each zone.
> > AFAICT, there's no explicit documentation on the layout of that file
> > though.
>
> This only affects 5.8 and later, yes?
>
> 26e7deadaae17 didn't gain us much at all and we shouldn't break
> userspace unless it's super important. So I think it's best to revert,
> and to backport the revert to 5.8.x.
>
> Could someone (Baoquan He?) please send a patch with a suitable
> changelog, cc:stable, reported-by:, etc?
OK, I will post a patch to revert this.
^ permalink raw reply [flat|nested] 7+ messages in thread
end of thread, other threads:[~2020-08-11 4:07 UTC | newest]
Thread overview: 7+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2020-08-10 21:24 ABI compatibility for /proc/zoneinfo Sonny Rao
2020-08-10 21:46 ` Yu Zhao
2020-08-11 2:37 ` Baoquan He
2020-08-11 3:18 ` Sonny Rao
2020-08-11 3:58 ` Baoquan He
2020-08-11 4:01 ` Andrew Morton
2020-08-11 4:07 ` Baoquan He
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox