From: Vlastimil Babka <vbabka@suse.cz>
To: Pintu Agarwal <pintu.ping@gmail.com>,
Christian Brauner <christian.brauner@ubuntu.com>
Cc: Cyrill Gorcunov <gorcunov@gmail.com>,
Pintu Kumar <quic_pintu@quicinc.com>,
open list <linux-kernel@vger.kernel.org>,
Andrew Morton <akpm@linux-foundation.org>,
linux-mm <linux-mm@kvack.org>,
ebiederm@xmission.com, sfr@canb.auug.org.au, legion@kernel.org,
sashal@kernel.org, chris.hyser@oracle.com, ccross@google.com,
pcc@google.com, dave@stgolabs.net, caoxiaofeng@yulong.com,
david@redhat.com, Linux API <linux-api@vger.kernel.org>
Subject: Re: [PATCH] sysinfo: include availram field in sysinfo struct
Date: Fri, 7 Jan 2022 17:58:24 +0100 [thread overview]
Message-ID: <91d662f1-baf6-1114-f237-a66ebc164009@suse.cz> (raw)
In-Reply-To: <CAOuPNLiJZu_HJQ+Hf5BJOgmT+v7DT96VLkiXrfx0MJQrkD3rSw@mail.gmail.com>
CC linux-api
On 1/7/22 14:44, Pintu Agarwal wrote:
> On Fri, 7 Jan 2022 at 17:35, Christian Brauner
> <christian.brauner@ubuntu.com> wrote:
>>
>> On Thu, Jan 06, 2022 at 08:27:47PM +0300, Cyrill Gorcunov wrote:
>> > On Thu, Jan 06, 2022 at 10:19:55PM +0530, Pintu Agarwal wrote:
>> > > > > diff --git a/include/uapi/linux/sysinfo.h b/include/uapi/linux/sysinfo.h
>> > > > > index 435d5c2..6e77e90 100644
>> > > > > --- a/include/uapi/linux/sysinfo.h
>> > > > > +++ b/include/uapi/linux/sysinfo.h
>> > > > > @@ -12,6 +12,7 @@ struct sysinfo {
>> > > > > __kernel_ulong_t freeram; /* Available memory size */
>> > > > > __kernel_ulong_t sharedram; /* Amount of shared memory */
>> > > > > __kernel_ulong_t bufferram; /* Memory used by buffers */
>> > > > > + __kernel_ulong_t availram; /* Memory available for allocation */
>> > > > > __kernel_ulong_t totalswap; /* Total swap space size */
>> > > > > __kernel_ulong_t freeswap; /* swap space still available */
>> > > > > __u16 procs; /* Number of current processes */
>> > > >
>> > > > Hi! Sorry, but I don't understand -- the sysinfo structure seems to
>> > > > be part of user API, no? Don't we break it up here?
>> > >
>> > > Yes, the corresponding user space header /usr/include/linux/sysinfo.h
>> > > also needs to be updated.
>> > > When we generate the kernel header it will be updated automatically.
>> >
>> > Wait. The userspace may pass old structure here, and in result we
>> > return incorrect layout which won't match old one, no? Old binary
>> > code has no clue about this header update.
>>
>> Yes, that won't work as done.
>>
>> If we want to do this it needs to be done at the end of the struct right
>> before the padding field and the newly added field substracted from the
>> padding. (Not the preferred way to do it these days for new structs.)
>>
>> A new kernel can then pass in the struct with the newly added field and
>> an old kernel can just fill the struct in as usual. New kernel will
>> update the field with the correct value.
>>
>> But there's a catch depending on the type of value.
>> The problem with these types of extensions is that you'll often need
>> indicators to and from the kernel whether the extension is supported.
>>
>> Consider an extension where 0 is a valid value meaning "this resource is
>> completely used". Since the kernel and userspace always agree on the
>> size of the struct the kernel will zero the whole struct. So if in your
>> newly added field 0 is a valid value you can't differentiate between 0
>> as a valid value indicating that your resource isn't available and 0 as
>> the kernel not supporting your extension.
>>
>> Other APIs solve this and similar problems by having a request mask and
>> a return mask. Userspace fills in what values it wants reported in the
>> request mask and kernel sets the supported flags in the return mask.
>> This way you can differentiate between the two (see statx).
>>
>> If the 0 example is not a concern or acceptable for userspace it's
>> probably fine. But you need to document that having 0 returned can mean
>> both things.
>>
>> Or, you select a value different from 0 (-1?) that you can use to
>> indicate to userspace that the resource is used up so 0 can just mean
>> "kernel doesn't support this extension".
>
> Thanks all for your inputs.
> As Eric suggested in other thread (pasting here for reference):
> {
>> Before the padding and you should reduce the size of the padding by the
>> size of your new field.
>
>>> Also, I could not understand what this is for ?
>>> Do we need to update this since sture is changed ?
>
>> In general padding like that is so new fields can be added. The
>> comment about libc5 makes me a wonder a bit, but I expect libc5 just
>> added the padding in it's copy of the structure that it exported to
>> userspace many many years ago so that new fields could be added.
>
>> Eric
> }
>
> I made the changes like below and this seems to work even with older user space.
> I mean earlier, when I ran "free" command it was giving "stack
> smashing..." error,
> but now the "free" command (which comes as part of busybox) works fine
> even without recompiling with the updated header.
>
> These are the header changes for quick look:
> {{{
> diff --git a/include/uapi/linux/sysinfo.h b/include/uapi/linux/sysinfo.h
> index 6e77e90..fe84c6a 100644
> --- a/include/uapi/linux/sysinfo.h
> +++ b/include/uapi/linux/sysinfo.h
> @@ -12,7 +12,6 @@ struct sysinfo {
> __kernel_ulong_t freeram; /* Available memory size */
> __kernel_ulong_t sharedram; /* Amount of shared memory */
> __kernel_ulong_t bufferram; /* Memory used by buffers */
> - __kernel_ulong_t availram; /* Memory available for allocation */
> __kernel_ulong_t totalswap; /* Total swap space size */
> __kernel_ulong_t freeswap; /* swap space still available */
> __u16 procs; /* Number of current processes */
> @@ -20,7 +19,8 @@ struct sysinfo {
> __kernel_ulong_t totalhigh; /* Total high memory size */
> __kernel_ulong_t freehigh; /* Available high memory size */
> __u32 mem_unit; /* Memory unit size in bytes */
> - char _f[20-2*sizeof(__kernel_ulong_t)-sizeof(__u32)]; /*
> Padding: libc5 uses this.. */
> + __kernel_ulong_t availram; /* Memory available for allocation */
> + char _f[20-3*sizeof(__kernel_ulong_t)-sizeof(__u32)]; /*
> Padding: libc5 uses this.. */
> };
> }}}
>
> If this is fine, I will push the new patch set.
Please CC linux-api@vger.kernel.org on the new posting.
> Thanks,
> Pintu
>
next prev parent reply other threads:[~2022-01-07 16:58 UTC|newest]
Thread overview: 21+ messages / expand[flat|nested] mbox.gz Atom feed top
2022-01-06 15:34 Pintu Kumar
2022-01-06 16:11 ` Cyrill Gorcunov
2022-01-06 16:49 ` Pintu Agarwal
2022-01-06 17:27 ` Cyrill Gorcunov
2022-01-07 12:04 ` Christian Brauner
2022-01-07 13:44 ` Pintu Agarwal
2022-01-07 16:58 ` Vlastimil Babka [this message]
2022-01-07 17:47 ` Pintu Agarwal
2022-01-07 22:18 ` David Laight
2022-01-07 19:51 ` Cyrill Gorcunov
2022-01-06 17:41 ` David Laight
2022-01-06 17:59 ` Pintu Agarwal
2022-01-06 19:20 ` Eric W. Biederman
2022-01-07 18:07 ` [PATCH v2] " Pintu Kumar
2022-01-07 21:01 ` Cyrill Gorcunov
2022-01-08 16:24 ` Pintu Agarwal
2022-01-10 8:11 ` Cyrill Gorcunov
2022-01-07 22:22 ` David Laight
2022-01-08 16:53 ` Pintu Agarwal
2022-01-08 22:35 ` David Laight
2022-01-10 14:55 ` Pintu Agarwal
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=91d662f1-baf6-1114-f237-a66ebc164009@suse.cz \
--to=vbabka@suse.cz \
--cc=akpm@linux-foundation.org \
--cc=caoxiaofeng@yulong.com \
--cc=ccross@google.com \
--cc=chris.hyser@oracle.com \
--cc=christian.brauner@ubuntu.com \
--cc=dave@stgolabs.net \
--cc=david@redhat.com \
--cc=ebiederm@xmission.com \
--cc=gorcunov@gmail.com \
--cc=legion@kernel.org \
--cc=linux-api@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=pcc@google.com \
--cc=pintu.ping@gmail.com \
--cc=quic_pintu@quicinc.com \
--cc=sashal@kernel.org \
--cc=sfr@canb.auug.org.au \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox