* [PATCH 1/2] admin-guide/memory-hotplug.rst: remove locking internal part from admin-guide
@ 2018-12-05 2:34 Wei Yang
2018-12-05 2:34 ` [PATCH 2/2] core-api/memory-hotplug.rst: divide Locking Internal section by different locks Wei Yang
` (3 more replies)
0 siblings, 4 replies; 17+ messages in thread
From: Wei Yang @ 2018-12-05 2:34 UTC (permalink / raw)
To: david, mhocko, osalvador; +Cc: akpm, linux-doc, linux-mm, Wei Yang
Locking Internal section exists in core-api documentation, which is more
suitable for this.
This patch removes the duplication part here.
Signed-off-by: Wei Yang <richard.weiyang@gmail.com>
---
Documentation/admin-guide/mm/memory-hotplug.rst | 40 -------------------------
1 file changed, 40 deletions(-)
diff --git a/Documentation/admin-guide/mm/memory-hotplug.rst b/Documentation/admin-guide/mm/memory-hotplug.rst
index 5c4432c96c4b..241f4ce1e387 100644
--- a/Documentation/admin-guide/mm/memory-hotplug.rst
+++ b/Documentation/admin-guide/mm/memory-hotplug.rst
@@ -392,46 +392,6 @@ Need more implementation yet....
- Notification completion of remove works by OS to firmware.
- Guard from remove if not yet.
-
-Locking Internals
-=================
-
-When adding/removing memory that uses memory block devices (i.e. ordinary RAM),
-the device_hotplug_lock should be held to:
-
-- synchronize against online/offline requests (e.g. via sysfs). This way, memory
- block devices can only be accessed (.online/.state attributes) by user
- space once memory has been fully added. And when removing memory, we
- know nobody is in critical sections.
-- synchronize against CPU hotplug and similar (e.g. relevant for ACPI and PPC)
-
-Especially, there is a possible lock inversion that is avoided using
-device_hotplug_lock when adding memory and user space tries to online that
-memory faster than expected:
-
-- device_online() will first take the device_lock(), followed by
- mem_hotplug_lock
-- add_memory_resource() will first take the mem_hotplug_lock, followed by
- the device_lock() (while creating the devices, during bus_add_device()).
-
-As the device is visible to user space before taking the device_lock(), this
-can result in a lock inversion.
-
-onlining/offlining of memory should be done via device_online()/
-device_offline() - to make sure it is properly synchronized to actions
-via sysfs. Holding device_hotplug_lock is advised (to e.g. protect online_type)
-
-When adding/removing/onlining/offlining memory or adding/removing
-heterogeneous/device memory, we should always hold the mem_hotplug_lock in
-write mode to serialise memory hotplug (e.g. access to global/zone
-variables).
-
-In addition, mem_hotplug_lock (in contrast to device_hotplug_lock) in read
-mode allows for a quite efficient get_online_mems/put_online_mems
-implementation, so code accessing memory can protect from that memory
-vanishing.
-
-
Future Work
===========
--
2.15.1
^ permalink raw reply [flat|nested] 17+ messages in thread
* [PATCH 2/2] core-api/memory-hotplug.rst: divide Locking Internal section by different locks
2018-12-05 2:34 [PATCH 1/2] admin-guide/memory-hotplug.rst: remove locking internal part from admin-guide Wei Yang
@ 2018-12-05 2:34 ` Wei Yang
2018-12-05 8:08 ` David Hildenbrand
` (2 more replies)
2018-12-05 8:03 ` [PATCH 1/2] admin-guide/memory-hotplug.rst: remove locking internal part from admin-guide David Hildenbrand
` (2 subsequent siblings)
3 siblings, 3 replies; 17+ messages in thread
From: Wei Yang @ 2018-12-05 2:34 UTC (permalink / raw)
To: david, mhocko, osalvador; +Cc: akpm, linux-doc, linux-mm, Wei Yang
Currently locking for memory hotplug is a little complicated.
Generally speaking, we leverage the two global lock:
* device_hotplug_lock
* mem_hotplug_lock
to serialise the process.
While for the long term, we are willing to have more fine-grained lock
to provide higher scalability.
This patch divides Locking Internal section based on these two global
locks to help readers to understand it. Also it adds some new finding to
enrich it.
[David: words arrangement]
Signed-off-by: Wei Yang <richard.weiyang@gmail.com>
---
Documentation/core-api/memory-hotplug.rst | 27 ++++++++++++++++++++++++---
1 file changed, 24 insertions(+), 3 deletions(-)
diff --git a/Documentation/core-api/memory-hotplug.rst b/Documentation/core-api/memory-hotplug.rst
index de7467e48067..95662b283328 100644
--- a/Documentation/core-api/memory-hotplug.rst
+++ b/Documentation/core-api/memory-hotplug.rst
@@ -89,6 +89,20 @@ NOTIFY_STOP stops further processing of the notification queue.
Locking Internals
=================
+There are three locks involved in memory-hotplug, two global lock and one local
+lock:
+
+- device_hotplug_lock
+- mem_hotplug_lock
+- device_lock
+
+Currently, they are twisted together for all kinds of reasons. The following
+part is divided into device_hotplug_lock and mem_hotplug_lock parts
+respectively to describe those tricky situations.
+
+device_hotplug_lock
+---------------------
+
When adding/removing memory that uses memory block devices (i.e. ordinary RAM),
the device_hotplug_lock should be held to:
@@ -111,13 +125,20 @@ As the device is visible to user space before taking the device_lock(), this
can result in a lock inversion.
onlining/offlining of memory should be done via device_online()/
-device_offline() - to make sure it is properly synchronized to actions
-via sysfs. Holding device_hotplug_lock is advised (to e.g. protect online_type)
+device_offline() - to make sure it is properly synchronized to actions via
+sysfs. Even mem_hotplug_lock is used to protect the process, because of the
+lock inversion described above, holding device_hotplug_lock is still advised
+(to e.g. protect online_type)
+
+mem_hotplug_lock
+---------------------
When adding/removing/onlining/offlining memory or adding/removing
heterogeneous/device memory, we should always hold the mem_hotplug_lock in
write mode to serialise memory hotplug (e.g. access to global/zone
-variables).
+variables). Currently, we take advantage of this to serialise sparsemem's
+mem_section handling in sparse_add_one_section() and
+sparse_remove_one_section().
In addition, mem_hotplug_lock (in contrast to device_hotplug_lock) in read
mode allows for a quite efficient get_online_mems/put_online_mems
--
2.15.1
^ permalink raw reply [flat|nested] 17+ messages in thread
* Re: [PATCH 1/2] admin-guide/memory-hotplug.rst: remove locking internal part from admin-guide
2018-12-05 2:34 [PATCH 1/2] admin-guide/memory-hotplug.rst: remove locking internal part from admin-guide Wei Yang
2018-12-05 2:34 ` [PATCH 2/2] core-api/memory-hotplug.rst: divide Locking Internal section by different locks Wei Yang
@ 2018-12-05 8:03 ` David Hildenbrand
2018-12-05 8:30 ` Mike Rapoport
2018-12-05 9:20 ` Wei Yang
2018-12-05 12:11 ` Michal Hocko
2018-12-06 0:26 ` [PATCH v2 " Wei Yang
3 siblings, 2 replies; 17+ messages in thread
From: David Hildenbrand @ 2018-12-05 8:03 UTC (permalink / raw)
To: Wei Yang, mhocko, osalvador
Cc: akpm, linux-doc, linux-mm, Jonathan Corbet, Mike Rapoport
On 05.12.18 03:34, Wei Yang wrote:
> Locking Internal section exists in core-api documentation, which is more
> suitable for this.
>
> This patch removes the duplication part here.
>
> Signed-off-by: Wei Yang <richard.weiyang@gmail.com>
> ---
> Documentation/admin-guide/mm/memory-hotplug.rst | 40 -------------------------
> 1 file changed, 40 deletions(-)
>
> diff --git a/Documentation/admin-guide/mm/memory-hotplug.rst b/Documentation/admin-guide/mm/memory-hotplug.rst
> index 5c4432c96c4b..241f4ce1e387 100644
> --- a/Documentation/admin-guide/mm/memory-hotplug.rst
> +++ b/Documentation/admin-guide/mm/memory-hotplug.rst
> @@ -392,46 +392,6 @@ Need more implementation yet....
> - Notification completion of remove works by OS to firmware.
> - Guard from remove if not yet.
>
> -
> -Locking Internals
> -=================
> -
> -When adding/removing memory that uses memory block devices (i.e. ordinary RAM),
> -the device_hotplug_lock should be held to:
> -
> -- synchronize against online/offline requests (e.g. via sysfs). This way, memory
> - block devices can only be accessed (.online/.state attributes) by user
> - space once memory has been fully added. And when removing memory, we
> - know nobody is in critical sections.
> -- synchronize against CPU hotplug and similar (e.g. relevant for ACPI and PPC)
> -
> -Especially, there is a possible lock inversion that is avoided using
> -device_hotplug_lock when adding memory and user space tries to online that
> -memory faster than expected:
> -
> -- device_online() will first take the device_lock(), followed by
> - mem_hotplug_lock
> -- add_memory_resource() will first take the mem_hotplug_lock, followed by
> - the device_lock() (while creating the devices, during bus_add_device()).
> -
> -As the device is visible to user space before taking the device_lock(), this
> -can result in a lock inversion.
> -
> -onlining/offlining of memory should be done via device_online()/
> -device_offline() - to make sure it is properly synchronized to actions
> -via sysfs. Holding device_hotplug_lock is advised (to e.g. protect online_type)
> -
> -When adding/removing/onlining/offlining memory or adding/removing
> -heterogeneous/device memory, we should always hold the mem_hotplug_lock in
> -write mode to serialise memory hotplug (e.g. access to global/zone
> -variables).
> -
> -In addition, mem_hotplug_lock (in contrast to device_hotplug_lock) in read
> -mode allows for a quite efficient get_online_mems/put_online_mems
> -implementation, so code accessing memory can protect from that memory
> -vanishing.
> -
> -
> Future Work
> ===========
>
>
I reported this yesterday to Jonathan and Mike
https://lkml.org/lkml/2018/12/3/340
Anyhow
Reviewed-by: David Hildenbrand <david@redhat.com>
--
Thanks,
David / dhildenb
^ permalink raw reply [flat|nested] 17+ messages in thread
* Re: [PATCH 2/2] core-api/memory-hotplug.rst: divide Locking Internal section by different locks
2018-12-05 2:34 ` [PATCH 2/2] core-api/memory-hotplug.rst: divide Locking Internal section by different locks Wei Yang
@ 2018-12-05 8:08 ` David Hildenbrand
2018-12-05 9:23 ` Wei Yang
2018-12-05 8:40 ` Mike Rapoport
2018-12-05 12:13 ` Michal Hocko
2 siblings, 1 reply; 17+ messages in thread
From: David Hildenbrand @ 2018-12-05 8:08 UTC (permalink / raw)
To: Wei Yang, mhocko, osalvador; +Cc: akpm, linux-doc, linux-mm
On 05.12.18 03:34, Wei Yang wrote:
> Currently locking for memory hotplug is a little complicated.
>
> Generally speaking, we leverage the two global lock:
>
> * device_hotplug_lock
> * mem_hotplug_lock
>
> to serialise the process.
>
> While for the long term, we are willing to have more fine-grained lock
> to provide higher scalability.
>
> This patch divides Locking Internal section based on these two global
> locks to help readers to understand it. Also it adds some new finding to
> enrich it.
>
> [David: words arrangement]
>
> Signed-off-by: Wei Yang <richard.weiyang@gmail.com>
> ---
> Documentation/core-api/memory-hotplug.rst | 27 ++++++++++++++++++++++++---
> 1 file changed, 24 insertions(+), 3 deletions(-)
>
> diff --git a/Documentation/core-api/memory-hotplug.rst b/Documentation/core-api/memory-hotplug.rst
> index de7467e48067..95662b283328 100644
> --- a/Documentation/core-api/memory-hotplug.rst
> +++ b/Documentation/core-api/memory-hotplug.rst
> @@ -89,6 +89,20 @@ NOTIFY_STOP stops further processing of the notification queue.
> Locking Internals
> =================
>
> +There are three locks involved in memory-hotplug, two global lock and one local
> +lock:
> +
> +- device_hotplug_lock
> +- mem_hotplug_lock
> +- device_lock
> +
Do we really only ever use these three and not anything else when
adding/removing/onlining/offlining memory?
(I am thinking e.g. about pgdat_resize_lock)
If so, you should phrase that maybe more generally Or add more details :)
"In addition to fine grained locks like pgdat_resize_lock, there are
three locks involved ..."
> +Currently, they are twisted together for all kinds of reasons. The following
> +part is divided into device_hotplug_lock and mem_hotplug_lock parts
> +respectively to describe those tricky situations.
> +
> +device_hotplug_lock
> +---------------------
> +
> When adding/removing memory that uses memory block devices (i.e. ordinary RAM),
> the device_hotplug_lock should be held to:
>
> @@ -111,13 +125,20 @@ As the device is visible to user space before taking the device_lock(), this
> can result in a lock inversion.
>
> onlining/offlining of memory should be done via device_online()/
> -device_offline() - to make sure it is properly synchronized to actions
> -via sysfs. Holding device_hotplug_lock is advised (to e.g. protect online_type)
> +device_offline() - to make sure it is properly synchronized to actions via
> +sysfs. Even mem_hotplug_lock is used to protect the process, because of the
> +lock inversion described above, holding device_hotplug_lock is still advised
> +(to e.g. protect online_type)
> +
> +mem_hotplug_lock
> +---------------------
>
> When adding/removing/onlining/offlining memory or adding/removing
> heterogeneous/device memory, we should always hold the mem_hotplug_lock in
> write mode to serialise memory hotplug (e.g. access to global/zone
> -variables).
> +variables). Currently, we take advantage of this to serialise sparsemem's
> +mem_section handling in sparse_add_one_section() and
> +sparse_remove_one_section().
>
> In addition, mem_hotplug_lock (in contrast to device_hotplug_lock) in read
> mode allows for a quite efficient get_online_mems/put_online_mems
>
--
Thanks,
David / dhildenb
^ permalink raw reply [flat|nested] 17+ messages in thread
* Re: [PATCH 1/2] admin-guide/memory-hotplug.rst: remove locking internal part from admin-guide
2018-12-05 8:03 ` [PATCH 1/2] admin-guide/memory-hotplug.rst: remove locking internal part from admin-guide David Hildenbrand
@ 2018-12-05 8:30 ` Mike Rapoport
2018-12-05 9:20 ` Wei Yang
2018-12-05 9:20 ` Wei Yang
1 sibling, 1 reply; 17+ messages in thread
From: Mike Rapoport @ 2018-12-05 8:30 UTC (permalink / raw)
To: David Hildenbrand
Cc: Wei Yang, mhocko, osalvador, akpm, linux-doc, linux-mm,
Jonathan Corbet, Mike Rapoport
On Wed, Dec 05, 2018 at 09:03:24AM +0100, David Hildenbrand wrote:
> On 05.12.18 03:34, Wei Yang wrote:
> > Locking Internal section exists in core-api documentation, which is more
> > suitable for this.
> >
> > This patch removes the duplication part here.
> >
> > Signed-off-by: Wei Yang <richard.weiyang@gmail.com>
> > ---
> > Documentation/admin-guide/mm/memory-hotplug.rst | 40 -------------------------
> > 1 file changed, 40 deletions(-)
> >
> > diff --git a/Documentation/admin-guide/mm/memory-hotplug.rst b/Documentation/admin-guide/mm/memory-hotplug.rst
> > index 5c4432c96c4b..241f4ce1e387 100644
> > --- a/Documentation/admin-guide/mm/memory-hotplug.rst
> > +++ b/Documentation/admin-guide/mm/memory-hotplug.rst
> > @@ -392,46 +392,6 @@ Need more implementation yet....
> > - Notification completion of remove works by OS to firmware.
> > - Guard from remove if not yet.
[ ... ]
> > Future Work
> > ===========
> >
> >
>
> I reported this yesterday to Jonathan and Mike
>
> https://lkml.org/lkml/2018/12/3/340
Somehow I've missed it...
> Anyhow
>
> Reviewed-by: David Hildenbrand <david@redhat.com>
Acked-by: Mike Rapoport <rppt@linux.ibm.com>
>
> --
>
> Thanks,
>
> David / dhildenb
>
--
Sincerely yours,
Mike.
^ permalink raw reply [flat|nested] 17+ messages in thread
* Re: [PATCH 2/2] core-api/memory-hotplug.rst: divide Locking Internal section by different locks
2018-12-05 2:34 ` [PATCH 2/2] core-api/memory-hotplug.rst: divide Locking Internal section by different locks Wei Yang
2018-12-05 8:08 ` David Hildenbrand
@ 2018-12-05 8:40 ` Mike Rapoport
2018-12-05 9:24 ` Wei Yang
2018-12-05 12:13 ` Michal Hocko
2 siblings, 1 reply; 17+ messages in thread
From: Mike Rapoport @ 2018-12-05 8:40 UTC (permalink / raw)
To: Wei Yang; +Cc: david, mhocko, osalvador, akpm, linux-doc, linux-mm
On Wed, Dec 05, 2018 at 10:34:26AM +0800, Wei Yang wrote:
> Currently locking for memory hotplug is a little complicated.
>
> Generally speaking, we leverage the two global lock:
>
> * device_hotplug_lock
> * mem_hotplug_lock
>
> to serialise the process.
>
> While for the long term, we are willing to have more fine-grained lock
> to provide higher scalability.
>
> This patch divides Locking Internal section based on these two global
> locks to help readers to understand it. Also it adds some new finding to
> enrich it.
>
> [David: words arrangement]
>
> Signed-off-by: Wei Yang <richard.weiyang@gmail.com>
> ---
> Documentation/core-api/memory-hotplug.rst | 27 ++++++++++++++++++++++++---
> 1 file changed, 24 insertions(+), 3 deletions(-)
>
> diff --git a/Documentation/core-api/memory-hotplug.rst b/Documentation/core-api/memory-hotplug.rst
> index de7467e48067..95662b283328 100644
> --- a/Documentation/core-api/memory-hotplug.rst
> +++ b/Documentation/core-api/memory-hotplug.rst
> @@ -89,6 +89,20 @@ NOTIFY_STOP stops further processing of the notification queue.
> Locking Internals
> =================
>
> +There are three locks involved in memory-hotplug, two global lock and one local
typo: ^locks
> +lock:
> +
> +- device_hotplug_lock
> +- mem_hotplug_lock
> +- device_lock
> +
> +Currently, they are twisted together for all kinds of reasons. The following
> +part is divided into device_hotplug_lock and mem_hotplug_lock parts
> +respectively to describe those tricky situations.
> +
> +device_hotplug_lock
> +---------------------
> +
> When adding/removing memory that uses memory block devices (i.e. ordinary RAM),
> the device_hotplug_lock should be held to:
>
> @@ -111,13 +125,20 @@ As the device is visible to user space before taking the device_lock(), this
> can result in a lock inversion.
>
> onlining/offlining of memory should be done via device_online()/
> -device_offline() - to make sure it is properly synchronized to actions
> -via sysfs. Holding device_hotplug_lock is advised (to e.g. protect online_type)
> +device_offline() - to make sure it is properly synchronized to actions via
> +sysfs. Even mem_hotplug_lock is used to protect the process, because of the
I think it should be "Even if mem_hotplug_lock ..."
> +lock inversion described above, holding device_hotplug_lock is still advised
> +(to e.g. protect online_type)
> +
> +mem_hotplug_lock
> +---------------------
>
> When adding/removing/onlining/offlining memory or adding/removing
> heterogeneous/device memory, we should always hold the mem_hotplug_lock in
> write mode to serialise memory hotplug (e.g. access to global/zone
> -variables).
> +variables). Currently, we take advantage of this to serialise sparsemem's
> +mem_section handling in sparse_add_one_section() and
> +sparse_remove_one_section().
>
> In addition, mem_hotplug_lock (in contrast to device_hotplug_lock) in read
> mode allows for a quite efficient get_online_mems/put_online_mems
> --
> 2.15.1
>
--
Sincerely yours,
Mike.
^ permalink raw reply [flat|nested] 17+ messages in thread
* Re: [PATCH 1/2] admin-guide/memory-hotplug.rst: remove locking internal part from admin-guide
2018-12-05 8:03 ` [PATCH 1/2] admin-guide/memory-hotplug.rst: remove locking internal part from admin-guide David Hildenbrand
2018-12-05 8:30 ` Mike Rapoport
@ 2018-12-05 9:20 ` Wei Yang
1 sibling, 0 replies; 17+ messages in thread
From: Wei Yang @ 2018-12-05 9:20 UTC (permalink / raw)
To: David Hildenbrand
Cc: Wei Yang, mhocko, osalvador, akpm, linux-doc, linux-mm,
Jonathan Corbet, Mike Rapoport
On Wed, Dec 05, 2018 at 09:03:24AM +0100, David Hildenbrand wrote:
>On 05.12.18 03:34, Wei Yang wrote:
>> Locking Internal section exists in core-api documentation, which is more
>> suitable for this.
>>
>> This patch removes the duplication part here.
>>
>> Signed-off-by: Wei Yang <richard.weiyang@gmail.com>
>> ---
>> Documentation/admin-guide/mm/memory-hotplug.rst | 40 -------------------------
>> 1 file changed, 40 deletions(-)
>>
>> diff --git a/Documentation/admin-guide/mm/memory-hotplug.rst b/Documentation/admin-guide/mm/memory-hotplug.rst
>> index 5c4432c96c4b..241f4ce1e387 100644
>> --- a/Documentation/admin-guide/mm/memory-hotplug.rst
>> +++ b/Documentation/admin-guide/mm/memory-hotplug.rst
>> @@ -392,46 +392,6 @@ Need more implementation yet....
>> - Notification completion of remove works by OS to firmware.
>> - Guard from remove if not yet.
>>
>> -
>> -Locking Internals
>> -=================
>> -
>> -When adding/removing memory that uses memory block devices (i.e. ordinary RAM),
>> -the device_hotplug_lock should be held to:
>> -
>> -- synchronize against online/offline requests (e.g. via sysfs). This way, memory
>> - block devices can only be accessed (.online/.state attributes) by user
>> - space once memory has been fully added. And when removing memory, we
>> - know nobody is in critical sections.
>> -- synchronize against CPU hotplug and similar (e.g. relevant for ACPI and PPC)
>> -
>> -Especially, there is a possible lock inversion that is avoided using
>> -device_hotplug_lock when adding memory and user space tries to online that
>> -memory faster than expected:
>> -
>> -- device_online() will first take the device_lock(), followed by
>> - mem_hotplug_lock
>> -- add_memory_resource() will first take the mem_hotplug_lock, followed by
>> - the device_lock() (while creating the devices, during bus_add_device()).
>> -
>> -As the device is visible to user space before taking the device_lock(), this
>> -can result in a lock inversion.
>> -
>> -onlining/offlining of memory should be done via device_online()/
>> -device_offline() - to make sure it is properly synchronized to actions
>> -via sysfs. Holding device_hotplug_lock is advised (to e.g. protect online_type)
>> -
>> -When adding/removing/onlining/offlining memory or adding/removing
>> -heterogeneous/device memory, we should always hold the mem_hotplug_lock in
>> -write mode to serialise memory hotplug (e.g. access to global/zone
>> -variables).
>> -
>> -In addition, mem_hotplug_lock (in contrast to device_hotplug_lock) in read
>> -mode allows for a quite efficient get_online_mems/put_online_mems
>> -implementation, so code accessing memory can protect from that memory
>> -vanishing.
>> -
>> -
>> Future Work
>> ===========
>>
>>
>
>I reported this yesterday to Jonathan and Mike
>
>https://lkml.org/lkml/2018/12/3/340
>
Ah, Thanks :-)
>
>Anyhow
>
>Reviewed-by: David Hildenbrand <david@redhat.com>
>
>--
>
>Thanks,
>
>David / dhildenb
--
Wei Yang
Help you, Help me
^ permalink raw reply [flat|nested] 17+ messages in thread
* Re: [PATCH 1/2] admin-guide/memory-hotplug.rst: remove locking internal part from admin-guide
2018-12-05 8:30 ` Mike Rapoport
@ 2018-12-05 9:20 ` Wei Yang
0 siblings, 0 replies; 17+ messages in thread
From: Wei Yang @ 2018-12-05 9:20 UTC (permalink / raw)
To: Mike Rapoport
Cc: David Hildenbrand, Wei Yang, mhocko, osalvador, akpm, linux-doc,
linux-mm, Jonathan Corbet, Mike Rapoport
On Wed, Dec 05, 2018 at 10:30:13AM +0200, Mike Rapoport wrote:
>On Wed, Dec 05, 2018 at 09:03:24AM +0100, David Hildenbrand wrote:
>> On 05.12.18 03:34, Wei Yang wrote:
>> > Locking Internal section exists in core-api documentation, which is more
>> > suitable for this.
>> >
>> > This patch removes the duplication part here.
>> >
>> > Signed-off-by: Wei Yang <richard.weiyang@gmail.com>
>> > ---
>> > Documentation/admin-guide/mm/memory-hotplug.rst | 40 -------------------------
>> > 1 file changed, 40 deletions(-)
>> >
>> > diff --git a/Documentation/admin-guide/mm/memory-hotplug.rst b/Documentation/admin-guide/mm/memory-hotplug.rst
>> > index 5c4432c96c4b..241f4ce1e387 100644
>> > --- a/Documentation/admin-guide/mm/memory-hotplug.rst
>> > +++ b/Documentation/admin-guide/mm/memory-hotplug.rst
>> > @@ -392,46 +392,6 @@ Need more implementation yet....
>> > - Notification completion of remove works by OS to firmware.
>> > - Guard from remove if not yet.
>
>[ ... ]
>
>> > Future Work
>> > ===========
>> >
>> >
>>
>> I reported this yesterday to Jonathan and Mike
>>
>> https://lkml.org/lkml/2018/12/3/340
>
>Somehow I've missed it...
>
>> Anyhow
>>
>> Reviewed-by: David Hildenbrand <david@redhat.com>
>
>Acked-by: Mike Rapoport <rppt@linux.ibm.com>
>
Thanks :-)
>>
>> --
>>
>> Thanks,
>>
>> David / dhildenb
>>
>
>--
>Sincerely yours,
>Mike.
--
Wei Yang
Help you, Help me
^ permalink raw reply [flat|nested] 17+ messages in thread
* Re: [PATCH 2/2] core-api/memory-hotplug.rst: divide Locking Internal section by different locks
2018-12-05 8:08 ` David Hildenbrand
@ 2018-12-05 9:23 ` Wei Yang
0 siblings, 0 replies; 17+ messages in thread
From: Wei Yang @ 2018-12-05 9:23 UTC (permalink / raw)
To: David Hildenbrand; +Cc: Wei Yang, mhocko, osalvador, akpm, linux-doc, linux-mm
On Wed, Dec 05, 2018 at 09:08:47AM +0100, David Hildenbrand wrote:
>On 05.12.18 03:34, Wei Yang wrote:
>> Currently locking for memory hotplug is a little complicated.
>>
>> Generally speaking, we leverage the two global lock:
>>
>> * device_hotplug_lock
>> * mem_hotplug_lock
>>
>> to serialise the process.
>>
>> While for the long term, we are willing to have more fine-grained lock
>> to provide higher scalability.
>>
>> This patch divides Locking Internal section based on these two global
>> locks to help readers to understand it. Also it adds some new finding to
>> enrich it.
>>
>> [David: words arrangement]
>>
>> Signed-off-by: Wei Yang <richard.weiyang@gmail.com>
>> ---
>> Documentation/core-api/memory-hotplug.rst | 27 ++++++++++++++++++++++++---
>> 1 file changed, 24 insertions(+), 3 deletions(-)
>>
>> diff --git a/Documentation/core-api/memory-hotplug.rst b/Documentation/core-api/memory-hotplug.rst
>> index de7467e48067..95662b283328 100644
>> --- a/Documentation/core-api/memory-hotplug.rst
>> +++ b/Documentation/core-api/memory-hotplug.rst
>> @@ -89,6 +89,20 @@ NOTIFY_STOP stops further processing of the notification queue.
>> Locking Internals
>> =================
>>
>> +There are three locks involved in memory-hotplug, two global lock and one local
>> +lock:
>> +
>> +- device_hotplug_lock
>> +- mem_hotplug_lock
>> +- device_lock
>> +
>
>Do we really only ever use these three and not anything else when
>adding/removing/onlining/offlining memory?
>
>(I am thinking e.g. about pgdat_resize_lock)
Yes there are more than those three, pgdat_resize_lock is one of them.
>
>If so, you should phrase that maybe more generally Or add more details :)
Yep, while I don't get a whole picture about the pgdat_resize_lock. The
usage of this lock scatter in many places.
>
>"In addition to fine grained locks like pgdat_resize_lock, there are
>three locks involved ..."
>
Sounds better :-)
--
Wei Yang
Help you, Help me
^ permalink raw reply [flat|nested] 17+ messages in thread
* Re: [PATCH 2/2] core-api/memory-hotplug.rst: divide Locking Internal section by different locks
2018-12-05 8:40 ` Mike Rapoport
@ 2018-12-05 9:24 ` Wei Yang
0 siblings, 0 replies; 17+ messages in thread
From: Wei Yang @ 2018-12-05 9:24 UTC (permalink / raw)
To: Mike Rapoport
Cc: Wei Yang, david, mhocko, osalvador, akpm, linux-doc, linux-mm
On Wed, Dec 05, 2018 at 10:40:45AM +0200, Mike Rapoport wrote:
>On Wed, Dec 05, 2018 at 10:34:26AM +0800, Wei Yang wrote:
>> Currently locking for memory hotplug is a little complicated.
>>
>> Generally speaking, we leverage the two global lock:
>>
>> * device_hotplug_lock
>> * mem_hotplug_lock
>>
>> to serialise the process.
>>
>> While for the long term, we are willing to have more fine-grained lock
>> to provide higher scalability.
>>
>> This patch divides Locking Internal section based on these two global
>> locks to help readers to understand it. Also it adds some new finding to
>> enrich it.
>>
>> [David: words arrangement]
>>
>> Signed-off-by: Wei Yang <richard.weiyang@gmail.com>
>> ---
>> Documentation/core-api/memory-hotplug.rst | 27 ++++++++++++++++++++++++---
>> 1 file changed, 24 insertions(+), 3 deletions(-)
>>
>> diff --git a/Documentation/core-api/memory-hotplug.rst b/Documentation/core-api/memory-hotplug.rst
>> index de7467e48067..95662b283328 100644
>> --- a/Documentation/core-api/memory-hotplug.rst
>> +++ b/Documentation/core-api/memory-hotplug.rst
>> @@ -89,6 +89,20 @@ NOTIFY_STOP stops further processing of the notification queue.
>> Locking Internals
>> =================
>>
>> +There are three locks involved in memory-hotplug, two global lock and one local
>
>typo: ^locks
>
Thanks :-)
>> +lock:
>> +
>> +- device_hotplug_lock
>> +- mem_hotplug_lock
>> +- device_lock
>> +
>> +Currently, they are twisted together for all kinds of reasons. The following
>> +part is divided into device_hotplug_lock and mem_hotplug_lock parts
>> +respectively to describe those tricky situations.
>> +
>> +device_hotplug_lock
>> +---------------------
>> +
>> When adding/removing memory that uses memory block devices (i.e. ordinary RAM),
>> the device_hotplug_lock should be held to:
>>
>> @@ -111,13 +125,20 @@ As the device is visible to user space before taking the device_lock(), this
>> can result in a lock inversion.
>>
>> onlining/offlining of memory should be done via device_online()/
>> -device_offline() - to make sure it is properly synchronized to actions
>> -via sysfs. Holding device_hotplug_lock is advised (to e.g. protect online_type)
>> +device_offline() - to make sure it is properly synchronized to actions via
>> +sysfs. Even mem_hotplug_lock is used to protect the process, because of the
>
>I think it should be "Even if mem_hotplug_lock ..."
>
Ah, my poor English, will fix it in next version. :-)
>> +lock inversion described above, holding device_hotplug_lock is still advised
>> +(to e.g. protect online_type)
>> +
>> +mem_hotplug_lock
>> +---------------------
>>
>> When adding/removing/onlining/offlining memory or adding/removing
>> heterogeneous/device memory, we should always hold the mem_hotplug_lock in
>> write mode to serialise memory hotplug (e.g. access to global/zone
>> -variables).
>> +variables). Currently, we take advantage of this to serialise sparsemem's
>> +mem_section handling in sparse_add_one_section() and
>> +sparse_remove_one_section().
>>
>> In addition, mem_hotplug_lock (in contrast to device_hotplug_lock) in read
>> mode allows for a quite efficient get_online_mems/put_online_mems
>> --
>> 2.15.1
>>
>
>--
>Sincerely yours,
>Mike.
--
Wei Yang
Help you, Help me
^ permalink raw reply [flat|nested] 17+ messages in thread
* Re: [PATCH 1/2] admin-guide/memory-hotplug.rst: remove locking internal part from admin-guide
2018-12-05 2:34 [PATCH 1/2] admin-guide/memory-hotplug.rst: remove locking internal part from admin-guide Wei Yang
2018-12-05 2:34 ` [PATCH 2/2] core-api/memory-hotplug.rst: divide Locking Internal section by different locks Wei Yang
2018-12-05 8:03 ` [PATCH 1/2] admin-guide/memory-hotplug.rst: remove locking internal part from admin-guide David Hildenbrand
@ 2018-12-05 12:11 ` Michal Hocko
2018-12-06 0:26 ` [PATCH v2 " Wei Yang
3 siblings, 0 replies; 17+ messages in thread
From: Michal Hocko @ 2018-12-05 12:11 UTC (permalink / raw)
To: Wei Yang; +Cc: david, osalvador, akpm, linux-doc, linux-mm
On Wed 05-12-18 10:34:25, Wei Yang wrote:
> Locking Internal section exists in core-api documentation, which is more
> suitable for this.
>
> This patch removes the duplication part here.
>
> Signed-off-by: Wei Yang <richard.weiyang@gmail.com>
Yes this doesn't really make any sense in an admin guide. It is a pure
implementation detail nobody should be relying on.
--
Michal Hocko
SUSE Labs
^ permalink raw reply [flat|nested] 17+ messages in thread
* Re: [PATCH 2/2] core-api/memory-hotplug.rst: divide Locking Internal section by different locks
2018-12-05 2:34 ` [PATCH 2/2] core-api/memory-hotplug.rst: divide Locking Internal section by different locks Wei Yang
2018-12-05 8:08 ` David Hildenbrand
2018-12-05 8:40 ` Mike Rapoport
@ 2018-12-05 12:13 ` Michal Hocko
2018-12-05 12:20 ` Wei Yang
2 siblings, 1 reply; 17+ messages in thread
From: Michal Hocko @ 2018-12-05 12:13 UTC (permalink / raw)
To: Wei Yang; +Cc: david, osalvador, akpm, linux-doc, linux-mm
On Wed 05-12-18 10:34:26, Wei Yang wrote:
> Currently locking for memory hotplug is a little complicated.
>
> Generally speaking, we leverage the two global lock:
>
> * device_hotplug_lock
> * mem_hotplug_lock
>
> to serialise the process.
>
> While for the long term, we are willing to have more fine-grained lock
> to provide higher scalability.
>
> This patch divides Locking Internal section based on these two global
> locks to help readers to understand it. Also it adds some new finding to
> enrich it.
>
> [David: words arrangement]
>
> Signed-off-by: Wei Yang <richard.weiyang@gmail.com>
For a love of mine I cannot find the locking description by Oscar. Maybe
it never existed and I just made it up ;) But if it is not imaginary
then my recollection is that it was much more comprehensive. If not then
even this is a good start.
> ---
> Documentation/core-api/memory-hotplug.rst | 27 ++++++++++++++++++++++++---
> 1 file changed, 24 insertions(+), 3 deletions(-)
>
> diff --git a/Documentation/core-api/memory-hotplug.rst b/Documentation/core-api/memory-hotplug.rst
> index de7467e48067..95662b283328 100644
> --- a/Documentation/core-api/memory-hotplug.rst
> +++ b/Documentation/core-api/memory-hotplug.rst
> @@ -89,6 +89,20 @@ NOTIFY_STOP stops further processing of the notification queue.
> Locking Internals
> =================
>
> +There are three locks involved in memory-hotplug, two global lock and one local
> +lock:
> +
> +- device_hotplug_lock
> +- mem_hotplug_lock
> +- device_lock
> +
> +Currently, they are twisted together for all kinds of reasons. The following
> +part is divided into device_hotplug_lock and mem_hotplug_lock parts
> +respectively to describe those tricky situations.
> +
> +device_hotplug_lock
> +---------------------
> +
> When adding/removing memory that uses memory block devices (i.e. ordinary RAM),
> the device_hotplug_lock should be held to:
>
> @@ -111,13 +125,20 @@ As the device is visible to user space before taking the device_lock(), this
> can result in a lock inversion.
>
> onlining/offlining of memory should be done via device_online()/
> -device_offline() - to make sure it is properly synchronized to actions
> -via sysfs. Holding device_hotplug_lock is advised (to e.g. protect online_type)
> +device_offline() - to make sure it is properly synchronized to actions via
> +sysfs. Even mem_hotplug_lock is used to protect the process, because of the
> +lock inversion described above, holding device_hotplug_lock is still advised
> +(to e.g. protect online_type)
> +
> +mem_hotplug_lock
> +---------------------
>
> When adding/removing/onlining/offlining memory or adding/removing
> heterogeneous/device memory, we should always hold the mem_hotplug_lock in
> write mode to serialise memory hotplug (e.g. access to global/zone
> -variables).
> +variables). Currently, we take advantage of this to serialise sparsemem's
> +mem_section handling in sparse_add_one_section() and
> +sparse_remove_one_section().
>
> In addition, mem_hotplug_lock (in contrast to device_hotplug_lock) in read
> mode allows for a quite efficient get_online_mems/put_online_mems
> --
> 2.15.1
>
--
Michal Hocko
SUSE Labs
^ permalink raw reply [flat|nested] 17+ messages in thread
* Re: [PATCH 2/2] core-api/memory-hotplug.rst: divide Locking Internal section by different locks
2018-12-05 12:13 ` Michal Hocko
@ 2018-12-05 12:20 ` Wei Yang
0 siblings, 0 replies; 17+ messages in thread
From: Wei Yang @ 2018-12-05 12:20 UTC (permalink / raw)
To: Michal Hocko; +Cc: Wei Yang, david, osalvador, akpm, linux-doc, linux-mm
On Wed, Dec 05, 2018 at 01:13:10PM +0100, Michal Hocko wrote:
>On Wed 05-12-18 10:34:26, Wei Yang wrote:
>> Currently locking for memory hotplug is a little complicated.
>>
>> Generally speaking, we leverage the two global lock:
>>
>> * device_hotplug_lock
>> * mem_hotplug_lock
>>
>> to serialise the process.
>>
>> While for the long term, we are willing to have more fine-grained lock
>> to provide higher scalability.
>>
>> This patch divides Locking Internal section based on these two global
>> locks to help readers to understand it. Also it adds some new finding to
>> enrich it.
>>
>> [David: words arrangement]
>>
>> Signed-off-by: Wei Yang <richard.weiyang@gmail.com>
>
>For a love of mine I cannot find the locking description by Oscar. Maybe
>it never existed and I just made it up ;) But if it is not imaginary
>then my recollection is that it was much more comprehensive. If not then
>even this is a good start.
Thanks.
If Oscar has already has some work on it, this could be a complement to
his work :-)
>
>> ---
>> Documentation/core-api/memory-hotplug.rst | 27 ++++++++++++++++++++++++---
>> 1 file changed, 24 insertions(+), 3 deletions(-)
>>
>> diff --git a/Documentation/core-api/memory-hotplug.rst b/Documentation/core-api/memory-hotplug.rst
>> index de7467e48067..95662b283328 100644
>> --- a/Documentation/core-api/memory-hotplug.rst
>> +++ b/Documentation/core-api/memory-hotplug.rst
>> @@ -89,6 +89,20 @@ NOTIFY_STOP stops further processing of the notification queue.
>> Locking Internals
>> =================
>>
>> +There are three locks involved in memory-hotplug, two global lock and one local
>> +lock:
>> +
>> +- device_hotplug_lock
>> +- mem_hotplug_lock
>> +- device_lock
>> +
>> +Currently, they are twisted together for all kinds of reasons. The following
>> +part is divided into device_hotplug_lock and mem_hotplug_lock parts
>> +respectively to describe those tricky situations.
>> +
>> +device_hotplug_lock
>> +---------------------
>> +
>> When adding/removing memory that uses memory block devices (i.e. ordinary RAM),
>> the device_hotplug_lock should be held to:
>>
>> @@ -111,13 +125,20 @@ As the device is visible to user space before taking the device_lock(), this
>> can result in a lock inversion.
>>
>> onlining/offlining of memory should be done via device_online()/
>> -device_offline() - to make sure it is properly synchronized to actions
>> -via sysfs. Holding device_hotplug_lock is advised (to e.g. protect online_type)
>> +device_offline() - to make sure it is properly synchronized to actions via
>> +sysfs. Even mem_hotplug_lock is used to protect the process, because of the
>> +lock inversion described above, holding device_hotplug_lock is still advised
>> +(to e.g. protect online_type)
>> +
>> +mem_hotplug_lock
>> +---------------------
>>
>> When adding/removing/onlining/offlining memory or adding/removing
>> heterogeneous/device memory, we should always hold the mem_hotplug_lock in
>> write mode to serialise memory hotplug (e.g. access to global/zone
>> -variables).
>> +variables). Currently, we take advantage of this to serialise sparsemem's
>> +mem_section handling in sparse_add_one_section() and
>> +sparse_remove_one_section().
>>
>> In addition, mem_hotplug_lock (in contrast to device_hotplug_lock) in read
>> mode allows for a quite efficient get_online_mems/put_online_mems
>> --
>> 2.15.1
>>
>
>--
>Michal Hocko
>SUSE Labs
--
Wei Yang
Help you, Help me
^ permalink raw reply [flat|nested] 17+ messages in thread
* [PATCH v2 1/2] admin-guide/memory-hotplug.rst: remove locking internal part from admin-guide
2018-12-05 2:34 [PATCH 1/2] admin-guide/memory-hotplug.rst: remove locking internal part from admin-guide Wei Yang
` (2 preceding siblings ...)
2018-12-05 12:11 ` Michal Hocko
@ 2018-12-06 0:26 ` Wei Yang
2018-12-06 0:26 ` [PATCH v2 2/2] core-api/memory-hotplug.rst: divide Locking Internal section by different locks Wei Yang
3 siblings, 1 reply; 17+ messages in thread
From: Wei Yang @ 2018-12-06 0:26 UTC (permalink / raw)
To: rppt, david, mhocko, osalvador; +Cc: akpm, linux-doc, linux-mm, Wei Yang
Locking Internal section exists in core-api documentation, which is more
suitable for this.
This patch removes the duplication part here.
Signed-off-by: Wei Yang <richard.weiyang@gmail.com>
Reviewed-by: David Hildenbrand <david@redhat.com>
Acked-by: Mike Rapoport <rppt@linux.ibm.com>
---
Documentation/admin-guide/mm/memory-hotplug.rst | 40 -------------------------
1 file changed, 40 deletions(-)
diff --git a/Documentation/admin-guide/mm/memory-hotplug.rst b/Documentation/admin-guide/mm/memory-hotplug.rst
index 5c4432c96c4b..241f4ce1e387 100644
--- a/Documentation/admin-guide/mm/memory-hotplug.rst
+++ b/Documentation/admin-guide/mm/memory-hotplug.rst
@@ -392,46 +392,6 @@ Need more implementation yet....
- Notification completion of remove works by OS to firmware.
- Guard from remove if not yet.
-
-Locking Internals
-=================
-
-When adding/removing memory that uses memory block devices (i.e. ordinary RAM),
-the device_hotplug_lock should be held to:
-
-- synchronize against online/offline requests (e.g. via sysfs). This way, memory
- block devices can only be accessed (.online/.state attributes) by user
- space once memory has been fully added. And when removing memory, we
- know nobody is in critical sections.
-- synchronize against CPU hotplug and similar (e.g. relevant for ACPI and PPC)
-
-Especially, there is a possible lock inversion that is avoided using
-device_hotplug_lock when adding memory and user space tries to online that
-memory faster than expected:
-
-- device_online() will first take the device_lock(), followed by
- mem_hotplug_lock
-- add_memory_resource() will first take the mem_hotplug_lock, followed by
- the device_lock() (while creating the devices, during bus_add_device()).
-
-As the device is visible to user space before taking the device_lock(), this
-can result in a lock inversion.
-
-onlining/offlining of memory should be done via device_online()/
-device_offline() - to make sure it is properly synchronized to actions
-via sysfs. Holding device_hotplug_lock is advised (to e.g. protect online_type)
-
-When adding/removing/onlining/offlining memory or adding/removing
-heterogeneous/device memory, we should always hold the mem_hotplug_lock in
-write mode to serialise memory hotplug (e.g. access to global/zone
-variables).
-
-In addition, mem_hotplug_lock (in contrast to device_hotplug_lock) in read
-mode allows for a quite efficient get_online_mems/put_online_mems
-implementation, so code accessing memory can protect from that memory
-vanishing.
-
-
Future Work
===========
--
2.15.1
^ permalink raw reply [flat|nested] 17+ messages in thread
* [PATCH v2 2/2] core-api/memory-hotplug.rst: divide Locking Internal section by different locks
2018-12-06 0:26 ` [PATCH v2 " Wei Yang
@ 2018-12-06 0:26 ` Wei Yang
2018-12-06 7:32 ` Mike Rapoport
2018-12-06 8:22 ` David Hildenbrand
0 siblings, 2 replies; 17+ messages in thread
From: Wei Yang @ 2018-12-06 0:26 UTC (permalink / raw)
To: rppt, david, mhocko, osalvador; +Cc: akpm, linux-doc, linux-mm, Wei Yang
Currently locking for memory hotplug is a little complicated.
Generally speaking, we leverage the two global lock:
* device_hotplug_lock
* mem_hotplug_lock
to serialise the process.
While for the long term, we are willing to have more fine-grained lock
to provide higher scalability.
This patch divides Locking Internal section based on these two global
locks to help readers to understand it. Also it adds some new finding to
enrich it.
[David: words arrangement]
Signed-off-by: Wei Yang <richard.weiyang@gmail.com>
---
v2: adjustment based on David and Mike comment
---
Documentation/core-api/memory-hotplug.rst | 27 ++++++++++++++++++++++++---
1 file changed, 24 insertions(+), 3 deletions(-)
diff --git a/Documentation/core-api/memory-hotplug.rst b/Documentation/core-api/memory-hotplug.rst
index de7467e48067..51d477ad4b80 100644
--- a/Documentation/core-api/memory-hotplug.rst
+++ b/Documentation/core-api/memory-hotplug.rst
@@ -89,6 +89,20 @@ NOTIFY_STOP stops further processing of the notification queue.
Locking Internals
=================
+In addition to fine grained locks like pgdat_resize_lock, there are three locks
+involved
+
+- device_hotplug_lock
+- mem_hotplug_lock
+- device_lock
+
+Currently, they are twisted together for all kinds of reasons. The following
+part is divided into device_hotplug_lock and mem_hotplug_lock parts
+respectively to describe those tricky situations.
+
+device_hotplug_lock
+---------------------
+
When adding/removing memory that uses memory block devices (i.e. ordinary RAM),
the device_hotplug_lock should be held to:
@@ -111,13 +125,20 @@ As the device is visible to user space before taking the device_lock(), this
can result in a lock inversion.
onlining/offlining of memory should be done via device_online()/
-device_offline() - to make sure it is properly synchronized to actions
-via sysfs. Holding device_hotplug_lock is advised (to e.g. protect online_type)
+device_offline() - to make sure it is properly synchronized to actions via
+sysfs. Even if mem_hotplug_lock is used to protect the process, because of the
+lock inversion described above, holding device_hotplug_lock is still advised
+(to e.g. protect online_type)
+
+mem_hotplug_lock
+---------------------
When adding/removing/onlining/offlining memory or adding/removing
heterogeneous/device memory, we should always hold the mem_hotplug_lock in
write mode to serialise memory hotplug (e.g. access to global/zone
-variables).
+variables). Currently, we take advantage of this to serialise sparsemem's
+mem_section handling in sparse_add_one_section() and
+sparse_remove_one_section().
In addition, mem_hotplug_lock (in contrast to device_hotplug_lock) in read
mode allows for a quite efficient get_online_mems/put_online_mems
--
2.15.1
^ permalink raw reply [flat|nested] 17+ messages in thread
* Re: [PATCH v2 2/2] core-api/memory-hotplug.rst: divide Locking Internal section by different locks
2018-12-06 0:26 ` [PATCH v2 2/2] core-api/memory-hotplug.rst: divide Locking Internal section by different locks Wei Yang
@ 2018-12-06 7:32 ` Mike Rapoport
2018-12-06 8:22 ` David Hildenbrand
1 sibling, 0 replies; 17+ messages in thread
From: Mike Rapoport @ 2018-12-06 7:32 UTC (permalink / raw)
To: Wei Yang; +Cc: david, mhocko, osalvador, akpm, linux-doc, linux-mm
On Thu, Dec 06, 2018 at 08:26:22AM +0800, Wei Yang wrote:
> Currently locking for memory hotplug is a little complicated.
>
> Generally speaking, we leverage the two global lock:
>
> * device_hotplug_lock
> * mem_hotplug_lock
>
> to serialise the process.
>
> While for the long term, we are willing to have more fine-grained lock
> to provide higher scalability.
>
> This patch divides Locking Internal section based on these two global
> locks to help readers to understand it. Also it adds some new finding to
> enrich it.
>
> [David: words arrangement]
>
> Signed-off-by: Wei Yang <richard.weiyang@gmail.com>
Reviewd-by: Mike Rapoport <rppt@linux.ibm.com>
> ---
> v2: adjustment based on David and Mike comment
> ---
> Documentation/core-api/memory-hotplug.rst | 27 ++++++++++++++++++++++++---
> 1 file changed, 24 insertions(+), 3 deletions(-)
>
> diff --git a/Documentation/core-api/memory-hotplug.rst b/Documentation/core-api/memory-hotplug.rst
> index de7467e48067..51d477ad4b80 100644
> --- a/Documentation/core-api/memory-hotplug.rst
> +++ b/Documentation/core-api/memory-hotplug.rst
> @@ -89,6 +89,20 @@ NOTIFY_STOP stops further processing of the notification queue.
> Locking Internals
> =================
>
> +In addition to fine grained locks like pgdat_resize_lock, there are three locks
> +involved
> +
> +- device_hotplug_lock
> +- mem_hotplug_lock
> +- device_lock
> +
> +Currently, they are twisted together for all kinds of reasons. The following
> +part is divided into device_hotplug_lock and mem_hotplug_lock parts
> +respectively to describe those tricky situations.
> +
> +device_hotplug_lock
> +---------------------
> +
> When adding/removing memory that uses memory block devices (i.e. ordinary RAM),
> the device_hotplug_lock should be held to:
>
> @@ -111,13 +125,20 @@ As the device is visible to user space before taking the device_lock(), this
> can result in a lock inversion.
>
> onlining/offlining of memory should be done via device_online()/
> -device_offline() - to make sure it is properly synchronized to actions
> -via sysfs. Holding device_hotplug_lock is advised (to e.g. protect online_type)
> +device_offline() - to make sure it is properly synchronized to actions via
> +sysfs. Even if mem_hotplug_lock is used to protect the process, because of the
> +lock inversion described above, holding device_hotplug_lock is still advised
> +(to e.g. protect online_type)
> +
> +mem_hotplug_lock
> +---------------------
>
> When adding/removing/onlining/offlining memory or adding/removing
> heterogeneous/device memory, we should always hold the mem_hotplug_lock in
> write mode to serialise memory hotplug (e.g. access to global/zone
> -variables).
> +variables). Currently, we take advantage of this to serialise sparsemem's
> +mem_section handling in sparse_add_one_section() and
> +sparse_remove_one_section().
>
> In addition, mem_hotplug_lock (in contrast to device_hotplug_lock) in read
> mode allows for a quite efficient get_online_mems/put_online_mems
> --
> 2.15.1
>
--
Sincerely yours,
Mike.
^ permalink raw reply [flat|nested] 17+ messages in thread
* Re: [PATCH v2 2/2] core-api/memory-hotplug.rst: divide Locking Internal section by different locks
2018-12-06 0:26 ` [PATCH v2 2/2] core-api/memory-hotplug.rst: divide Locking Internal section by different locks Wei Yang
2018-12-06 7:32 ` Mike Rapoport
@ 2018-12-06 8:22 ` David Hildenbrand
1 sibling, 0 replies; 17+ messages in thread
From: David Hildenbrand @ 2018-12-06 8:22 UTC (permalink / raw)
To: Wei Yang, rppt, mhocko, osalvador; +Cc: akpm, linux-doc, linux-mm
On 06.12.18 01:26, Wei Yang wrote:
> Currently locking for memory hotplug is a little complicated.
>
> Generally speaking, we leverage the two global lock:
>
> * device_hotplug_lock
> * mem_hotplug_lock
>
> to serialise the process.
>
> While for the long term, we are willing to have more fine-grained lock
> to provide higher scalability.
>
> This patch divides Locking Internal section based on these two global
> locks to help readers to understand it. Also it adds some new finding to
> enrich it.
>
> [David: words arrangement]
>
> Signed-off-by: Wei Yang <richard.weiyang@gmail.com>
> ---
> v2: adjustment based on David and Mike comment
> ---
> Documentation/core-api/memory-hotplug.rst | 27 ++++++++++++++++++++++++---
> 1 file changed, 24 insertions(+), 3 deletions(-)
>
> diff --git a/Documentation/core-api/memory-hotplug.rst b/Documentation/core-api/memory-hotplug.rst
> index de7467e48067..51d477ad4b80 100644
> --- a/Documentation/core-api/memory-hotplug.rst
> +++ b/Documentation/core-api/memory-hotplug.rst
> @@ -89,6 +89,20 @@ NOTIFY_STOP stops further processing of the notification queue.
> Locking Internals
> =================
>
> +In addition to fine grained locks like pgdat_resize_lock, there are three locks
> +involved
> +
> +- device_hotplug_lock
> +- mem_hotplug_lock
> +- device_lock
> +
> +Currently, they are twisted together for all kinds of reasons. The following
> +part is divided into device_hotplug_lock and mem_hotplug_lock parts
> +respectively to describe those tricky situations.
> +
> +device_hotplug_lock
> +---------------------
> +
> When adding/removing memory that uses memory block devices (i.e. ordinary RAM),
> the device_hotplug_lock should be held to:
>
> @@ -111,13 +125,20 @@ As the device is visible to user space before taking the device_lock(), this
> can result in a lock inversion.
>
> onlining/offlining of memory should be done via device_online()/
> -device_offline() - to make sure it is properly synchronized to actions
> -via sysfs. Holding device_hotplug_lock is advised (to e.g. protect online_type)
> +device_offline() - to make sure it is properly synchronized to actions via
> +sysfs. Even if mem_hotplug_lock is used to protect the process, because of the
> +lock inversion described above, holding device_hotplug_lock is still advised
> +(to e.g. protect online_type)
> +
> +mem_hotplug_lock
> +---------------------
>
> When adding/removing/onlining/offlining memory or adding/removing
> heterogeneous/device memory, we should always hold the mem_hotplug_lock in
> write mode to serialise memory hotplug (e.g. access to global/zone
> -variables).
> +variables). Currently, we take advantage of this to serialise sparsemem's
> +mem_section handling in sparse_add_one_section() and
> +sparse_remove_one_section().
>
> In addition, mem_hotplug_lock (in contrast to device_hotplug_lock) in read
> mode allows for a quite efficient get_online_mems/put_online_mems
>
Reviewed-by: David Hildenbrand <david@redhat.com>
--
Thanks,
David / dhildenb
^ permalink raw reply [flat|nested] 17+ messages in thread
end of thread, other threads:[~2018-12-06 8:22 UTC | newest]
Thread overview: 17+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2018-12-05 2:34 [PATCH 1/2] admin-guide/memory-hotplug.rst: remove locking internal part from admin-guide Wei Yang
2018-12-05 2:34 ` [PATCH 2/2] core-api/memory-hotplug.rst: divide Locking Internal section by different locks Wei Yang
2018-12-05 8:08 ` David Hildenbrand
2018-12-05 9:23 ` Wei Yang
2018-12-05 8:40 ` Mike Rapoport
2018-12-05 9:24 ` Wei Yang
2018-12-05 12:13 ` Michal Hocko
2018-12-05 12:20 ` Wei Yang
2018-12-05 8:03 ` [PATCH 1/2] admin-guide/memory-hotplug.rst: remove locking internal part from admin-guide David Hildenbrand
2018-12-05 8:30 ` Mike Rapoport
2018-12-05 9:20 ` Wei Yang
2018-12-05 9:20 ` Wei Yang
2018-12-05 12:11 ` Michal Hocko
2018-12-06 0:26 ` [PATCH v2 " Wei Yang
2018-12-06 0:26 ` [PATCH v2 2/2] core-api/memory-hotplug.rst: divide Locking Internal section by different locks Wei Yang
2018-12-06 7:32 ` Mike Rapoport
2018-12-06 8:22 ` David Hildenbrand
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox