From: Mike Kravetz <mike.kravetz@oracle.com>
To: David Hildenbrand <david@redhat.com>,
linux-mm@kvack.org, linux-kernel@vger.kernel.org
Cc: Michal Hocko <mhocko@suse.com>,
Oscar Salvador <osalvador@suse.de>, Zi Yan <ziy@nvidia.com>,
Muchun Song <songmuchun@bytedance.com>,
Naoya Horiguchi <naoya.horiguchi@linux.dev>,
David Rientjes <rientjes@google.com>,
"Aneesh Kumar K . V" <aneesh.kumar@linux.ibm.com>,
Andrew Morton <akpm@linux-foundation.org>
Subject: Re: [PATCH v2 2/4] hugetlb: add HPageCma flag and code to free non-gigantic pages in CMA
Date: Fri, 1 Oct 2021 10:50:01 -0700 [thread overview]
Message-ID: <bd1ab280-3287-eaa6-a56f-557d3ddc5941@oracle.com> (raw)
In-Reply-To: <24dc6cfd-8b2e-4ed6-d64a-9bf9372b1b4c@oracle.com>
On 9/29/21 4:21 PM, Mike Kravetz wrote:
> On 9/29/21 12:42 PM, Mike Kravetz wrote:
>> On 9/24/21 2:36 AM, David Hildenbrand wrote:
>>> On 23.09.21 19:53, Mike Kravetz wrote:
>>>> When huge page demotion is fully implemented, gigantic pages can be
>>>> demoted to a smaller huge page size. For example, on x86 a 1G page
>>>> can be demoted to 512 2M pages. However, gigantic pages can potentially
>>>> be allocated from CMA. If a gigantic page which was allocated from CMA
>>>> is demoted, the corresponding demoted pages needs to be returned to CMA.
>>>>
>>>> In order to track hugetlb pages that need to be returned to CMA, add the
>>>> hugetlb specific flag HPageCma. Flag is set when a huge page is
>>>> allocated from CMA and transferred to any demoted pages. Non-gigantic
>>>> huge page freeing code checks for the flag and takes appropriate action.
>>>
>>> Do we really need that flag or couldn't we simply always try cma_release() and fallback to out ordinary freeing-path?
>>>
>>> IIRC, cma knows exactly if something was allocated via a CMA are and can be free via it. No need for additional tracking usually.
>>>
>>
>> Yes, I think this is possible.
>> Initially, I thought the check for whether pages were part of CMA
>> involved a mutex or some type of locking. But, it really is
>> lightweight. So, should not be in issue calling in every case.
>
> When modifying the code, I did come across one issue. Sorry I did not
> immediately remember this.
>
> Gigantic pages are allocated as a 'set of pages' and turned into a compound
> page by the hugetlb code. They must be restored to a 'set of pages' before
> calling cma_release. You can not pass a compound page to cma_release.
> Non-gigantic page are allocated from the buddy directly as compound pages.
> They are returned to buddy as a compound page.
>
> So, the issue comes about when freeing a non-gigantic page. We would
> need to convert to a 'set of pages' before calling cma_release just to
> see if cma_release succeeds. Then, if it fails convert back to a
> compound page to call __free_pages. Conversion is somewhat expensive as
> we must modify every tail page struct.
>
> Some possible solutions:
> - Create a cma_pages_valid() routine that checks whether the pages
> belong to a cma region. Only convert to 'set of pages' if cma_pages_valid
> and we know subsequent cma_release will succeed.
> - Always convert non-gigantic pages to a 'set of pages' before freeing.
> Alternatively, don't allocate compound pages from buddy and just use
> the hugetlb gigantic page prep and destroy routines for all hugetlb
> page sizes.
> - Use some kind of flag as in proposed patch.
>
> Having hugetlb just allocate a set of pages from buddy is interesting.
> This would make the allocate/free code paths for gigantic and
> non-gigantic pages align more closely. It may in overall code simplification,
> not just for demote.
Taking this approach actually got a bit ugly in alloc_and_dissolve_huge_page
which is used in migration. Instead, I took the approach of adding a
'cma_pages_valid()' interface to check at page freeing time. Sending
out v3 shortly.
--
Mike Kravetz
next prev parent reply other threads:[~2021-10-01 17:50 UTC|newest]
Thread overview: 17+ messages / expand[flat|nested] mbox.gz Atom feed top
2021-09-23 17:53 [PATCH v2 0/4] hugetlb: add demote/split page functionality Mike Kravetz
2021-09-23 17:53 ` [PATCH v2 1/4] hugetlb: add demote hugetlb page sysfs interfaces Mike Kravetz
2021-09-23 21:24 ` Andrew Morton
2021-09-23 22:08 ` Mike Kravetz
2021-09-24 7:08 ` Aneesh Kumar K.V
2021-09-29 18:22 ` Mike Kravetz
2021-09-24 9:28 ` David Hildenbrand
2021-09-29 19:34 ` Mike Kravetz
2021-09-23 17:53 ` [PATCH v2 2/4] hugetlb: add HPageCma flag and code to free non-gigantic pages in CMA Mike Kravetz
2021-09-24 9:36 ` David Hildenbrand
2021-09-29 19:42 ` Mike Kravetz
2021-09-29 23:21 ` Mike Kravetz
2021-10-01 17:50 ` Mike Kravetz [this message]
2021-09-23 17:53 ` [PATCH v2 3/4] hugetlb: add demote bool to gigantic page routines Mike Kravetz
2021-09-23 17:53 ` [PATCH v2 4/4] hugetlb: add hugetlb demote page support Mike Kravetz
2021-09-24 9:44 ` David Hildenbrand
2021-09-29 19:54 ` Mike Kravetz
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=bd1ab280-3287-eaa6-a56f-557d3ddc5941@oracle.com \
--to=mike.kravetz@oracle.com \
--cc=akpm@linux-foundation.org \
--cc=aneesh.kumar@linux.ibm.com \
--cc=david@redhat.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=mhocko@suse.com \
--cc=naoya.horiguchi@linux.dev \
--cc=osalvador@suse.de \
--cc=rientjes@google.com \
--cc=songmuchun@bytedance.com \
--cc=ziy@nvidia.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox