From: Tvrtko Ursulin <tursulin@ursulin.net>
To: "Loïc Molinari" <loic.molinari@collabora.com>,
"Maarten Lankhorst" <maarten.lankhorst@linux.intel.com>,
"Maxime Ripard" <mripard@kernel.org>,
"Thomas Zimmermann" <tzimmermann@suse.de>,
"David Airlie" <airlied@gmail.com>,
"Simona Vetter" <simona@ffwll.ch>,
"Jani Nikula" <jani.nikula@linux.intel.com>,
"Joonas Lahtinen" <joonas.lahtinen@linux.intel.com>,
"Rodrigo Vivi" <rodrigo.vivi@intel.com>,
"Boris Brezillon" <boris.brezillon@collabora.com>,
"Rob Herring" <robh@kernel.org>,
"Steven Price" <steven.price@arm.com>,
"Liviu Dudau" <liviu.dudau@arm.com>,
"Melissa Wen" <mwen@igalia.com>,
"Maíra Canal" <mcanal@igalia.com>,
"Hugh Dickins" <hughd@google.com>,
"Baolin Wang" <baolin.wang@linux.alibaba.com>,
"Andrew Morton" <akpm@linux-foundation.org>,
"Al Viro" <viro@zeniv.linux.org.uk>,
"Mikołaj Wasiak" <mikolaj.wasiak@intel.com>,
"Christian Brauner" <brauner@kernel.org>,
"Nitin Gote" <nitin.r.gote@intel.com>,
"Andi Shyti" <andi.shyti@linux.intel.com>,
"Jonathan Corbet" <corbet@lwn.net>,
"Christopher Healy" <healych@amazon.com>,
"Matthew Wilcox" <willy@infradead.org>,
"Bagas Sanjaya" <bagasdotme@gmail.com>
Cc: linux-kernel@vger.kernel.org, dri-devel@lists.freedesktop.org,
intel-gfx@lists.freedesktop.org, linux-mm@kvack.org,
linux-doc@vger.kernel.org, kernel@collabora.com
Subject: Re: [PATCH v9 05/11] drm/i915: Use huge tmpfs mountpoint helpers
Date: Thu, 20 Nov 2025 09:31:53 +0000 [thread overview]
Message-ID: <fee6476e-3168-4f4f-ae2f-3ef65fe209b0@ursulin.net> (raw)
In-Reply-To: <20251114170303.2800-6-loic.molinari@collabora.com>
On 14/11/2025 17:02, Loïc Molinari wrote:
> Make use of the new drm_gem_huge_mnt_create() and
> drm_gem_get_huge_mnt() helpers to avoid code duplication. Now that
> it's just a few lines long, the single function in i915_gemfs.c is
> moved into v3d_gem_shmem.c.
>
> v3:
> - use huge tmpfs mountpoint in drm_device
> - move i915_gemfs.c into i915_gem_shmem.c
>
> v4:
> - clean up mountpoint creation error handling
>
> v5:
> - use drm_gem_has_huge_mnt() helper
>
> v7:
> - include <drm/drm_print.h> in i915_gem_shmem.c
>
> v8:
> - keep logging notice message with CONFIG_TRANSPARENT_HUGEPAGE=n
> - don't access huge_mnt field with CONFIG_TRANSPARENT_HUGEPAGE=n
>
> v9:
> - replace drm_gem_has_huge_mnt() by drm_gem_get_huge_mnt()
> - remove useless ternary op test in selftests/huge_pages.c
>
> Signed-off-by: Loïc Molinari <loic.molinari@collabora.com>
> ---
> drivers/gpu/drm/i915/Makefile | 3 +-
> drivers/gpu/drm/i915/gem/i915_gem_shmem.c | 48 +++++++++----
> drivers/gpu/drm/i915/gem/i915_gemfs.c | 71 -------------------
> drivers/gpu/drm/i915/gem/i915_gemfs.h | 14 ----
> .../gpu/drm/i915/gem/selftests/huge_pages.c | 16 +++--
> drivers/gpu/drm/i915/i915_drv.h | 5 --
> 6 files changed, 47 insertions(+), 110 deletions(-)
> delete mode 100644 drivers/gpu/drm/i915/gem/i915_gemfs.c
> delete mode 100644 drivers/gpu/drm/i915/gem/i915_gemfs.h
>
> diff --git a/drivers/gpu/drm/i915/Makefile b/drivers/gpu/drm/i915/Makefile
> index 84ec79b64960..b5a8c0a6b747 100644
> --- a/drivers/gpu/drm/i915/Makefile
> +++ b/drivers/gpu/drm/i915/Makefile
> @@ -169,8 +169,7 @@ gem-y += \
> gem/i915_gem_ttm_move.o \
> gem/i915_gem_ttm_pm.o \
> gem/i915_gem_userptr.o \
> - gem/i915_gem_wait.o \
> - gem/i915_gemfs.o
> + gem/i915_gem_wait.o
> i915-y += \
> $(gem-y) \
> i915_active.o \
> diff --git a/drivers/gpu/drm/i915/gem/i915_gem_shmem.c b/drivers/gpu/drm/i915/gem/i915_gem_shmem.c
> index 26dda55a07ff..15c2c6fde2ac 100644
> --- a/drivers/gpu/drm/i915/gem/i915_gem_shmem.c
> +++ b/drivers/gpu/drm/i915/gem/i915_gem_shmem.c
> @@ -9,14 +9,16 @@
> #include <linux/uio.h>
>
> #include <drm/drm_cache.h>
> +#include <drm/drm_gem.h>
> +#include <drm/drm_print.h>
>
> #include "gem/i915_gem_region.h"
> #include "i915_drv.h"
> #include "i915_gem_object.h"
> #include "i915_gem_tiling.h"
> -#include "i915_gemfs.h"
> #include "i915_scatterlist.h"
> #include "i915_trace.h"
> +#include "i915_utils.h"
>
> /*
> * Move folios to appropriate lru and release the batch, decrementing the
> @@ -497,6 +499,7 @@ static int __create_shmem(struct drm_i915_private *i915,
> resource_size_t size)
> {
> unsigned long flags = VM_NORESERVE;
> + struct vfsmount *huge_mnt;
> struct file *filp;
>
> drm_gem_private_object_init(&i915->drm, obj, size);
> @@ -515,9 +518,9 @@ static int __create_shmem(struct drm_i915_private *i915,
> if (BITS_PER_LONG == 64 && size > MAX_LFS_FILESIZE)
> return -E2BIG;
>
> - if (i915->mm.gemfs)
> - filp = shmem_file_setup_with_mnt(i915->mm.gemfs, "i915", size,
> - flags);
> + huge_mnt = drm_gem_get_huge_mnt(&i915->drm);
> + if (huge_mnt)
> + filp = shmem_file_setup_with_mnt(huge_mnt, "i915", size, flags);
> else
> filp = shmem_file_setup("i915", size, flags);
> if (IS_ERR(filp))
> @@ -644,21 +647,40 @@ i915_gem_object_create_shmem_from_data(struct drm_i915_private *i915,
>
> static int init_shmem(struct intel_memory_region *mem)
> {
> - i915_gemfs_init(mem->i915);
> - intel_memory_region_set_name(mem, "system");
> + struct drm_i915_private *i915 = mem->i915;
>
> - return 0; /* We have fallback to the kernel mnt if gemfs init failed. */
> -}
> + /*
> + * By creating our own shmemfs mountpoint, we can pass in
> + * mount flags that better match our usecase.
> + *
> + * One example, although it is probably better with a per-file
> + * control, is selecting huge page allocations ("huge=within_size").
> + * However, we only do so on platforms which benefit from it, or to
> + * offset the overhead of iommu lookups, where with latter it is a net
> + * win even on platforms which would otherwise see some performance
> + * regressions such a slow reads issue on Broadwell and Skylake.
> + */
>
> -static int release_shmem(struct intel_memory_region *mem)
> -{
> - i915_gemfs_fini(mem->i915);
> - return 0;
> + if (GRAPHICS_VER(i915) < 11 && !i915_vtd_active(i915))
> + goto no_thp;
> +
> + drm_gem_huge_mnt_create(&i915->drm, "within_size");
> + if (drm_gem_get_huge_mnt(&i915->drm))
> + drm_info(&i915->drm, "Using Transparent Hugepages\n");
> + else
> + drm_notice(&i915->drm,
> + "Transparent Hugepage support is recommended for optimal performance%s\n",
> + GRAPHICS_VER(i915) >= 11 ? " on this platform!" :
> + " when IOMMU is enabled!");
> +
> + no_thp:
> + intel_memory_region_set_name(mem, "system");
> +
> + return 0; /* We have fallback to the kernel mnt if huge mnt failed. */
> }
>
> static const struct intel_memory_region_ops shmem_region_ops = {
> .init = init_shmem,
> - .release = release_shmem,
> .init_object = shmem_object_init,
> };
>
> diff --git a/drivers/gpu/drm/i915/gem/i915_gemfs.c b/drivers/gpu/drm/i915/gem/i915_gemfs.c
> deleted file mode 100644
> index 1f1290214031..000000000000
> --- a/drivers/gpu/drm/i915/gem/i915_gemfs.c
> +++ /dev/null
> @@ -1,71 +0,0 @@
> -// SPDX-License-Identifier: MIT
> -/*
> - * Copyright © 2017 Intel Corporation
> - */
> -
> -#include <linux/fs.h>
> -#include <linux/mount.h>
> -#include <linux/fs_context.h>
> -
> -#include <drm/drm_print.h>
> -
> -#include "i915_drv.h"
> -#include "i915_gemfs.h"
> -#include "i915_utils.h"
> -
> -void i915_gemfs_init(struct drm_i915_private *i915)
> -{
> - struct file_system_type *type;
> - struct fs_context *fc;
> - struct vfsmount *gemfs;
> - int ret;
> -
> - /*
> - * By creating our own shmemfs mountpoint, we can pass in
> - * mount flags that better match our usecase.
> - *
> - * One example, although it is probably better with a per-file
> - * control, is selecting huge page allocations ("huge=within_size").
> - * However, we only do so on platforms which benefit from it, or to
> - * offset the overhead of iommu lookups, where with latter it is a net
> - * win even on platforms which would otherwise see some performance
> - * regressions such a slow reads issue on Broadwell and Skylake.
> - */
> -
> - if (GRAPHICS_VER(i915) < 11 && !i915_vtd_active(i915))
> - return;
> -
> - if (!IS_ENABLED(CONFIG_TRANSPARENT_HUGEPAGE))
> - goto err;
> -
> - type = get_fs_type("tmpfs");
> - if (!type)
> - goto err;
> -
> - fc = fs_context_for_mount(type, SB_KERNMOUNT);
> - if (IS_ERR(fc))
> - goto err;
> - ret = vfs_parse_fs_string(fc, "source", "tmpfs");
> - if (!ret)
> - ret = vfs_parse_fs_string(fc, "huge", "within_size");
> - if (!ret)
> - gemfs = fc_mount_longterm(fc);
> - put_fs_context(fc);
> - if (ret)
> - goto err;
> -
> - i915->mm.gemfs = gemfs;
> - drm_info(&i915->drm, "Using Transparent Hugepages\n");
> - return;
> -
> -err:
> - drm_notice(&i915->drm,
> - "Transparent Hugepage support is recommended for optimal performance%s\n",
> - GRAPHICS_VER(i915) >= 11 ? " on this platform!" :
> - " when IOMMU is enabled!");
> -}
> -
> -void i915_gemfs_fini(struct drm_i915_private *i915)
> -{
> - kern_unmount(i915->mm.gemfs);
> -}
> diff --git a/drivers/gpu/drm/i915/gem/i915_gemfs.h b/drivers/gpu/drm/i915/gem/i915_gemfs.h
> deleted file mode 100644
> index 16d4333c9a4e..000000000000
> --- a/drivers/gpu/drm/i915/gem/i915_gemfs.h
> +++ /dev/null
> @@ -1,14 +0,0 @@
> -/* SPDX-License-Identifier: MIT */
> -/*
> - * Copyright © 2017 Intel Corporation
> - */
> -
> -#ifndef __I915_GEMFS_H__
> -#define __I915_GEMFS_H__
> -
> -struct drm_i915_private;
> -
> -void i915_gemfs_init(struct drm_i915_private *i915);
> -void i915_gemfs_fini(struct drm_i915_private *i915);
> -
> -#endif
> diff --git a/drivers/gpu/drm/i915/gem/selftests/huge_pages.c b/drivers/gpu/drm/i915/gem/selftests/huge_pages.c
> index bd08605a1611..28aef75630a2 100644
> --- a/drivers/gpu/drm/i915/gem/selftests/huge_pages.c
> +++ b/drivers/gpu/drm/i915/gem/selftests/huge_pages.c
> @@ -1316,7 +1316,7 @@ typedef struct drm_i915_gem_object *
>
> static inline bool igt_can_allocate_thp(struct drm_i915_private *i915)
> {
> - return i915->mm.gemfs && has_transparent_hugepage();
> + return !!drm_gem_get_huge_mnt(&i915->drm);
> }
>
> static struct drm_i915_gem_object *
> @@ -1761,7 +1761,9 @@ static int igt_tmpfs_fallback(void *arg)
> struct drm_i915_private *i915 = arg;
> struct i915_address_space *vm;
> struct i915_gem_context *ctx;
> - struct vfsmount *gemfs = i915->mm.gemfs;
> +#ifdef CONFIG_TRANSPARENT_HUGEPAGE
> + struct vfsmount *huge_mnt = i915->drm.huge_mnt;
> +#endif
> struct drm_i915_gem_object *obj;
> struct i915_vma *vma;
> struct file *file;
> @@ -1782,10 +1784,12 @@ static int igt_tmpfs_fallback(void *arg)
> /*
> * Make sure that we don't burst into a ball of flames upon falling back
> * to tmpfs, which we rely on if on the off-chance we encounter a failure
> - * when setting up gemfs.
> + * when setting up a huge mountpoint.
> */
>
> - i915->mm.gemfs = NULL;
> +#ifdef CONFIG_TRANSPARENT_HUGEPAGE
> + i915->drm.huge_mnt = NULL;
> +#endif
>
> obj = i915_gem_object_create_shmem(i915, PAGE_SIZE);
> if (IS_ERR(obj)) {
> @@ -1819,7 +1823,9 @@ static int igt_tmpfs_fallback(void *arg)
> out_put:
> i915_gem_object_put(obj);
> out_restore:
> - i915->mm.gemfs = gemfs;
> +#ifdef CONFIG_TRANSPARENT_HUGEPAGE
> + i915->drm.huge_mnt = huge_mnt;
> +#endif
Apart from this layering violation in the selftest, this version looks
good to me. I am just wondering if we could somehow improve this aspect.
I was thinking a self-test builds only special version of
i915_gem_object_create_shmem. Call chain is deep but there are flags
passed on:
i915_gem_object_create_shmem
i915_gem_object_create_region
__i915_gem_object_create_region
err = mem->ops->init_object(
So we could add a new helper like:
selftests_create_shmem
i915_gem_object_create_region(...flags =
I915_BO_ALLOC_SELFTESTS_NOTHP...)
And in __create_shmem we just make it:
...
huge_mnt = drm_gem_get_huge_mnt(&i915->drm) &&
if (IS_ENABLED(..SELFTESTS..) &&
(flags & I915_BO_ALLOC_SELFTESTS_NOTHP))
huge_mnt = NULL;
...
It would avoid the ifdef and needing to play games with the DRM internals.
How does that sound to you?
Regards,
Tvrtko
>
> i915_vm_put(vm);
> out:
> diff --git a/drivers/gpu/drm/i915/i915_drv.h b/drivers/gpu/drm/i915/i915_drv.h
> index 95f9ddf22ce4..93a5af3de334 100644
> --- a/drivers/gpu/drm/i915/i915_drv.h
> +++ b/drivers/gpu/drm/i915/i915_drv.h
> @@ -141,11 +141,6 @@ struct i915_gem_mm {
> */
> atomic_t free_count;
>
> - /**
> - * tmpfs instance used for shmem backed objects
> - */
> - struct vfsmount *gemfs;
> -
> struct intel_memory_region *regions[INTEL_REGION_UNKNOWN];
>
> struct notifier_block oom_notifier;
next prev parent reply other threads:[~2025-11-20 9:32 UTC|newest]
Thread overview: 22+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-11-14 17:02 [PATCH v9 00/11] drm: Reduce page tables overhead with THP Loïc Molinari
2025-11-14 17:02 ` [PATCH v9 01/11] drm/shmem-helper: Simplify page offset calculation in fault handler Loïc Molinari
2025-11-14 17:02 ` [PATCH v9 02/11] drm/shmem-helper: Map huge pages " Loïc Molinari
2025-11-14 17:02 ` [PATCH v9 03/11] drm/gem: Introduce drm_gem_get_unmapped_area() fop Loïc Molinari
2025-11-14 17:02 ` [PATCH v9 04/11] drm/gem: Add huge tmpfs mountpoint helpers Loïc Molinari
2025-11-14 17:02 ` [PATCH v9 05/11] drm/i915: Use " Loïc Molinari
2025-11-20 9:31 ` Tvrtko Ursulin [this message]
2025-11-28 18:41 ` Loïc Molinari
2025-12-02 11:01 ` Tvrtko Ursulin
2025-12-05 8:37 ` Loïc Molinari
2025-11-14 17:02 ` [PATCH v9 06/11] drm/v3d: " Loïc Molinari
2025-11-14 17:11 ` Boris Brezillon
2025-11-14 17:21 ` Loïc Molinari
2025-11-20 9:39 ` Tvrtko Ursulin
2025-11-28 18:19 ` Loïc Molinari
2025-11-14 17:02 ` [PATCH v9 07/11] drm/gem: Get rid of *_with_mnt helpers Loïc Molinari
2025-11-14 17:15 ` Boris Brezillon
2025-11-14 17:02 ` [PATCH v9 08/11] drm/panthor: Introduce huge tmpfs mountpoint option Loïc Molinari
2025-11-14 17:03 ` [PATCH v9 09/11] drm/panthor: Improve IOMMU map/unmap debugging logs Loïc Molinari
2025-11-26 10:10 ` Boris Brezillon
2025-11-14 17:03 ` [PATCH v9 10/11] drm/panfrost: Introduce huge tmpfs mountpoint option Loïc Molinari
2025-11-14 17:03 ` [PATCH v9 11/11] Documentation/gpu/drm-mm: Add THP paragraph to GEM mapping section Loïc Molinari
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=fee6476e-3168-4f4f-ae2f-3ef65fe209b0@ursulin.net \
--to=tursulin@ursulin.net \
--cc=airlied@gmail.com \
--cc=akpm@linux-foundation.org \
--cc=andi.shyti@linux.intel.com \
--cc=bagasdotme@gmail.com \
--cc=baolin.wang@linux.alibaba.com \
--cc=boris.brezillon@collabora.com \
--cc=brauner@kernel.org \
--cc=corbet@lwn.net \
--cc=dri-devel@lists.freedesktop.org \
--cc=healych@amazon.com \
--cc=hughd@google.com \
--cc=intel-gfx@lists.freedesktop.org \
--cc=jani.nikula@linux.intel.com \
--cc=joonas.lahtinen@linux.intel.com \
--cc=kernel@collabora.com \
--cc=linux-doc@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=liviu.dudau@arm.com \
--cc=loic.molinari@collabora.com \
--cc=maarten.lankhorst@linux.intel.com \
--cc=mcanal@igalia.com \
--cc=mikolaj.wasiak@intel.com \
--cc=mripard@kernel.org \
--cc=mwen@igalia.com \
--cc=nitin.r.gote@intel.com \
--cc=robh@kernel.org \
--cc=rodrigo.vivi@intel.com \
--cc=simona@ffwll.ch \
--cc=steven.price@arm.com \
--cc=tzimmermann@suse.de \
--cc=viro@zeniv.linux.org.uk \
--cc=willy@infradead.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox