linux-mm.kvack.org archive mirror
 help / color / mirror / Atom feed
From: Shiju Jose <shiju.jose@huawei.com>
To: Borislav Petkov <bp@alien8.de>
Cc: "rafael@kernel.org" <rafael@kernel.org>,
	"akpm@linux-foundation.org" <akpm@linux-foundation.org>,
	"rppt@kernel.org" <rppt@kernel.org>,
	"dferguson@amperecomputing.com" <dferguson@amperecomputing.com>,
	"linux-edac@vger.kernel.org" <linux-edac@vger.kernel.org>,
	"linux-acpi@vger.kernel.org" <linux-acpi@vger.kernel.org>,
	"linux-mm@kvack.org" <linux-mm@kvack.org>,
	"linux-doc@vger.kernel.org" <linux-doc@vger.kernel.org>,
	"tony.luck@intel.com" <tony.luck@intel.com>,
	"lenb@kernel.org" <lenb@kernel.org>,
	"leo.duran@amd.com" <leo.duran@amd.com>,
	"Yazen.Ghannam@amd.com" <Yazen.Ghannam@amd.com>,
	"mchehab@kernel.org" <mchehab@kernel.org>,
	Jonathan Cameron <jonathan.cameron@huawei.com>,
	Linuxarm <linuxarm@huawei.com>,
	"rientjes@google.com" <rientjes@google.com>,
	"jiaqiyan@google.com" <jiaqiyan@google.com>,
	"Jon.Grimm@amd.com" <Jon.Grimm@amd.com>,
	"dave.hansen@linux.intel.com" <dave.hansen@linux.intel.com>,
	"naoya.horiguchi@nec.com" <naoya.horiguchi@nec.com>,
	"james.morse@arm.com" <james.morse@arm.com>,
	"jthoughton@google.com" <jthoughton@google.com>,
	"somasundaram.a@hpe.com" <somasundaram.a@hpe.com>,
	"erdemaktas@google.com" <erdemaktas@google.com>,
	"pgonda@google.com" <pgonda@google.com>,
	"duenwen@google.com" <duenwen@google.com>,
	"gthelen@google.com" <gthelen@google.com>,
	"wschwartz@amperecomputing.com" <wschwartz@amperecomputing.com>,
	"wbs@os.amperecomputing.com" <wbs@os.amperecomputing.com>,
	"nifan.cxl@gmail.com" <nifan.cxl@gmail.com>,
	tanxiaofei <tanxiaofei@huawei.com>,
	"Zengtao (B)" <prime.zeng@hisilicon.com>,
	"Roberto Sassu" <roberto.sassu@huawei.com>,
	"kangkang.shen@futurewei.com" <kangkang.shen@futurewei.com>,
	wanghuiqiang <wanghuiqiang@huawei.com>
Subject: RE: [PATCH v13 1/2] ACPI:RAS2: Add driver for the ACPI RAS2 feature table
Date: Mon, 5 Jan 2026 18:13:01 +0000	[thread overview]
Message-ID: <95978df0ee6c4254b20cc773c8a4df29@huawei.com> (raw)
In-Reply-To: <20251231131512.GBaVUh4NSWqvr2xhbM@fat_crate.local>

>-----Original Message-----
>From: Borislav Petkov <bp@alien8.de>
>Sent: 31 December 2025 13:15
>To: Shiju Jose <shiju.jose@huawei.com>
>Cc: rafael@kernel.org; akpm@linux-foundation.org; rppt@kernel.org;
>dferguson@amperecomputing.com; linux-edac@vger.kernel.org; linux-
>acpi@vger.kernel.org; linux-mm@kvack.org; linux-doc@vger.kernel.org;
>tony.luck@intel.com; lenb@kernel.org; leo.duran@amd.com;
>Yazen.Ghannam@amd.com; mchehab@kernel.org; Jonathan Cameron
><jonathan.cameron@huawei.com>; Linuxarm <linuxarm@huawei.com>;
>rientjes@google.com; jiaqiyan@google.com; Jon.Grimm@amd.com;
>dave.hansen@linux.intel.com; naoya.horiguchi@nec.com;
>james.morse@arm.com; jthoughton@google.com; somasundaram.a@hpe.com;
>erdemaktas@google.com; pgonda@google.com; duenwen@google.com;
>gthelen@google.com; wschwartz@amperecomputing.com;
>wbs@os.amperecomputing.com; nifan.cxl@gmail.com; tanxiaofei
><tanxiaofei@huawei.com>; Zengtao (B) <prime.zeng@hisilicon.com>; Roberto
>Sassu <roberto.sassu@huawei.com>; kangkang.shen@futurewei.com;
>wanghuiqiang <wanghuiqiang@huawei.com>
>Subject: Re: [PATCH v13 1/2] ACPI:RAS2: Add driver for the ACPI RAS2 feature
>table
>
>On Tue, Nov 25, 2025 at 01:28:19PM +0000, Shiju Jose wrote:
>> I will change to depends. I followed the existing CONFIG ACPI_CPPC_LIB.
>
>Read the "Note:" under
>
>"- reverse dependencies: "select" <symbol> ["if" <expr>]"
>
>here pls: Documentation/kbuild/kconfig-language.rst
>
>Now, some of the Kconfig symbols:
>
>diff --git a/drivers/acpi/Kconfig b/drivers/acpi/Kconfig index
>2322b0470d07..7f846c22fc30 100644
>--- a/drivers/acpi/Kconfig
>+++ b/drivers/acpi/Kconfig
>@@ -295,7 +295,7 @@ config ACPI_CPPC_LIB
>
> config ACPI_RAS2
>        bool "ACPI RAS2 driver"
>-       depends on AUXILIARY_BUS
>+       select AUXILIARY_BUS
>        depends on MAILBOX
>        depends on PCC
>        help
>diff --git a/drivers/ras/Kconfig b/drivers/ras/Kconfig index
>dfc3a899280e..a1e6aed8bcc8 100644
>--- a/drivers/ras/Kconfig
>+++ b/drivers/ras/Kconfig
>@@ -51,7 +51,7 @@ config MEM_ACPI_RAS2
>        depends on ACPI_RAS2
>        depends on EDAC
>        depends on EDAC_SCRUB
>-       depends on NUMA_KEEP_MEMINFO
>+       select NUMA_KEEP_MEMINFO
>        help
>          The driver binds to the auxiliary device added by the ACPI RAS2
>          feature table parser. The driver uses a PCC channel subspace to
>
>are made to be selectable only and so you should select them because they're
>non-visible. Just remember that blindly selecting things is evil.

Hi Borislav,
 
Thanks for correcting. Modified. Will post these changes in V15 after your feedback
for v14.
>
>
>> >> +			sspcc->last_cmd, sspcc->pcc_id);
>> >> +		status &= ~PCC_STATUS_ERROR;
>> >> +		writew_relaxed(status, &gen_comm_base->status);
>> >> +		return -EIO;
>> >> +	}
>> >> +
>> >> +	cap_status = readw_relaxed(&gen_comm_base->set_caps_status);
>> >
>> >Is that register read always successful or you need to handle errors here too?
>>
>> Return value of 'set capability status'  is decoded and return error
>> code on error case in the below function call  'return
>decode_cap_error(cap_status)'
>
>Yah, this is not a common coding pattern. What you do is something like this:
>
>diff --git a/drivers/acpi/ras2.c b/drivers/acpi/ras2.c index
>627895fee143..4caef7f2c4ea 100644
>--- a/drivers/acpi/ras2.c
>+++ b/drivers/acpi/ras2.c
>@@ -85,7 +85,6 @@ static int decode_cap_error(u32 cap_status)  static int
>check_pcc_chan(struct ras2_sspcc *sspcc)  {
>        struct acpi_ras2_shmem __iomem *gen_comm_base = sspcc->comm_addr;
>-       u32 cap_status;
>        u16 status;
>        int rc;
>
>@@ -114,9 +113,11 @@ static int check_pcc_chan(struct ras2_sspcc *sspcc)
>                return -EIO;
>        }
>
>-       cap_status = readw_relaxed(&gen_comm_base->set_caps_status);
>+       rc =
>+ decode_cap_error(readw_relaxed(&gen_comm_base->set_caps_status));
>+
>        writew_relaxed(0x0, &gen_comm_base->set_caps_status);
>-       return decode_cap_error(cap_status);
>+
>+       return rc;
> }

Modified. 
>
>
>> >> +	 */
>> >> +	if (cmd == PCC_CMD_EXEC_RAS2 || sspcc->pcc_mrtt) {
>> >> +		rc = check_pcc_chan(sspcc);
>> >> +		if (sspcc->pcc_mrtt)
>> >> +			sspcc->last_cmd_cmpl_time = ktime_get();
>> >> +	}
>> >> +
>> >> +	if (pcc_channel->mbox->txdone_irq)
>> >> +		mbox_chan_txdone(pcc_channel, rc);
>> >> +	else
>> >> +		mbox_client_txdone(pcc_channel, rc);
>> >> +
>> >> +	return rc < 0 ? rc : 0;
>> >
>> >So you mean simply
>> >
>> >	return rc;
>> >
>> >no? rc can be 0 too so what's the point of the ternary expression?
>>
>> This was added to handle the case rc = check_pcc_chan(sspcc); is not
>> called and last rc is returned from mbox_send_message() call because
>> mbox_send_message() return non-negative value for success and negative
>value for failure as per the documentation.
>> https://elixir.bootlin.com/linux/v6.18-rc7/source/drivers/mailbox/mail
>> box.c#L241
>
>Why do you keep pointing to some indexing service? What's wrong with simply
>pasting the code snippet you mean so that I can find it myself too?
>
>Anyway, what's wrong with:
>
>        /* Ring doorbell */
>        rc = mbox_send_message(pcc_channel, &cmd);
>        if (rc < 0) {
>                dev_warn(ras2_ctx->dev,
>                         "Error sending PCC mbox message cmd: 0x%x, rc:%d\n", cmd, rc);
>                return rc;
>        }
>
>Also, cmds in hex please.

Modified.
>
>> >And what's the logic here? You'd capture rc above from
>> >check_pcc_chan() and even if it is != 0, you'd pass it into the mbox*
>> >functions? I guess that weirdness deserves a comment...
>>
>> Both mbox_chan_txdone() and  mbox_client_txdone() required the status
>> of the last transmission as second argument.
>
>Yah, comment please!

Added comment.
>
>s> >
>> >> +{
>> >> +	struct acpi_ras2_pcc_desc *pcc_desc_list;
>> >> +	struct ras2_mem_ctx *ras2_ctx;
>> >> +	u16 i, count;
>> >> +
>> >> +	if (ras2_tab->header.length < sizeof(*ras2_tab)) {
>> >> +		pr_warn(FW_WARN "ACPI RAS2 table present but broken (too
>> >short, size=%u)\n",
>> >> +			ras2_tab->header.length);
>> >> +		return;
>> >> +	}
>> >> +
>> >> +	if (!ras2_tab->num_pcc_descs) {
>> >> +		pr_warn(FW_WARN "No PCC descs in ACPI RAS2 table\n");
>> >> +		return;
>> >> +	}
>> >
>> >You need to sanity-check the number of descs so that the below
>> >allocation doesn't go nuts.
>> Sorry, can you give more information?
>> I am wondering the above check  'if (!ras2_tab->num_pcc_descs)' { } is not
>enough?
>
>You've done what I wanted:
>
>        if (!ras2_tab->num_pcc_descs || ras2_tab->num_pcc_descs >
>RAS2_MAX_NUM_PCC_DESCS) {
>                pr_warn(FW_WARN "No/Invalid number of PCC descs(%d) in ACPI RAS2
>table\n",
>                        ras2_tab->num_pcc_descs);
>                return -EINVAL;
>        }
>
>The RAS2_MAX_NUM_PCC_DESCS thing.

Ok.
>
>> >Also, what's the point of that pctx_list array at all? So that you
>> >can do uninit on the ->adev in case you encounter a failure?
>> Local variable ras2_ctx  is updated when calling add_aux_device() in
>> each iteration as
>> add_aux_device()  allocates memory for struct ras2_mem_ctx  for the
>> corresponding PCC descriptor in the RAS2 table.
>> Thus storing pointer to each ras2_ctx  in pctx_list[] to uninit all the previously
>added auxiliary devices
>> using auxiliary_device_uninit(->adev); when encounter a failure in a later
>iteration.
>
>Looks weird. Lemme look at your new submission and see whether I can make it
>better.
Sure.
>
>
>> >> +		return;
>> >> +	}
>> >> +
>> >> +	acpi_ras2_parse(ras2_tab);
>> >
>> >This function does some table sanity checking and warns. What it
>> >should do is fail the driver load if the table is broken.
>>
>> Sure.
>> If acpi_ras2_parse() and thus acpi_ras2_init() return error, can you
>> guide how to handle this error in acpi_init(void) where  acpi_ras2_init() is
>called?
>> Something similar to this below,
>> diff --git a/drivers/acpi/bus.c b/drivers/acpi/bus.c index
>> b02ceb2837c6..8b4fc572a05b 100644
>> --- a/drivers/acpi/bus.c
>> +++ b/drivers/acpi/bus.c
>> @@ -1475,7 +1475,12 @@ static int __init acpi_init(void)
>>         acpi_debugger_init();
>>         acpi_setup_sb_notify_handler();
>>         acpi_viot_init();
>> -       acpi_ras2_init();
>> +       result = acpi_ras2_init();
>> +       if (result) {
>> +               kobject_put(acpi_kobj);
>> +               disable_acpi();
>
>No, you certainly won't disable ACPI if that RAS2 thing parsing fails. What you
>should do is not allow the RAS2 memory driver to load.
>
>Lemme look at your new version.

Sure. Thanks.
>
>Thx.
>
>--
>Regards/Gruss,
>    Boris.
>
>https://people.kernel.org/tglx/notes-about-netiquette
>

Thanks,
Shiju


  reply	other threads:[~2026-01-05 18:13 UTC|newest]

Thread overview: 14+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2025-11-21 18:28 [PATCH v13 0/2] ACPI: Add support for " shiju.jose
2025-11-21 18:28 ` [PATCH v13 1/2] ACPI:RAS2: Add driver for the " shiju.jose
2025-11-22  5:18   ` Randy Dunlap
2025-11-25  7:36   ` Borislav Petkov
2025-11-25 13:28     ` Shiju Jose
2025-12-31 13:15       ` Borislav Petkov
2026-01-05 18:13         ` Shiju Jose [this message]
2025-12-08 16:29     ` Jonathan Cameron
2025-12-31 16:07       ` Borislav Petkov
2025-11-21 18:28 ` [PATCH v13 2/2] ras: mem: Add ACPI RAS2 memory driver shiju.jose
2025-11-22  5:18   ` Randy Dunlap
2025-11-24  9:29     ` Shiju Jose
2025-11-22  5:22   ` Randy Dunlap
2025-11-24 10:00     ` Shiju Jose

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=95978df0ee6c4254b20cc773c8a4df29@huawei.com \
    --to=shiju.jose@huawei.com \
    --cc=Jon.Grimm@amd.com \
    --cc=Yazen.Ghannam@amd.com \
    --cc=akpm@linux-foundation.org \
    --cc=bp@alien8.de \
    --cc=dave.hansen@linux.intel.com \
    --cc=dferguson@amperecomputing.com \
    --cc=duenwen@google.com \
    --cc=erdemaktas@google.com \
    --cc=gthelen@google.com \
    --cc=james.morse@arm.com \
    --cc=jiaqiyan@google.com \
    --cc=jonathan.cameron@huawei.com \
    --cc=jthoughton@google.com \
    --cc=kangkang.shen@futurewei.com \
    --cc=lenb@kernel.org \
    --cc=leo.duran@amd.com \
    --cc=linux-acpi@vger.kernel.org \
    --cc=linux-doc@vger.kernel.org \
    --cc=linux-edac@vger.kernel.org \
    --cc=linux-mm@kvack.org \
    --cc=linuxarm@huawei.com \
    --cc=mchehab@kernel.org \
    --cc=naoya.horiguchi@nec.com \
    --cc=nifan.cxl@gmail.com \
    --cc=pgonda@google.com \
    --cc=prime.zeng@hisilicon.com \
    --cc=rafael@kernel.org \
    --cc=rientjes@google.com \
    --cc=roberto.sassu@huawei.com \
    --cc=rppt@kernel.org \
    --cc=somasundaram.a@hpe.com \
    --cc=tanxiaofei@huawei.com \
    --cc=tony.luck@intel.com \
    --cc=wanghuiqiang@huawei.com \
    --cc=wbs@os.amperecomputing.com \
    --cc=wschwartz@amperecomputing.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox