From: Shiju Jose <shiju.jose@huawei.com>
To: Borislav Petkov <bp@alien8.de>
Cc: "rafael@kernel.org" <rafael@kernel.org>,
"akpm@linux-foundation.org" <akpm@linux-foundation.org>,
"rppt@kernel.org" <rppt@kernel.org>,
"dferguson@amperecomputing.com" <dferguson@amperecomputing.com>,
"linux-edac@vger.kernel.org" <linux-edac@vger.kernel.org>,
"linux-acpi@vger.kernel.org" <linux-acpi@vger.kernel.org>,
"linux-mm@kvack.org" <linux-mm@kvack.org>,
"linux-doc@vger.kernel.org" <linux-doc@vger.kernel.org>,
"tony.luck@intel.com" <tony.luck@intel.com>,
"lenb@kernel.org" <lenb@kernel.org>,
"leo.duran@amd.com" <leo.duran@amd.com>,
"Yazen.Ghannam@amd.com" <Yazen.Ghannam@amd.com>,
"mchehab@kernel.org" <mchehab@kernel.org>,
Jonathan Cameron <jonathan.cameron@huawei.com>,
Linuxarm <linuxarm@huawei.com>,
"rientjes@google.com" <rientjes@google.com>,
"jiaqiyan@google.com" <jiaqiyan@google.com>,
"Jon.Grimm@amd.com" <Jon.Grimm@amd.com>,
"dave.hansen@linux.intel.com" <dave.hansen@linux.intel.com>,
"naoya.horiguchi@nec.com" <naoya.horiguchi@nec.com>,
"james.morse@arm.com" <james.morse@arm.com>,
"jthoughton@google.com" <jthoughton@google.com>,
"somasundaram.a@hpe.com" <somasundaram.a@hpe.com>,
"erdemaktas@google.com" <erdemaktas@google.com>,
"pgonda@google.com" <pgonda@google.com>,
"duenwen@google.com" <duenwen@google.com>,
"gthelen@google.com" <gthelen@google.com>,
"wschwartz@amperecomputing.com" <wschwartz@amperecomputing.com>,
"wbs@os.amperecomputing.com" <wbs@os.amperecomputing.com>,
"nifan.cxl@gmail.com" <nifan.cxl@gmail.com>,
tanxiaofei <tanxiaofei@huawei.com>,
"Zengtao (B)" <prime.zeng@hisilicon.com>,
"Roberto Sassu" <roberto.sassu@huawei.com>,
"kangkang.shen@futurewei.com" <kangkang.shen@futurewei.com>,
wanghuiqiang <wanghuiqiang@huawei.com>
Subject: RE: [PATCH v13 1/2] ACPI:RAS2: Add driver for the ACPI RAS2 feature table
Date: Mon, 5 Jan 2026 18:13:01 +0000 [thread overview]
Message-ID: <95978df0ee6c4254b20cc773c8a4df29@huawei.com> (raw)
In-Reply-To: <20251231131512.GBaVUh4NSWqvr2xhbM@fat_crate.local>
>-----Original Message-----
>From: Borislav Petkov <bp@alien8.de>
>Sent: 31 December 2025 13:15
>To: Shiju Jose <shiju.jose@huawei.com>
>Cc: rafael@kernel.org; akpm@linux-foundation.org; rppt@kernel.org;
>dferguson@amperecomputing.com; linux-edac@vger.kernel.org; linux-
>acpi@vger.kernel.org; linux-mm@kvack.org; linux-doc@vger.kernel.org;
>tony.luck@intel.com; lenb@kernel.org; leo.duran@amd.com;
>Yazen.Ghannam@amd.com; mchehab@kernel.org; Jonathan Cameron
><jonathan.cameron@huawei.com>; Linuxarm <linuxarm@huawei.com>;
>rientjes@google.com; jiaqiyan@google.com; Jon.Grimm@amd.com;
>dave.hansen@linux.intel.com; naoya.horiguchi@nec.com;
>james.morse@arm.com; jthoughton@google.com; somasundaram.a@hpe.com;
>erdemaktas@google.com; pgonda@google.com; duenwen@google.com;
>gthelen@google.com; wschwartz@amperecomputing.com;
>wbs@os.amperecomputing.com; nifan.cxl@gmail.com; tanxiaofei
><tanxiaofei@huawei.com>; Zengtao (B) <prime.zeng@hisilicon.com>; Roberto
>Sassu <roberto.sassu@huawei.com>; kangkang.shen@futurewei.com;
>wanghuiqiang <wanghuiqiang@huawei.com>
>Subject: Re: [PATCH v13 1/2] ACPI:RAS2: Add driver for the ACPI RAS2 feature
>table
>
>On Tue, Nov 25, 2025 at 01:28:19PM +0000, Shiju Jose wrote:
>> I will change to depends. I followed the existing CONFIG ACPI_CPPC_LIB.
>
>Read the "Note:" under
>
>"- reverse dependencies: "select" <symbol> ["if" <expr>]"
>
>here pls: Documentation/kbuild/kconfig-language.rst
>
>Now, some of the Kconfig symbols:
>
>diff --git a/drivers/acpi/Kconfig b/drivers/acpi/Kconfig index
>2322b0470d07..7f846c22fc30 100644
>--- a/drivers/acpi/Kconfig
>+++ b/drivers/acpi/Kconfig
>@@ -295,7 +295,7 @@ config ACPI_CPPC_LIB
>
> config ACPI_RAS2
> bool "ACPI RAS2 driver"
>- depends on AUXILIARY_BUS
>+ select AUXILIARY_BUS
> depends on MAILBOX
> depends on PCC
> help
>diff --git a/drivers/ras/Kconfig b/drivers/ras/Kconfig index
>dfc3a899280e..a1e6aed8bcc8 100644
>--- a/drivers/ras/Kconfig
>+++ b/drivers/ras/Kconfig
>@@ -51,7 +51,7 @@ config MEM_ACPI_RAS2
> depends on ACPI_RAS2
> depends on EDAC
> depends on EDAC_SCRUB
>- depends on NUMA_KEEP_MEMINFO
>+ select NUMA_KEEP_MEMINFO
> help
> The driver binds to the auxiliary device added by the ACPI RAS2
> feature table parser. The driver uses a PCC channel subspace to
>
>are made to be selectable only and so you should select them because they're
>non-visible. Just remember that blindly selecting things is evil.
Hi Borislav,
Thanks for correcting. Modified. Will post these changes in V15 after your feedback
for v14.
>
>
>> >> + sspcc->last_cmd, sspcc->pcc_id);
>> >> + status &= ~PCC_STATUS_ERROR;
>> >> + writew_relaxed(status, &gen_comm_base->status);
>> >> + return -EIO;
>> >> + }
>> >> +
>> >> + cap_status = readw_relaxed(&gen_comm_base->set_caps_status);
>> >
>> >Is that register read always successful or you need to handle errors here too?
>>
>> Return value of 'set capability status' is decoded and return error
>> code on error case in the below function call 'return
>decode_cap_error(cap_status)'
>
>Yah, this is not a common coding pattern. What you do is something like this:
>
>diff --git a/drivers/acpi/ras2.c b/drivers/acpi/ras2.c index
>627895fee143..4caef7f2c4ea 100644
>--- a/drivers/acpi/ras2.c
>+++ b/drivers/acpi/ras2.c
>@@ -85,7 +85,6 @@ static int decode_cap_error(u32 cap_status) static int
>check_pcc_chan(struct ras2_sspcc *sspcc) {
> struct acpi_ras2_shmem __iomem *gen_comm_base = sspcc->comm_addr;
>- u32 cap_status;
> u16 status;
> int rc;
>
>@@ -114,9 +113,11 @@ static int check_pcc_chan(struct ras2_sspcc *sspcc)
> return -EIO;
> }
>
>- cap_status = readw_relaxed(&gen_comm_base->set_caps_status);
>+ rc =
>+ decode_cap_error(readw_relaxed(&gen_comm_base->set_caps_status));
>+
> writew_relaxed(0x0, &gen_comm_base->set_caps_status);
>- return decode_cap_error(cap_status);
>+
>+ return rc;
> }
Modified.
>
>
>> >> + */
>> >> + if (cmd == PCC_CMD_EXEC_RAS2 || sspcc->pcc_mrtt) {
>> >> + rc = check_pcc_chan(sspcc);
>> >> + if (sspcc->pcc_mrtt)
>> >> + sspcc->last_cmd_cmpl_time = ktime_get();
>> >> + }
>> >> +
>> >> + if (pcc_channel->mbox->txdone_irq)
>> >> + mbox_chan_txdone(pcc_channel, rc);
>> >> + else
>> >> + mbox_client_txdone(pcc_channel, rc);
>> >> +
>> >> + return rc < 0 ? rc : 0;
>> >
>> >So you mean simply
>> >
>> > return rc;
>> >
>> >no? rc can be 0 too so what's the point of the ternary expression?
>>
>> This was added to handle the case rc = check_pcc_chan(sspcc); is not
>> called and last rc is returned from mbox_send_message() call because
>> mbox_send_message() return non-negative value for success and negative
>value for failure as per the documentation.
>> https://elixir.bootlin.com/linux/v6.18-rc7/source/drivers/mailbox/mail
>> box.c#L241
>
>Why do you keep pointing to some indexing service? What's wrong with simply
>pasting the code snippet you mean so that I can find it myself too?
>
>Anyway, what's wrong with:
>
> /* Ring doorbell */
> rc = mbox_send_message(pcc_channel, &cmd);
> if (rc < 0) {
> dev_warn(ras2_ctx->dev,
> "Error sending PCC mbox message cmd: 0x%x, rc:%d\n", cmd, rc);
> return rc;
> }
>
>Also, cmds in hex please.
Modified.
>
>> >And what's the logic here? You'd capture rc above from
>> >check_pcc_chan() and even if it is != 0, you'd pass it into the mbox*
>> >functions? I guess that weirdness deserves a comment...
>>
>> Both mbox_chan_txdone() and mbox_client_txdone() required the status
>> of the last transmission as second argument.
>
>Yah, comment please!
Added comment.
>
>s> >
>> >> +{
>> >> + struct acpi_ras2_pcc_desc *pcc_desc_list;
>> >> + struct ras2_mem_ctx *ras2_ctx;
>> >> + u16 i, count;
>> >> +
>> >> + if (ras2_tab->header.length < sizeof(*ras2_tab)) {
>> >> + pr_warn(FW_WARN "ACPI RAS2 table present but broken (too
>> >short, size=%u)\n",
>> >> + ras2_tab->header.length);
>> >> + return;
>> >> + }
>> >> +
>> >> + if (!ras2_tab->num_pcc_descs) {
>> >> + pr_warn(FW_WARN "No PCC descs in ACPI RAS2 table\n");
>> >> + return;
>> >> + }
>> >
>> >You need to sanity-check the number of descs so that the below
>> >allocation doesn't go nuts.
>> Sorry, can you give more information?
>> I am wondering the above check 'if (!ras2_tab->num_pcc_descs)' { } is not
>enough?
>
>You've done what I wanted:
>
> if (!ras2_tab->num_pcc_descs || ras2_tab->num_pcc_descs >
>RAS2_MAX_NUM_PCC_DESCS) {
> pr_warn(FW_WARN "No/Invalid number of PCC descs(%d) in ACPI RAS2
>table\n",
> ras2_tab->num_pcc_descs);
> return -EINVAL;
> }
>
>The RAS2_MAX_NUM_PCC_DESCS thing.
Ok.
>
>> >Also, what's the point of that pctx_list array at all? So that you
>> >can do uninit on the ->adev in case you encounter a failure?
>> Local variable ras2_ctx is updated when calling add_aux_device() in
>> each iteration as
>> add_aux_device() allocates memory for struct ras2_mem_ctx for the
>> corresponding PCC descriptor in the RAS2 table.
>> Thus storing pointer to each ras2_ctx in pctx_list[] to uninit all the previously
>added auxiliary devices
>> using auxiliary_device_uninit(->adev); when encounter a failure in a later
>iteration.
>
>Looks weird. Lemme look at your new submission and see whether I can make it
>better.
Sure.
>
>
>> >> + return;
>> >> + }
>> >> +
>> >> + acpi_ras2_parse(ras2_tab);
>> >
>> >This function does some table sanity checking and warns. What it
>> >should do is fail the driver load if the table is broken.
>>
>> Sure.
>> If acpi_ras2_parse() and thus acpi_ras2_init() return error, can you
>> guide how to handle this error in acpi_init(void) where acpi_ras2_init() is
>called?
>> Something similar to this below,
>> diff --git a/drivers/acpi/bus.c b/drivers/acpi/bus.c index
>> b02ceb2837c6..8b4fc572a05b 100644
>> --- a/drivers/acpi/bus.c
>> +++ b/drivers/acpi/bus.c
>> @@ -1475,7 +1475,12 @@ static int __init acpi_init(void)
>> acpi_debugger_init();
>> acpi_setup_sb_notify_handler();
>> acpi_viot_init();
>> - acpi_ras2_init();
>> + result = acpi_ras2_init();
>> + if (result) {
>> + kobject_put(acpi_kobj);
>> + disable_acpi();
>
>No, you certainly won't disable ACPI if that RAS2 thing parsing fails. What you
>should do is not allow the RAS2 memory driver to load.
>
>Lemme look at your new version.
Sure. Thanks.
>
>Thx.
>
>--
>Regards/Gruss,
> Boris.
>
>https://people.kernel.org/tglx/notes-about-netiquette
>
Thanks,
Shiju
next prev parent reply other threads:[~2026-01-05 18:13 UTC|newest]
Thread overview: 14+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-11-21 18:28 [PATCH v13 0/2] ACPI: Add support for " shiju.jose
2025-11-21 18:28 ` [PATCH v13 1/2] ACPI:RAS2: Add driver for the " shiju.jose
2025-11-22 5:18 ` Randy Dunlap
2025-11-25 7:36 ` Borislav Petkov
2025-11-25 13:28 ` Shiju Jose
2025-12-31 13:15 ` Borislav Petkov
2026-01-05 18:13 ` Shiju Jose [this message]
2025-12-08 16:29 ` Jonathan Cameron
2025-12-31 16:07 ` Borislav Petkov
2025-11-21 18:28 ` [PATCH v13 2/2] ras: mem: Add ACPI RAS2 memory driver shiju.jose
2025-11-22 5:18 ` Randy Dunlap
2025-11-24 9:29 ` Shiju Jose
2025-11-22 5:22 ` Randy Dunlap
2025-11-24 10:00 ` Shiju Jose
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=95978df0ee6c4254b20cc773c8a4df29@huawei.com \
--to=shiju.jose@huawei.com \
--cc=Jon.Grimm@amd.com \
--cc=Yazen.Ghannam@amd.com \
--cc=akpm@linux-foundation.org \
--cc=bp@alien8.de \
--cc=dave.hansen@linux.intel.com \
--cc=dferguson@amperecomputing.com \
--cc=duenwen@google.com \
--cc=erdemaktas@google.com \
--cc=gthelen@google.com \
--cc=james.morse@arm.com \
--cc=jiaqiyan@google.com \
--cc=jonathan.cameron@huawei.com \
--cc=jthoughton@google.com \
--cc=kangkang.shen@futurewei.com \
--cc=lenb@kernel.org \
--cc=leo.duran@amd.com \
--cc=linux-acpi@vger.kernel.org \
--cc=linux-doc@vger.kernel.org \
--cc=linux-edac@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=linuxarm@huawei.com \
--cc=mchehab@kernel.org \
--cc=naoya.horiguchi@nec.com \
--cc=nifan.cxl@gmail.com \
--cc=pgonda@google.com \
--cc=prime.zeng@hisilicon.com \
--cc=rafael@kernel.org \
--cc=rientjes@google.com \
--cc=roberto.sassu@huawei.com \
--cc=rppt@kernel.org \
--cc=somasundaram.a@hpe.com \
--cc=tanxiaofei@huawei.com \
--cc=tony.luck@intel.com \
--cc=wanghuiqiang@huawei.com \
--cc=wbs@os.amperecomputing.com \
--cc=wschwartz@amperecomputing.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox