From: Jane Chu <jane.chu@oracle.com>
To: "Luck, Tony" <tony.luck@intel.com>,
Borislav Petkov <bp@alien8.de>,
"djwong@kernel.org" <djwong@kernel.org>
Cc: "x86@kernel.org" <x86@kernel.org>,
"linux-fsdevel@vger.kernel.org" <linux-fsdevel@vger.kernel.org>,
"linux-mm@kvack.org" <linux-mm@kvack.org>
Subject: Re: Is it possible to corrupt disk when writeback page with undetected UE?
Date: Fri, 16 Sep 2022 19:26:01 +0000 [thread overview]
Message-ID: <a9b7fce1-691c-6186-ae29-f22c7e30e93a@oracle.com> (raw)
In-Reply-To: <SJ1PR11MB6083C1CBA41CB53183600B0FFC489@SJ1PR11MB6083.namprd11.prod.outlook.com>
On 9/16/2022 9:17 AM, Luck, Tony wrote:
>> Were you using madvise to inject an error to a mmap'ed address?
>> or a different tool? Do you still have the test documented
>> somewhere?
>
> I was injecting with ACPI/EINJ (so tweaking some ECC bits in memory to create
> a real uncorrectable error). This was a long time back when I was just trying to
> get basic recovery from usermode access to poison working reliably. So I just
> noted the workaround ("make; sync; run_test") to keep making progress.
>
> Handling poison in the page cache has been on my TODO list for a long time.
> Someday it will make it to the top.
I see, looking forward to your patches.
>
>> And, aside from verifying every write with a read prior to sync,
>> any suggestion to minimize the window of such corruption?
>
> There's no cheap solution. As you point out the best that can be done
> is to reduce the window (since bits may get flipped after you perform
> your check but before DMS to storage).
Sounds like the disk controller is the last in the chain in terms
of detecting a late UE, so if the disk controller detection could
trickle up to a filesystem level action marking 'file:<offset,len>'
being bad and relate the information to user for repair, that might be
reasonable...
thanks,
-jane
>
> -Tony
>
prev parent reply other threads:[~2022-09-16 19:26 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
2022-09-15 22:36 Jane Chu
2022-09-15 22:50 ` Luck, Tony
2022-09-16 0:26 ` Jane Chu
2022-09-16 0:30 ` Yang Shi
2022-09-16 0:51 ` Jane Chu
2022-09-16 16:17 ` Luck, Tony
2022-09-16 19:26 ` Jane Chu [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=a9b7fce1-691c-6186-ae29-f22c7e30e93a@oracle.com \
--to=jane.chu@oracle.com \
--cc=bp@alien8.de \
--cc=djwong@kernel.org \
--cc=linux-fsdevel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=tony.luck@intel.com \
--cc=x86@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox