* [MAINTAINERS SUMMIT] re-think of richACLs in AI/LLM era @ 2025-09-08 8:33 Coly Li 2025-09-08 10:52 ` Jan Kara 0 siblings, 1 reply; 12+ messages in thread From: Coly Li @ 2025-09-08 8:33 UTC (permalink / raw) To: ksummit Hi folks, This is Coly Li. I’ve been maintaining bcache for a while and have met Linus, Greg, Ted, and other maintainers in person at many conferences. Yes, I am a sustained and reliable kernel developer. Recently, I joined a startup (https://fnnas.com) that provides AI/LLM capabilities for personal or micro-enterprise storage. We help users share and communicate AI/LLM-processed information from their stored data more conveniently. Our users can run highly compact LLMs on their own normal and inexpensive hardware to process photos, videos, and documents using AI. Of course, it’s slow but that’s expected and acceptable. They can even come back to check the results weeks later. In our use case, different people or roles store their personal and sensitive data in the same storage pool, with different access controls granted to AI/LLM processing tasks. When they share specific information or data with others within the same machine or over the internet, the access control hierarchy or rules become highly complicated and impossible to handle with POSIX ACLs. We tried bypassing access control to user space, which worked well except for scalability and performance: - As the number and size of files increase, storing all access control rules in user space memory doesn’t scale—especially on normal machines without huge memory resources. - For some hot data sets (a group of files and directories), checking access control rules in user space and hooking back to the kernel is highly inefficient. Therefore, the RichACL project comes back to mind. Of course, RichACL alone isn’t enough. A high-level policy agent (in user space) is still needed for task/session-oriented access and sharing policy control, but RichACL can help implement file system-level access control. This would give us a context-aware and highly efficient access control implementation. What I’d like to discuss is: - After almost 10 years, should we reconsider RichACL in the AI/LLM era? - What are the major barriers or remaining work needed to get RichACLs into upstream? Since our first public beta was released 13 months ago, we now have over one- million active installations running daily. This is a real workload for RichACL and represents real feature demand from end users. If you’re interested in this topic, we’d be happy to provide more details about the access control requirements in AI workloads and even show a live demo of the use case. Thanks in advance. Coly Li ^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: [MAINTAINERS SUMMIT] re-think of richACLs in AI/LLM era 2025-09-08 8:33 [MAINTAINERS SUMMIT] re-think of richACLs in AI/LLM era Coly Li @ 2025-09-08 10:52 ` Jan Kara 2025-09-08 13:47 ` Coly Li 2025-09-08 15:39 ` Steven Rostedt 0 siblings, 2 replies; 12+ messages in thread From: Jan Kara @ 2025-09-08 10:52 UTC (permalink / raw) To: Coly Li; +Cc: ksummit Hi Coly! Maintainers summit is about process issues with the Linux kernel. Your proposal is for a technical discussion so it doesn't really fit there. More fitting would be a Kernel summit track at LPC which is for technical topics. Given your topic is mostly a discussion on a relatively specialized filesystem topic, I think the best venue for this would be LSFMM summit next spring where you get the biggest concentration of people that have something to say to this topic. (OTOH it's going to be in Europe so I understand the location needn't be as convenient for you as LPC). I'm not really on program committee for either a Maintainers summit or LPC so take this just as a friendly advice from a bystander :) Honza On Mon 08-09-25 16:33:54, Coly Li wrote: > This is Coly Li. I’ve been maintaining bcache for a while and have met Linus, > Greg, Ted, and other maintainers in person at many conferences. Yes, I am a > sustained and reliable kernel developer. > > Recently, I joined a startup (https://fnnas.com) that provides AI/LLM > capabilities for personal or micro-enterprise storage. We help users share and > communicate AI/LLM-processed information from their stored data more > conveniently. > > Our users can run highly compact LLMs on their own normal and inexpensive > hardware to process photos, videos, and documents using AI. Of course, it’s slow > but that’s expected and acceptable. They can even come back to check the results > weeks later. > > In our use case, different people or roles store their personal and sensitive > data in the same storage pool, with different access controls granted to AI/LLM > processing tasks. When they share specific information or data with others > within the same machine or over the internet, the access control hierarchy or > rules become highly complicated and impossible to handle with POSIX ACLs. > > We tried bypassing access control to user space, which worked well except for > scalability and performance: > - As the number and size of files increase, storing all access control rules in > user space memory doesn’t scale—especially on normal machines without huge > memory resources. > - For some hot data sets (a group of files and directories), checking access > control rules in user space and hooking back to the kernel is highly > inefficient. > > Therefore, the RichACL project comes back to mind. Of course, RichACL alone > isn’t enough. A high-level policy agent (in user space) is still needed for > task/session-oriented access and sharing policy control, but RichACL can help > implement file system-level access control. This would give us a context-aware > and highly efficient access control implementation. > > What I’d like to discuss is: > - After almost 10 years, should we reconsider RichACL in the AI/LLM era? > - What are the major barriers or remaining work needed to get RichACLs into > upstream? > > Since our first public beta was released 13 months ago, we now have over one- > million active installations running daily. This is a real workload for RichACL > and represents real feature demand from end users. If you’re interested in this > topic, we’d be happy to provide more details about the access control > requirements in AI workloads and even show a live demo of the use case. > > Thanks in advance. > > Coly Li > -- Jan Kara <jack@suse.com> SUSE Labs, CR ^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: [MAINTAINERS SUMMIT] re-think of richACLs in AI/LLM era 2025-09-08 10:52 ` Jan Kara @ 2025-09-08 13:47 ` Coly Li 2025-09-08 15:39 ` Steven Rostedt 1 sibling, 0 replies; 12+ messages in thread From: Coly Li @ 2025-09-08 13:47 UTC (permalink / raw) To: Jan Kara; +Cc: ksummit Hi Jan, > 2025年9月8日 18:52,Jan Kara <jack@suse.cz> 写道: > > Hi Coly! > > Maintainers summit is about process issues with the Linux kernel. Your > proposal is for a technical discussion so it doesn't really fit there. More > fitting would be a Kernel summit track at LPC which is for technical Copied. I will resend the proposal by kernel summit track format. > topics. Given your topic is mostly a discussion on a relatively specialized > filesystem topic, I think the best venue for this would be LSFMM summit > next spring where you get the biggest concentration of people that have > something to say to this topic. (OTOH it's going to be in Europe so I > understand the location needn't be as convenient for you as LPC). > Yes this time it is better for both time and location. > I'm not really on program committee for either a Maintainers summit or LPC > so take this just as a friendly advice from a bystander :) > Thanks for your kind suggestion :-) Coly Li > Honza > > On Mon 08-09-25 16:33:54, Coly Li wrote: >> This is Coly Li. I’ve been maintaining bcache for a while and have met Linus, >> Greg, Ted, and other maintainers in person at many conferences. Yes, I am a >> sustained and reliable kernel developer. >> >> Recently, I joined a startup (https://fnnas.com) that provides AI/LLM >> capabilities for personal or micro-enterprise storage. We help users share and >> communicate AI/LLM-processed information from their stored data more >> conveniently. >> >> Our users can run highly compact LLMs on their own normal and inexpensive >> hardware to process photos, videos, and documents using AI. Of course, it’s slow >> but that’s expected and acceptable. They can even come back to check the results >> weeks later. >> >> In our use case, different people or roles store their personal and sensitive >> data in the same storage pool, with different access controls granted to AI/LLM >> processing tasks. When they share specific information or data with others >> within the same machine or over the internet, the access control hierarchy or >> rules become highly complicated and impossible to handle with POSIX ACLs. >> >> We tried bypassing access control to user space, which worked well except for >> scalability and performance: >> - As the number and size of files increase, storing all access control rules in >> user space memory doesn’t scale—especially on normal machines without huge >> memory resources. >> - For some hot data sets (a group of files and directories), checking access >> control rules in user space and hooking back to the kernel is highly >> inefficient. >> >> Therefore, the RichACL project comes back to mind. Of course, RichACL alone >> isn’t enough. A high-level policy agent (in user space) is still needed for >> task/session-oriented access and sharing policy control, but RichACL can help >> implement file system-level access control. This would give us a context-aware >> and highly efficient access control implementation. >> >> What I’d like to discuss is: >> - After almost 10 years, should we reconsider RichACL in the AI/LLM era? >> - What are the major barriers or remaining work needed to get RichACLs into >> upstream? >> >> Since our first public beta was released 13 months ago, we now have over one- >> million active installations running daily. This is a real workload for RichACL >> and represents real feature demand from end users. If you’re interested in this >> topic, we’d be happy to provide more details about the access control >> requirements in AI workloads and even show a live demo of the use case. >> >> Thanks in advance. >> >> Coly Li >> > -- > Jan Kara <jack@suse.com> > SUSE Labs, CR ^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: [MAINTAINERS SUMMIT] re-think of richACLs in AI/LLM era 2025-09-08 10:52 ` Jan Kara 2025-09-08 13:47 ` Coly Li @ 2025-09-08 15:39 ` Steven Rostedt 2025-09-08 15:42 ` Coly Li 1 sibling, 1 reply; 12+ messages in thread From: Steven Rostedt @ 2025-09-08 15:39 UTC (permalink / raw) To: Jan Kara; +Cc: Coly Li, ksummit On Mon, 8 Sep 2025 12:52:48 +0200 Jan Kara <jack@suse.cz> wrote: > Hi Coly! > > Maintainers summit is about process issues with the Linux kernel. Your > proposal is for a technical discussion so it doesn't really fit there. More > fitting would be a Kernel summit track at LPC which is for technical > topics. Given your topic is mostly a discussion on a relatively specialized > filesystem topic, I think the best venue for this would be LSFMM summit > next spring where you get the biggest concentration of people that have > something to say to this topic. (OTOH it's going to be in Europe so I > understand the location needn't be as convenient for you as LPC). > > I'm not really on program committee for either a Maintainers summit or LPC > so take this just as a friendly advice from a bystander :) > As Jan has stated, this is a technical topic. But you can send it to this list too. From Ted's original email: Related to the Maintainer's Summit, the Kernel Summit is organized as a track which is run in parallel with the other tracks at the Linux Plumbers Conference (LPC), and is open to all registered attendees of LPC. The goal of the Kernel Summit track will be to provide a forum to discuss specific technical issues that would be easier to resolve in person than over e-mail. The program committee will also consider "information sharing" topics if they are clearly of interest to the wider development community (i.e., advanced training in topics that would be useful to kernel developers). To suggest a topic for the Kernel Summit, please do two things. by September 10th, 2025. First, please send e-mail with a subject prefix of [TECH TOPIC] to ksummit@lists.linux.dev. As before, please use a separate e-mail for each topic. Secondly, please create a topic at the Linux Plumbers Conference proposal submission site and target it to the Kernel Summit track: https://lpc.events/event/19/abstracts Please do both steps. I'll try to notice if someone forgets one or the other, but your chances of making sure your proposal gets the necessary attention and consideration are maximized by submitting both to the mailing list and the web site. The CfP is closing this Wednesday, so please submit it soon. -- Steve ^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: [MAINTAINERS SUMMIT] re-think of richACLs in AI/LLM era 2025-09-08 15:39 ` Steven Rostedt @ 2025-09-08 15:42 ` Coly Li 2025-09-08 23:22 ` Randy Dunlap 0 siblings, 1 reply; 12+ messages in thread From: Coly Li @ 2025-09-08 15:42 UTC (permalink / raw) To: Steven Rostedt; +Cc: Jan Kara, ksummit > 2025年9月8日 23:39,Steven Rostedt <rostedt@goodmis.org> 写道: > > On Mon, 8 Sep 2025 12:52:48 +0200 > Jan Kara <jack@suse.cz> wrote: > >> Hi Coly! >> >> Maintainers summit is about process issues with the Linux kernel. Your >> proposal is for a technical discussion so it doesn't really fit there. More >> fitting would be a Kernel summit track at LPC which is for technical >> topics. Given your topic is mostly a discussion on a relatively specialized >> filesystem topic, I think the best venue for this would be LSFMM summit >> next spring where you get the biggest concentration of people that have >> something to say to this topic. (OTOH it's going to be in Europe so I >> understand the location needn't be as convenient for you as LPC). >> >> I'm not really on program committee for either a Maintainers summit or LPC >> so take this just as a friendly advice from a bystander :) >> > > As Jan has stated, this is a technical topic. But you can send it to this > list too. From Ted's original email: > > Related to the Maintainer's Summit, the Kernel Summit is organized as > a track which is run in parallel with the other tracks at the Linux > Plumbers Conference (LPC), and is open to all registered attendees of > LPC. The goal of the Kernel Summit track will be to provide a forum > to discuss specific technical issues that would be easier to resolve > in person than over e-mail. The program committee will also consider > "information sharing" topics if they are clearly of interest to the > wider development community (i.e., advanced training in topics that > would be useful to kernel developers). > > To suggest a topic for the Kernel Summit, please do two things. by > September 10th, 2025. First, please send e-mail with a subject prefix of > [TECH TOPIC] to ksummit@lists.linux.dev. As before, please use a separate > e-mail for each topic. > > Secondly, please create a topic at the Linux Plumbers Conference > proposal submission site and target it to the Kernel Summit track: > > https://lpc.events/event/19/abstracts > > Please do both steps. I'll try to notice if someone forgets one or > the other, but your chances of making sure your proposal gets the > necessary attention and consideration are maximized by submitting both > to the mailing list and the web site. > > > The CfP is closing this Wednesday, so please submit it soon. Yes, I already submitted the CfP on https://lpc.events/event/19/abstracts and got the confirmation email. Also I resend another proposal email subjected with [TECH TOPIC] and almost identical message body. Thanks for the hint! Coly Li ^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: [MAINTAINERS SUMMIT] re-think of richACLs in AI/LLM era 2025-09-08 15:42 ` Coly Li @ 2025-09-08 23:22 ` Randy Dunlap 2025-09-09 1:03 ` Paul Moore 0 siblings, 1 reply; 12+ messages in thread From: Randy Dunlap @ 2025-09-08 23:22 UTC (permalink / raw) To: Coly Li, Steven Rostedt; +Cc: Jan Kara, ksummit Hi, On 9/8/25 8:42 AM, Coly Li wrote: >> 2025年9月8日 23:39,Steven Rostedt <rostedt@goodmis.org> 写道: >> >>> >>> I'm not really on program committee for either a Maintainers summit or LPC >>> so take this just as a friendly advice from a bystander :) >>> >> >> As Jan has stated, this is a technical topic. But you can send it to this >> list too. From Ted's original email: >> >> Related to the Maintainer's Summit, the Kernel Summit is organized as >> a track which is run in parallel with the other tracks at the Linux >> Plumbers Conference (LPC), and is open to all registered attendees of >> LPC. The goal of the Kernel Summit track will be to provide a forum >> to discuss specific technical issues that would be easier to resolve >> in person than over e-mail. The program committee will also consider >> "information sharing" topics if they are clearly of interest to the >> wider development community (i.e., advanced training in topics that >> would be useful to kernel developers). >> >> To suggest a topic for the Kernel Summit, please do two things. by >> September 10th, 2025. First, please send e-mail with a subject prefix of >> [TECH TOPIC] to ksummit@lists.linux.dev. As before, please use a separate >> e-mail for each topic. >> >> Secondly, please create a topic at the Linux Plumbers Conference >> proposal submission site and target it to the Kernel Summit track: >> >> https://lpc.events/event/19/abstracts >> >> Please do both steps. I'll try to notice if someone forgets one or >> the other, but your chances of making sure your proposal gets the >> necessary attention and consideration are maximized by submitting both >> to the mailing list and the web site. >> >> >> The CfP is closing this Wednesday, so please submit it soon. > > Yes, I already submitted the CfP on https://lpc.events/event/19/abstracts and got the confirmation email. > > Also I resend another proposal email subjected with [TECH TOPIC] and almost identical message body. > > Thanks for the hint! Hi, I don't object to such a topic, but I thought that LSMs were the wave of the future for extended attributes or special capability handling. ? -- ~Randy ^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: [MAINTAINERS SUMMIT] re-think of richACLs in AI/LLM era 2025-09-08 23:22 ` Randy Dunlap @ 2025-09-09 1:03 ` Paul Moore 2025-09-10 13:32 ` Coly Li 0 siblings, 1 reply; 12+ messages in thread From: Paul Moore @ 2025-09-09 1:03 UTC (permalink / raw) To: Randy Dunlap; +Cc: Coly Li, Steven Rostedt, Jan Kara, ksummit On Mon, Sep 8, 2025 at 7:23 PM Randy Dunlap <rdunlap@infradead.org> wrote: > On 9/8/25 8:42 AM, Coly Li wrote: > >> 2025年9月8日 23:39,Steven Rostedt <rostedt@goodmis.org> 写道: > >> > >>> > >>> I'm not really on program committee for either a Maintainers summit or LPC > >>> so take this just as a friendly advice from a bystander :) > >>> > >> > >> As Jan has stated, this is a technical topic. But you can send it to this > >> list too. From Ted's original email: > >> > >> Related to the Maintainer's Summit, the Kernel Summit is organized as > >> a track which is run in parallel with the other tracks at the Linux > >> Plumbers Conference (LPC), and is open to all registered attendees of > >> LPC. The goal of the Kernel Summit track will be to provide a forum > >> to discuss specific technical issues that would be easier to resolve > >> in person than over e-mail. The program committee will also consider > >> "information sharing" topics if they are clearly of interest to the > >> wider development community (i.e., advanced training in topics that > >> would be useful to kernel developers). > >> > >> To suggest a topic for the Kernel Summit, please do two things. by > >> September 10th, 2025. First, please send e-mail with a subject prefix of > >> [TECH TOPIC] to ksummit@lists.linux.dev. As before, please use a separate > >> e-mail for each topic. > >> > >> Secondly, please create a topic at the Linux Plumbers Conference > >> proposal submission site and target it to the Kernel Summit track: > >> > >> https://lpc.events/event/19/abstracts > >> > >> Please do both steps. I'll try to notice if someone forgets one or > >> the other, but your chances of making sure your proposal gets the > >> necessary attention and consideration are maximized by submitting both > >> to the mailing list and the web site. > >> > >> > >> The CfP is closing this Wednesday, so please submit it soon. > > > > Yes, I already submitted the CfP on https://lpc.events/event/19/abstracts and got the confirmation email. > > > > Also I resend another proposal email subjected with [TECH TOPIC] and almost identical message body. > > > > Thanks for the hint! > > Hi, > I don't object to such a topic, but I thought that LSMs were the wave > of the future for extended attributes or special capability handling. ? I can't say I'm familiar with the RichACL concept, but generally speaking yes, the LSM framework exists as a way to implement access control mechanisms beyond the traditional Linux access controls (other things too, but those aren't really relevant here). -- paul-moore.com ^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: [MAINTAINERS SUMMIT] re-think of richACLs in AI/LLM era 2025-09-09 1:03 ` Paul Moore @ 2025-09-10 13:32 ` Coly Li 2025-09-10 19:11 ` Paul Moore 0 siblings, 1 reply; 12+ messages in thread From: Coly Li @ 2025-09-10 13:32 UTC (permalink / raw) To: Paul Moore; +Cc: Randy Dunlap, Steven Rostedt, Jan Kara, ksummit On Mon, Sep 08, 2025 at 09:03:24PM +0800, Paul Moore wrote: > On Mon, Sep 8, 2025 at 7:23 PM Randy Dunlap <rdunlap@infradead.org> wrote: > > On 9/8/25 8:42 AM, Coly Li wrote: > > >> 2025年9月8日 23:39,Steven Rostedt <rostedt@goodmis.org> 写道: > > >> > > >>> > > >>> I'm not really on program committee for either a Maintainers summit or LPC > > >>> so take this just as a friendly advice from a bystander :) > > >>> > > >> > > >> As Jan has stated, this is a technical topic. But you can send it to this > > >> list too. From Ted's original email: > > >> > > >> Related to the Maintainer's Summit, the Kernel Summit is organized as > > >> a track which is run in parallel with the other tracks at the Linux > > >> Plumbers Conference (LPC), and is open to all registered attendees of > > >> LPC. The goal of the Kernel Summit track will be to provide a forum > > >> to discuss specific technical issues that would be easier to resolve > > >> in person than over e-mail. The program committee will also consider > > >> "information sharing" topics if they are clearly of interest to the > > >> wider development community (i.e., advanced training in topics that > > >> would be useful to kernel developers). > > >> > > >> To suggest a topic for the Kernel Summit, please do two things. by > > >> September 10th, 2025. First, please send e-mail with a subject prefix of > > >> [TECH TOPIC] to ksummit@lists.linux.dev. As before, please use a separate > > >> e-mail for each topic. > > >> > > >> Secondly, please create a topic at the Linux Plumbers Conference > > >> proposal submission site and target it to the Kernel Summit track: > > >> > > >> https://lpc.events/event/19/abstracts > > >> > > >> Please do both steps. I'll try to notice if someone forgets one or > > >> the other, but your chances of making sure your proposal gets the > > >> necessary attention and consideration are maximized by submitting both > > >> to the mailing list and the web site. > > >> > > >> > > >> The CfP is closing this Wednesday, so please submit it soon. > > > > > > Yes, I already submitted the CfP on https://lpc.events/event/19/abstracts and got the confirmation email. > > > > > > Also I resend another proposal email subjected with [TECH TOPIC] and almost identical message body. > > > > > > Thanks for the hint! > > Hi Paul and Randy, Thank you for suggesting LSM as another choice. > > Hi, > > I don't object to such a topic, but I thought that LSMs were the wave > > of the future for extended attributes or special capability handling. ? > > I can't say I'm familiar with the RichACL concept, but generally > speaking yes, the LSM framework exists as a way to implement access > control mechanisms beyond the traditional Linux access controls (other > things too, but those aren't really relevant here). Is it convenient for normal users or non-root processes (including the policy agent) to setup the LSM rules? We need to allow normal users to set their own access control policy for the data they owned. Also there is a special case, e.g. a file’s parent directory doesn’t allow write permission, but the file needs to grant delete permission. 4614 int do_unlinkat(int dfd, struct filename *name) 4615 { [snipped] 4645 inode = dentry->d_inode; 4646 ihold(inode); 4647 error = security_path_unlink(&path, dentry); 4648 if (error) 4649 goto exit3; 4650 error = vfs_unlink(mnt_idmap(path.mnt), path.dentry->d_inode, 4651 dentry, &delegated_inode); [snipped] 4563 int vfs_unlink(struct mnt_idmap *idmap, struct inode *dir, 4564 struct dentry *dentry, struct inode **delegated_inode) 4565 { 4566 struct inode *target = dentry->d_inode; 4567 int error = may_delete(idmap, dir, dentry, 0); 4568 4569 if (error) 4570 return error; [snipped] It seems in do_unlinkat() the security check security_path_unlink() is called before calling may_delete() inside vfs_unlink(). So even security rule permits to delete this file but the parent directory doesn't grant write permission, such access control still cannot be archieved. We are open to any possible method to serve the complicated access controls in current AI/LLM driven workloads. And the executive team decide to invest a few engineers and time to work with other community deveopers on the promissing technial choice, and help to make the effort to go into upstream kernel. Therefore we do want to hear all your voices and advices, to make sure the effort will be on the correct tack. Thanks again for your replies. Coly Li ^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: [MAINTAINERS SUMMIT] re-think of richACLs in AI/LLM era 2025-09-10 13:32 ` Coly Li @ 2025-09-10 19:11 ` Paul Moore 2025-09-16 17:12 ` Coly Li 0 siblings, 1 reply; 12+ messages in thread From: Paul Moore @ 2025-09-10 19:11 UTC (permalink / raw) To: Coly Li; +Cc: Randy Dunlap, Steven Rostedt, Jan Kara, ksummit On Wed, Sep 10, 2025 at 9:32 AM Coly Li <colyli@fnnas.com> wrote: > On Mon, Sep 08, 2025 at 09:03:24PM +0800, Paul Moore wrote: ... > > I can't say I'm familiar with the RichACL concept, but generally > > speaking yes, the LSM framework exists as a way to implement access > > control mechanisms beyond the traditional Linux access controls (other > > things too, but those aren't really relevant here). > > Is it convenient for normal users or non-root processes (including the policy agent) to > setup the LSM rules? We need to allow normal users to set their own access control policy > for the data they owned. Management of an individual LSM's configuration is generally left to the individual LSM. Some LSMs restrict their configuration knobs behind capabilities or their own access controls, while others allow unprivileged access to the configuration; it depends on the LSM's security model. As an unprivileged example, Landlock allows applications, run by arbitrary users, to set their own Landlock security policy via the Landlock API. > Also there is a special case, e.g. a file’s parent directory doesn’t allow write > permission, but the file needs to grant delete permission. > 4614 int do_unlinkat(int dfd, struct filename *name) > 4615 { > [snipped] > 4645 inode = dentry->d_inode; > 4646 ihold(inode); > 4647 error = security_path_unlink(&path, dentry); > 4648 if (error) > 4649 goto exit3; > 4650 error = vfs_unlink(mnt_idmap(path.mnt), path.dentry->d_inode, > 4651 dentry, &delegated_inode); > [snipped] > 4563 int vfs_unlink(struct mnt_idmap *idmap, struct inode *dir, > 4564 struct dentry *dentry, struct inode **delegated_inode) > 4565 { > 4566 struct inode *target = dentry->d_inode; > 4567 int error = may_delete(idmap, dir, dentry, 0); > 4568 > 4569 if (error) > 4570 return error; > [snipped] > > It seems in do_unlinkat() the security check security_path_unlink() is called before > calling may_delete() inside vfs_unlink(). So even security rule permits to delete > this file but the parent directory doesn't grant write permission, such access control > still cannot be archieved. One of the important parts of the LSM framework as a whole is that LSMs can not grant access that would otherwise be blocked by the standard/discretionary access controls built into the Linux kernel; in other words, LSMs can only say "no" to an access, they can not grant access by themselves. Yes, this is by design, and no, I see no reason to change that design decision at this point in time (doing so would require a tremendous amount of work and likely introduce a fair number of security regressions for quite some time). -- paul-moore.com ^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: [MAINTAINERS SUMMIT] re-think of richACLs in AI/LLM era 2025-09-10 19:11 ` Paul Moore @ 2025-09-16 17:12 ` Coly Li 2025-09-16 18:07 ` Randy Dunlap 2025-09-17 7:59 ` Jan Kara 0 siblings, 2 replies; 12+ messages in thread From: Coly Li @ 2025-09-16 17:12 UTC (permalink / raw) To: Paul Moore; +Cc: Randy Dunlap, Steven Rostedt, Jan Kara, ksummit On Wed, Sep 10, 2025 at 03:11:55PM +0800, Paul Moore wrote: > On Wed, Sep 10, 2025 at 9:32 AM Coly Li <colyli@fnnas.com> wrote: > > On Mon, Sep 08, 2025 at 09:03:24PM +0800, Paul Moore wrote: > > ... > > > > I can't say I'm familiar with the RichACL concept, but generally > > > speaking yes, the LSM framework exists as a way to implement access > > > control mechanisms beyond the traditional Linux access controls (other > > > things too, but those aren't really relevant here). > > > > Is it convenient for normal users or non-root processes (including the policy agent) to > > setup the LSM rules? We need to allow normal users to set their own access control policy > > for the data they owned. > > Management of an individual LSM's configuration is generally left to > the individual LSM. Some LSMs restrict their configuration knobs > behind capabilities or their own access controls, while others allow > unprivileged access to the configuration; it depends on the LSM's > security model. As an unprivileged example, Landlock allows > applications, run by arbitrary users, to set their own Landlock > security policy via the Landlock API. > I forward the suggestion to our application team, and they try to evaluate and replied the result. Currently they are using bpf to do the access control rules checking, and LSM access control method e.g. Landlock is quite similar to a rules based control method. They still need to persist all the rules on disk, and load the rules during system initlization time. When the number of rules increased, the maintennance is complicated and slow. Also the application team give me a use case, ask how to achieve the access control effiently. Let me describe by the followed text. Users store they photos on the system, and the compact AI module processes all their photos and groups all the photos into different categories like pizza, dogs, cats, foods or group photos. After the process done, users may see they photos in different categories that the AI module thinks they should be in. Then users may share the categories with photos to others. If indentical categories shared by different users, the shared photos can be combined all together. And AI module may continue to process the shared photos and generate new categaries from the shared photos, e.g. pizza in the same city, cats and dogs in closed location, group photos contains the most common people, etc. Now the differet categories are implemented by different directories in the publicly shared directory. In each category directory, photos with a category (or attribution) can be accessed as hard links to the original photo inodes and share the identical inodes. All these category directories are created by the AI module, although the photos are shared from each users. If a user is identified from a group photo, and this user is noticed that the photo is publicly shared. If this user doesn't want his face to be shared in public, for an optinal privacy protection right, this user can remove the hardlink of the photo which his or her face is in, that is he or she can remove the hardlink (dentry) under a publicly shared directory which this user doesn't have write permission. Because this user can be idnetified as owner of his or her face, and the photo has his face in, he or she should have write permission to delete the photo, but no write permission to other photos in same category directioy which his or her face is not in. The above example is one of the simple case just for photos processing and sharing in the AI context. The rules of access control are created or destoried dynamically and maybe only exist for a short period. And the number of rules are quite large. Current rules based access control is inefficient and complicated to implement for the above simple case, and the application team replies they don't see the rules based LSM method can make it be more simple. > > Also there is a special case, e.g. a file’s parent directory doesn’t allow write > > permission, but the file needs to grant delete permission. > > 4614 int do_unlinkat(int dfd, struct filename *name) > > 4615 { > > [snipped] > > 4645 inode = dentry->d_inode; > > 4646 ihold(inode); > > 4647 error = security_path_unlink(&path, dentry); > > 4648 if (error) > > 4649 goto exit3; > > 4650 error = vfs_unlink(mnt_idmap(path.mnt), path.dentry->d_inode, > > 4651 dentry, &delegated_inode); > > [snipped] > > 4563 int vfs_unlink(struct mnt_idmap *idmap, struct inode *dir, > > 4564 struct dentry *dentry, struct inode **delegated_inode) > > 4565 { > > 4566 struct inode *target = dentry->d_inode; > > 4567 int error = may_delete(idmap, dir, dentry, 0); > > 4568 > > 4569 if (error) > > 4570 return error; > > [snipped] > > > > It seems in do_unlinkat() the security check security_path_unlink() is called before > > calling may_delete() inside vfs_unlink(). So even security rule permits to delete > > this file but the parent directory doesn't grant write permission, such access control > > still cannot be archieved. > > One of the important parts of the LSM framework as a whole is that > LSMs can not grant access that would otherwise be blocked by the > standard/discretionary access controls built into the Linux kernel; in > other words, LSMs can only say "no" to an access, they can not grant > access by themselves. Yes, this is by design, and no, I see no reason > to change that design decision at this point in time (doing so would > require a tremendous amount of work and likely introduce a fair number > of security regressions for quite some time). I understand and agree the concern of security. But the reality is more and more similar or relative access control requirements will sprint up from the AI/LLM applications and use cases. We just want to solve the access control challenge, and the LSM rules based methods are not easy for application developers. Forgive me for the late response, because I wait for the feed back from our application development teams. Thanks. Coly Li ^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: [MAINTAINERS SUMMIT] re-think of richACLs in AI/LLM era 2025-09-16 17:12 ` Coly Li @ 2025-09-16 18:07 ` Randy Dunlap 2025-09-17 7:59 ` Jan Kara 1 sibling, 0 replies; 12+ messages in thread From: Randy Dunlap @ 2025-09-16 18:07 UTC (permalink / raw) To: Coly Li, Paul Moore; +Cc: Steven Rostedt, Jan Kara, ksummit Hi Coly, On 9/16/25 10:12 AM, Coly Li wrote: > On Wed, Sep 10, 2025 at 03:11:55PM +0800, Paul Moore wrote: >> On Wed, Sep 10, 2025 at 9:32 AM Coly Li <colyli@fnnas.com> wrote: >>> On Mon, Sep 08, 2025 at 09:03:24PM +0800, Paul Moore wrote: >> >> ... >> >>>> I can't say I'm familiar with the RichACL concept, but generally >>>> speaking yes, the LSM framework exists as a way to implement access >>>> control mechanisms beyond the traditional Linux access controls (other >>>> things too, but those aren't really relevant here). >>> >>> Is it convenient for normal users or non-root processes (including the policy agent) to >>> setup the LSM rules? We need to allow normal users to set their own access control policy >>> for the data they owned. >> >> Management of an individual LSM's configuration is generally left to >> the individual LSM. Some LSMs restrict their configuration knobs >> behind capabilities or their own access controls, while others allow >> unprivileged access to the configuration; it depends on the LSM's >> security model. As an unprivileged example, Landlock allows >> applications, run by arbitrary users, to set their own Landlock >> security policy via the Landlock API. >> > > I forward the suggestion to our application team, and they try to evaluate > and replied the result. > > Currently they are using bpf to do the access control rules checking, and > LSM access control method e.g. Landlock is quite similar to a rules based > control method. They still need to persist all the rules on disk, and load > the rules during system initlization time. When the number of rules increased, > the maintennance is complicated and slow. > > Also the application team give me a use case, ask how to achieve the access > control effiently. Let me describe by the followed text. > > Users store they photos on the system, and the compact AI module processes all > their photos and groups all the photos into different categories like pizza, > dogs, cats, foods or group photos. After the process done, users may see they > photos in different categories that the AI module thinks they should be in. Then > users may share the categories with photos to others. If indentical categories > shared by different users, the shared photos can be combined all together. And > AI module may continue to process the shared photos and generate new categaries > from the shared photos, e.g. pizza in the same city, cats and dogs in closed > location, group photos contains the most common people, etc. Now the differet > categories are implemented by different directories in the publicly shared > directory. > > In each category directory, photos with a category (or attribution) can be > accessed as hard links to the original photo inodes and share the identical > inodes. All these category directories are created by the AI module, although > the photos are shared from each users. If a user is identified from a group > photo, and this user is noticed that the photo is publicly shared. If this user > doesn't want his face to be shared in public, for an optinal privacy protection > right, this user can remove the hardlink of the photo which his or her face is > in, that is he or she can remove the hardlink (dentry) under a publicly shared > directory which this user doesn't have write permission. Because this user can > be idnetified as owner of his or her face, and the photo has his face in, he or > she should have write permission to delete the photo, but no write permission to > other photos in same category directioy which his or her face is not in. What permissions/rights does the AI module have such that it can create a file in shared/photos/faces from my personal files? The shared file with my photo is still owned by the sharing user, correct? What are the permissions/owner of the parent directory? What permissions are required in my personal photos directory to allow files there to be shared? It sounds to me like the AI was aggressive in sharing; now you (the user) want to correct/fix that. The AI could be trained better, but that's too difficult. (just playing devil's advocate there) I haven't looked at this in any detail, but I'm wondering if an intermediate directory level with "my" permissions/ownership would allow a fix for this issue. Then I (the user) could remove the file (the hardlink) from the intermediate directory. However, that might leave some dangling link in the final shared directory. I don't know enough about it to answer that. > The above example is one of the simple case just for photos processing and > sharing in the AI context. The rules of access control are created or destoried > dynamically and maybe only exist for a short period. And the number of rules are > quite large. > > Current rules based access control is inefficient and complicated to implement > for the above simple case, and the application team replies they don't see the > rules based LSM method can make it be more simple. [snip] >> One of the important parts of the LSM framework as a whole is that >> LSMs can not grant access that would otherwise be blocked by the >> standard/discretionary access controls built into the Linux kernel; in >> other words, LSMs can only say "no" to an access, they can not grant >> access by themselves. Yes, this is by design, and no, I see no reason >> to change that design decision at this point in time (doing so would >> require a tremendous amount of work and likely introduce a fair number >> of security regressions for quite some time). > > I understand and agree the concern of security. But the reality is more and more > similar or relative access control requirements will sprint up from the AI/LLM > applications and use cases. We just want to solve the access control challenge, > and the LSM rules based methods are not easy for application developers. Do you have control over the AI that does this photo directory magic? Anyway, it sounds like a reasonable topic for the kernel summit (as a Tech Topic) instead of the maintainer's summit (process stuff). -- ~Randy ^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: [MAINTAINERS SUMMIT] re-think of richACLs in AI/LLM era 2025-09-16 17:12 ` Coly Li 2025-09-16 18:07 ` Randy Dunlap @ 2025-09-17 7:59 ` Jan Kara 1 sibling, 0 replies; 12+ messages in thread From: Jan Kara @ 2025-09-17 7:59 UTC (permalink / raw) To: Coly Li; +Cc: Paul Moore, Randy Dunlap, Steven Rostedt, Jan Kara, ksummit On Wed 17-09-25 01:12:48, Coly Li wrote: > Users store they photos on the system, and the compact AI module processes all > their photos and groups all the photos into different categories like pizza, > dogs, cats, foods or group photos. After the process done, users may see they > photos in different categories that the AI module thinks they should be in. Then > users may share the categories with photos to others. If indentical categories > shared by different users, the shared photos can be combined all together. And > AI module may continue to process the shared photos and generate new categaries > from the shared photos, e.g. pizza in the same city, cats and dogs in closed > location, group photos contains the most common people, etc. Now the differet > categories are implemented by different directories in the publicly shared > directory. > > In each category directory, photos with a category (or attribution) can be > accessed as hard links to the original photo inodes and share the identical > inodes. All these category directories are created by the AI module, although > the photos are shared from each users. If a user is identified from a group > photo, and this user is noticed that the photo is publicly shared. If this user > doesn't want his face to be shared in public, for an optinal privacy protection > right, this user can remove the hardlink of the photo which his or her face is > in, that is he or she can remove the hardlink (dentry) under a publicly shared > directory which this user doesn't have write permission. Because this user can > be idnetified as owner of his or her face, and the photo has his face in, he or > she should have write permission to delete the photo, but no write permission to > other photos in same category directioy which his or her face is not in. Well, from what you describe I'd say that the category directories should just be AI owned rwxrwxrwt dirs (do notice the sticky bit set). This is how /tmp/ is usually setup. This means that everybody can read the dir, everybody can delete files but only if they are their owner, everybody can create files - this is the part you probably don't want but *that* is pretty easy to restrict by a LSM (practically any one can do this). Honza -- Jan Kara <jack@suse.com> SUSE Labs, CR ^ permalink raw reply [flat|nested] 12+ messages in thread
end of thread, other threads:[~2025-09-17 7:59 UTC | newest] Thread overview: 12+ messages (download: mbox.gz / follow: Atom feed) -- links below jump to the message on this page -- 2025-09-08 8:33 [MAINTAINERS SUMMIT] re-think of richACLs in AI/LLM era Coly Li 2025-09-08 10:52 ` Jan Kara 2025-09-08 13:47 ` Coly Li 2025-09-08 15:39 ` Steven Rostedt 2025-09-08 15:42 ` Coly Li 2025-09-08 23:22 ` Randy Dunlap 2025-09-09 1:03 ` Paul Moore 2025-09-10 13:32 ` Coly Li 2025-09-10 19:11 ` Paul Moore 2025-09-16 17:12 ` Coly Li 2025-09-16 18:07 ` Randy Dunlap 2025-09-17 7:59 ` Jan Kara
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox