memcg v1: provide read access to memory.pressure_level - linux.git

diff options

author	Florian Schmidt	2023-04-04 10:58:59 +0000
committer	Andrew Morton	2023-04-18 16:29:52 -0700
commit	6b0ba2abbeede5e1756d54277e811cf2783eb0a8 (patch)
tree	595b0978a9700d7ed41ab22029d241763c6fb934 /mm/madvise.c
parent	a2e17cc2efc72792c0d13d669d824fe9ab7155a1 (diff)

memcg v1: provide read access to memory.pressure_level

cgroups v1 has a unique way of setting up memory pressure notifications: the user opens "memory.pressure_level" of the cgroup they want to monitor for pressure, then open "cgroup.event_control" and write the fd (among other things) to that file. memory.pressure_level has no other use, specifically it does not support any read or write operations. Consequently, no handlers are provided, and cgroup_file_mode() sets the permissions to 000. However, to actually use the mechanism, the subscribing user must have read access to the file and open the fd for reading, see memcg_write_event_control(). This is all fine as long as the subscribing process runs as root and is otherwise unconfined by further restrictions. However, if you add strict access controls such as selinux, the permission bits will be enforced, and opening memory.pressure_level for reading will fail, preventing the process from subscribing, even as root. To work around this issue, introduce a dummy read handler. When memory.pressure_level is created, cgroup_file_mode() will notice the existence of a handler, and therefore add read permissions to the file. Link: https://lkml.kernel.org/r/20230404105900.2005-1-flosch@nutanix.com Signed-off-by: Florian Schmidt <flosch@nutanix.com> Acked-by: Michal Hocko <mhocko@suse.com> Cc: Johannes Weiner <hannes@cmpxchg.org> Cc: Muchun Song <muchun.song@linux.dev> Cc: Roman Gushchin <roman.gushchin@linux.dev> Cc: Shakeel Butt <shakeelb@google.com> Signed-off-by: Andrew Morton <akpm@linux-foundation.org>

Diffstat (limited to 'mm/madvise.c')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: