Universal Blue•2mo ago

ostree-finalize-staged fails with SELinux error

remember this blast from the past? I originally had that problem on my Bazzite rig... which I've let go idle (it dual boots) because I couldn't solve and seems I need to reinstall. But now... I've run into it on a uCore(bOS) system! 😦

422 Replies

bshermanOP•2mo ago

@M2 we should probably thread this as I'm now more motivated to try to understand this... it shouldn't be happening

M2•2mo ago

Yay....

bshermanOP•2mo ago

1) i really want a want to recover when this happens

M2•2mo ago

Again for mine it was due to SELinux labels no longer existing in a new image

bshermanOP•2mo ago

2) i'd like to prevent it from happening... and so far, it's been on my boS images, so i guess I stopped installing something which had installed SElinux label configs ?

M2•2mo ago

Likely. For me specifically the cause was the following. I had swtpm installed. I then mutated the file itself to fix the SELinux label. Then I tried to remove swtpm from my image, but it couldnt finalize the deployment I then readded swtpm back to the image and that seemed to fix it I haven't removed swtpm since I suppose this is one of the things composefs is likely trying to fix. I also haven't had this happen since I started rechunking my images, but again my problem package of swtpm has been on every single image of mine since then

bshermanOP•2mo ago

hmm... I'm rechunking, so I don't think that helps

# /usr/bin/ostree admin finalize-staged -v
OT: root_is_ostree_booted: 1
OT: remounted /sysroot writable
OT: remounted /boot writable
OT: using fuse: 0
OT: Target rootdev key backing-root-device-inode found
OT: Deployment 65bd69a25430389e0f8b6a28ca70558a1626be29812657988960d001ad513f9c.0 unlocked=0
OT: Deployment b7f6f79964d08901f33431e676b982851915c685d38e8cf55fc555c27ca3f9d7.0 unlocked=0
Copying /etc changes: 444 modified, 1 removed, 73 added
Problems processing filecon rules
Failed post db handling
Post process failed
semodule:  Failed!
error: Finalizing deployment: Finalizing SELinux policy: Child process exited with code 1

# /usr/bin/ostree admin finalize-staged -v
OT: root_is_ostree_booted: 1
OT: remounted /sysroot writable
OT: remounted /boot writable
OT: using fuse: 0
OT: Target rootdev key backing-root-device-inode found
OT: Deployment 65bd69a25430389e0f8b6a28ca70558a1626be29812657988960d001ad513f9c.0 unlocked=0
OT: Deployment b7f6f79964d08901f33431e676b982851915c685d38e8cf55fc555c27ca3f9d7.0 unlocked=0
Copying /etc changes: 444 modified, 1 removed, 73 added
Problems processing filecon rules
Failed post db handling
Post process failed
semodule:  Failed!
error: Finalizing deployment: Finalizing SELinux policy: Child process exited with code 1

ugh, once again, it doesn't tell me why just bringing this link here for easy reference: https://discussion.fedoraproject.org/t/ostree-finalize-staged-fails-with-an-selinux-error/97734/8

M2•2mo ago

My reading of this is that it's something to do with /usr/etc merge

M2•2mo ago

https://ostreedev.github.io/ostree/deployment/#staged-deployments

ostreedev/ostree

Deployments

ostree documentation

bshermanOP•2mo ago

i got busy with work, but thanks for that link too... i'm going to have to crack this nut eventually

jmac•5w ago

i too have come across this. oddly enough its with my ucore image. one system updated to that image with no problem, the other did not..

semodule -B -vv

semodule -B -vv

gives some output to much to paste in here

bshermanOP•5w ago

good, we've both hit it on ucore and i've hit it on bazzite and m2 hit it on bluefin, so it's generic to ostree/container-navtive probably and I realized I have this happening on a VM, so i've cloned the disk and I'm going to use that as a place to test/troubleshoot in a retryable manner

jmac•5w ago

SELinux is preventing bootc from getattr access on the file /run/ostree-booted. For complete SELinux messages run: sealert -l 6facffd2-f4db-4726-b178-73f283964978

SELinux is preventing bootc from getattr access on the file /run/ostree-booted. For complete SELinux messages run: sealert -l 6facffd2-f4db-4726-b178-73f283964978

ya definitely an odd error

bshermanOP•5w ago

oh, that's interesting

jmac•5w ago

SELinux is preventing bootc from getattr access on the file /run/ostree-booted.

                                                                  *****  Plugin catchall (100. confidence) suggests   **************************

                                                                  If you believe that bootc should be allowed getattr access on the ostree-booted file by default.
                                                                  Then you should report this as a bug.
                                                                  You can generate a local policy module to allow this access.
                                                                  Do
                                                                  allow this access for now by executing:
                                                                  # ausearch -c 'bootc' --raw | audit2allow -M my-bootc
                                                                  # semodule -X 300 -i my-bootc.pp

SELinux is preventing bootc from getattr access on the file /run/ostree-booted.

                                                                  *****  Plugin catchall (100. confidence) suggests   **************************

                                                                  If you believe that bootc should be allowed getattr access on the ostree-booted file by default.
                                                                  Then you should report this as a bug.
                                                                  You can generate a local policy module to allow this access.
                                                                  Do
                                                                  allow this access for now by executing:
                                                                  # ausearch -c 'bootc' --raw | audit2allow -M my-bootc
                                                                  # semodule -X 300 -i my-bootc.pp

granted those steps didnt shit, just posting what i am finding and of course this is happening on my supermicro board system.. so reboots are slllooooooooooooww rpm -V selinux-policy selinux-policy-targeted shows a ton of missing policies eh nevermind, same as on my system that is ok. hmm

jmac•5w ago

https://github.com/hhd-dev/rechunk?tab=readme-ov-file#issues-in-the-rechunked-image

GitHub

GitHub - hhd-dev/rechunk

Contribute to hhd-dev/rechunk development by creating an account on GitHub.

jmac•5w ago

maybe something with the rechunking that is causing this? interesting, second system is now doing it as well.

bshermanOP•5w ago

M2 had a situation where he thought rechunking actually HELPED the problem On my test VM, I just attempted a bootc update rebooted and got the error. Reverted to a snapshot I took specifically to be able to restore to the broken state and... Yes. I can reproduce the issue with a revert. This is all the time I can give at this moment. but it's progress to not just have a place to try to fix, but also being to reproduce.

M2•5w ago

@jmac are you rechunking your ucore image?

jmac•4w ago

i was, i just disabled it in hopes of it offering a solution so one one server with issues i reverted to an old build and just updated that server is fixed hmm i cannot for the life of me get the other system fixed though ok i was able to get the other server up. i rolledback again and that seemed to work really odd im going to setup a vm to see if i can repeat this going to reenable rechunking on my builds and see what happens i cannot get it to error out again

bshermanOP•4w ago

Yeah. That’s why I got a reproducible place for the big. I’ve got. Some time to test today. I’ll try disabling rechunkkng also

jmac•4w ago

i cant say for sure if that is what fixed my issue like i said, i rolledback and snagged a newer version both seemed to be stuck on an early march version, so who knows how long the issue was persisting prior to me noticing i didnt have an update i only found it because i was trying to get the 389 package in and ran across the issue.

bshermanOP•4w ago

i can't seem to even build my images when i disable rechunker there's a lot of stuff going on in the Justfile which seems dependent on previous steps

M2•4w ago

You shouldn't need the rechunker. But if you are not using the rechunker, you need to not use the load images step And not build the images as root I'm assuming you are using my justfile

bshermanOP•4w ago

Yep. I figured out the not using rechunker or load images steps It took me a while to realize to not use root Then I still think there was a bug but I updated from you latest Justfile changes. I think it’s better. Need to go verify my test Nope I get this error https://github.com/bsherman/bos/actions/runs/14048060735/job/39333054037#step:7:1 error: Recipe get-tags could not be run because of an IO error while trying to create a temporary directory or write a file to that directory: Permission denied (os error 13) at path "/run/user/1001/just/just-dd8nhV" Maybe I’m still doing sudo/root in the workflow

M2•4w ago

M2•4w ago

I would definitely say so

bshermanOP•4w ago

been working on this in between other things today splitting focus is not my strong suit 🙂 @M2 why this? https://github.com/bsherman/bos/blob/main/Justfile#L185-L189 that seems odd

M2•4w ago

Oh I would remove the base image after building. Thoughts were to always pull the base image for builds + on the github action remove the base image once the build was complete so that there would be space for rechunk You'll also notice that the unrechunked image is removed here https://github.com/bsherman/bos/blob/c612b7436deae17c265a766044236f887e4c2fbc/Justfile#L250 after you create the tree

bshermanOP•4w ago

right, but if I'm not rechunking...

M2•4w ago

If you're not rechunking.... then removing the base image doesn't really do anything

bshermanOP•4w ago

right

M2•4w ago

it removes the ref. But your build is just another layer on top of it, so removing it does nothing since your local image is built on top

bshermanOP•4w ago

i got a a green build now, i'm good bleh, no... it doesn't push

M2•4w ago

https://github.com/bsherman/bos/actions/runs/14048743564/job/39335032959#step:6:90 You only tag the build with the name and not the name + version.

GitHub

bOS Build Server · bsherman/bos@c612b74

Customized OS images based on Universal Blue. Contribute to bsherman/bos development by creating an account on GitHub.

M2•4w ago

You probably should generate the tags and tag your original build with all of the tags

M2•4w ago

https://github.com/ublue-os/main/pull/770/files This does that for main

GitHub

chore: cleanup justfile by m2Giles · Pull Request #770 · ublue-os...

Gets this working again. Does not have an ISO builder yet as we are between several options right now. This also has most of the stuff necessary to convert the github workflow to use the justfile f...

bshermanOP•4w ago

sorry, i didn'g get these notifications, but i've got it working i realized what was missing i was trying to minimize changes so i could easily switch back to rechunking ok, so rechunked images are definitely smaller but whatever for my personal use now to see if my ostree-selinux issue is resolved by not rechunking nope removal of rechunker did NOT fix it and rolling back to an image from december 31 and then upgrading... does not fix how did you see this? when you say this "reverted" did you "upgrade" (rpm-ostree rebase/bootc switch) to an old image, or just do a rollback? i'm digging into the C code for ostree finalize-staged/sysroot-deploy under the hood, it seems to be doing a bwrap exec of semodule -N --refresh in the staged ostree dir hmm... come to think of it, i think i'm in the wrong section of code yeah, what i'm seeing we aren't even hitting yet ok, yeah this looks more correct... it tries to do an ostree_sepolicy_restorecon this will only last a day, but it's a diff of the changes i'm trying to apply, nothing stands out to me in removed package list https://paste.centos.org/view/90e491a7 @Kyle Gospo just for context of what I was telling you about on our call the other night, this is the issue I'm trying to solve on a work VM server running ucore and on my bazzite desktop I did just re read https://discussion.fedoraproject.org/t/ostree-finalize-staged-fails-with-an-selinux-error/97734/4 and the solution to Jean-Baptiste's issue was rebasing to upstream (sericea in his case) then back to custom image. So... I'm going to try that. well, something like that. no dice, i can't even go back to upstream coreos I tried setting up bwrap to enter the staged deployment dir but i am a bit lost, i end up in a context where sestatus thinks that selinux is disabled, so I can't even try to restorecon etc and this is where i landed in the C code https://github.com/ostreedev/ostree/blob/4154366766fc910f54f4be5b0d816deaaade2379/src/libostree/ostree-sepolicy.c#L614 but not really getting anywhere hmm... same thing if i do a naive chroot, something is odd, and i'm missing it oh! this ucore system has composefs is that significant? no wonder things seem a bit different than last time i dug deep /me sigh i was hoping that disabling composefs might at least change behavior... but no

Gaming

Programming

ostree-finalize-staged fails with SELinux error

Did you find this page helpful?