Nvidia dgpu laptop problems

Hi! I am having issues with nvidia-powerd.service failing to start after rebasing to bazzite-nvidia from silverblue-nvidia, I attached the output of systemctl status nvidia-powerd.service, any ideas how I should start with trying to diagnose this problem? Its a mobile RTX 2060. An nvidia-powerd.service error message that was also appearing on my laptop's screen every boot, which was how I knew to check, if you are wondering.
83 Replies
Kyle Gospo
Kyle Gospo2y ago
If you're using secure boot, have you enrolled the key? Powerd failing can be normal, not all GPUs support it Make sure the Nvidia kernel module is loaded
Leaf Eriksen
Leaf EriksenOP2y ago
not currently using secure boot, would you recommend enabling it? also when looking for how to check if the Nvidia kernel module is loaded, I discovered this cat: /proc/driver/nvidia/version: No such file or directory
Kyle Gospo
Kyle Gospo2y ago
Hmm, are there any issues besides powerd? What's your output for modinfo nvidia
Leaf Eriksen
Leaf EriksenOP2y ago
Kyle Gospo
Kyle Gospo2y ago
That looks good at least Let's try uhh systemctl status nvidia-powerd | fpaste
Leaf Eriksen
Leaf EriksenOP2y ago
https://paste.centos.org/view/ea541a07 woah, fpaste is new to me! thats pretty neat
Kyle Gospo
Kyle Gospo2y ago
rpm-ostree kargs | fpaste
Kyle Gospo
Kyle Gospo2y ago
There we go, you appear to be missing all the kargs Which is weird, those should auto apply There's a just command do it
Leaf Eriksen
Leaf EriksenOP2y ago
nvidia-set-kargs?
Kyle Gospo
Kyle Gospo2y ago
Yea
Leaf Eriksen
Leaf EriksenOP2y ago
ok, just a minute https://paste.centos.org/view/e1bdfc08 same output after running the commands and rebooting. hmm
Kyle Gospo
Kyle Gospo2y ago
Okay that is extremely weird You could try running the karg command manually See if there's any difference
Leaf Eriksen
Leaf EriksenOP2y ago
yeah, I ran it, it told me to systemctl reboot, and nothing happened
Kyle Gospo
Kyle Gospo2y ago
All right, give me just a bit here I'll poke around and see if I can find anything I don't believe we ever remove those settings, we only ever add them
Leaf Eriksen
Leaf EriksenOP2y ago
This is all, just starting to look like I just did one too many rebases. Especially since I was coming from the fedora 39 beta. if you think it could be relevant and hard to fix, I could do a reinstall instead of just trying to make rebasing work. Also, if all the secure boot stuff is working, I could reenable that. I only had it disabled because a while back ublue was having secure boot issues.
Kyle Gospo
Kyle Gospo2y ago
Yeah feel free to, it should all just work
Leaf Eriksen
Leaf EriksenOP2y ago
I’m confused. I’m still getting TPM issues when trying to install from the main image. should I keep TPM and secure boot stuff off during install?
Kyle Gospo
Kyle Gospo2y ago
If you can disable one and not the other I would just turn TPM off Otherwise there's nothing wrong with both off at all System 76 scheduler can't change your CFS parameters if secure boot is on, so it's probably objectively better on a laptop anyway
Leaf Eriksen
Leaf EriksenOP2y ago
Cool. Also, I’m not seeing bazzite in the main installer, is that normal or did I just miss the right menu?
Leaf Eriksen
Leaf EriksenOP2y ago
Well, I reinstalled, and the issues aren’t over yet
No description
Kyle Gospo
Kyle Gospo2y ago
If that's the first boot that's normal, it should set the kargs and then reboot again with Nvidia loading We can't set kargs in the installer yet
Leaf Eriksen
Leaf EriksenOP2y ago
OK, I’ll get back to you afterwards [leaf@fedora leaf]$ just enroll-secure-boot-key sudo mokutil --import /etc/pki/akmods/certs/akmods-ublue.der input password: input password again: echo 'Enter password "ublue-os" if prompted' Enter password "ublue-os" if prompted so... not only does the nvidia gpu not seem to work, I can't get the secure boot script to work I keep entering ublue-os as password, but it just doesn't like me.
Kyle Gospo
Kyle Gospo2y ago
That looks right, don't see any errors When you reboot it'll boot to your bios and enroll the key Or at least should @bsherman might need you on this one if you have time
Leaf Eriksen
Leaf EriksenOP2y ago
OK, I just thought the message was broken since after entering the password twice it told me what I should’ve entered and then shut down the script. I just rebooted, and it came back to the desktop. Should I run just bios after running the script again? Or just bios without running the script again
Kyle Gospo
Kyle Gospo2y ago
Script again I think Since once you leave that screen it's done trying to add the key
bsherman
bsherman2y ago
So, I think the timeline here is: 1. system was running ublue-os/silverblue-nvidia 2. rebased to ublue-os/bazzite-nvidia 3. nvidia-powerd service is failing 4. then enabled secureboot and disable dtpm in bios ? 5. then attempted to enroll secureboot keys?
Kyle Gospo
Kyle Gospo2y ago
Fresh install now With secure boot enabled
bsherman
bsherman2y ago
so, if user does not need SecureBoot and TPM, I'd at least start with them disabled in order to rule them out but before changing anything... I'd do a few debugging things:
rpm-ostree|fpaste
rpm-ostree kargs|fpaste
mokutil --list-enrolled |grep -A4 ublue|fpaste
modinfo nvidia|fpaste
systemctl status nvidia-powerd|fpaste
dmesg|fpaste
rpm-ostree|fpaste
rpm-ostree kargs|fpaste
mokutil --list-enrolled |grep -A4 ublue|fpaste
modinfo nvidia|fpaste
systemctl status nvidia-powerd|fpaste
dmesg|fpaste
at least with that info i can try to help gotta step away for a few, but i'll be back
Leaf Eriksen
Leaf EriksenOP2y ago
I just disabled secure boot to just eliminate any relevant issues, here are the outputs First No text to send Second https://paste.centos.org/view/d2c9dcc1 Third No text to send. Fourth https://paste.centos.org/view/00c7514d Fifth https://paste.centos.org/view/490a3c72 Sixth https://paste.centos.org/view/7caeb3ef I am in no rush, I have another working computer for non-gaming things, so feel free to de-prioritise this issue as needed Just tell me if you think another clean install with secure boot off could help, cause I am down to do it
bsherman
bsherman2y ago
First No text to send
I messed up this should have been rpm-ostree status|fpaste 😄
Second https://paste.centos.org/view/d2c9dcc1
not yet showing nvidia specific kargs, but i think that makes sense when i read the chat above, as i think you were having trouble with the mok enrollment
Third No text to send.
expected since apparently mok enrollment was not successful
Fourth https://paste.centos.org/view/00c7514d
this is good, shows you are running some derivative of ublue-os/silverblue-nvidia or have installed our akmods built nvidia kmod this shows that 1) the drivers is on the filesystem 2) it's signed by ublue's key
Fifth https://paste.centos.org/view/490a3c72
expected nvidia-powerd failure given the nvidia driver is not loaded
Sixth https://paste.centos.org/view/7caeb3ef
ok, dmesg boot output shows 1) intel CPU and nvidia card are present 2) secureboot is already disabled ... this MAY be related to failure to enroll the MOK... 3) TPM is likely disabled in bios because kernel can't find TPM chip at this stage, I'd do: just nvidia-set-kargs please report any output from that here... then reboot... i'd expect the nvidia driver to be loaded after rebooting could repeat some debugging
dmesg -T|fpaste
rpm-ostree kargs|fpaste
systemctl status nvidia-powerd|fpaste
lspci -k|fpaste
dmesg -T|fpaste
rpm-ostree kargs|fpaste
systemctl status nvidia-powerd|fpaste
lspci -k|fpaste
Leaf Eriksen
Leaf EriksenOP2y ago
ok, I can't for the life of me get notifications to work on this specific chat, so I just saw this. ran the 'just nvidia-set-kargs' command, rebooted and ran the debugging stuff https://paste.centos.org/view/27729c4d https://paste.centos.org/view/bee6cee2 https://paste.centos.org/view/20678332 https://paste.centos.org/view/1d2dc929 nvtop still isn't seeing the rtx gpu, and the nvidia-powerd issues persist. I know turned off secure boot before these last two rounds of diagnostics, but I am thinking something might have gone wrong because of me having it enabled during os install. also yeah, tpm is off to avoid weird installer bugs was havng.
bsherman
bsherman2y ago
I'll ping you more directly 🙂 @leaferiksen
https://paste.centos.org/view/bee6cee2
This shows the kargs are not set to blacklist nouveau . did you run just nvidia-set-kargs ? i mean, you say you did 😕 was there any output?
Leaf Eriksen
Leaf EriksenOP2y ago
NVK can't arrive soon enough
bsherman
bsherman2y ago
was there any output from trying to set the nvidia kargs? maybe try just -v nvidia-set-kargs ?
Leaf Eriksen
Leaf EriksenOP2y ago
oh shit just saw your message yes, it built the whole image, told me to reboot, and nothing had changed
bsherman
bsherman2y ago
just nvidia-set-kargs should not build an image now i'm curious if that recipie has a bug
Leaf Eriksen
Leaf EriksenOP2y ago
oh I just saw staging deployment, in my head connected the two concepts
bsherman
bsherman2y ago
i'm removing the kargs from my system... i'll do some rebooting and test that recipe
Leaf Eriksen
Leaf EriksenOP2y ago
ran this, doing a reboot as it suggested now also, I am unplugging the monitor in hopes of less variables to this puzzle
bsherman
bsherman2y ago
i'm pretty convinced the problem is you don't have the proper kargs i don't think you can mis-type just nvidia-set-kargs 🙂 so i'll test here i mean, i can mis-type it :-D, but it wouldn't give the output you mention
Leaf Eriksen
Leaf EriksenOP2y ago
lol never underestimate the power of users with terrible typing and weird computers same nvidia-powerd issues after a reboot
bsherman
bsherman2y ago
it should have done something like this:
$ just nvidia-set-kargs
/usr/bin/nvidia-smi
Checking out tree d4b7183... done
Writing OSTree commit... done
Staging deployment... done
Freed: 419.3 MB (pkgcache branches: 0)
Changes queued for next boot. Run "systemctl reboot" to start a reboot
$ just nvidia-set-kargs
/usr/bin/nvidia-smi
Checking out tree d4b7183... done
Writing OSTree commit... done
Staging deployment... done
Freed: 419.3 MB (pkgcache branches: 0)
Changes queued for next boot. Run "systemctl reboot" to start a reboot
ok, i've manually removed my nvidia/nouveau kargs... rebooted... my nvidia drivers were not loading as expected... then run the command above (with that output) and now it's working again... what are you r kargs now? rpm-ostree kargs
Leaf Eriksen
Leaf EriksenOP2y ago
[leaf@fedora leaf]$ just nvidia-set-kargs /usr/bin/nvidia-smi Staging deployment... done Changes queued for next boot. Run "systemctl reboot" to start a reboot
Kyle Gospo
Kyle Gospo2y ago
first time I've ever seen kargs not apply this is core ostree stuff
bsherman
bsherman2y ago
right... that's why i tested manually myself... wanted to rule out a problem with the recipe
Leaf Eriksen
Leaf EriksenOP2y ago
[leaf@fedora leaf]$ rpm-ostree kargs rd.luks.options=discard rhgb quiet root=UUID=a3b3db8c-d31e-4c5e-9791-8c2b7773c562 rootflags=subvol=root rw ostree=/ostree/boot.0/default/b3b5fe2f2136685150875a7fda59f73000e92bdad229b8205e65be2909dc4c19/0 rd.driver.blacklist=nouveau modprobe.blacklist=nouveau nvidia-drm.modeset=1
Kyle Gospo
Kyle Gospo2y ago
that looks better
Leaf Eriksen
Leaf EriksenOP2y ago
yeah, and I didn't get the nvidia-powerd.service errors on boot (not that they have been there with any consistency) but the service is still failing, and nvtop still doesn't see the gpu.
bsherman
bsherman2y ago
lspci -k | fpaste plz
bsherman
bsherman2y ago
and cat /proc/cmdline|fpaste ?
bsherman
bsherman2y ago
I don't know how to reconcile your debugging here... Here: https://discord.com/channels/1072614816579063828/1155551944555888760/1155646717761433671 it's clear you are booting WITHOUT the kargs which blacklist nouveau Here: https://discord.com/channels/1072614816579063828/1155551944555888760/1155637710309040158 you show them present in ostree...
Leaf Eriksen
Leaf EriksenOP2y ago
This is all very weird. I can’t re-create the successful blacklisting of nouveau drivers. Thank you for all the help, i’ll be trying another clean installation, with secure boot off from the start
bsherman
bsherman2y ago
you can try it manually
sudo rpm-ostree kargs \
--append-if-missing=rd.driver.blacklist=nouveau \
--append-if-missing=modprobe.blacklist=nouveau \
--append-if-missing=nvidia-drm.modeset=1 \
--delete-if-present=nomodeset
sudo rpm-ostree kargs \
--append-if-missing=rd.driver.blacklist=nouveau \
--append-if-missing=modprobe.blacklist=nouveau \
--append-if-missing=nvidia-drm.modeset=1 \
--delete-if-present=nomodeset
Leaf Eriksen
Leaf EriksenOP2y ago
I ran that, rebooted when it told me to, and came back to no blacklist added to the rpm-ostree kargs. My system is broken in a really special sort of way. Btw, I did a clean install, and things are still being broken. I’ll get back in contact if I ever figure out a fix, in case it could be useful to others on particularly weird Nvidia laptops.
Nathaniel
Nathaniel2y ago
Just wanted to chime in to say that I have that Nvidia powerd message on reboot too, but in my case the drivers load, so I think that can be safely disregarded.
bsherman
bsherman2y ago
which nvidia card are you running? i think nvidia-powerd fails by design (thanks for nothing Nvidia, Inc) on older cards ... and by old I mean 10-series and 16-series... even though they work with the current driver... i'm pretty sure 20-series and newer is where it should succeed but yeah, regardless, the real test is if the drivers load... and nvidia-smi is one of the simplest ways to validate that
Leaf Eriksen
Leaf EriksenOP2y ago
What’s the trick to force an app to run on the descrete gpu on kde? I’ve been just checking to see if nvtop can see the card like it used to on silverblue-nvidia, but I honestly haven’t tried launching an app on the card since the install just cause I don’t know how
Kyle Gospo
Kyle Gospo2y ago
You need those kargs sorted first No point even trying to use your gpu until they work
Nathaniel
Nathaniel2y ago
That would be a 1070, so that makes sense
tryhardsoccermomswag
GitHub
Default steam.desktop has BAD options causing it to close and ope...
Your system information Steam client version (build number or date): Jun 21 2023 21:17:38 Distribution (e.g. Ubuntu): Ubuntu 23.10 KDE Plasma Opted into Steam client beta?: No Have you checked for ...
GitHub
New steam UI does not open if run with DRI_PRIME=1 · Issue #9383 · ...
Your system information Steam client version (build number or date): 1682634349 Distribution (e.g. Ubuntu): Fedora Silverblue 38 Opted into Steam client beta?: [Yes/No] Yes Have you checked for sys...
GitHub
Remove PrefersNonDefaultGPU=true from .desktop file of Steam · Iss...
counter-part to #7089 PrefersNonDefaultGPU is broken by design, intended to mean "Use the Discrete GPU if possible" but instead generalized to mean "Use anything that we aren't u...
Leaf Eriksen
Leaf EriksenOP2y ago
Thanks
Kyle Gospo
Kyle Gospo2y ago
Try updating to the latest version Your issue may be solved
Leaf Eriksen
Leaf EriksenOP2y ago
holy shit yall fixed it, thank you so much i'm gonna do one more reinstall probably, just cause I want to play around with whether I have consistency with all this, but it is working right now! So happy rn Happy to report that the Nvidia stuff continues to work after another reinstall. The bazzite setup portal didn’t auto start on the first boot, which was a little weird, but nothing else is misbehaving. (Other than all my usual issues, like reboots turning into shutdowns when the lid is closed and other dumb laptop nonsense) Thanks for all y’all’s hard work to make this a great user experience!
Gyromancer
Gyromancer2y ago
I'm still having this problem on the latest version of bazzite-nvidia desktop. The kernal arguments are present and secure boot is set up. Nvidia 1060 6GB
Kyle Gospo
Kyle Gospo2y ago
If you're just seeing a message in terminal with no other issues the latest update may correct that
Gyromancer
Gyromancer2y ago
I just ran rpm-ostree upgrade and rebooted. I'm on 38.20230930
Kyle Gospo
Kyle Gospo2y ago
What are your symptoms?
Gyromancer
Gyromancer2y ago
The terminal tells me the Nvidia driver is not loaded Nvtop also says that there is no GPU to monitor Even though neofetch lists the GPU
Kyle Gospo
Kyle Gospo2y ago
lsmod | grep nvidia
Gyromancer
Gyromancer2y ago
Returns nothing
Kyle Gospo
Kyle Gospo2y ago
dmesg | grep nvidia | fpaste
Kyle Gospo
Kyle Gospo2y ago
I'm not sure what's going on here, this show's the correct kargs and that the driver is loading
Gyromancer
Gyromancer2y ago
Weirdo Firmware Security in the Info Center says Secure Boot is enabled Says Linux kernel is tainted 🤔
Kyle Gospo
Kyle Gospo2y ago
That's normal with NVIDIA The license itself taints it
Gyromancer
Gyromancer2y ago
Ah
Kyle Gospo
Kyle Gospo2y ago
And your dmesg doesn't show the errors you'd get it it was being blocked by lack of key for secure boot Probably a quirk of dual GPU? Which is to say I'd check the arch wiki page on it and try some of their suggestions Mainly the GPU switching stuff
Gyromancer
Gyromancer2y ago
This is a system with a Xeon, so no iGPU
Kyle Gospo
Kyle Gospo2y ago
Ah I missed that This is REALLY weird then
Gyromancer
Gyromancer2y ago
I was in Hybrid mode, so I just switched to fully integrated We'll see if that fixes it Wait, no It should be fully discrete Wtf Okay, back to hybrid Says dGPU power active

Did you find this page helpful?