Hello guys.
First of all, big thanks to all the team for the great job on this project.
Been using your Distro for 3 month now and it rocks guys. lately I've got an issue with one of my rigs, a faulty riser cause the rig to freeze without being able to complete it's reboot routing.
As Doftorul pointed out more often than not having a GPU dropping off the bus triggers a kernel panic.
So I'm sharing my workaround for those who got the same issue and have only a remote access to the rig:
Create a magic reboot script that contain (magicreboot.sh) :#!/bin/bash
echo 1 > /proc/sys/kernel/sysrq
echo s > /proc/sysrq-trigger
echo u > /proc/sysrq-trigger
echo s > /proc/sysrq-trigger
echo b > /proc/sysrq-trigger
Edit 5watchdog :Change all the sudo reboot occurrence to
sudo magicreboot.shDo the same in 6tempcontrolfor more reading about Sysrq:
https://en.wikipedia.org/wiki/Magic_SysRq_keyHope this could help
Big thanks to Doftorul, sizzlephizzle and all the nvOC team
Nice idea
Have you tested it?
I tried to implement R.E.I.S.U.B once but was not successful with a different approach.
Does it need to sync twice?
(u) remount the filesystem as read-only, dont think it can sync data to disk after u
Isnt it better to do the full REISUB sequence?
Edit;
If you have a faulty GPU that causes system freeze can you please test this full REISUB sequence:
#!/bin/bash
echo 1 > /proc/sys/kernel/sysrq
# (un*R*aw) Takes back control of keyboard from X
echo r > /proc/sysrq-trigger
# (t*E*rminate) Send SIGTERM to all processes.
echo e > /proc/sysrq-trigger
# (k*I*ll) Send SIGKILL to all processes.
echo i > /proc/sysrq-trigger
# (*S*nc) Sync all cached disk operations to disk
echo s > /proc/sysrq-trigger
# (*U*mount) Umounts all mounted partitions
echo u > /proc/sysrq-trigger
# (re*B*oot) Reboots the system
echo b > /proc/sysrq-trigger