I'm using version 16.04 on a Dell XPS 15 with a Nvidia GTX1050i graphics card. I have several versions of nvidia driver installed, but 410.79 is the one used. The version of my kernel is 4.15, which is, I believe, the last one.
I've recently started using tensorflow-gpu with cuda 9.0 (the latest version compatible with tensorflow) and since then the computer has frozen every two hours, seemingly unannounced. The cursor disappears and it does not respond to any command and I have to force the shutdown. The fan also starts to turn very loud and the laptop heats up, as it runs a very trying process. Once, it froze by displaying htop, and I have not seen any unusual processes listed.
There is a suggestion here to install modprobe, but I already have the latest version. I've also tried the solution here, but this gave errors when tensorflow tried using the gpu.
More recently, I have removed and reinstalled all of the following:
- cuda-9.0, from the tensorflow statements, from the deb (local) option,
follow 4 patches
- cudnn, from nvidia webiste, run option (deb)
in that order, and that did not help.
I would be very happy if someone could help me find a solution. Thank you.