Comment 8 for bug 1916640

Revision history for this message
Ejmarkow (ejmarkow) wrote : Re: Nvidia 340.108 fails to install with kernels 5.11.x - new patches needeed

Hi Butterfly (kelebek333),

On the contrary, an Nvidia 340.108 patch is probably required for Linux Kernel 5.12.

I've been testing Linux kernel 5.12 with the all the Nvidia 340.108 patches including for kernel 5.11) and I'm getting random 'core dump' crashes. It works fine with kernel 5.11, but not with 5.12.

Here is some info:

uname -a:
Linux galicia 5.12.0-rc5-git-5e46d1b-Pogorze-Karpackie-CUSTOM-KERNEL #3 SMP Wed Mar 31 08:36:13 CEST 2021 x86_64 GNU/Linux

dmesg:
NVRM: loading NVIDIA UNIX x86_64 Kernel Module 340.108 Wed Dec 11 11:06:58 PST 2019

nvidia_uvm: Loaded the UVM driver, major device number 243

lsmod:
Both nvidia & nvidia_uvm modules are loaded fine

The issue:

Interesting thing is, the shim build for nvidia.ko and nvidia_uvm.ko is compiling FINE, with NO ERRORS during or after the build. So, no problem there.

The problem is, I'm getting a random 'core dump' when navigating in the Google Chrome browser. Chrome isn't the problem though because it is not mentioned in the 'core dump', but "nvidia_drv.so" is mentioned.

What happens is: Chrome will close first, Xorg will crash and then I'm logged out of Lightdm.

Both Xorg.bin and Lightdm are mentioned in the following 'core dump', along with Nvidia. Here is the output for journalctl and dmesg (also link to pastebin [ https://pastebin.com/raw/9DXR6MY1 ]:

# journalctl

Apr 09 01:15:50 galicia systemd[1]: Starting Hostname Service...
Apr 09 01:15:51 galicia dbus-daemon[457]: [system] Successfully activated service 'org.freedesktop.hostname1'
Apr 09 01:15:51 galicia systemd[1]: Started Hostname Service.
Apr 09 01:16:21 galicia systemd[1]: systemd-hostnamed.service: Succeeded.
Apr 09 01:20:30 galicia nscd[476]: 476 checking for monitored file `/etc/services': No such file or directory
Apr 09 01:20:45 galicia nscd[476]: 476 checking for monitored file `/etc/services': No such file or directory
Apr 09 01:24:13 galicia smartd[463]: Device: /dev/sda [SAT], SMART Usage Attribute: 195 Hardware_ECC_Recovered changed from 71 to 70
Apr 09 01:27:55 galicia systemd[1]: Created slice system-systemd\x2dcoredump.slice.
Apr 09 01:27:55 galicia systemd[1]: Started Process Core Dump (PID 16933/UID 0).
Apr 09 01:27:58 galicia systemd-coredump[16934]: Process 1126 (Xorg.bin) of user 0 dumped core.

                                                 Stack trace of thread 1126:
                                                 #0 0x00007f34154764a5 raise (libc.so.6 + 0x3d4a5)
                                                 #1 0x00007f341545f864 abort (libc.so.6 + 0x26864)
                                                 #2 0x0000556b3271138c OsAbort (Xorg.bin + 0x1c838c)
                                                 #3 0x0000556b32712130 FatalError (Xorg.bin + 0x1c9130)
                                                 #4 0x0000556b32714df9 n/a (Xorg.bin + 0x1cbdf9)
                                                 #5 0x00007f341561ba30 __restore_rt (libpthread.so.0 + 0x13a30)
                                                 #6 0x00007f34147fd986 n/a (libglx.so + 0xa1f986)
                                                 #7 0x00007f3410ad23c7 n/a (nvidia_drv.so + 0x6d13c7)
                                                 #8 0x00007f3410ad2cdc n/a (nvidia_drv.so + 0x6d1cdc)
                                                 #9 0x0000556b32593f0d n/a (Xorg.bin + 0x4af0d)
                                                 #10 0x00007f3415460b25 __libc_start_main (libc.so.6 + 0x27b25)
                                                 #11 0x0000556b325942ce _start (Xorg.bin + 0x4b2ce)

                                                 Stack trace of thread 1142:
                                                 #0 0x00007f34155396fe epoll_wait (libc.so.6 + 0x1006fe)
                                                 #1 0x0000556b32717ba7 n/a (Xorg.bin + 0x1ceba7)
                                                 #2 0x0000556b3270dee1 n/a (Xorg.bin + 0x1c4ee1)
                                                 #3 0x00007f3415611299 start_thread (libpthread.so.0 + 0x9299)
                                                 #4 0x00007f34155393b3 __clone (libc.so.6 + 0x1003b3)
Apr 09 01:27:58 galicia systemd[1]: systemd-coredump@0-16933-0.service: Succeeded.
Apr 09 01:27:58 galicia at-spi-bus-launcher[1420]: X connection to :0 broken (explicit kill or server shutdown).
Apr 09 01:27:58 galicia su[1870]: pam_unix(su:session): session closed for user root
Apr 09 01:27:58 galicia lightdm[1196]: pam_unix(lightdm:session): session closed for user ejm
Apr 09 01:27:58 galicia systemd-logind[552]: Session 2 logged out. Waiting for processes to exit.
Apr 09 01:27:58 galicia kernel: show_signal_msg: 34 callbacks suppressed
Apr 09 01:27:58 galicia kernel: lightdm[16941]: segfault at 8 ip 00007fdea653cbba sp 00007ffeb03c9568 error 4 in libc-2.33.so[7fdea64c3000+14e000]
Apr 09 01:27:58 galicia kernel: Code: f3 0f 1e fa 66 0f ef c0 66 0f ef c9 66 0f ef d2 66 0f ef db 48 89 f8 48 89 f9 48 81 e1 ff 0f 00 00 48 81 f9 cf 0f 00 00 77 66 <f3> 0f >
Apr 09 01:27:58 galicia systemd[1217]: pulseaudio.service: Succeeded.
Apr 09 01:27:58 galicia systemd[1]: Started Process Core Dump (PID 16942/UID 0).
Apr 09 01:27:59 galicia kernel: resource sanity check: requesting [mem 0x000c0000-0x000fffff], which spans more than PCI Bus 0000:00 [mem 0x000c0000-0x000dffff window]
Apr 09 01:27:59 galicia kernel: caller _nv000788rm+0xe4/0x1c0 [nvidia] mapping multiple BARs
Apr 09 01:27:59 galicia systemd-coredump[16945]: Process 16941 (lightdm) of user 0 dumped core.

                                                 Stack trace of thread 16941:
                                                 #0 0x00007fdea653cbba __GI___strlen_sse2 (libc.so.6 + 0x9fbba)
                                                 #1 0x00007fdea64dc6f9 __add_to_environ (libc.so.6 + 0x3f6f9)
                                                 #2 0x000055630fada246 n/a (lightdm + 0x18246)
                                                 #3 0x000055630fada499 n/a (lightdm + 0x18499)
                                                 #4 0x000055630fae1906 n/a (lightdm + 0x1f906)
                                                 #5 0x00007fdea69572ee n/a (libgobject-2.0.so.0 + 0x152ee)
                                                 #6 0x00007fdea696fac9 g_signal_emit_valist (libgobject-2.0.so.0 + 0x2dac9)
                                                 #7 0x00007fdea697003f g_signal_emit (libgobject-2.0.so.0 + 0x2e03f)
                                                 #8 0x000055630fad982f n/a (lightdm + 0x1782f)
                                                 #9 0x00007fdea6861254 n/a (libglib-2.0.so.0 + 0x51254)
                                                 #10 0x00007fdea6864f57 g_main_context_dispatch (libglib-2.0.so.0 + 0x54f57)
                                                 #11 0x00007fdea68652d8 n/a (libglib-2.0.so.0 + 0x552d8)
                                                 #12 0x00007fdea68655cb g_main_loop_run (libglib-2.0.so.0 + 0x555cb)
                                                 #13 0x000055630facc52d n/a (lightdm + 0xa52d)
                                                 #14 0x00007fdea64c4b25 __libc_start_main (libc.so.6 + 0x27b25)
                                                 #15 0x000055630face60e n/a (lightdm + 0xc60e)
Apr 09 01:27:59 galicia systemd[1]: systemd-coredump@1-16942-0.service: Succeeded.
Apr 09 01:28:00 galicia systemd[1]: Created slice User Slice of UID 482.
Apr 09 01:28:00 galicia systemd[1]: Starting User Runtime Directory /run/user/482...
Apr 09 01:28:00 galicia systemd-logind[552]: New session 3 of user lightdm.
Apr 09 01:28:01 galicia systemd[1]: Finished User Runtime Directory /run/user/482.
Apr 09 01:28:01 galicia systemd[1]: Starting User Manager for UID 482...
Apr 09 01:28:01 galicia systemd[16966]: pam_warn(systemd-user:setcred): function=[pam_sm_setcred] flags=0x8002 service=[systemd-user] terminal=[<unknown>] user=[lightdm] ru>
Apr 09 01:28:01 galicia systemd[16966]: pam_unix(systemd-user:session): session opened for user lightdm(uid=482) by (uid=0)

-----

$dmesg

[20057.313160] show_signal_msg: 34 callbacks suppressed
[20057.313170] lightdm[16941]: segfault at 8 ip 00007fdea653cbba sp 00007ffeb03c9568 error 4 in libc-2.33.so[7fdea64c3000+14e000]
[20057.313271] Code: f3 0f 1e fa 66 0f ef c0 66 0f ef c9 66 0f ef d2 66 0f ef db 48 89 f8 48 89 f9 48 81 e1 ff 0f 00 00 48 81 f9 cf 0f 00 00 77 66 <f3> 0f 6f 20 66 0f 74 e0 66 0f d7 d4 85 d2 74 04 0f bc c2 c3 48 83
[20057.972305] resource sanity check: requesting [mem 0x000c0000-0x000fffff], which spans more than PCI Bus 0000:00 [mem 0x000c0000-0x000dffff window]

=====