bnxt_en NIC driver crashes IO_PAGE_FAULT

Asked by aft2d on 2021-06-03

Hi all,

We received a bunch of new servers with a Supermicro H12SSL-NT mainboard that has an embedded Broadcom BCM57416 NIC.

On all those servers we observe crashes of the NIC driver (bnxt_en) from time to time. We're not able to manually reproduce this issue, it just occurs at some point. Also our monitoring does not show any irregularities(DDoS or sth. like this).

Syslog: https://paste.steinh.art/ezagesivim.log

All servers are running with up-to-date packages:
$ lsb_release -rd
Description: Ubuntu 20.04.2 LTS
Release: 20.04
$ uname -r
5.4.0-73-generic ### It also happened on older kernel versions. The oldest we tried was 5.4.0-66.74

Thanks in advance.
~ Roman

Question information

Language:
English Edit question
Status:
Answered
For:
Ubuntu Edit question
Assignee:
No assignee Edit question
Last query:
Last reply:
Revision history for this message
actionparsnip (andrew-woodhead666) said :
#1

If you use a different kernel is it different?

Revision history for this message
Manfred Hampl (m-hampl) said :
#2

Suggestions:

1. create a bug report (against "linux")
and/or
2. try switching to the hwe kernel (by installing linux-image-generic-hwe-20.04 or linux-generic-hwe-20.04, kernel version 5.8.*)

Can you help with this problem?

Provide an answer of your own, or ask aft2d for more information if necessary.

To post a message you must log in.