Unable to install ibacm under ubuntu 18.04 (bionic)

Asked by Marinela Selseth

Attempted to install ibacm under ubuntu 18.04.
I get thee errors:
++++++++
Setting up ibacm (17.1-1) ...
Job for ibacm.service failed because the control process exited with error code.
See "systemctl status ibacm.service" and "journalctl -xe" for details.
invoke-rc.d: initscript ibacm, action "start" failed.
● ibacm.service - InfiniBand Address Cache Manager Daemon
   Loaded: loaded (/lib/systemd/system/ibacm.service; enabled; vendor preset: enabled)
   Active: failed (Result: exit-code) since Tue 2018-08-14 12:20:51 CDT; 4ms ago
     Docs: man:ibacm
           file:/etc/rdma/ibacm_opts.cfg
  Process: 11881 ExecStart=/usr/sbin/ibacm --systemd (code=exited, status=255)
 Main PID: 11881 (code=exited, status=255)

Aug 14 12:20:51 Inspiron-5566-Ubuntu systemd[1]: Starting InfiniBand Address Cache Manager Daemon...
Aug 14 12:20:51 Inspiron-5566-Ubuntu systemd[1]: ibacm.service: Main process exited, code=exited, status=255/n/a
Aug 14 12:20:51 Inspiron-5566-Ubuntu systemd[1]: ibacm.service: Failed with result 'exit-code'.
Aug 14 12:20:51 Inspiron-5566-Ubuntu systemd[1]: Failed to start InfiniBand Address Cache Manager Daemon.
dpkg: error processing package ibacm (--configure):
 installed ibacm package post-installation script subprocess returned error exit status 1
Errors were encountered while processing:
 ibacm
E: Sub-process /usr/bin/dpkg returned an error code (1)
++++++

Appears to fail on configuration:
State: partially configured

Few notes:
There is no file /etc/rdma/ibacm_opts.cfg on my system.

Output from journalctl -xe:
Aug 14 12:27:09 Inspiron-5566-Ubuntu kernel: pcieport 0000:00:1c.0: [12] Replay Timer Timeout
Aug 14 12:27:15 Inspiron-5566-Ubuntu kernel: pcieport 0000:00:1c.0: AER: Corrected error received: id=00e0
Aug 14 12:27:15 Inspiron-5566-Ubuntu kernel: pcieport 0000:00:1c.0: PCIe Bus Error: severity=Corrected, type=Data Link Layer, id=00e0(Transmitte
Aug 14 12:27:15 Inspiron-5566-Ubuntu kernel: pcieport 0000:00:1c.0: device [8086:9d14] error status/mask=00001000/00002000
Aug 14 12:27:15 Inspiron-5566-Ubuntu kernel: pcieport 0000:00:1c.0: [12] Replay Timer Timeout
Aug 14 12:27:17 Inspiron-5566-Ubuntu kernel: pcieport 0000:00:1c.0: AER: Corrected error received: id=00e0
Aug 14 12:27:17 Inspiron-5566-Ubuntu kernel: pcieport 0000:00:1c.0: PCIe Bus Error: severity=Corrected, type=Data Link Layer, id=00e0(Transmitte
Aug 14 12:27:17 Inspiron-5566-Ubuntu kernel: pcieport 0000:00:1c.0: device [8086:9d14] error status/mask=00001000/00002000
Aug 14 12:27:17 Inspiron-5566-Ubuntu kernel: pcieport 0000:00:1c.0: [12] Replay Timer Timeout
Aug 14 12:27:18 Inspiron-5566-Ubuntu kernel: pcieport 0000:00:1c.0: AER: Multiple Corrected error received: id=00e0
Aug 14 12:27:18 Inspiron-5566-Ubuntu kernel: pcieport 0000:00:1c.0: PCIe Bus Error: severity=Corrected, type=Data Link Layer, id=00e0(Transmitte
Aug 14 12:27:18 Inspiron-5566-Ubuntu kernel: pcieport 0000:00:1c.0: device [8086:9d14] error status/mask=00003000/00002000
Aug 14 12:27:18 Inspiron-5566-Ubuntu kernel: pcieport 0000:00:1c.0: [12] Replay Timer Timeout
Aug 14 12:27:32 Inspiron-5566-Ubuntu kernel: pcieport 0000:00:1c.0: AER: Corrected error received: id=00e0
Aug 14 12:27:32 Inspiron-5566-Ubuntu kernel: pcieport 0000:00:1c.0: PCIe Bus Error: severity=Corrected, type=Data Link Layer, id=00e0(Transmitte
Aug 14 12:27:32 Inspiron-5566-Ubuntu kernel: pcieport 0000:00:1c.0: device [8086:9d14] error status/mask=00001000/00002000
Aug 14 12:27:32 Inspiron-5566-Ubuntu kernel: pcieport 0000:00:1c.0: [12] Replay Timer Timeout
Aug 14 12:27:32 Inspiron-5566-Ubuntu kernel: pcieport 0000:00:1c.0: AER: Corrected error received: id=00e0
Aug 14 12:27:32 Inspiron-5566-Ubuntu kernel: pcieport 0000:00:1c.0: PCIe Bus Error: severity=Corrected, type=Data Link Layer, id=00e0(Transmitte
Aug 14 12:27:32 Inspiron-5566-Ubuntu kernel: pcieport 0000:00:1c.0: device [8086:9d14] error status/mask=00001000/00002000
Aug 14 12:27:32 Inspiron-5566-Ubuntu kernel: pcieport 0000:00:1c.0: [12] Replay Timer Timeout
Aug 14 12:27:36 Inspiron-5566-Ubuntu kernel: pcieport 0000:00:1c.0: AER: Corrected error received: id=00e0
Aug 14 12:27:36 Inspiron-5566-Ubuntu kernel: pcieport 0000:00:1c.0: PCIe Bus Error: severity=Corrected, type=Data Link Layer, id=00e0(Transmitte
Aug 14 12:27:36 Inspiron-5566-Ubuntu kernel: pcieport 0000:00:1c.0: device [8086:9d14] error status/mask=00001000/00002000
Aug 14 12:27:36 Inspiron-5566-Ubuntu kernel: pcieport 0000:00:1c.0: [12] Replay Timer Timeout
Aug 14 12:28:03 Inspiron-5566-Ubuntu kernel: pcieport 0000:00:1c.0: AER: Corrected error received: id=00e0
Aug 14 12:28:03 Inspiron-5566-Ubuntu kernel: pcieport 0000:00:1c.0: PCIe Bus Error: severity=Corrected, type=Data Link Layer, id=00e0(Transmitte
Aug 14 12:28:03 Inspiron-5566-Ubuntu kernel: pcieport 0000:00:1c.0: device [8086:9d14] error status/mask=00001000/00002000
Aug 14 12:28:03 Inspiron-5566-Ubuntu kernel: pcieport 0000:00:1c.0: [12] Replay Timer Timeout
Aug 14 12:28:15 Inspiron-5566-Ubuntu kernel: pcieport 0000:00:1c.0: AER: Corrected error received: id=00e0
Aug 14 12:28:15 Inspiron-5566-Ubuntu kernel: pcieport 0000:00:1c.0: PCIe Bus Error: severity=Corrected, type=Data Link Layer, id=00e0(Transmitte
Aug 14 12:28:15 Inspiron-5566-Ubuntu kernel: pcieport 0000:00:1c.0: device [8086:9d14] error status/mask=00001000/00002000
Aug 14 12:28:15 Inspiron-5566-Ubuntu kernel: pcieport 0000:00:1c.0: [12] Replay Timer Timeout
Aug 14 12:28:16 Inspiron-5566-Ubuntu kernel: pcieport 0000:00:1c.0: AER: Corrected error received: id=00e0
Aug 14 12:28:16 Inspiron-5566-Ubuntu kernel: pcieport 0000:00:1c.0: PCIe Bus Error: severity=Corrected, type=Data Link Layer, id=00e0(Transmitte
Aug 14 12:28:16 Inspiron-5566-Ubuntu kernel: pcieport 0000:00:1c.0: device [8086:9d14] error status/mask=00001000/00002000
Aug 14 12:28:16 Inspiron-5566-Ubuntu kernel: pcieport 0000:00:1c.0: [12] Replay Timer Timeout

+++++++++++++++++++++++++++++++++++++++++++++++++++++++

How do I start debugging this failure?

Question information

Language:
English Edit question
Status:
Expired
For:
Ubuntu Edit question
Assignee:
No assignee Edit question
Last query:
Last reply:
Revision history for this message
Marinela Selseth (mselseth) said :
#1

Manually created /etc/rdma/ibacm_opts.cfg using:
ib_acme -O

Also loaded ib_umad manually using:
sudo modprobe ib_umad

Remaining failures:
++++
systemctl status ibacm.service
● ibacm.service - InfiniBand Address Cache Manager Daemon
   Loaded: loaded (/lib/systemd/system/ibacm.service; enabled; vendor preset: enabled)
   Active: failed (Result: exit-code) since Wed 2018-08-15 12:40:18 CDT; 17min ago
     Docs: man:ibacm
           file:/etc/rdma/ibacm_opts.cfg
 Main PID: 8179 (code=exited, status=255)

Aug 15 12:40:18 Inspiron-5566-Ubuntu systemd[1]: Starting InfiniBand Address Cache Manager Daemon...
Aug 15 12:40:18 Inspiron-5566-Ubuntu systemd[1]: ibacm.service: Main process exited, code=exited, status=255/n/a
Aug 15 12:40:18 Inspiron-5566-Ubuntu systemd[1]: ibacm.service: Failed with result 'exit-code'.
Aug 15 12:40:18 Inspiron-5566-Ubuntu systemd[1]: Failed to start InfiniBand Address Cache Manager Daemon.

Revision history for this message
Launchpad Janitor (janitor) said :
#2

This question was expired because it remained in the 'Open' state without activity for the last 15 days.