Bug #1505564 “Soft lockup with “block nbdX: Attempted send on cl...” : Bugs : linux package : Ubuntu

Revision history for this message

Junien F (axino) wrote on 2015-10-13: BootDmesg.txt

#1

BootDmesg.txt Edit (86.3 KiB, text/plain)

apport information

tags:	added: apport-collected trusty uec-images
description:	updated

Revision history for this message

Junien F (axino) wrote on 2015-10-13: CRDA.txt

#2

CRDA.txt Edit (257 bytes, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-13: CurrentDmesg.txt

#3

CurrentDmesg.txt Edit (245.0 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-13: Lspci.txt

#4

Lspci.txt Edit (84.8 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-13: Lsusb.txt

#5

Lsusb.txt Edit (490 bytes, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-13: ProcCpuinfo.txt

#6

ProcCpuinfo.txt Edit (35.3 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-13: ProcInterrupts.txt

#7

ProcInterrupts.txt Edit (26.0 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-13: ProcModules.txt

#8

ProcModules.txt Edit (5.2 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-13: UdevDb.txt

#9

UdevDb.txt Edit (203.9 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-13: UdevLog.txt

#10

UdevLog.txt Edit (439.9 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-13: WifiSyslog.txt

#11

WifiSyslog.txt Edit (33.3 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-13:

#12

Second host now

tags:	added: staging
description:	updated

Revision history for this message

Junien F (axino) wrote on 2015-10-13: BootDmesg.txt

#13

BootDmesg.txt Edit (102.3 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-13: CRDA.txt

#14

CRDA.txt Edit (277 bytes, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-13: CurrentDmesg.txt

#15

CurrentDmesg.txt Edit (424.0 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-13: Lspci.txt

#16

Lspci.txt Edit (189.2 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-13: Lsusb.txt

#17

Lsusb.txt Edit (398 bytes, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-13: ProcCpuinfo.txt

#18

ProcCpuinfo.txt Edit (23.0 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-13: ProcInterrupts.txt

#19

ProcInterrupts.txt Edit (21.3 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-13: ProcModules.txt

#20

ProcModules.txt Edit (4.7 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-13: UdevDb.txt

#21

UdevDb.txt Edit (198.4 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-13: UdevLog.txt

#22

UdevLog.txt Edit (432.6 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-13: WifiSyslog.txt

#23

WifiSyslog.txt Edit (4.8 KiB, text/plain)

apport information

Revision history for this message

Brad Figg (brad-figg) wrote on 2015-10-13: Status changed to Confirmed

#24

This change was made by a bot.

Changed in linux (Ubuntu):
status:	New → Confirmed

Revision history for this message

Junien F (axino) wrote on 2015-10-13:

#25

I think that this may be a duplicate of #1500739, the symptoms certainly look the same.

Rafael David Tinoco (rafaeldtinoco) on 2015-10-15

Changed in linux (Ubuntu):
assignee:	nobody → Rafael David Tinoco (inaddy)

Revision history for this message

Junien F (axino) wrote on 2015-10-28: BootDmesg.txt

#26

BootDmesg.txt Edit (85.2 KiB, text/plain)

apport information

description:

updated

Revision history for this message

Junien F (axino) wrote on 2015-10-28: CRDA.txt

#27

CRDA.txt Edit (322 bytes, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-28: CurrentDmesg.txt

#28

CurrentDmesg.txt Edit (8.5 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-28: Lspci.txt

#29

Lspci.txt Edit (189.3 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-28: Lsusb.txt

#30

Lsusb.txt Edit (398 bytes, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-28: ProcCpuinfo.txt

#31

ProcCpuinfo.txt Edit (22.8 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-28: ProcInterrupts.txt

#32

ProcInterrupts.txt Edit (16.4 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-28: ProcModules.txt

#33

ProcModules.txt Edit (4.5 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-28: UdevDb.txt

#34

UdevDb.txt Edit (270.6 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-28: UdevLog.txt

#35

UdevLog.txt Edit (660.1 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-28: WifiSyslog.txt

#36

WifiSyslog.txt Edit (438.2 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-28:

#37

This issue just hit us again, this time I sent an NMI to the server to get a dump. It's available at https://chinstrap.canonical.com/~axino/201510281259.crash.lp1505564.tar.xz

apport information post-reboot is available above.

We've been trying to see if the issue appeared somewhere in the 3.13 series, hence the 3.13.0-29-generic kernel version.

Thanks !

Revision history for this message

Junien F (axino) wrote on 2015-10-29:

#38

I'm just now realizing that the crashdump above may have been taken too late (when the kernel wasn't locked up anymre), because I could ssh to the server when I took it.

I was seeing the "block nbdX: Attempted send on closed socket" kernel log spam on the serial when I sent the NMI, but _perhaps_ these messages were just earlier messages that the serial was still catching up with.

Anyway, I got 2 new dumps, and these 2 were triggered automaticallt by kernel.softlockup_panic so they might be better.

Revision history for this message

Junien F (axino) wrote on 2015-10-29:

#39

First dump + apport (post reboot) below

description:

updated

Revision history for this message

Junien F (axino) wrote on 2015-10-29: BootDmesg.txt

#40

BootDmesg.txt Edit (90.8 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-29: CRDA.txt

#41

CRDA.txt Edit (322 bytes, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-29: CurrentDmesg.txt

#42

CurrentDmesg.txt Edit (240.4 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-29: Lspci.txt

#43

Lspci.txt Edit (85.4 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-29: Lsusb.txt

#44

Lsusb.txt Edit (490 bytes, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-29: ProcCpuinfo.txt

#45

ProcCpuinfo.txt Edit (34.5 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-29: ProcInterrupts.txt

#46

ProcInterrupts.txt Edit (19.6 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-29: ProcModules.txt

#47

ProcModules.txt Edit (4.9 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-29: UdevDb.txt

#48

UdevDb.txt Edit (299.9 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-29: UdevLog.txt

#49

UdevLog.txt Edit (745.7 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-29:

#50

dump is at https://chinstrap.canonical.com/~axino/201510281618.crash.lp1505564.druk.tar.xz

Revision history for this message

Junien F (axino) wrote on 2015-10-29:

#51

Second apport+dump below

description:

updated

Revision history for this message

Junien F (axino) wrote on 2015-10-29: BootDmesg.txt

#52

BootDmesg.txt Edit (85.2 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-29: CRDA.txt

#53

CRDA.txt Edit (322 bytes, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-29: CurrentDmesg.txt

#54

CurrentDmesg.txt Edit (221.9 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-29: Lspci.txt

#55

Lspci.txt Edit (189.3 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-29: Lsusb.txt

#56

Lsusb.txt Edit (398 bytes, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-29: ProcCpuinfo.txt

#57

ProcCpuinfo.txt Edit (22.8 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-29: ProcInterrupts.txt

#58

ProcInterrupts.txt Edit (16.4 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-29: ProcModules.txt

#59

ProcModules.txt Edit (4.4 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-29: UdevDb.txt

#60

UdevDb.txt Edit (268.5 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-29: UdevLog.txt

#61

UdevLog.txt Edit (660.5 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-29:

#62

dump at https://chinstrap.canonical.com/~axino/201510281951.crash.lp1505564.orlo.tar.xz

Revision history for this message

Junien F (axino) wrote on 2015-10-29:

#63

sha1 sums for all 3 dumps below :
6b63d74566b6df0671ba9e79dca724ddc6d8d6df 201510281259.crash.lp1505564.tar.xz <= may have been taken after the lockup occured
3a8cbdd9e51af4f6eaba4ff0aacc6f956c706961 201510281618.crash.lp1505564.druk.tar.xz
1ebd57dea13cf655e7ef442951da2aedc33d0046 201510281951.crash.lp1505564.orlo.tar.xz

Revision history for this message

Junien F (axino) wrote on 2015-10-30:

#64

Upgraded all the kernels to lts-vivid (3.19.0-31-generic), and got a new crashdump overnight, from the same server just above (orlo). apport + dump below.

description:

updated

Revision history for this message

Junien F (axino) wrote on 2015-10-30: BootDmesg.txt

#65

BootDmesg.txt Edit (102.6 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-30: CRDA.txt

#66

CRDA.txt Edit (277 bytes, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-30: CurrentDmesg.txt

#67

CurrentDmesg.txt Edit (95.4 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-30: Lspci.txt

#68

Lspci.txt Edit (189.2 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-30: Lsusb.txt

#69

Lsusb.txt Edit (398 bytes, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-30: ProcCpuinfo.txt

#70

ProcCpuinfo.txt Edit (23.0 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-30: ProcInterrupts.txt

#71

ProcInterrupts.txt Edit (21.3 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-30: ProcModules.txt

#72

ProcModules.txt Edit (4.7 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-30: UdevDb.txt

#73

UdevDb.txt Edit (198.0 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-30: UdevLog.txt

#74

UdevLog.txt Edit (432.6 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-30: WifiSyslog.txt

#75

WifiSyslog.txt Edit (4.0 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-30:

#76

crashdump available at https://chinstrap.canonical.com/~axino/201510292103.crash.lp1505564.orlo.tar.xz - sha1sum is 366c0460cceed5938f2a19fc4b925380a33c18a6

Revision history for this message

Junien F (axino) wrote on 2015-10-30: BootDmesg.txt

#77

BootDmesg.txt Edit (102.7 KiB, text/plain)

apport information

description:

updated

Revision history for this message

Junien F (axino) wrote on 2015-10-30: CRDA.txt

#78

CRDA.txt Edit (277 bytes, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-30: CurrentDmesg.txt

#79

CurrentDmesg.txt Edit (19.0 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-30: Lspci.txt

#80

Lspci.txt Edit (189.2 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-30: Lsusb.txt

#81

Lsusb.txt Edit (398 bytes, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-30: ProcCpuinfo.txt

#82

ProcCpuinfo.txt Edit (23.0 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-30: ProcInterrupts.txt

#83

ProcInterrupts.txt Edit (21.3 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-30: ProcModules.txt

#84

ProcModules.txt Edit (4.7 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-30: UdevDb.txt

#85

UdevDb.txt Edit (200.0 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-30: UdevLog.txt

#86

UdevLog.txt Edit (432.6 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-30: WifiSyslog.txt

#87

WifiSyslog.txt Edit (313.5 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-10-30:

#88

Yet another crash, on another node this time (still a 100% Nova compute node). apport information is above, crashdump is at https://chinstrap.canonical.com/~axino/201510301227.crash.lp1505564.phianna.tar.xz - sha1sum 71353f8c70d009369a61de811c90d6199b341543

Thanks !

Revision history for this message

Rafael David Tinoco (rafaeldtinoco) wrote on 2015-10-30:

#89

Junien, I'm on it right now.. will update here asap.

Revision history for this message

Rafael David Tinoco (rafaeldtinoco) wrote on 2015-10-30:

#91

Download full text (6.0 KiB)

I'm attaching the crash tool output from the 3.13 kernel dump.

Much likely related to the situation already found in the following case:
-> https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1413540

Handled by Chris Arges and I on LKML discussions with Ingo and Linus:
-> http://www.kernelhub.org/?p=2&msg=683682

FOR NOW, it is LIKELY that I'll rely on already known recommendations for Proliant (including the ones related to X2APIC mode):
-> https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1417580

So we can TRY TO GUARANTEE that there are no LOST IRQs (IPIs) using the firmware you're using. Hopefully with the proper APIC mode set, like HP recommends, we will not have those IPIs problems.

OBS: Whenever IPIs are lost (we've seen this on some nested KVMs and some buggy HW) we can be locked up in the SMP callback state machine. This means that the state machine looses IPIs ACKs and the state machine loops forever trying to shutdown the CPU for the SMP task queue to continue.

I'll provide SOON a comment with SUGGESTIONS and asking for FEEDBACK.

################

For now, from the 3.13 kernel dump, the most interesting part:

We had 7 CPUs executing the migration kernel thread (for the SMP callback state machine execution):

#### migration tasks (state machine loop)

> 93 2 4 ffff8808147b47d0 RU 0.0 0 0 [migration/4]
> 118 2 9 ffff881814a2c7d0 RU 0.0 0 0 [migration/9]
> 123 2 10 ffff88081404c7d0 RU 0.0 0 0 [migration/10]
> 128 2 11 ffff881814a4c7d0 RU 0.0 0 0 [migration/11]
> 138 2 13 ffff881814a647d0 RU 0.0 0 0 [migration/13]
> 165 2 18 ffff8810149ec7d0 RU 0.0 0 0 [migration/18]
> 195 2 24 ffff881014a647d0 RU 0.0 0 0 [migration/24]

This logic will try to migrate tasks from one CPU to another. In order for that to happen they have to rely on the state machine logic of shutting CPUs down before migrating the tasks (turning off IRQs, etc). The state machine - shutting down the CPUs on phases - relies on the SMP callbacks bellow.

We had 3 CPUs in a part of the kernel that we have already identified to be problematic under certain conditions and/or HW.

** > 17247 1 23 ffff881007055fc0 RU 1.6 7358428 2192548 qemu-system-x86

PID: 17247 TASK: ffff881007055fc0 CPU: 23 COMMAND: "qemu-system-x86"
#0 [ffff88203eac6e58] crash_nmi_callback at ffffffff8103fb72
#1 [ffff88203eac6e68] nmi_handle at ffffffff8171f188
#2 [ffff88203eac6ec8] do_nmi at ffffffff8171f350
#3 [ffff88203eac6ef0] end_repeat_nmi at ffffffff8171e5f1
    [exception RIP: generic_exec_single+130]
    RIP: ffffffff810db712 RSP: ffff8810ea7c96e0 RFLAGS: 00000202
    RAX: 0000000000000010 RBX: 0000000000000010 RCX: 0000000000000202
    RDX: ffff8810ea7c96e0 RSI: 0000000000000018 RDI: 0000000000000001
    RBP: ffffffff810db712 R8: ffffffff810db712 R9: 0000000000000018
    R10: ffff8810ea7c96e0 R11: 0000000000000202 R12: ffffffffffffffff
    R13: 0000000000000206 R14: 000000007bc87bc6 R15: ffff8814959f76c0
    ORIG_RAX: ffff8814959f76c0 CS: 0010 SS: 0018
--- <NMI exception stack> -...

I'm attaching the crash tool output from the 3.13 kernel dump.

Much likely related to the situation already found in the following case: 
-> https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1413540

Handled by Chris Arges and I on LKML discussions with Ingo and Linus:
-> http://www.kernelhub.org/?p=2&msg=683682

FOR NOW, it is LIKELY that I'll rely on already known recommendations for Proliant (including the ones related to X2APIC mode): 
-> https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1417580

So we can TRY TO GUARANTEE that there are no LOST IRQs (IPIs) using the firmware you're using. Hopefully with the proper APIC mode set, like HP recommends, we will not have those IPIs problems.

OBS: Whenever IPIs are lost (we've seen this on some nested KVMs and some buggy HW)  we can be locked up in the SMP callback state machine. This means that the state machine looses IPIs ACKs and the state machine loops forever trying to shutdown the CPU for the SMP task queue to continue.

I'll provide SOON a comment with SUGGESTIONS and asking for FEEDBACK.

################

For now, from the 3.13 kernel dump, the most interesting part:

We had 7 CPUs executing the migration kernel thread (for the SMP callback state machine execution):

#### migration tasks (state machine loop)

>    93      2   4  ffff8808147b47d0  RU   0.0       0      0  [migration/4]
>   118      2   9  ffff881814a2c7d0  RU   0.0       0      0  [migration/9]
>   123      2  10  ffff88081404c7d0  RU   0.0       0      0  [migration/10]
>   128      2  11  ffff881814a4c7d0  RU   0.0       0      0  [migration/11]
>   138      2  13  ffff881814a647d0  RU   0.0       0      0  [migration/13]
>   165      2  18  ffff8810149ec7d0  RU   0.0       0      0  [migration/18]
>   195      2  24  ffff881014a647d0  RU   0.0       0      0  [migration/24]

This logic will try to migrate tasks from one CPU to another. In order for that to happen they have to rely on the state machine logic of shutting CPUs down before migrating the tasks (turning off IRQs, etc). The state machine - shutting down the CPUs on phases - relies on the SMP callbacks bellow.

We had 3 CPUs in a part of the kernel that we have already identified to be problematic under certain conditions and/or HW.

** > 17247      1  23  ffff881007055fc0  RU   1.6 7358428 2192548  qemu-system-x86

PID: 17247  TASK: ffff881007055fc0  CPU: 23  COMMAND: "qemu-system-x86"
 #0 [ffff88203eac6e58] crash_nmi_callback at ffffffff8103fb72
 #1 [ffff88203eac6e68] nmi_handle at ffffffff8171f188
 #2 [ffff88203eac6ec8] do_nmi at ffffffff8171f350
 #3 [ffff88203eac6ef0] end_repeat_nmi at ffffffff8171e5f1
    [exception RIP: generic_exec_single+130]
    RIP: ffffffff810db712  RSP: ffff8810ea7c96e0  RFLAGS: 00000202
    RAX: 0000000000000010  RBX: 0000000000000010  RCX: 0000000000000202
    RDX: ffff8810ea7c96e0  RSI: 0000000000000018  RDI: 0000000000000001
    RBP: ffffffff810db712   R8: ffffffff810db712   R9: 0000000000000018
    R10: ffff8810ea7c96e0  R11: 0000000000000202  R12: ffffffffffffffff
    R13: 0000000000000206  R14: 000000007bc87bc6  R15: ffff8814959f76c0
    ORIG_RAX: ffff8814959f76c0  CS: 0010  SS: 0018
--- <NMI exception stack> ---
 #4 [ffff8810ea7c96e0] generic_exec_single at ffffffff810db712

!!!! CSD_FLAG logic discussed with Linus

108             while (csd->flags & CSD_FLAG_LOCK)
   0xffffffff810db712 <+130>:   testb  $0x1,0x20(%rbx)
   0xffffffff810db716 <+134>:   jne    0xffffffff810db710 <generic_exec_single+128>

109                     cpu_relax();
110     }

** > 21036      1  27  ffff8810b69947d0  RU   1.0 7484828 1401940  qemu-system-x86

PID: 21036  TASK: ffff8810b69947d0  CPU: 27  COMMAND: "qemu-system-x86"
 #0 [ffff88203eb46e58] crash_nmi_callback at ffffffff8103fb72
 #1 [ffff88203eb46e68] nmi_handle at ffffffff8171f188
 #2 [ffff88203eb46ec8] do_nmi at ffffffff8171f350
 #3 [ffff88203eb46ef0] end_repeat_nmi at ffffffff8171e5f1
    [exception RIP: generic_exec_single+130]
    RIP: ffffffff810db712  RSP: ffff8814959f7670  RFLAGS: 00000202
    RAX: 0000000000000010  RBX: 0000000000000010  RCX: 0000000000000202
    RDX: ffff8814959f7670  RSI: 0000000000000018  RDI: 0000000000000001
    RBP: ffffffff810db712   R8: ffffffff810db712   R9: 0000000000000018
    R10: ffff8814959f7670  R11: 0000000000000202  R12: ffffffffffffffff
    R13: 0000000000000282  R14: 0000000000000000  R15: 0000000000000100
    ORIG_RAX: 0000000000000100  CS: 0010  SS: 0018
--- <NMI exception stack> ---
 #4 [ffff8814959f7670] generic_exec_single at ffffffff810db712

!!!! CSD_FLAG logic discussed with Linus

108             while (csd->flags & CSD_FLAG_LOCK)
   0xffffffff810db712 <+130>:   testb  $0x1,0x20(%rbx)
   0xffffffff810db716 <+134>:   jne    0xffffffff810db710 <generic_exec_single+128>

109                     cpu_relax();
110     }

** > 18516      1  31  ffff881dd54a2fe0  RU   1.6 7358428 2192548  qemu-system-x86

PID: 18516  TASK: ffff881dd54a2fe0  CPU: 31  COMMAND: "qemu-system-x86"
 #0 [ffff88203ebc6e58] crash_nmi_callback at ffffffff8103fb72
 #1 [ffff88203ebc6e68] nmi_handle at ffffffff8171f188
 #2 [ffff88203ebc6ec8] do_nmi at ffffffff8171f350
 #3 [ffff88203ebc6ef0] end_repeat_nmi at ffffffff8171e5f1
    [exception RIP: generic_exec_single+130]
    RIP: ffffffff810db712  RSP: ffff881dd55597a0  RFLAGS: 00000202
    RAX: 0000000000000010  RBX: 0000000000000010  RCX: 0000000000000202
    RDX: ffff881dd55597a0  RSI: 0000000000000018  RDI: 0000000000000001
    RBP: ffffffff810db712   R8: ffffffff810db712   R9: 0000000000000018
    R10: ffff881dd55597a0  R11: 0000000000000202  R12: ffffffffffffffff
    R13: 0000000000000206  R14: 000000007bca7bc8  R15: ffff8814959f76c0
    ORIG_RAX: ffff8814959f76c0  CS: 0010  SS: 0018
--- <NMI exception stack> ---
 #4 [ffff881dd55597a0] generic_exec_single at ffffffff810db712

!!!! CSD_FLAG logic discussed with Linus

108             while (csd->flags & CSD_FLAG_LOCK)
   0xffffffff810db712 <+130>:   testb  $0x1,0x20(%rbx)
   0xffffffff810db716 <+134>:   jne    0xffffffff810db710 <generic_exec_single+128>

109                     cpu_relax();
110     }

Revision history for this message

Rafael David Tinoco (rafaeldtinoco) wrote on 2015-10-30:

#92

lp1505564-3.13-kdump-crash-output.txt Edit (23.6 KiB, text/plain)

Rafael David Tinoco (rafaeldtinoco) on 2015-10-30

Changed in linux (Ubuntu):
status:	Confirmed → In Progress

Revision history for this message

Rafael David Tinoco (rafaeldtinoco) wrote on 2015-11-02: Re: [Bug 1505564] Re: Soft lockup with "block nbdX: Attempted send on closed socket" spam

#93

Hello Junien,
(recommendations with *)
I'm replying to you and to the LP bug so it gets proper documentation.
Under comment #91:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1505564/comments/91
You can see my kernel dump analysis, where I am showing you that the
OS is stuck in a "migration thread", possibly because of a lack of
IPIs synchronisation (maybe even an IPI being lost). We have already
seen cases like this - specially in nested virtualisation environments
- and this has been discussed in LKML.
Before we move further I need you to follow some kind of "best
practices" for Proliant Servers:
1 - NMIs caused during MWAIT instruction (caused by intel_idle module):
& HP Proliant Servers - Kernel Panic - NMI - DL360 & DL380 - HPWDT module loaded
(https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1417580)
(https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1432837)
* Firmware: Configure a maximum of a C3 c-state for CPU savings (CPU C-STATES)
* Firmware: Disable packed CPU c-state
* Firmware: Disable Cooperative Power Management
* Make sure NOT TO LOAD HPWDT kernel module (LP: #1432837 Fix Released
3.13.0-49.81)
2 - Recently discovered NMIs caused by a BUG in Intel microcode
(https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1416414)
** If you have Intel based Proliant Servers, because of Intel
microcode issue, use at least* 3.13.0-35.61.
3 - X2APIC support for HP Proliant Servers
(https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1398497)
* For Proliant prior to G8 (<= G7) use "nox2apic intremap=off" into grub cmdline
* For Proliant G8 use "intremap=no_x2apic_optout" into grub cmdline
4 - HP Proliant Latest Firmware
MOST IMPORTANT
Upgrade server firmware to latest version
There were numerous firmware fixes from HP.
---> If we are facing a firmware problem - related to IPIs, the
inter-processor-interrupts, being missed - we have to make sure this
is reproducible in the latest firmware in order to work together with
HP ROM engineering team.
Summary:
Could you follow all these steps and provide feedback ? I understand
this might take awhile if you have a big number of servers and - if so
- I would take a statistical approach here, by changing only half of
the servers and sticking with the first half as the "control group",
for future comparisons.
Is this feasible ? Looking forward to hearing your feedback.
Best Regards
Rafael Tinoco
Sustaining Engineering

Revision history for this message

Chris Stratford (chris-gondolin) wrote on 2015-11-12:

#94

Hi Rafael,

I've been continuing Junien's investigations into this problem. The machines have had all the BIOS and firmware updates I could find on HP's website (although in the case of a DL385-G7 the latest appears to be February 2014!) One of them only lasted a day before crashing again.

So, step 2 was to add "nox2apic intremap=off" to the DL385-G7s. I added it to only one of them initially. That machine lasted 9 days before we had another kernel panic ("NMI watchdog: BUG: soft lockup - CPU#27 stuck for 23s! [migration/27:200]"), but after the panic it seems to have settled back down again (without any reboot).

I've also added "intremap=no_x2apic_optout" to one of the DL360-G8s after it crashed a couple of days ago. So far, it's doing ok.

I''m tempted to try upgrading them to linux-image-generic-lts-wily (currently 4.2.0.18.13) unless there's any information from the current setup that could be useful.

Revision history for this message

Rafael David Tinoco (rafaeldtinoco) wrote on 2015-11-12:

#95

Hello Chris,

Could you clarify the following statement:

"""
So, step 2 was to add "nox2apic intremap=off" to the DL385-G7s. I added it to only one of them initially. That machine lasted 9 days before we had another kernel panic ("NMI watchdog: BUG: soft lockup - CPU#27 stuck for 23s! [migration/27:200]"), but after the panic it seems to have settled back down again (without any reboot).
"""

So, I'm not sure if you are "panic'ing on hung tasks" (sysctl option). The way I read this is that the machine showed a soft lockup BUT the kernel did not crash and recovered after some time. This might indicate that, after workload was reduced, the kernel could get back on track with migration kthread. Could you clarify this ?

You did right.

< G8 cmdline == "nox2apic intremap=off"
>= G8 cmdline == "intremap=no_x2apic_optout"

So, if the kernel (G7) had a soft lockup warning but had no "hard lockups" (race conditions), then we are good. Judging by the G8, it looks like that after the change it is still running. Could you clarify if you changed the c-states (min and packing) firmware options ?

I would recommend you staying in 3.13 if they show stable after firmware version/options and cmdline were changed. This way we have a way to "compare" things. As long as they don't have HARD lockups, I think we will be good.

Let me know if you need any other clarification.

Cheers!

Rafael Tinoco

Revision history for this message

Junien F (axino) wrote on 2015-11-17:

#96

Hi Rafael,

For starters, the server Chris mentioned above didn't panic because the kernel.softlockup_panic wasn't set to 1 on reboot. This is now fixed.

Then, we're still running 3.19 (all the nodes got rebooted to 3.19.0-33-generic). Let me know if you wish us to get back to 3.13.

I verified that all the firmwares were the most recent ones, and they were.

I rebooted all the nodes with the proper x2apic kernel options. I also disabled all C-States, and also set everything relevant to "performance". You can see the changes here : http://paste.ubuntu.com/13312776/ (this paste is showing all possible settings in G7 and Gen8, I of course could only apply the settings that existed on each infrastructure).

Unfortunately, even with all this, we had a G7 that panic'ed and crashdump'ed about ~1h after I set it back in the compute pool. You will find the apport and crashdump below.

Let me know what are the next steps.

Thanks !

Revision history for this message

Junien F (axino) wrote on 2015-11-17: BootDmesg.txt

#97

BootDmesg.txt Edit (87.3 KiB, text/plain)

apport information

description:

updated

Revision history for this message

Junien F (axino) wrote on 2015-11-17: CRDA.txt

#98

CRDA.txt Edit (277 bytes, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-11-17: CurrentDmesg.txt

#99

CurrentDmesg.txt Edit (23.4 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-11-17: Lspci.txt

#100

Lspci.txt Edit (84.8 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-11-17: Lsusb.txt

#101

Lsusb.txt Edit (490 bytes, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-11-17: ProcCpuinfo.txt

#102

ProcCpuinfo.txt Edit (35.3 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-11-17: ProcInterrupts.txt

#103

ProcInterrupts.txt Edit (26.0 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-11-17: ProcModules.txt

#104

ProcModules.txt Edit (5.2 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-11-17: UdevDb.txt

#105

UdevDb.txt Edit (208.4 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-11-17: UdevLog.txt

#106

UdevLog.txt Edit (443.7 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-11-17: WifiSyslog.txt

#107

WifiSyslog.txt Edit (540.2 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-11-17:

#108

apport above, crash dump is at https://chinstrap.canonical.com/~axino/201511171222.crash.lp1505564.druk.tar.xz - sha1sum 93ae006186b6bc7298afd37d3f759effe08d7ba3

Joseph Salisbury (jsalisbury) on 2015-11-17

tags:	added: kernel-key
Changed in linux (Ubuntu):
importance:	Undecided → High

Revision history for this message

Rafael David Tinoco (rafaeldtinoco) wrote on 2015-11-17:

#109

Thank Junien, I'm downloading the crash dump (10GB) and will update you as soon as I open it.

Revision history for this message

Rafael David Tinoco (rafaeldtinoco) wrote on 2015-11-19:

#110

Hello Junien,

After your last crash - similar to previous ones - one thing called my attention: For the first time we had one CPU RCU stall detected by another CPU. This made me think that it wasn't only related to the SMP logic - like I believed - but the stall occurred also somewhere else.

----
[ 5792.466770] INFO: rcu_sched detected stalls on CPUs/tasks: { 7} (detected by 15, t=15003 jiffies, g=182379, c=182378, q=0)
----

And this stall happened before Async I/O callbacks started to be suppressed:

----
[ 5793.190218] block nbd6: Attempted send on closed socket
[ 5793.190221] blk_update_request: 1154 callbacks suppressed
[ 5793.190223] blk_update_request: I/O error, dev nbd6, sector 125828992
[ 5793.190226] buffer_io_error: 1151 callbacks suppressed
[ 5793.190227] Buffer I/O error on dev nbd6, logical block 125828992, async page read
[ 5793.190235] block nbd6: Attempted send on closed socket
[ 5793.190237] blk_update_request: I/O error, dev nbd6, sector 125828993
[ 5793.190238] Buffer I/O error on dev nbd6, logical block 125828993, async page read
[ 5793.190242] block nbd6: Attempted send on closed socket
[ 5793.190243] blk_update_request: I/O error, dev nbd6, sector 125828994
[ 5793.190245] Buffer I/O error on dev nbd6, logical block 125828994, async page read
[ 5793.190248] block nbd6: Attempted send on closed socket
----

Digging upstream (from 3.13 to HEAD) I could see there were not a huge amount of fixes:

----
$ git log --pretty=oneline v3.13..HEAD -- drivers/block/nbd.c | wc -l
31
----

For nbd.c and I identified an improvement on nbd timeout handling:

----
commit 7e2893a16d3e71035a38122a77bc55848a29f0e4
Author: Markus Pargmann <email address hidden>
Date: Mon Aug 17 08:20:00 2015 +0200

nbd: Fix timeout detection
----

This fix is pretty recent (4.3) and it fit to the case: 3.18 kernel facing the same issue.

Later I found out that Debian had a similar bug:

----
https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=770479
https://lists.debian.org/debian-kernel/2015/05/msg00054.html
----

for kernel 3.16, complaining about messages like this:

----
[ 5793.190242] block nbd6: Attempted send on closed socket
----

And the lack of proper timeout for nbd connections (now based on timeout after IO submission).

SO...

The backport shall be easy* and I'll probably make one PPA containing a 3.18 (+ this patch) available for you tomorrow.

* 2 out of 12 hunks FAILED -- saving rejects to file drivers/block/nbd.c.rej
* Debian has a 3.16 version already

Thank you

Rafael Tinoco

Hello Junien,

After your last crash - similar to previous ones - one thing called my attention: For the first time we had one CPU RCU stall detected by another CPU. This made me think that it wasn't only related to the SMP logic - like I believed - but the stall occurred also somewhere else.

----
[ 5792.466770] INFO: rcu_sched detected stalls on CPUs/tasks: { 7} (detected by 15, t=15003 jiffies, g=182379, c=182378, q=0)
----

And this stall happened before Async I/O callbacks started to be suppressed:

----
[ 5793.190218] block nbd6: Attempted send on closed socket
[ 5793.190221] blk_update_request: 1154 callbacks suppressed
[ 5793.190223] blk_update_request: I/O error, dev nbd6, sector 125828992
[ 5793.190226] buffer_io_error: 1151 callbacks suppressed
[ 5793.190227] Buffer I/O error on dev nbd6, logical block 125828992, async page read
[ 5793.190235] block nbd6: Attempted send on closed socket
[ 5793.190237] blk_update_request: I/O error, dev nbd6, sector 125828993
[ 5793.190238] Buffer I/O error on dev nbd6, logical block 125828993, async page read
[ 5793.190242] block nbd6: Attempted send on closed socket
[ 5793.190243] blk_update_request: I/O error, dev nbd6, sector 125828994
[ 5793.190245] Buffer I/O error on dev nbd6, logical block 125828994, async page read
[ 5793.190248] block nbd6: Attempted send on closed socket
----

Digging upstream (from 3.13 to HEAD) I could see there were not a huge amount of fixes:

----
$ git log --pretty=oneline v3.13..HEAD -- drivers/block/nbd.c | wc -l
31
----

For nbd.c and I identified an improvement on nbd timeout handling:

----
commit 7e2893a16d3e71035a38122a77bc55848a29f0e4
Author: Markus Pargmann <mpa@pengutronix.de>
Date:   Mon Aug 17 08:20:00 2015 +0200

nbd: Fix timeout detection
----

This fix is pretty recent (4.3) and it fit to the case: 3.18 kernel facing the same issue.

Later I found out that Debian had a similar bug:

----
https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=770479
https://lists.debian.org/debian-kernel/2015/05/msg00054.html
----

for kernel 3.16, complaining about messages like this:

----
[ 5793.190242] block nbd6: Attempted send on closed socket
----

And the lack of proper timeout for nbd connections (now based on timeout after IO submission).

SO...

The backport shall be easy* and I'll probably make one PPA containing a 3.18 (+ this patch) available for you tomorrow.

* 2 out of 12 hunks FAILED -- saving rejects to file drivers/block/nbd.c.rej
* Debian has a 3.16 version already

Thank you

Rafael Tinoco

Revision history for this message

Junien F (axino) wrote on 2015-11-19:

#111

Thanks for your update Rafael. Since nova-compute doesn't do anything useful with qemu-nbd anyway, I'm going to try to "soft-disable" it (divert + symlink to /bin/true), and we'll see if we can repro the crashes. I'll keep you posted.

I'll also try your patched kernel as soon as it's ready, of course :)

Revision history for this message

Junien F (axino) wrote on 2015-11-20:

#112

Hi Rafael,

WIth qemu-nbd symlinked to /bin/true, no crash so far...

Revision history for this message

Rafael David Tinoco (rafaeldtinoco) wrote on 2015-11-20:

#113

Junien,

I faced minor issues on backport yesterday and today is holiday in Brazil. I'll get back to this soon. Nevertheless, it is good feedback that this "qemu-nbd" workaround is probably making the system more stable.

I'll get back to you soon.

Thank you

Rafael

Rafael David Tinoco (rafaeldtinoco) on 2015-11-22

description:

updated

Revision history for this message

Rafael David Tinoco (rafaeldtinoco) wrote on 2015-11-22:

#114

vivid-nbd-restructure-debugging-prints.patch Edit (7.9 KiB, text/plain)

Revision history for this message

Rafael David Tinoco (rafaeldtinoco) wrote on 2015-11-22:

#115

vivid-nbd-fix-timeout-detection.patch Edit (7.6 KiB, text/plain)

Ubuntu Foundations Team Bug Bot (crichton) on 2015-11-23

tags:

added: patch

Revision history for this message

Rafael David Tinoco (rafaeldtinoco) wrote on 2015-11-23:

#116

Testing patches I have attached above:

inaddy@sf00090138trusty(~)$ sudo qemu-img create -f qcow2 -o preallocation=metadata ./test.qcow2 1G
Formatting './test.qcow2', fmt=qcow2 size=1073741824 encryption=off cluster_size=65536 preallocation='metadata' lazy_refcounts=off

inaddy@sf00090138trusty(~)$ sudo qemu-nbd --connect=/dev/nbd0 ./test.qcow2

[ 34.348125] nbd: registered device at major 43
[ 317.034493] nbd0: unknown partition table

inaddy@sf00090138trusty(~)$ sudo fdisk /dev/nbd0

Device Boot Start End Blocks Id System
/dev/nbd0p1 2048 2097151 1047552 83 Linux

inaddy@sf00090138trusty(~)$ sudo mkfs.ext3 /dev/nbd0p1

inaddy@sf00090138trusty(~)$ sudo mount /dev/nbd0p1 /mnt

inaddy@sf00090138trusty(/mnt)$ dd if=/dev/zero of=./teste bs=1M count=256 oflag=direct
256+0 records in
256+0 records out
268435456 bytes (268 MB) copied, 1.15586 s, 232 MB/s

Hopefully they won't cause any regression the PPA to be provided soon.

Revision history for this message

Rafael David Tinoco (rafaeldtinoco) wrote on 2015-11-23:

#117

Hello Junien,

Based on my previous feedbacks, I've created the following PPA:

https://launchpad.net/~inaddy/+archive/ubuntu/lp1505564

With a Trusty HWE kernel (vivid) + 2 patches:

nbd: Restructure debugging prints
nbd: Fix timeout detection

For you to use and provide me feedback.

I've done minor tests and it looks like there are no regressions.
Hopefully these patches will address the problem.

If they do, I'll work on fixing Trusty, Vivid, Wily and Xenial.

Cheers

Rafael Tinoco

PS: I'm still finishing kernel compilation and will copy packages
to the PPA as soon as it is ready (it might take a few min/hours).

Revision history for this message

Rafael David Tinoco (rafaeldtinoco) wrote on 2015-11-23:

#118

Okay,

PPA is ready:

https://launchpad.net/~inaddy/+archive/ubuntu/lp1505564/+packages

Please upgrade kernel to:

linux-lts-vivid - 3.19.0-36.41~14.04.1hf00090138v20151122b1

By doing:

$ sudo add-apt-repository ppa:inaddy/lp1505564
$ sudo apt-get update
$ sudo apt-get install linux-image-3.19.0-36-generic linux-image-extra-3.19.0-36-generic linux-headers-3.19.0-36-generic

And make sure packages are being installed from PPA. Then reboot server using the hotfixed kernel.

I'm looking forward on hearing feedback if this kernel mitigated the issues.

Cheers

Rafael Tinoco

Revision history for this message

Junien F (axino) wrote on 2015-11-23:

#119

Hi Rafael,

I applied the patch earlier today.
No crash so far, which was nearly impossible before !

This looks very promising, I'll keep you posted tomorrow.

Revision history for this message

Rafael David Tinoco (rafaeldtinoco) wrote on 2015-11-24:

#120

Junien,

That is good feedback. I also received another request to backport this to 3.13 SO I'll be providing the hotfixed kernel in the same PPA soon (tomorrow morning most likely).

Attaching the 3.13 patches (just for reference since the SRU process requires me to send all those patches to kernel-team mailing list).

Lets see if things continue good. If, by any chance, you are able to test this 3.13 kernel - maybe in another server - please provide me feedback also.

Thank you very much

Cheers

Rafael Tinoco

Revision history for this message

Rafael David Tinoco (rafaeldtinoco) wrote on 2015-11-24:

#123

Note to self:

The commit being backported to 3.19 and 3.13 has to contain this race fix:

commit dcc909d90ccdbb73226397ff6d298f7af35b0e11
Author: Markus Pargmann <email address hidden>
Date: Tue Oct 6 20:03:54 2015 +0200

nbd: Add locking for tasks

    The timeout handling introduced in
        7e2893a16d3e (nbd: Fix timeout detection)
    introduces a race condition which may lead to killing of tasks that are
    not in nbd context anymore. This was not observed or reproducable yet.

    This patch adds locking to critical use of task_recv and task_send to
    avoid killing tasks that already left the NBD thread functions. This
    lock is only acquired if a timeout occures or the nbd device
    starts/stops.

    Reported-by: Ben Hutchings <email address hidden>
    Signed-off-by: Markus Pargmann <email address hidden>
    Reviewed-by: Ben Hutchings <email address hidden>
    Fixes: 7e2893a16d3e ("nbd: Fix timeout detection")
    Signed-off-by: Jens Axboe <email address hidden>

Also.

Revision history for this message

Junien F (axino) wrote on 2015-11-24: BootDmesg.txt

#124

BootDmesg.txt Edit (84.6 KiB, text/plain)

apport information

description:

updated

Revision history for this message

Junien F (axino) wrote on 2015-11-24: CRDA.txt

#125

CRDA.txt Edit (277 bytes, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-11-24: CurrentDmesg.txt

#126

CurrentDmesg.txt Edit (13.2 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-11-24: Lspci.txt

#127

Lspci.txt Edit (83.6 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-11-24: Lsusb.txt

#128

Lsusb.txt Edit (490 bytes, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-11-24: ProcCpuinfo.txt

#129

ProcCpuinfo.txt Edit (22.8 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-11-24: ProcInterrupts.txt

#130

ProcInterrupts.txt Edit (20.1 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-11-24: ProcModules.txt

#131

ProcModules.txt Edit (4.6 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-11-24: UdevDb.txt

#132

UdevDb.txt Edit (185.9 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-11-24: UdevLog.txt

#133

UdevLog.txt Edit (403.3 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-11-24: WifiSyslog.txt

#134

WifiSyslog.txt Edit (242.6 KiB, text/plain)

apport information

Revision history for this message

Junien F (axino) wrote on 2015-11-24:

#135

Unfortunately, one server managed to crashdump, even with your patched kernel. apport is above, crashdump is at https://private-fileshare.canonical.com/~axino/201511241217.crash.lp1505564.matar.tar.xz - sha1sum 056fae2554e52989a24094945b297c0c5906be7c

I've diverted qemu-nbd again.

Please let me know the next steps.

Thanks !

Revision history for this message

Rafael David Tinoco (rafaeldtinoco) wrote on 2015-12-07:

#136

Junien,

Sorry for the delay. After sometime dealing with some other priorities, I'm coming back to this. I'm downloading the dump and will take a look. Lets see what this bug is related with.

Tks for providing it. Will report something back soon.

Revision history for this message

Dan Streetman (ddstreet) wrote on 2015-12-11:

#137

I've dl'ed the dump and I'm reviewing it.

Revision history for this message

Dan Streetman (ddstreet) wrote on 2015-12-15:

#138

Ok, here's my analysis of the latest dump.

There are 3 kernel migrate threads waiting; this is the cause of the softlockup - specifically pid 101 on cpu 13 is where the softlockup (and then panic, due to panic on softlockup enabled) happens, and the other 2 migrate threads (pid 79 and 151) are also waiting. All are waiting for multi_cpu_stop to finish. The way multi_cpu_stop works is: the caller sets up one or more cpus to coordinate stopping; in multi_cpu_stop, the state machine moves from MULTI_STOP_PREPARE through disable irqs, to run (the provided function), to exit when done. However, only the specified cpus (in the cpumask) will run the function. The state machine doesn't proceed to the next step until all cpus have processed the current state.

This is where the problem comes in. In this case, it's a migration of tasks from one numa node to another, via numa rebalancing. In this particular case, there are 3 rebalancing events happening: cpu 3 and cpu 10, cpu 3 and cpu 13, cpu 3 and cpu 20. the migrate threads on cpus 10, 13, and 20 are running multi_cpu_stop, but it's stuck waiting because cpu 3 still has it in its queue.

cpu 3 is writing bytes to the serial port, and currently waiting for confirmation that the serial port write completed. This wait is done via checking the serial port register for CTS, then if it's not set delaying for 1us, and trying again. However, this is all inside a held spinlock, with irqs disabled. So while this serial port r/w is being done, nothing else will run on this cpu. But - the code limits this to 1 second, so presumably it shouldn't lock up the cpu for longer than 1 second or so (I haven't dug too far into this, so the function may be called multiple times with the lock held).

For whatever reason, that serial port r/w seems to be taking a long time. The migrate threads on the other cpus are waiting for it to finish, so that the migrate thread on cpu 3 can run, and move the multi_cpu_stop state machine along. But that doesn't happen in time to avoid the softlockup detector.

The multi_cpu_stop function could arguably use the addition of touch_nmi_watchdog(), since it intentionally spins on the cpu with interrupts disabled - doing so would avoid the softlockup detector (but would not change the system behavior). However, it's not really its fault, since the real cause is the other cpu(s) it's waiting for being locked.

back on cpu 3 (that the others are waiting on), the way that delay is implemented is using the TSC. Unfortunately, the TSC is a generally unreliable clock source, so it's possible there is a problem in the delay function.

To determine that, can you please boot with the "notsc" parameter, which will change the udelay function to use a simple loop instead of the TSC, and reproduce the softlockup?

Ok, here's my analysis of the latest dump.

There are 3 kernel migrate threads waiting; this is the cause of the softlockup - specifically pid 101 on cpu 13 is where the softlockup (and then panic, due to panic on softlockup enabled) happens, and the other 2 migrate threads (pid 79 and 151) are also waiting.  All are waiting for multi_cpu_stop to finish.  The way multi_cpu_stop works is: the caller sets up one or more cpus to coordinate stopping; in multi_cpu_stop, the state machine moves from MULTI_STOP_PREPARE through disable irqs, to run (the provided function), to exit when done.  However, only the specified cpus (in the cpumask) will run the function.  The state machine doesn't proceed to the next step until all cpus have processed the current state.

This is where the problem comes in.  In this case, it's a migration of tasks from one numa node to another, via numa rebalancing.  In this particular case, there are 3 rebalancing events happening: cpu 3 and cpu 10, cpu 3 and cpu 13, cpu 3 and cpu 20.  the migrate threads on cpus 10, 13, and 20 are running multi_cpu_stop, but it's stuck waiting because cpu 3 still has it in its queue.

cpu 3 is writing bytes to the serial port, and currently waiting for confirmation that the serial port write completed.  This wait is done via checking the serial port register for CTS, then if it's not set delaying for 1us, and trying again.  However, this is all inside a held spinlock, with irqs disabled.  So while this serial port r/w is being done, nothing else will run on this cpu.  But - the code limits this to 1 second, so presumably it shouldn't lock up the cpu for longer than 1 second or so (I haven't dug too far into this, so the function may be called multiple times with the lock held).

For whatever reason, that serial port r/w seems to be taking a long time.  The migrate threads on the other cpus are waiting for it to finish, so that the migrate thread on cpu 3 can run, and move the multi_cpu_stop state machine along.  But that doesn't happen in time to avoid the softlockup detector.

The multi_cpu_stop function could arguably use the addition of touch_nmi_watchdog(), since it intentionally spins on the cpu with interrupts disabled - doing so would avoid the softlockup detector (but would not change the system behavior).  However, it's not really its fault, since the real cause is the other cpu(s) it's waiting for being locked.

back on cpu 3 (that the others are waiting on), the way that delay is implemented is using the TSC.  Unfortunately, the TSC is a generally unreliable clock source, so it's possible there is a problem in the delay function.

To determine that, can you please boot with the "notsc" parameter, which will change the udelay function to use a simple loop instead of the TSC, and reproduce the softlockup?

Changed in linux (Ubuntu):
assignee:	Rafael David Tinoco (inaddy) → Dan Streetman (ddstreet)

Revision history for this message

Junien F (axino) wrote on 2015-12-23:

#139

Hi Dan,

Thanks for your investigation. Sorry for the delay, but finally I managed to reboot the compute nodes with the "notsc" kernel parameter. I also disabled the qemu-nbd workaround.

Once that was done, it didn't take long for a node to crash, which would indicate that notsc didn't fix the problem. However, the host got stuck and didn't dump anything. OK then. It happened a second time a few minutes after on a different host, so I thought I'd investigate this more.

It turns out, the kernel booted through kexec fails booting probably because of the notsc option : https://pastebin.canonical.com/146714/

I'm a bit worried about the following line :
[ 0.000000] tsc: Kernel compiled with CONFIG_X86_TSC, cannot disable TSC completely

which is also displayed during "regular" boots (eg not through kexec).

I guess I can remove "notsc" from the kexec command line, but this will take additional time. I thought I'd let you know the current status in the meantime.

Cheers

Revision history for this message

Dan Streetman (ddstreet) wrote on 2015-12-24:

#140

> It turns out, the kernel booted through kexec fails booting probably because of the notsc option :
> https://pastebin.canonical.com/146714/

hmm, that's weird, but if notsc is all that changed i assume it is the problem.

> I'm a bit worried about the following line :
> [ 0.000000] tsc: Kernel compiled with CONFIG_X86_TSC, cannot disable TSC completely

that's normal with notsc, the tsc is still there, it's just not used for the udelay function. but if it doesn't help the problem, no need to keep it.

> I guess I can remove "notsc" from the kexec command line, but this will take additional time.
> I thought I'd let you know the current status in the meantime.

ok thanks. I'll be out next week for the holidays, but continue looking at this Jan 1.

Revision history for this message

Junien F (axino) wrote on 2015-12-31:

#141

Re-reading comment #318 Dan, I realize that we may be investigating a symptom and not the root cause.

Whenever the soft-lockup happens, the serial console does get flooded "block nbdX: Attempted send on closed socket". If the serial console getting flooded causes soft lockups, then it is indeed a concerning issue, but shouldn't we focus, in this bug, on making nbd not flood the console in the first place ?

Revision history for this message

Dan Streetman (ddstreet) wrote on 2016-01-04:

#142

Well, yes I agree, it does look like the serial port causing the softlockup is probably separate - but caused by - the nbd closed socket errors. However, the serial port output definitely shouldn't be causing a softlockup - no matter how much data it has to send, the serial port driver in the kernel should be scheduling itself during operation, so that it doesn't hog a single cpu for a long time. It's more likely that the general system "freezing" you are seeing is due to the serial port driver refusing to schedule off its cpu, and not any problem with the nbdX failure.

I'll look into the nbd code also though, to see where that error is coming from and what that problem may be.

Revision history for this message

Nick Moffitt (nick-moffitt) wrote on 2016-01-06:

#143

This problem has caused more serious damage recently. When nbd dies and printk()s like mad, the serial console is not fast enough to display it.

The kernel keeps allocating buffer space for serial output, which we see as 13G kmalloc-256 or kmalloc-512 kernel threads.

Eventually the OOMkiller tries to free up space, but it can only kill userspace programs so ultimately the system dies altogether.

This is more dire than mere CPU load or lockup warning messages.

Revision history for this message

Nick Moffitt (nick-moffitt) wrote on 2016-01-06:

#144

This memory leak we have so far only seen on arm64, to be clear.

Revision history for this message

Dan Streetman (ddstreet) wrote on 2016-01-07:

#145

axino or nick, can either of you attach an sosreport from an affected system? The crashdump doesn't include any userspace data so I can't see what exactly the qemu-nbd userspace program is doing, nor can i see what params it's started with. I'll need that info to be able to debug the qemu-nbd side of this.

Revision history for this message

Dan Streetman (ddstreet) wrote on 2016-01-07:

#146

Ok, nm about the sosreport - I got the info from some older emails from axino, nova is using qemu-nbd to locally mount images and access the partitions inside them. I was able to trivially reproduce this simply by creating an image, attaching it with qemu-nbd to /dev/nbd0, partitioning it and mkfs its p1 and then mounting it, then while copying a file to it, performing qemu-nbd -d to un-attach it to /dev/nbd0. That causes the spam of "Attempted..." error messages.

So this appears to be a simple case of nova calling qemu-nbd -d while there is still I/O to the image. The right thing to do is simply ratelimit the error messages (and they really should be anyway, as they're printing directly inside a loop). The messages themselves do not indicate any kernel error, simply that the nbd device was removed while being written to.

Can you try this kernel PPA to see if it fixes the problem? You will still see the error messages, but only a few lines since they'll be ratelimited.

Of course there is still the (probably more serious) problem of the serial port driver hanging a cpu and eating up memory; that probably deserves its own bug, since it's caused by this, but a separate issue.

Revision history for this message

Junien F (axino) wrote on 2016-01-07:

#147

Except that what happens on the compute nodes is that, when creating an instance, nova attaches the image with qemu-nbd (say to /dev/nbd0), and then tries to mount /dev/nbd0 somewhere, except that doesn't work because the image has partitions, and so the root device is actually on /dev/nbd0p1. So the "mount" commands return an error, and nova then detaches the image with qemu-nbd -d.

Overall, as far as nova logs show, there is 0 write on the nbd device and very few reads (probably just the MBR ?). Could that still cause inflight I/O when qemu-nbd -d is ran ?

I'll happily test your kernel PPA, but as far as I can see, you don't mention where it actually is :)

Thanks !

Revision history for this message

Dan Streetman (ddstreet) wrote on 2016-01-07:

#148

> Overall, as far as nova logs show, there is 0 write on the nbd device and very few reads (probably just the MBR ?).
> Could that still cause inflight I/O when qemu-nbd -d is ran ?

"very few" > 0
:-)

and it could be coming from elsewhere...but we don't need to account for where the IO is coming from, as the simple fact that it's there is enough. Also it's not just data IO, it's any "request", including metadata/control requests. Network-backed devices can disappear at any time, and the driver must be able to handle that. Spamming endless messages to the log isn't a good idea in that case.

To clarify the exact code in this situation:

while ((req = blk_fetch_request(q)) != NULL) {
...
  if (unlikely(!nbd->sock)) {
                        dev_err(disk_to_dev(nbd->disk), "Attempted send on closed socket\n");
...
                        continue;
         }

so, as soon as the connection (socket) is gone, there will be an "Attempted..." message printed for every request in the queue, as the queue is cleared.

> I'll happily test your kernel PPA, but as far as I can see, you don't mention where it actually is :)

ha, forgot to paste it in, sorry :-)

https://launchpad.net/~ddstreet/+archive/ubuntu/lp1505564

Revision history for this message

Junien F (axino) wrote on 2016-01-08:

#149

I applied the patch, and it saved a reboot twice already, I think. dmesg from one server : http://pastebin.ubuntu.com/14438525/

I have to stop the tests for the weekend though, I'll resume on Monday.

Revision history for this message

Junien F (axino) wrote on 2016-01-13:

#150

I resumed the tests on Monday, and so far we're looking good. Your change prevented ~10 locks so far, it would seem.

Revision history for this message

Dan Streetman (ddstreet) wrote on 2016-01-13:

#151

Great. I'll send the patch upstream, and open a new bug for the serial port hanging issue. Thanks!

Alvaro Uria (aluria) on 2016-01-14

tags:

added: canonical-bootstack

Revision history for this message

Dan Streetman (ddstreet) wrote on 2016-01-14:

#152

opened bug 1534216 to track the serial port issue.

Revision history for this message

Launchpad Janitor (janitor) wrote on 2016-02-18:

#153

Download full text (3.4 KiB)

This bug was fixed in the package linux - 4.4.0-6.21

---------------
linux (4.4.0-6.21) xenial; urgency=low

[ Tim Gardner ]

* Release Tracking Bug
- LP: #1546283

  * Naples/Zen, NTB Driver (LP: #1542071)
    - [Config] CONFIG_NTB_AMD=m
    - NTB: Add support for AMD PCI-Express Non-Transparent Bridge

* [Hyper-V] kernel panic occurs when installing Ubuntu Server x32 (LP: #1495983)
- SAUCE: storvsc: use small sg_tablesize on x86

* Enable arm64 emulation of removed ARMv7 instructions (LP: #1545542)
- [Config] CONFIG_ARMV8_DEPRECATED=y

* Surelock-GA2:kernel panic/ exception @ pcibios_set_pcie_reset_state+0x118/0x280 + cxl_reset+0x5c/0xc0 (LP: #1545037)
- powerpc/eeh: Fix stale cached primary bus

  * Miscellaneous Ubuntu changes
    - SAUCE: fs: Add user namesapace member to struct super_block
    - SAUCE: fs: Limit file caps to the user namespace of the super block
    - SAUCE: Smack: Add support for unprivileged mounts from user namespaces
    - SAUCE: block_dev: Support checking inode permissions in lookup_bdev()
    - SAUCE: block_dev: Check permissions towards block device inode when mounting
    - SAUCE: fs: Treat foreign mounts as nosuid
    - SAUCE: selinux: Add support for unprivileged mounts from user namespaces
    - SAUCE: userns: Replace in_userns with current_in_userns
    - SAUCE: Smack: Handle labels consistently in untrusted mounts
    - SAUCE: fs: Check for invalid i_uid in may_follow_link()
    - SAUCE: cred: Reject inodes with invalid ids in set_create_file_as()
    - SAUCE: fs: Refuse uid/gid changes which don't map into s_user_ns
    - SAUCE: fs: Update posix_acl support to handle user namespace mounts
    - SAUCE: fs: Ensure the mounter of a filesystem is privileged towards its inodes
    - SAUCE: fs: Don't remove suid for CAP_FSETID in s_user_ns
    - SAUCE: fs: Allow superblock owner to access do_remount_sb()
    - SAUCE: capabilities: Allow privileged user in s_user_ns to set security.* xattrs
    - SAUCE: fuse: Add support for pid namespaces
    - SAUCE: fuse: Support fuse filesystems outside of init_user_ns
    - SAUCE: fuse: Restrict allow_other to the superblock's namespace or a descendant
    - SAUCE: fuse: Allow user namespace mounts
    - SAUCE: mtd: Check permissions towards mtd block device inode when mounting
    - SAUCE: fs: Update i_[ug]id_(read|write) to translate relative to s_user_ns
    - SAUCE: quota: Convert ids relative to s_user_ns
    - SAUCE: evm: Translate user/group ids relative to s_user_ns when computing HMAC
    - SAUCE: fs: Allow CAP_SYS_ADMIN in s_user_ns to freeze and thaw filesystems
    - SAUCE: quota: Treat superblock owner as privilged
    - SAUCE: ima/evm: Allow root in s_user_ns to set xattrs
    - SAUCE: block_dev: Forbid unprivileged mounting when device is opened for writing
    - SAUCE: ext4: Add support for unprivileged mounts from user namespaces
    - SAUCE: ext4: Add module parameter to enable user namespace mounts
    - SAUCE: fuse: Add module parameter to enable user namespace mounts

  * Miscellaneous upstream changes
    - megaraid: Fix possible NULL pointer deference in mraid_mm_ioctl
    - libahci: Implement the capability to override th...

This bug was fixed in the package linux - 4.2.0-34.39

---------------
linux (4.2.0-34.39) wily; urgency=low

[ Brad Figg ]

* Release Tracking Bug
    - LP: #1555821

[ Florian Westphal ]

* SAUCE: [nf] netfilter: x_tables: check for size overflow
    - LP: #1555353
  * SAUCE: [nf,v2] netfilter: x_tables: don't rely on well-behaving
    userspace
    - LP: #1555338

linux (4.2.0-33.38) wily; urgency=low

[ Brad Figg ]

* Release Tracking Bug
    - LP: #1554649

[ Upstream Kernel Changes ]

* Revert "drm/radeon: call hpd_irq_event on resume"
    - LP: #1554608
  * cxl: Fix PSL timebase synchronization detection
    - LP: #1532914

linux (4.2.0-32.37) wily; urgency=low

[ Kamal Mostafa ]

* Release Tracking Bug
    - LP: #1550045

[ Kamal Mostafa ]

* Merged back Ubuntu-4.2.0-31.36

linux (4.2.0-31.36) wily; urgency=low

[ Brad Figg ]

* Release Tracking Bug
    - LP: #1548579

[ Andy Whitcroft ]

* [Debian] hv: hv_set_ifconfig -- convert to python3
    - LP: #1506521
  * [Debian] hv: hv_set_ifconfig -- switch to approved indentation
    - LP: #1540586
  * [Debian] hv: hv_set_ifconfig -- fix numerous parameter handling issues
    - LP: #1540586

[ Carol L Soto ]

* SAUCE: IB/IPoIB: Do not set skb truesize since using one linearskb
    - LP: #1541326

[ Dan Streetman ]

* SAUCE: nbd: ratelimit error msgs after socket close
    - LP: #1505564

[ Tim Gardner ]

* Revert "SAUCE: (noup) cxlflash: Fix to avoid virtual LUN failover
    failure"
    - LP: #1541635
  * Revert "SAUCE: (noup) cxlflash: Fix to escalate LINK_RESET also on port
    1"
    - LP: #1541635
  * [Config] ARMV8_DEPRECATED=y
    - LP: #1545542

[ Upstream Kernel Changes ]

* x86/xen/p2m: hint at the last populated P2M entry
    - LP: #1542941
  * mm: add dma_pool_zalloc() call to DMA API
    - LP: #1543737
  * sctp: Prevent soft lockup when sctp_accept() is called during a timeout
    event
    - LP: #1543737
  * xen-netback: respect user provided max_queues
    - LP: #1543737
  * xen-netfront: respect user provided max_queues
    - LP: #1543737
  * xen-netfront: update num_queues to real created
    - LP: #1543737
  * iio: adis_buffer: Fix out-of-bounds memory access
    - LP: #1543737
  * KVM: PPC: Fix emulation of H_SET_DABR/X on POWER8
    - LP: #1543737
  * KVM: PPC: Fix ONE_REG AltiVec support
    - LP: #1543737
  * x86/irq: Call chip->irq_set_affinity in proper context
    - LP: #1543737
  * drm/amdgpu: fix tonga smu resume
    - LP: #1543737
  * perf kvm record/report: 'unprocessable sample' error while
    recording/reporting guest data
    - LP: #1543737
  * hrtimer: Handle remaining time proper for TIME_LOW_RES
    - LP: #1543737
  * timerfd: Handle relative timers with CONFIG_TIME_LOW_RES proper
    - LP: #1543737
  * posix-timers: Handle relative timers with CONFIG_TIME_LOW_RES proper
    - LP: #1543737
  * itimers: Handle relative timers with CONFIG_TIME_LOW_RES proper
    - LP: #1543737
  * drm/amdgpu: Use drm_calloc_large for VM page_tables array
    - LP: #1543737
  * drm/amdgpu: fix amdgpu_bo_pin_restricted VRAM placing v2
    - LP: #1543737
  * drm/radeon: properly byte swap vce firmware setup
    - LP: #1543737
  * ACPI: Revert "ACPI / video: Add Dell Inspiron 5737 to the blacklist"
    - LP: #1543737
  * ACPI / PCI / hotplug: unlock in error path in acpiphp_enable_slot()
    - LP: #1543737
  * hwmon: (dell-smm) Blacklist Dell Studio XPS 8000
    - LP: #1543737
  * usb: cdc-acm: handle unlinked urb in acm read callback
    - LP: #1543737
  * usb: cdc-acm: send zero packet for intel 7260 modem
    - LP: #1543737
  * cdc-acm:exclude Samsung phone 04e8:685d
    - LP: #1543737
  * usb: hub: do not clear BOS field during reset device
    - LP: #1543737
  * USB: cp210x: add ID for IAI USB to RS485 adaptor
    - LP: #1543737
  * USB: visor: fix null-deref at probe
    - LP: #1543737
  * USB: serial: visor: fix crash on detecting device without write_urbs
    - LP: #1543737
  * USB: serial: option: Adding support for Telit LE922
    - LP: #1543737
  * ALSA: seq: Fix incorrect sanity check at snd_seq_oss_synth_cleanup()
    - LP: #1543737
  * ALSA: seq: Degrade the error message for too many opens
    - LP: #1543737
  * USB: serial: ftdi_sio: add support for Yaesu SCU-18 cable
    - LP: #1543737
  * arm64: kernel: fix architected PMU registers unconditional access
    - LP: #1543737
  * USB: option: fix Cinterion AHxx enumeration
    - LP: #1543737
  * ALSA: compress: Disable GET_CODEC_CAPS ioctl for some architectures
    - LP: #1543737
  * ALSA: usb-audio: Fix TEAC UD-501/UD-503/NT-503 usb delay
    - LP: #1543737
  * virtio_pci: fix use after free on release
    - LP: #1543737
  * ALSA: bebob: Use a signed return type for get_formation_index
    - LP: #1543737
  * arm64: errata: Add -mpc-relative-literal-loads to build flags
    - LP: #1533009, #1543737
  * arm64: mm: avoid calling apply_to_page_range on empty range
    - LP: #1543737
  * x86/mm: Fix types used in pgprot cacheability flags translations
    - LP: #1543737
  * powerpc/eeh: Fix PE location code
    - LP: #1543737
  * SCSI: fix crashes in sd and sr runtime PM
    - LP: #1543737
  * tty: Fix unsafe ldisc reference via ioctl(TIOCGETD)
    - LP: #1543737
  * n_tty: Fix unsafe reference to "other" ldisc
    - LP: #1543737
  * staging/speakup: Use tty_ldisc_ref() for paste kworker
    - LP: #1543737
  * tick/nohz: Set the correct expiry when switching to nohz/lowres mode
    - LP: #1543737
  * irqchip/atmel-aic: Fix wrong bit operation for IRQ priority
    - LP: #1543737
  * seccomp: always propagate NO_NEW_PRIVS on tsync
    - LP: #1543737
  * drm/radeon: cleaned up VCO output settings for DP audio
    - LP: #1543737
  * drm/radeon: Add a common function for DFS handling
    - LP: #1543737
  * drm/radeon: fix DP audio support for APU with DCE4.1 display engine
    - LP: #1543737
  * cpufreq: Fix NULL reference crash while accessing policy->governor_data
    - LP: #1543737
  * cpufreq: pxa2xx: fix pxa_cpufreq_change_voltage prototype
    - LP: #1543737
  * ALSA: dummy: Disable switching timer backend via sysfs
    - LP: #1543737
  * drm/vmwgfx: respect 'nomodeset'
    - LP: #1543737
  * Staging: speakup: Fix getting port information
    - LP: #1543737
  * x86/mm/pat: Avoid truncation when converting cpa->numpages to address
    - LP: #1543737
  * serial: 8250_pci: Add Intel Broadwell ports
    - LP: #1543737
  * perf annotate browser: Fix behaviour of Shift-Tab with nothing focussed
    - LP: #1543737
  * perf hists: Fix HISTC_MEM_DCACHELINE width setting
    - LP: #1543737
  * powerpc/perf: Remove PPMU_HAS_SSLOT flag for Power8
    - LP: #1543737
  * Linux 4.2.8-ckt4
    - LP: #1543737
  * cxlflash: Resolve oops in wait_port_offline
    - LP: #1541635
  * cxlflash: Fix to resolve cmd leak after host reset
    - LP: #1541635
  * cxlflash: Removed driver date print
    - LP: #1541635
  * cxlflash: drop unlikely before IS_ERR_OR_NULL
    - LP: #1541635
  * powerpc/powernv: Panic on unhandled Machine Check
    - LP: #1541635
  * cxlflash: Fix to avoid virtual LUN failover failure
    - LP: #1541635
  * cxlflash: Fix to escalate LINK_RESET also on port 1
    - LP: #1541635
  * IB/ipoib: Suppress warning for send only join failures
    - LP: #1542444
  * IB/ipoib: Expire sendonly multicast joins
    - LP: #1542444
  * IB/ipoib: increase the max mcast backlog queue
    - LP: #1542444
  * IB/ipoib: For sendonly join free the multicast group on leave
    - LP: #1542444
  * qeth: initialize net_device with carrier off
    - LP: #1541907
  * mwifiex: remove USB8897 chipset support
    - LP: #1494593
  * powerpc/powernv: Fix stale PE primary bus
    - LP: #1546145
  * ALSA: usb-audio: avoid freeing umidi object twice
    - LP: #1546177
    - CVE-2016-2384

-- Brad Figg <brad.figg@canonical.com>  Thu, 10 Mar 2016 13:46:44 -0800

Changed in linux (Ubuntu Wily):
status:	Fix Committed → Fix Released

Revision history for this message

Launchpad Janitor (janitor) wrote on 2016-03-14:

#160

Download full text (21.3 KiB)

This bug was fixed in the package linux - 3.19.0-56.62

---------------
linux (3.19.0-56.62) vivid; urgency=low

[ Brad Figg ]

* Release Tracking Bug
- LP: #1555832

[ Florian Westphal ]

  * SAUCE: [nf,v2] netfilter: x_tables: don't rely on well-behaving
    userspace
    - LP: #1555338

linux (3.19.0-55.61) vivid; urgency=low

[ Brad Figg ]

* Release Tracking Bug
- LP: #1554708

[ Upstream Kernel Changes ]

* Revert "drm/radeon: call hpd_irq_event on resume"
- LP: #1554608

linux (3.19.0-54.60) vivid; urgency=low

[ Luis Henriques ]

* Release Tracking Bug
- LP: #1552337

[ Upstream Kernel Changes ]

* Revert "firmware: dmi_scan: Fix UUID endianness for SMBIOS >= 2.6"
- LP: #1551419

linux (3.19.0-53.59) vivid; urgency=low

[ Kamal Mostafa ]

* Release Tracking Bug
- LP: #1550576

[ Kamal Mostafa ]

* Merged back 3.19.0-52.58

linux (3.19.0-52.58) vivid; urgency=low

[ Brad Figg ]

* Release Tracking Bug
- LP: #1548548

[ Dan Streetman ]

* SAUCE: nbd: ratelimit error msgs after socket close
- LP: #1505564

[ Upstream Kernel Changes ]

  * Revert "ACPI / LPSS: allow to use specific PM domain during ->probe()"
    - LP: #1542457
  * Revert "workqueue: make sure delayed work run in local cpu"
    - LP: #1546320
  * net: ipmr: fix static mfc/dev leaks on table destruction
    - LP: #1542457
  * drm/nouveau/nv46: Change mc subdev oclass from nv44 to nv4c
    - LP: #1542457
  * ovl: allow zero size xattr
    - LP: #1542457
  * ovl: use a minimal buffer in ovl_copy_xattr
    - LP: #1542457
  * [media] vb2: fix a regression in poll() behavior for output,streams
    - LP: #1542457
  * [media] gspca: ov534/topro: prevent a division by 0
    - LP: #1542457
  * [media] media: dvb-core: Don't force CAN_INVERSION_AUTO in oneshot mode
    - LP: #1542457
  * tools lib traceevent: Fix output of %llu for 64 bit values read on 32
    bit machines
    - LP: #1542457
  * KVM: x86: expose MSR_TSC_AUX to userspace
    - LP: #1542457
  * KVM: x86: correctly print #AC in traces
    - LP: #1542457
  * drm/radeon: call hpd_irq_event on resume
    - LP: #1542457
  * xhci: refuse loading if nousb is used
    - LP: #1542457
  * arm64: Clear out any singlestep state on a ptrace detach operation
    - LP: #1542457
  * time: Avoid signed overflow in timekeeping_get_ns()
    - LP: #1542457
  * ovl: root: copy attr
    - LP: #1542457
  * Bluetooth: Add support of Toshiba Broadcom based devices
    - LP: #1522949, #1542457
  * rtlwifi: fix memory leak for USB device
    - LP: #1542457
  * wlcore/wl12xx: spi: fix oops on firmware load
    - LP: #1542457
  * ovl: check dentry positiveness in ovl_cleanup_whiteouts()
    - LP: #1542457
  * EDAC, mc_sysfs: Fix freeing bus' name
    - LP: #1542457
  * EDAC: Robustify workqueues destruction
    - LP: #1542457
  * arm64: mm: ensure that the zero page is visible to the page table
    walker
    - LP: #1542457
  * powerpc: Make value-returning atomics fully ordered
    - LP: #1542457
  * powerpc: Make {cmp}xchg* and their atomic_ versions fully ordered
    - LP: #1542457
  * dm space map metadata: remove unused variable in brb_pop()
    - LP: #1542457
  * dm thi...

This bug was fixed in the package linux - 3.19.0-56.62

---------------
linux (3.19.0-56.62) vivid; urgency=low

[ Brad Figg ]

* Release Tracking Bug
    - LP: #1555832

[ Florian Westphal ]

* SAUCE: [nf,v2] netfilter: x_tables: don't rely on well-behaving
    userspace
    - LP: #1555338

linux (3.19.0-55.61) vivid; urgency=low

[ Brad Figg ]

* Release Tracking Bug
    - LP: #1554708

[ Upstream Kernel Changes ]

* Revert "drm/radeon: call hpd_irq_event on resume"
    - LP: #1554608

linux (3.19.0-54.60) vivid; urgency=low

[ Luis Henriques ]

* Release Tracking Bug
    - LP: #1552337

[ Upstream Kernel Changes ]

* Revert "firmware: dmi_scan: Fix UUID endianness for SMBIOS >= 2.6"
    - LP: #1551419

linux (3.19.0-53.59) vivid; urgency=low

[ Kamal Mostafa ]

* Release Tracking Bug
    - LP: #1550576

[ Kamal Mostafa ]

* Merged back 3.19.0-52.58

linux (3.19.0-52.58) vivid; urgency=low

[ Brad Figg ]

* Release Tracking Bug
    - LP: #1548548

[ Dan Streetman ]

* SAUCE: nbd: ratelimit error msgs after socket close
    - LP: #1505564

[ Upstream Kernel Changes ]

* Revert "ACPI / LPSS: allow to use specific PM domain during ->probe()"
    - LP: #1542457
  * Revert "workqueue: make sure delayed work run in local cpu"
    - LP: #1546320
  * net: ipmr: fix static mfc/dev leaks on table destruction
    - LP: #1542457
  * drm/nouveau/nv46: Change mc subdev oclass from nv44 to nv4c
    - LP: #1542457
  * ovl: allow zero size xattr
    - LP: #1542457
  * ovl: use a minimal buffer in ovl_copy_xattr
    - LP: #1542457
  * [media] vb2: fix a regression in poll() behavior for output,streams
    - LP: #1542457
  * [media] gspca: ov534/topro: prevent a division by 0
    - LP: #1542457
  * [media] media: dvb-core: Don't force CAN_INVERSION_AUTO in oneshot mode
    - LP: #1542457
  * tools lib traceevent: Fix output of %llu for 64 bit values read on 32
    bit machines
    - LP: #1542457
  * KVM: x86: expose MSR_TSC_AUX to userspace
    - LP: #1542457
  * KVM: x86: correctly print #AC in traces
    - LP: #1542457
  * drm/radeon: call hpd_irq_event on resume
    - LP: #1542457
  * xhci: refuse loading if nousb is used
    - LP: #1542457
  * arm64: Clear out any singlestep state on a ptrace detach operation
    - LP: #1542457
  * time: Avoid signed overflow in timekeeping_get_ns()
    - LP: #1542457
  * ovl: root: copy attr
    - LP: #1542457
  * Bluetooth: Add support of Toshiba Broadcom based devices
    - LP: #1522949, #1542457
  * rtlwifi: fix memory leak for USB device
    - LP: #1542457
  * wlcore/wl12xx: spi: fix oops on firmware load
    - LP: #1542457
  * ovl: check dentry positiveness in ovl_cleanup_whiteouts()
    - LP: #1542457
  * EDAC, mc_sysfs: Fix freeing bus' name
    - LP: #1542457
  * EDAC: Robustify workqueues destruction
    - LP: #1542457
  * arm64: mm: ensure that the zero page is visible to the page table
    walker
    - LP: #1542457
  * powerpc: Make value-returning atomics fully ordered
    - LP: #1542457
  * powerpc: Make {cmp}xchg* and their atomic_ versions fully ordered
    - LP: #1542457
  * dm space map metadata: remove unused variable in brb_pop()
    - LP: #1542457
  * dm thin: fix race condition when destroying thin pool workqueue
    - LP: #1542457
  * futex: Drop refcount if requeue_pi() acquired the rtmutex
    - LP: #1542457
  * arm64: mdscr_el1: avoid exposing DCC to userspace
    - LP: #1542457
  * arm64: kernel: enforce pmuserenr_el0 initialization and restore
    - LP: #1542457
  * drm/radeon: Fix off-by-one errors in radeon_vm_bo_set_addr
    - LP: #1542457
  * drm/radeon: clean up fujitsu quirks
    - LP: #1542457
  * mmc: sdio: Fix invalid vdd in voltage switch power cycle
    - LP: #1542457
  * mmc: sdhci: Fix DMA descriptor with zero data length
    - LP: #1542457
  * mmc: sdhci: Fix sdhci_runtime_pm_bus_on/off()
    - LP: #1542457
  * udf: limit the maximum number of indirect extents in a row
    - LP: #1542457
  * [media] rc: sunxi-cir: Initialize the spinlock properly
    - LP: #1542457
  * nfs: Fix race in __update_open_stateid()
    - LP: #1542457
  * USB: cp210x: add ID for ELV Marble Sound Board 1
    - LP: #1542457
  * NFSv4: Don't perform cached access checks before we've OPENed the file
    - LP: #1542457
  * NFS: Ensure we revalidate attributes before using execute_ok()
    - LP: #1542457
  * Thermal: initialize thermal zone device correctly
    - LP: #1542457
  * Thermal: handle thermal zone device properly during system sleep
    - LP: #1542457
  * Thermal: do thermal zone update after a cooling device registered
    - LP: #1542457
  * posix-clock: Fix return code on the poll method's error path
    - LP: #1542457
  * rtlwifi: rtl8723be: Fix module parameter initialization
    - LP: #1542457
  * rtlwifi: rtl8723ae: Fix initialization of module parameters
    - LP: #1542457
  * rtlwifi: rtl8821ae: Fix errors in parameter initialization
    - LP: #1542457
  * rtlwifi: rtl8188ee: Fix module parameter initialization
    - LP: #1542457
  * rtlwifi: rtl8192de: Fix incorrect module parameter descriptions
    - LP: #1542457
  * rtlwifi: rtl8192se: Fix module parameter initialization
    - LP: #1542457
  * rtlwifi: rtl8192ce: Fix handling of module parameters
    - LP: #1542457
  * rtlwifi: rtl8192cu: Add missing parameter setup
    - LP: #1542457
  * NFS: Fix attribute cache revalidation
    - LP: #1542457
  * rtlwifi: rtl_pci: Fix kernel panic
    - LP: #1542457
  * bcache: fix a livelock when we cause a huge number of cache misses
    - LP: #1542457
  * bcache: Add a cond_resched() call to gc
    - LP: #1542457
  * bcache: clear BCACHE_DEV_UNLINK_DONE flag when attaching a backing
    device
    - LP: #1542457
  * bcache: fix a leak in bch_cached_dev_run()
    - LP: #1542457
  * bcache: unregister reboot notifier if bcache fails to unregister device
    - LP: #1542457
  * bcache: allows use of register in udev to avoid "device_busy" error.
    - LP: #1542457
  * bcache: Change refill_dirty() to always scan entire disk if necessary
    - LP: #1542457
  * wlcore/wl12xx: spi: fix NULL pointer dereference (Oops)
    - LP: #1542457
  * Input: i8042 - add Fujitsu Lifebook U745 to the nomux list
    - LP: #1542457
  * libxfs: pack the agfl header structure so XFS_AGFL_SIZE is correct
    - LP: #1542457
  * x86/xen: don't reset vcpu_info on a cancelled suspend
    - LP: #1542457
  * udf: Prevent buffer overrun with multi-byte characters
    - LP: #1542457
  * udf: Check output buffer length when converting name to CS0
    - LP: #1542457
  * drm/dp/mst: process broadcast messages correctly
    - LP: #1542457
  * drm/dp/mst: always send reply for UP request
    - LP: #1542457
  * drm/dp/mst: fix in MSTB RAD initialization
    - LP: #1542457
  * drm/dp/mst: fix in RAD element access
    - LP: #1542457
  * PCI: Fix minimum allocation address overwrite
    - LP: #1542457
  * PCI: host: Mark PCIe/PCI (MSI) IRQ cascade handlers as IRQF_NO_THREAD
    - LP: #1542457
  * iwlwifi: update and fix 7265 series PCI IDs
    - LP: #1542457
  * locks: fix unlock when fcntl_setlk races with a close
    - LP: #1542457
  * ASoC: compress: Fix compress device direction check
    - LP: #1542457
  * dm snapshot: fix hung bios when copy error occurs
    - LP: #1542457
  * uml: fix hostfs mknod()
    - LP: #1542457
  * uml: flush stdout before forking
    - LP: #1542457
  * drm/nouveau/kms: take mode_config mutex in connector hotplug path
    - LP: #1542457
  * ALSA: usb: Add native DSD support for Oppo HA-1
    - LP: #1542457
  * ALSA: hda - Fixup inverted internal mic for Lenovo E50-80
    - LP: #1529624, #1542457
  * x86/mm: Add barriers and document switch_mm()-vs-flush synchronization
    - LP: #1542457
  * x86/boot: Double BOOT_HEAP_SIZE to 64KB
    - LP: #1542457
  * s390: fix normalization bug in exception table sorting
    - LP: #1542457
  * xfs: inode recovery readahead can race with inode buffer creation
    - LP: #1542457
  * xfs: handle dquot buffer readahead in log recovery correctly
    - LP: #1542457
  * clocksource/drivers/vt8500: Increase the minimum delta
    - LP: #1542457
  * Input: elantech - mark protocols v2 and v3 as semi-mt
    - LP: #1542457
  * x86/reboot/quirks: Add iMac10,1 to pci_reboot_dmi_table[]
    - LP: #1542457
  * ALSA: seq: Fix missing NULL check at remove_events ioctl
    - LP: #1542457
  * ALSA: seq: Fix race at timer setup and close
    - LP: #1542457
  * virtio_balloon: fix race by fill and leak
    - LP: #1542457
  * virtio_balloon: fix race between migration and ballooning
    - LP: #1542457
  * parisc: Fix __ARCH_SI_PREAMBLE_SIZE
    - LP: #1542457
  * scripts/recordmcount.pl: support data in text section on powerpc
    - LP: #1542457
  * powerpc/module: Handle R_PPC64_ENTRY relocations
    - LP: #1542457
  * ALSA: hda - fix the headset mic detection problem for a Dell laptop
    - LP: #1533461, #1542457
  * ALSA: usb-audio: Fix mixer ctl regression of Native Instrument devices
    - LP: #1542457
  * drm/i915: intel_hpd_init(): Fix suspend/resume reprobing
    - LP: #1542457
  * x86/mm: Improve switch_mm() barrier comments
    - LP: #1542457
  * ALSA: timer: Fix double unlink of active_list
    - LP: #1542457
  * dmaengine: dw: fix cyclic transfer setup
    - LP: #1542457
  * dmaengine: dw: fix cyclic transfer callbacks
    - LP: #1542457
  * mmc: mmci: fix an ages old detection error
    - LP: #1542457
  * ALSA: timer: Fix race among timer ioctls
    - LP: #1542457
  * sparc64: fix incorrect sign extension in sys_sparc64_personality
    - LP: #1542457
  * cifs: Ratelimit kernel log messages
    - LP: #1542457
  * cifs: fix race between call_async() and reconnect()
    - LP: #1542457
  * cifs_dbg() outputs an uninitialized buffer in cifs_readdir()
    - LP: #1542457
  * m32r: fix m32104ut_defconfig build fail
    - LP: #1542457
  * dma-debug: switch check from _text to _stext
    - LP: #1542457
  * scripts/bloat-o-meter: fix python3 syntax error
    - LP: #1542457
  * ocfs2/dlm: ignore cleaning the migration mle that is inuse
    - LP: #1542457
  * ALSA: timer: Harden slave timer list handling
    - LP: #1542457
  * zram/zcomp: use GFP_NOIO to allocate streams
    - LP: #1542457
  * zram: try vmalloc() after kmalloc()
    - LP: #1542457
  * mm: soft-offline: check return value in second __get_any_page() call
    - LP: #1542457
  * memcg: only free spare array when readers are done
    - LP: #1542457
  * panic: release stale console lock to always get the logbuf printed out
    - LP: #1542457
  * kernel/panic.c: turn off locks debug before releasing console lock
    - LP: #1542457
  * printk: do cond_resched() between lines while outputting to consoles
    - LP: #1542457
  * ALSA: hda - Fix bass pin fixup for ASUS N550JX
    - LP: #1542457
  * crypto: af_alg - Disallow bind/setkey/... after accept(2)
    - LP: #1542457
  * crypto: af_alg - Fix socket double-free when accept fails
    - LP: #1542457
  * crypto: af_alg - Add nokey compatibility path
    - LP: #1542457
  * crypto: hash - Add crypto_ahash_has_setkey
    - LP: #1542457
  * crypto: af_alg - Allow af_af_alg_release_parent to be called on nokey
    path
    - LP: #1542457
  * crypto: af_alg - Forbid bind(2) when nokey child sockets are present
    - LP: #1542457
  * ALSA: hrtimer: Fix stall by hrtimer_cancel()
    - LP: #1542457
  * ALSA: pcm: Fix snd_pcm_hw_params struct copy in compat mode
    - LP: #1542457
  * ALSA: seq: Fix snd_seq_call_port_info_ioctl in compat mode
    - LP: #1542457
  * ALSA: control: Avoid kernel warnings from tlv ioctl with numid 0
    - LP: #1542457
  * ARM: debug-ll: fix BCM63xx entry for multiplatform
    - LP: #1542457
  * xfs: log mount failures don't wait for buffers to be released
    - LP: #1542457
  * crypto: algif_skcipher - Load TX SG list after waiting
    - LP: #1542457
  * crypto: crc32c - Fix crc32c soft dependency
    - LP: #1542457
  * IB/qib: fix mcast detach when qp not attached
    - LP: #1542457
  * IB/qib: Support creating qps with GFP_NOIO flag
    - LP: #1542457
  * ideapad-laptop: Add Lenovo ideapad Y700-17ISK to no_hw_rfkill dmi list
    - LP: #1542457
  * iscsi-target: Fix potential dead-lock during node acl delete
    - LP: #1542457
  * ALSA: hda - Flush the pending probe work at remove
    - LP: #1542457
  * ALSA: timer: Handle disconnection more safely
    - LP: #1542457
  * ocfs2: NFS hangs in __ocfs2_cluster_lock due to race with
    ocfs2_unblock_lock
    - LP: #1542457
  * MAINTAINERS: return arch/sh to maintained state, with new maintainers
    - LP: #1542457
  * ideapad-laptop: Add Lenovo Yoga 700 to no_hw_rfkill dmi list
    - LP: #1542457
  * drm/i915: On fb alloc failure, unref gem object where it gets refed
    - LP: #1542457
  * [media] rc: allow rc modules to be loaded if rc-main is not a module
    - LP: #1542457
  * SCSI: initio: remove duplicate module device table
    - LP: #1542457
  * clk: xgene: Fix divider with non-zero shift value
    - LP: #1542457
  * clk: st: avoid uninitialized variable use
    - LP: #1542457
  * ASoC: Intel: pass correct parameter in sst_alloc_stream_mrfld()
    - LP: #1542457
  * ath9k_htc: check for underflow in ath9k_htc_rx_msg()
    - LP: #1542457
  * mtd: nand: fix ONFI parameter page layout
    - LP: #1542457
  * mtd: nand: denali: add missing nand_release() call in denali_remove()
    - LP: #1542457
  * ALSA: fm801: propagate TUNER_ONLY bit when autodetected
    - LP: #1542457
  * pinctrl: bcm2835: Fix memory leak in error path
    - LP: #1542457
  * mmc: sdhci: restore behavior when setting VDD via external regulator
    - LP: #1542457
  * x86/LDT: Print the real LDT base address
    - LP: #1542457
  * sysrq: Fix warning in sysrq generated crash.
    - LP: #1542457
  * kconfig: return 'false' instead of 'no' in bool function
    - LP: #1542457
  * perf/x86: Fix filter_events() bug with event mappings
    - LP: #1542457
  * power: test_power: correctly handle empty writes
    - LP: #1542457
  * firmware: actually return NULL on failed request_firmware_nowait()
    - LP: #1542457
  * target: Fix a memory leak in target_dev_lba_map_store()
    - LP: #1542457
  * m68k/atari, m68k/sun3: Fix SCSI platform device registration when
    driver is modular
    - LP: #1542457
  * um: Fix build error and kconfig for i386
    - LP: #1542457
  * kbuild: Demote 'sign-compare' warning to W=2
    - LP: #1542457
  * mmc: sd: limit SD card power limit according to cards capabilities
    - LP: #1542457
  * net: tcp_memcontrol: properly detect ancestor socket pressure
    - LP: #1542457
  * vmstat: make vmstat_updater deferrable again and shut down on idle
    - LP: #1542457
  * Btrfs: clean up an error code in btrfs_init_space_info()
    - LP: #1542457
  * btrfs: initialize the seq counter in struct btrfs_device
    - LP: #1542457
  * printk: help pr_debug and pr_devel to optimize out arguments
    - LP: #1542457
  * ARM: dts: armadillo800eva Correct extal1 frequency to 24 MHz
    - LP: #1542457
  * mmc: debugfs: correct wrong voltage value
    - LP: #1542457
  * IB/mlx4: Initialize hop_limit when creating address handle
    - LP: #1542457
  * net/mlx4: Remove unused macro
    - LP: #1542457
  * veth: don’t modify ip_summed; doing so treats packets with bad
    checksums as good.
    - LP: #1542457
  * net: cdc_ncm: avoid changing RX/TX buffers on MTU changes
    - LP: #1542457
  * sctp: sctp should release assoc when sctp_make_abort_user return NULL
    in sctp_close
    - LP: #1542457
  * connector: bump skb->users before callback invocation
    - LP: #1542457
  * unix: properly account for FDs passed over unix sockets
    - LP: #1542457
  * bridge: Only call /sbin/bridge-stp for the initial network namespace
    - LP: #1542457
  * vxlan: fix test which detect duplicate vxlan iface
    - LP: #1542457
  * net: sctp: prevent writes to cookie_hmac_alg from accessing invalid
    memory
    - LP: #1542457
  * tcp_yeah: don't set ssthresh below 2
    - LP: #1542457
  * udp: disallow UFO for sockets with SO_NO_CHECK option
    - LP: #1542457
  * net: preserve IP control block during GSO segmentation
    - LP: #1542457
  * bonding: Prevent IPv6 link local address on enslaved devices
    - LP: #1542457
  * phonet: properly unshare skbs in phonet_rcv()
    - LP: #1542457
  * net: bpf: reject invalid shifts
    - LP: #1542457
  * ipv6: update skb->csum when CE mark is propagated
    - LP: #1542457
  * bridge: fix lockdep addr_list_lock false positive splat
    - LP: #1542457
  * batman-adv: Avoid recursive call_rcu for batadv_bla_claim
    - LP: #1542457
  * batman-adv: Avoid recursive call_rcu for batadv_nc_node
    - LP: #1542457
  * batman-adv: Drop immediate batadv_orig_ifinfo free function
    - LP: #1542457
  * batman-adv: Drop immediate batadv_neigh_node free function
    - LP: #1542457
  * batman-adv: Drop immediate neigh_ifinfo free function
    - LP: #1542457
  * batman-adv: Drop immediate batadv_hard_iface free function
    - LP: #1542457
  * batman-adv: Drop immediate orig_node free function
    - LP: #1542457
  * team: Replace rcu_read_lock with a mutex in team_vlan_rx_kill_vid
    - LP: #1542457
  * sctp: Prevent soft lockup when sctp_accept() is called during a timeout
    event
    - LP: #1542457
  * xen-netback: respect user provided max_queues
    - LP: #1542457
  * xen-netfront: respect user provided max_queues
    - LP: #1542457
  * xen-netfront: update num_queues to real created
    - LP: #1542457
  * Linux 3.19.8-ckt14
    - LP: #1542457
  * x86/xen/p2m: hint at the last populated P2M entry
    - LP: #1542941
  * qeth: initialize net_device with carrier off
    - LP: #1541907
  * HID: i2c-hid: Prevent sending reports from racing with device reset
    - LP: #1535513
  * iio: adis_buffer: Fix out-of-bounds memory access
    - LP: #1546183
  * KVM: PPC: Fix emulation of H_SET_DABR/X on POWER8
    - LP: #1546183
  * KVM: PPC: Fix ONE_REG AltiVec support
    - LP: #1546183
  * x86/irq: Call chip->irq_set_affinity in proper context
    - LP: #1546183
  * ACPI: Revert "ACPI / video: Add Dell Inspiron 5737 to the blacklist"
    - LP: #1546183
  * ACPI / PCI / hotplug: unlock in error path in acpiphp_enable_slot()
    - LP: #1546183
  * usb: cdc-acm: handle unlinked urb in acm read callback
    - LP: #1546183
  * usb: cdc-acm: send zero packet for intel 7260 modem
    - LP: #1546183
  * cdc-acm:exclude Samsung phone 04e8:685d
    - LP: #1546183
  * usb: hub: do not clear BOS field during reset device
    - LP: #1546183
  * USB: cp210x: add ID for IAI USB to RS485 adaptor
    - LP: #1546183
  * USB: visor: fix null-deref at probe
    - LP: #1546183
  * USB: serial: visor: fix crash on detecting device without write_urbs
    - LP: #1546183
  * USB: serial: option: Adding support for Telit LE922
    - LP: #1546183
  * ALSA: seq: Fix incorrect sanity check at snd_seq_oss_synth_cleanup()
    - LP: #1546183
  * ALSA: seq: Degrade the error message for too many opens
    - LP: #1546183
  * USB: serial: ftdi_sio: add support for Yaesu SCU-18 cable
    - LP: #1546183
  * arm64: kernel: fix architected PMU registers unconditional access
    - LP: #1546183
  * USB: option: fix Cinterion AHxx enumeration
    - LP: #1546183
  * ALSA: compress: Disable GET_CODEC_CAPS ioctl for some architectures
    - LP: #1546183
  * ALSA: usb-audio: Fix TEAC UD-501/UD-503/NT-503 usb delay
    - LP: #1546183
  * ALSA: bebob: Use a signed return type for get_formation_index
    - LP: #1546183
  * arm64: errata: Add -mpc-relative-literal-loads to build flags
    - LP: #1533009, #1546183
  * arm64: mm: avoid calling apply_to_page_range on empty range
    - LP: #1546183
  * x86/mm: Fix types used in pgprot cacheability flags translations
    - LP: #1546183
  * powerpc/eeh: Fix PE location code
    - LP: #1546183
  * SCSI: fix crashes in sd and sr runtime PM
    - LP: #1546183
  * tty: Fix unsafe ldisc reference via ioctl(TIOCGETD)
    - LP: #1546183
  * n_tty: Fix unsafe reference to "other" ldisc
    - LP: #1546183
  * staging/speakup: Use tty_ldisc_ref() for paste kworker
    - LP: #1546183
  * irqchip/atmel-aic: Fix wrong bit operation for IRQ priority
    - LP: #1546183
  * seccomp: always propagate NO_NEW_PRIVS on tsync
    - LP: #1546183
  * ALSA: dummy: Disable switching timer backend via sysfs
    - LP: #1546183
  * drm/vmwgfx: respect 'nomodeset'
    - LP: #1546183
  * Staging: speakup: Fix getting port information
    - LP: #1546183
  * x86/mm/pat: Avoid truncation when converting cpa->numpages to address
    - LP: #1546183
  * serial: 8250_pci: Add Intel Broadwell ports
    - LP: #1546183
  * perf annotate browser: Fix behaviour of Shift-Tab with nothing focussed
    - LP: #1546183
  * perf hists: Fix HISTC_MEM_DCACHELINE width setting
    - LP: #1546183
  * powerpc/perf: Remove PPMU_HAS_SSLOT flag for Power8
    - LP: #1546183
  * [media] usbvision fix overflow of interfaces array
    - LP: #1546183
  * [media] usbvision: fix leak of usb_dev on failure paths in
    usbvision_probe()
    - LP: #1546183
  * [media] usbvision: fix crash on detecting device with invalid
    configuration
    - LP: #1546183
  * ASN.1: Fix non-match detection failure on data overrun
    - LP: #1546183
  * Linux 3.19.8-ckt15
    - LP: #1546183
  * netfilter: bridge: don't use nf_bridge_info data to store mac header
    - LP: #1463911
  * netfilter: bridge: restore vlan tag when refragmenting
    - LP: #1463911
  * netfilter: bridge: forward IPv6 fragmented packets
    - LP: #1463911
  * netfilter: bridge: Use __in6_dev_get rather than in6_dev_get in
    br_validate_ipv6
    - LP: #1463911
  * mwifiex: remove USB8897 chipset support
    - LP: #1494593
  * ALSA: usb-audio: avoid freeing umidi object twice
    - LP: #1546177
    - CVE-2016-2384
  * vmstat: explicitly schedule per-cpu work on the CPU we need it to run
    on
    - LP: #1546320

-- Brad Figg <brad.figg@canonical.com>  Thu, 10 Mar 2016 14:11:41 -0800

Changed in linux (Ubuntu Vivid):
status:	Fix Committed → Fix Released

Revision history for this message

Launchpad Janitor (janitor) wrote on 2016-03-14:

#161

Download full text (14.5 KiB)

This bug was fixed in the package linux - 3.13.0-83.127

---------------
linux (3.13.0-83.127) trusty; urgency=low

[ Brad Figg ]

* Release Tracking Bug
- LP: #1555839

[ Florian Westphal ]

  * SAUCE: [nf,v2] netfilter: x_tables: don't rely on well-behaving
    userspace
    - LP: #1555338

linux (3.13.0-82.126) trusty; urgency=low

[ Brad Figg ]

* Release Tracking Bug
- LP: #1554732

[ Upstream Kernel Changes ]

  * Revert "drm/radeon: call hpd_irq_event on resume"
    - LP: #1554608
  * net: generic dev_disable_lro() stacked device handling
    - LP: #1547680

linux (3.13.0-81.125) trusty; urgency=low

[ Luis Henriques ]

* Release Tracking Bug
- LP: #1552316

[ Upstream Kernel Changes ]

  * Revert "firmware: dmi_scan: Fix UUID endianness for SMBIOS >= 2.6"
    - LP: #1551419
  * bcache: Fix a lockdep splat in an error path
    - LP: #1551327

linux (3.13.0-80.124) trusty; urgency=low

[ Brad Figg ]

* Release Tracking Bug
- LP: #1548519

[ Andy Whitcroft ]

  * [Debian] hv: hv_set_ifconfig -- convert to python3
    - LP: #1506521
  * [Debian] hv: hv_set_ifconfig -- switch to approved indentation
    - LP: #1540586
  * [Debian] hv: hv_set_ifconfig -- fix numerous parameter handling issues
    - LP: #1540586

[ Dan Streetman ]

* SAUCE: nbd: ratelimit error msgs after socket close
- LP: #1505564

[ Upstream Kernel Changes ]

  * Revert "workqueue: make sure delayed work run in local cpu"
    - LP: #1546320
  * [media] gspca: ov534/topro: prevent a division by 0
    - LP: #1542497
  * [media] media: dvb-core: Don't force CAN_INVERSION_AUTO in oneshot mode
    - LP: #1542497
  * tools lib traceevent: Fix output of %llu for 64 bit values read on 32
    bit machines
    - LP: #1542497
  * KVM: x86: correctly print #AC in traces
    - LP: #1542497
  * drm/radeon: call hpd_irq_event on resume
    - LP: #1542497
  * xhci: refuse loading if nousb is used
    - LP: #1542497
  * arm64: Clear out any singlestep state on a ptrace detach operation
    - LP: #1542497
  * time: Avoid signed overflow in timekeeping_get_ns()
    - LP: #1542497
  * rtlwifi: fix memory leak for USB device
    - LP: #1542497
  * wlcore/wl12xx: spi: fix oops on firmware load
    - LP: #1542497
  * EDAC, mc_sysfs: Fix freeing bus' name
    - LP: #1542497
  * EDAC: Don't try to cancel workqueue when it's never setup
    - LP: #1542497
  * EDAC: Robustify workqueues destruction
    - LP: #1542497
  * powerpc: Make value-returning atomics fully ordered
    - LP: #1542497
  * powerpc: Make {cmp}xchg* and their atomic_ versions fully ordered
    - LP: #1542497
  * dm space map metadata: remove unused variable in brb_pop()
    - LP: #1542497
  * dm thin: fix race condition when destroying thin pool workqueue
    - LP: #1542497
  * futex: Drop refcount if requeue_pi() acquired the rtmutex
    - LP: #1542497
  * drm/radeon: clean up fujitsu quirks
    - LP: #1542497
  * mmc: sdio: Fix invalid vdd in voltage switch power cycle
    - LP: #1542497
  * mmc: sdhci: Fix sdhci_runtime_pm_bus_on/off()
    - LP: #1542497
  * udf: limit the maximum number of indirect extents in a row
    - LP: #1542497
  * nfs: Fix race in __update_open_stateid...

This bug was fixed in the package linux - 3.13.0-83.127

---------------
linux (3.13.0-83.127) trusty; urgency=low

[ Brad Figg ]

* Release Tracking Bug
    - LP: #1555839

[ Florian Westphal ]

* SAUCE: [nf,v2] netfilter: x_tables: don't rely on well-behaving
    userspace
    - LP: #1555338

linux (3.13.0-82.126) trusty; urgency=low

[ Brad Figg ]

* Release Tracking Bug
    - LP: #1554732

[ Upstream Kernel Changes ]

* Revert "drm/radeon: call hpd_irq_event on resume"
    - LP: #1554608
  * net: generic dev_disable_lro() stacked device handling
    - LP: #1547680

linux (3.13.0-81.125) trusty; urgency=low

[ Luis Henriques ]

* Release Tracking Bug
    - LP: #1552316

[ Upstream Kernel Changes ]

* Revert "firmware: dmi_scan: Fix UUID endianness for SMBIOS >= 2.6"
    - LP: #1551419
  * bcache: Fix a lockdep splat in an error path
    - LP: #1551327

linux (3.13.0-80.124) trusty; urgency=low

[ Brad Figg ]

* Release Tracking Bug
    - LP: #1548519

[ Andy Whitcroft ]

* [Debian] hv: hv_set_ifconfig -- convert to python3
    - LP: #1506521
  * [Debian] hv: hv_set_ifconfig -- switch to approved indentation
    - LP: #1540586
  * [Debian] hv: hv_set_ifconfig -- fix numerous parameter handling issues
    - LP: #1540586

[ Dan Streetman ]

* SAUCE: nbd: ratelimit error msgs after socket close
    - LP: #1505564

[ Upstream Kernel Changes ]

* Revert "workqueue: make sure delayed work run in local cpu"
    - LP: #1546320
  * [media] gspca: ov534/topro: prevent a division by 0
    - LP: #1542497
  * [media] media: dvb-core: Don't force CAN_INVERSION_AUTO in oneshot mode
    - LP: #1542497
  * tools lib traceevent: Fix output of %llu for 64 bit values read on 32
    bit machines
    - LP: #1542497
  * KVM: x86: correctly print #AC in traces
    - LP: #1542497
  * drm/radeon: call hpd_irq_event on resume
    - LP: #1542497
  * xhci: refuse loading if nousb is used
    - LP: #1542497
  * arm64: Clear out any singlestep state on a ptrace detach operation
    - LP: #1542497
  * time: Avoid signed overflow in timekeeping_get_ns()
    - LP: #1542497
  * rtlwifi: fix memory leak for USB device
    - LP: #1542497
  * wlcore/wl12xx: spi: fix oops on firmware load
    - LP: #1542497
  * EDAC, mc_sysfs: Fix freeing bus' name
    - LP: #1542497
  * EDAC: Don't try to cancel workqueue when it's never setup
    - LP: #1542497
  * EDAC: Robustify workqueues destruction
    - LP: #1542497
  * powerpc: Make value-returning atomics fully ordered
    - LP: #1542497
  * powerpc: Make {cmp}xchg* and their atomic_ versions fully ordered
    - LP: #1542497
  * dm space map metadata: remove unused variable in brb_pop()
    - LP: #1542497
  * dm thin: fix race condition when destroying thin pool workqueue
    - LP: #1542497
  * futex: Drop refcount if requeue_pi() acquired the rtmutex
    - LP: #1542497
  * drm/radeon: clean up fujitsu quirks
    - LP: #1542497
  * mmc: sdio: Fix invalid vdd in voltage switch power cycle
    - LP: #1542497
  * mmc: sdhci: Fix sdhci_runtime_pm_bus_on/off()
    - LP: #1542497
  * udf: limit the maximum number of indirect extents in a row
    - LP: #1542497
  * nfs: Fix race in __update_open_stateid()
    - LP: #1542497
  * USB: cp210x: add ID for ELV Marble Sound Board 1
    - LP: #1542497
  * NFSv4: Don't perform cached access checks before we've OPENed the file
    - LP: #1542497
  * NFS: Fix attribute cache revalidation
    - LP: #1542497
  * posix-clock: Fix return code on the poll method's error path
    - LP: #1542497
  * rtlwifi: rtl8192de: Fix incorrect module parameter descriptions
    - LP: #1542497
  * rtlwifi: rtl8192se: Fix module parameter initialization
    - LP: #1542497
  * rtlwifi: rtl8192ce: Fix handling of module parameters
    - LP: #1542497
  * rtlwifi: rtl8192cu: Add missing parameter setup
    - LP: #1542497
  * bcache: fix a livelock when we cause a huge number of cache misses
    - LP: #1542497
  * bcache: Add a cond_resched() call to gc
    - LP: #1542497
  * bcache: clear BCACHE_DEV_UNLINK_DONE flag when attaching a backing
    device
    - LP: #1542497
  * bcache: fix a leak in bch_cached_dev_run()
    - LP: #1542497
  * bcache: unregister reboot notifier if bcache fails to unregister device
    - LP: #1542497
  * bcache: add mutex lock for bch_is_open
    - LP: #1542497
  * bcache: allows use of register in udev to avoid "device_busy" error.
    - LP: #1542497
  * bcache: Change refill_dirty() to always scan entire disk if necessary
    - LP: #1542497
  * wlcore/wl12xx: spi: fix NULL pointer dereference (Oops)
    - LP: #1542497
  * Input: i8042 - add Fujitsu Lifebook U745 to the nomux list
    - LP: #1542497
  * libxfs: pack the agfl header structure so XFS_AGFL_SIZE is correct
    - LP: #1542497
  * x86/xen: don't reset vcpu_info on a cancelled suspend
    - LP: #1542497
  * udf: Prevent buffer overrun with multi-byte characters
    - LP: #1542497
  * udf: Check output buffer length when converting name to CS0
    - LP: #1542497
  * PCI: host: Mark PCIe/PCI (MSI) IRQ cascade handlers as IRQF_NO_THREAD
    - LP: #1542497
  * iwlwifi: update and fix 7265 series PCI IDs
    - LP: #1542497
  * locks: fix unlock when fcntl_setlk races with a close
    - LP: #1542497
  * ASoC: compress: Fix compress device direction check
    - LP: #1542497
  * dm snapshot: fix hung bios when copy error occurs
    - LP: #1542497
  * uml: fix hostfs mknod()
    - LP: #1542497
  * uml: flush stdout before forking
    - LP: #1542497
  * drm/nouveau/kms: take mode_config mutex in connector hotplug path
    - LP: #1542497
  * x86/boot: Double BOOT_HEAP_SIZE to 64KB
    - LP: #1542497
  * s390: fix normalization bug in exception table sorting
    - LP: #1542497
  * xfs: inode recovery readahead can race with inode buffer creation
    - LP: #1542497
  * clocksource/drivers/vt8500: Increase the minimum delta
    - LP: #1542497
  * Input: elantech - mark protocols v2 and v3 as semi-mt
    - LP: #1542497
  * x86/reboot/quirks: Add iMac10,1 to pci_reboot_dmi_table[]
    - LP: #1542497
  * ALSA: seq: Fix missing NULL check at remove_events ioctl
    - LP: #1542497
  * ALSA: seq: Fix race at timer setup and close
    - LP: #1542497
  * virtio_balloon: fix race by fill and leak
    - LP: #1542497
  * virtio_balloon: fix race between migration and ballooning
    - LP: #1542497
  * parisc: Fix __ARCH_SI_PREAMBLE_SIZE
    - LP: #1542497
  * scripts/recordmcount.pl: support data in text section on powerpc
    - LP: #1542497
  * powerpc/module: Handle R_PPC64_ENTRY relocations
    - LP: #1542497
  * ALSA: timer: Fix double unlink of active_list
    - LP: #1542497
  * dmaengine: dw: fix cyclic transfer setup
    - LP: #1542497
  * dmaengine: dw: fix cyclic transfer callbacks
    - LP: #1542497
  * mmc: mmci: fix an ages old detection error
    - LP: #1542497
  * ALSA: timer: Fix race among timer ioctls
    - LP: #1542497
  * sparc64: fix incorrect sign extension in sys_sparc64_personality
    - LP: #1542497
  * cifs: Ratelimit kernel log messages
    - LP: #1542497
  * cifs: fix race between call_async() and reconnect()
    - LP: #1542497
  * cifs_dbg() outputs an uninitialized buffer in cifs_readdir()
    - LP: #1542497
  * m32r: fix m32104ut_defconfig build fail
    - LP: #1542497
  * dma-debug: switch check from _text to _stext
    - LP: #1542497
  * scripts/bloat-o-meter: fix python3 syntax error
    - LP: #1542497
  * ocfs2/dlm: ignore cleaning the migration mle that is inuse
    - LP: #1542497
  * ALSA: timer: Harden slave timer list handling
    - LP: #1542497
  * mm: soft-offline: check return value in second __get_any_page() call
    - LP: #1542497
  * memcg: only free spare array when readers are done
    - LP: #1542497
  * panic: release stale console lock to always get the logbuf printed out
    - LP: #1542497
  * kernel/panic.c: turn off locks debug before releasing console lock
    - LP: #1542497
  * printk: do cond_resched() between lines while outputting to consoles
    - LP: #1542497
  * ALSA: hda - Fix bass pin fixup for ASUS N550JX
    - LP: #1542497
  * crypto: af_alg - Disallow bind/setkey/... after accept(2)
    - LP: #1542497
  * crypto: af_alg - Fix socket double-free when accept fails
    - LP: #1542497
  * crypto: af_alg - Add nokey compatibility path
    - LP: #1542497
  * crypto: hash - Add crypto_ahash_has_setkey
    - LP: #1542497
  * crypto: af_alg - Allow af_af_alg_release_parent to be called on nokey
    path
    - LP: #1542497
  * crypto: af_alg - Forbid bind(2) when nokey child sockets are present
    - LP: #1542497
  * ALSA: hrtimer: Fix stall by hrtimer_cancel()
    - LP: #1542497
  * ALSA: pcm: Fix snd_pcm_hw_params struct copy in compat mode
    - LP: #1542497
  * ALSA: seq: Fix snd_seq_call_port_info_ioctl in compat mode
    - LP: #1542497
  * ALSA: control: Avoid kernel warnings from tlv ioctl with numid 0
    - LP: #1542497
  * crypto: algif_skcipher - Load TX SG list after waiting
    - LP: #1542497
  * crypto: crc32c - Fix crc32c soft dependency
    - LP: #1542497
  * IB/qib: fix mcast detach when qp not attached
    - LP: #1542497
  * iscsi-target: Fix potential dead-lock during node acl delete
    - LP: #1542497
  * ocfs2: NFS hangs in __ocfs2_cluster_lock due to race with
    ocfs2_unblock_lock
    - LP: #1542497
  * [media] rc: allow rc modules to be loaded if rc-main is not a module
    - LP: #1542497
  * SCSI: initio: remove duplicate module device table
    - LP: #1542497
  * clk: xgene: Fix divider with non-zero shift value
    - LP: #1542497
  * ath9k_htc: check for underflow in ath9k_htc_rx_msg()
    - LP: #1542497
  * mtd: nand: fix ONFI parameter page layout
    - LP: #1542497
  * ALSA: fm801: propagate TUNER_ONLY bit when autodetected
    - LP: #1542497
  * pinctrl: bcm2835: Fix memory leak in error path
    - LP: #1542497
  * kconfig: return 'false' instead of 'no' in bool function
    - LP: #1542497
  * perf/x86: Fix filter_events() bug with event mappings
    - LP: #1542497
  * power: test_power: correctly handle empty writes
    - LP: #1542497
  * firmware: actually return NULL on failed request_firmware_nowait()
    - LP: #1542497
  * mmc: sd: limit SD card power limit according to cards capabilities
    - LP: #1542497
  * Btrfs: clean up an error code in btrfs_init_space_info()
    - LP: #1542497
  * batman-adv: Avoid recursive call_rcu for batadv_bla_claim
    - LP: #1542497
  * batman-adv: Avoid recursive call_rcu for batadv_nc_node
    - LP: #1542497
  * batman-adv: Drop immediate orig_node free function
    - LP: #1542497
  * printk: help pr_debug and pr_devel to optimize out arguments
    - LP: #1542497
  * mmc: debugfs: correct wrong voltage value
    - LP: #1542497
  * IB/mlx4: Initialize hop_limit when creating address handle
    - LP: #1542497
  * veth: don’t modify ip_summed; doing so treats packets with bad
    checksums as good.
    - LP: #1542497
  * sctp: sctp should release assoc when sctp_make_abort_user return NULL
    in sctp_close
    - LP: #1542497
  * connector: bump skb->users before callback invocation
    - LP: #1542497
  * unix: properly account for FDs passed over unix sockets
    - LP: #1542497
  * bridge: Only call /sbin/bridge-stp for the initial network namespace
    - LP: #1542497
  * net: sctp: prevent writes to cookie_hmac_alg from accessing invalid
    memory
    - LP: #1542497
  * tcp_yeah: don't set ssthresh below 2
    - LP: #1542497
  * bonding: Prevent IPv6 link local address on enslaved devices
    - LP: #1542497
  * phonet: properly unshare skbs in phonet_rcv()
    - LP: #1542497
  * ipv6: update skb->csum when CE mark is propagated
    - LP: #1542497
  * team: Replace rcu_read_lock with a mutex in team_vlan_rx_kill_vid
    - LP: #1542497
  * Linux 3.13.11-ckt34
    - LP: #1542497
  * qeth: initialize net_device with carrier off
    - LP: #1541907
  * umount: Do not allow unmounting rootfs.
    - LP: #1541313
  * [media] usbvision fix overflow of interfaces array
    - LP: #1546273
  * [media] usbvision: fix leak of usb_dev on failure paths in
    usbvision_probe()
    - LP: #1546273
  * [media] usbvision: fix crash on detecting device with invalid
    configuration
    - LP: #1546273
  * tty: Fix unsafe ldisc reference via ioctl(TIOCGETD)
    - LP: #1546273
  * USB: serial: visor: fix crash on detecting device without write_urbs
    - LP: #1546273
  * ASN.1: Fix non-match detection failure on data overrun
    - LP: #1546273
  * iio: adis_buffer: Fix out-of-bounds memory access
    - LP: #1546273
  * x86/irq: Call chip->irq_set_affinity in proper context
    - LP: #1546273
  * usb: cdc-acm: handle unlinked urb in acm read callback
    - LP: #1546273
  * usb: cdc-acm: send zero packet for intel 7260 modem
    - LP: #1546273
  * cdc-acm:exclude Samsung phone 04e8:685d
    - LP: #1546273
  * usb: hub: do not clear BOS field during reset device
    - LP: #1546273
  * USB: cp210x: add ID for IAI USB to RS485 adaptor
    - LP: #1546273
  * USB: visor: fix null-deref at probe
    - LP: #1546273
  * USB: serial: option: Adding support for Telit LE922
    - LP: #1546273
  * ALSA: seq: Fix incorrect sanity check at snd_seq_oss_synth_cleanup()
    - LP: #1546273
  * ALSA: seq: Degrade the error message for too many opens
    - LP: #1546273
  * USB: serial: ftdi_sio: add support for Yaesu SCU-18 cable
    - LP: #1546273
  * USB: option: fix Cinterion AHxx enumeration
    - LP: #1546273
  * ALSA: compress: Disable GET_CODEC_CAPS ioctl for some architectures
    - LP: #1546273
  * ALSA: usb-audio: Fix TEAC UD-501/UD-503/NT-503 usb delay
    - LP: #1546273
  * arm64: errata: Add -mpc-relative-literal-loads to build flags
    - LP: #1533009, #1546273
  * SCSI: fix crashes in sd and sr runtime PM
    - LP: #1546273
  * n_tty: Fix unsafe reference to "other" ldisc
    - LP: #1546273
  * ALSA: dummy: Disable switching timer backend via sysfs
    - LP: #1546273
  * drm/vmwgfx: respect 'nomodeset'
    - LP: #1546273
  * x86/mm/pat: Avoid truncation when converting cpa->numpages to address
    - LP: #1546273
  * perf annotate browser: Fix behaviour of Shift-Tab with nothing focussed
    - LP: #1546273
  * powerpc/perf: Remove PPMU_HAS_SSLOT flag for Power8
    - LP: #1546273
  * Linux 3.13.11-ckt35
    - LP: #1546273
  * netfilter: bridge: don't use nf_bridge_info data to store mac header
    - LP: #1463911
  * netfilter: bridge: restore vlan tag when refragmenting
    - LP: #1463911
  * netfilter: bridge: forward IPv6 fragmented packets
    - LP: #1463911
  * netfilter: bridge: Use __in6_dev_get rather than in6_dev_get in
    br_validate_ipv6
    - LP: #1463911
  * ALSA: usb-audio: avoid freeing umidi object twice
    - LP: #1546177
    - CVE-2016-2384
  * vmstat: explicitly schedule per-cpu work on the CPU we need it to run
    on
    - LP: #1546320

-- Brad Figg <brad.figg@canonical.com>  Thu, 10 Mar 2016 14:41:56 -0800

Changed in linux (Ubuntu Trusty):
status:	Fix Committed → Fix Released

Revision history for this message

Paul Gear (paulgear) wrote on 2016-04-21:

#162

For posterity: If https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1505564/comments/143 is the cause of this issue for you, dmesg -D (which turns off console logging of kernel messages) might be a viable workaround until you can reboot.

Ubuntu
linux package

Soft lockup with "block nbdX: Attempted send on closed socket" spam

Bug Description

Related branches

CVE References

Other bug subscribers

Patches

Bug attachments

Remote bug watches

	Status	Importance	Assigned to
linux (Ubuntu)	Fix Released	High	Dan Streetman
Trusty	Fix Released	Undecided	Unassigned
Vivid	Fix Released	Undecided	Unassigned
Wily	Fix Released	Undecided	Unassigned

Changed in linux (Ubuntu):
status:	In Progress → Fix Released

Changed in linux (Ubuntu Trusty):
status:	New → Fix Committed

tags:	added: verification-needed-trusty
tags:	added: verification-needed-vivid

Ubuntulinux package

Soft lockup with "block nbdX: Attempted send on closed socket" spam

Bug Description

Related branches

CVE References

Other bug subscribers

Patches

Bug attachments

Remote bug watches

Ubuntu
linux package