CPU failure, memory controller, blacklist modules, patch kernel for DELL 12G PowerEdge

Asked by Robert McGrath

Our DELL server running Ubuntu 12.04 are erroneously reporting CPU failures.
DELL support have responded that this is an OS problem, and supplied supporting web-links indicating kernel patches that could be applied.

We require Canonical to urgently address this issue for our production systems running Ubuntu 12.04

Supporting info follows;
------------------------------------
kernel patch for sb_edac driver:
    http://lkml.indiana.edu/hypermail/linux/kernel/1112.2/03268.html
ACPI PAD driver bugzilla:
    https://bugzilla.kernel.org/show_bug.cgi?id=42981
Information on mei driver:
    http://ubuntuforums.org/showthread.php?t=1970325

http://en.community.dell.com/techcenter/b/techcenter/archive/2012/08/27/ubuntu-on-dell-12g-poweredge-servers.aspx

Revision history for this message
actionparsnip (andrew-woodhead666) said :
#1

Have you tried other distributions (not Debian based). Have you tried installing Windows to test there too?

Revision history for this message
Robert McGrath (robert-mcgrath) said :
#2

Installing other Distributions or Windows is not appropriate, these are production class systems, and we need support on Ubuntu 12.04 LTS as per our support agreement with Canonical.
Specifically the kernel and module fixes indicated by DELL (see links previously supplied) could be a good place to begin examining this serious issue.

Please advise of progress regularly, as this is a serious issue for us.

Revision history for this message
Robert McGrath (robert-mcgrath) said :
#3

Further information forwarded to us from DELL support is this website link,
    http://en.community.dell.com/techcenter/b/techcenter/archive/2012/09/14/follow-up-ubuntu-on-dell-12g-poweredge-servers.aspx
which directly references Ubuntu/Canonical/launchpad bug ID's
#1007061
#1035216
#1041164

Please advise when fixes for these bug will be available via normal Ubuntu repositories.

Revision history for this message
Thomas Krüger (thkrueger) said :
#4

You should be aware that this platform is for community support. If you have a support contract with Canonical, this is not the right place to post your request. Please use the support channels Canonical made available to you with the contract.

Revision history for this message
Robert McGrath (robert-mcgrath) said :
#5

Apologies for the inappropriate tone on a community forum, I've raised it via our correct support channel, thanks.
And thank you all for the community help.

Revision history for this message
Darryl Weaver (dweaver) said :
#6

FYI: Should be fixed in kernel linux-image-3.2.0-34-generic which is now available on precise.