CPU failure, memory controller, blacklist modules, patch kernel for DELL 12G PowerEdge

Asked by Robert McGrath on 2012-11-29

Our DELL server running Ubuntu 12.04 are erroneously reporting CPU failures.
DELL support have responded that this is an OS problem, and supplied supporting web-links indicating kernel patches that could be applied.

We require Canonical to urgently address this issue for our production systems running Ubuntu 12.04

Supporting info follows;
------------------------------------
kernel patch for sb_edac driver:
    http://lkml.indiana.edu/hypermail/linux/kernel/1112.2/03268.html
ACPI PAD driver bugzilla:
    https://bugzilla.kernel.org/show_bug.cgi?id=42981
Information on mei driver:
    http://ubuntuforums.org/showthread.php?t=1970325

http://en.community.dell.com/techcenter/b/techcenter/archive/2012/08/27/ubuntu-on-dell-12g-poweredge-servers.aspx

Have you tried other distributions (not Debian based). Have you tried installing Windows to test there too?

Installing other Distributions or Windows is not appropriate, these are production class systems, and we need support on Ubuntu 12.04 LTS as per our support agreement with Canonical.
Specifically the kernel and module fixes indicated by DELL (see links previously supplied) could be a good place to begin examining this serious issue.

Please advise of progress regularly, as this is a serious issue for us.

Further information forwarded to us from DELL support is this website link,
    http://en.community.dell.com/techcenter/b/techcenter/archive/2012/09/14/follow-up-ubuntu-on-dell-12g-poweredge-servers.aspx
which directly references Ubuntu/Canonical/launchpad bug ID's
#1007061
#1035216
#1041164

Please advise when fixes for these bug will be available via normal Ubuntu repositories.

Thomas Kr├╝ger (thkrueger) said : #4

You should be aware that this platform is for community support. If you have a support contract with Canonical, this is not the right place to post your request. Please use the support channels Canonical made available to you with the contract.

Apologies for the inappropriate tone on a community forum, I've raised it via our correct support channel, thanks.
And thank you all for the community help.

Darryl Weaver (dweaver) said : #6

FYI: Should be fixed in kernel linux-image-3.2.0-34-generic which is now available on precise.