XVM

XVM Nagios monitoring sucks

Bug #307357 reported by Evan Broder
4
Affects Status Importance Assigned to Milestone
XVM
Confirmed
Critical
Unassigned

Bug Description

We currently have an absolutely minimal Nagios setup for monitoring XVM. We should expand its configuration to monitor for specific services - the DNS server, the VNC proxy server, etc.

Revision history for this message
Duane Hinnen (duanedesign) wrote :

Thank you for taking the time to make Ubuntu better. Since what you submitted is not really a bug, or a problem, but rather an Feature Request to improve Ubuntu, you are invited to post your idea in Ubuntu Brainstorm at [WWW] https://brainstorm.ubuntu.com/ where it can be discussed, voted by the community and reviewed by developers. Thanks for taking the time to share your opinion!

Revision history for this message
Evan Broder (broder) wrote : Re: [Bug 307357] Re: XVM Nagios monitoring sucks

Hi - um...huh? What? This is /not/ an Ubuntu bug, this is a bug with the
XVM project I'm not entirely clear why you responded to it.

- Evan

duanedesign wrote:
> Thank you for taking the time to make Ubuntu better. Since what you
> submitted is not really a bug, or a problem, but rather an Feature
> Request to improve Ubuntu, you are invited to post your idea in Ubuntu
> Brainstorm at [WWW] https://brainstorm.ubuntu.com/ where it can be
> discussed, voted by the community and reviewed by developers. Thanks for
> taking the time to share your opinion!
>
>

Revision history for this message
Evan Broder (broder) wrote :

Two of XVM's three outages were discovered by fortunate accident, when they should have been discovered by monitoring. This is a serious issue.

Changed in xvm:
importance: Undecided → Critical
status: New → Confirmed
Revision history for this message
Greg Price (gregprice) wrote :

On Sat, Dec 27, 2008 at 07:06:11AM -0000, Evan Broder wrote:
> Two of XVM's three outages were discovered by fortunate accident, when
> they should have been discovered by monitoring. This is a serious issue.

E.g., we should check that the hosts can read their disks, which is
what failed last night.

Greg

Revision history for this message
Quentin Smith (quentin-mit) wrote :

As of tonight, we are now monitoring DNS, DHCP, and VNC. Things that we still need to configure monitoring for include:

- xend
- conserver
- the RAID

All of these require running something on each machine being monitored, which means they need to run a daemon of some kind. The canonical solution is called NRPE. We should figure out what a secure configuration looks like and set it up.

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.