Author Archive for Mike Guthrie

Page 2 of 8

Building a Nagios 4 / Nagios XI Prototype Box

So after an awesome set of presentations at the Nagios World Conference 2012, one of the hot topics for discussion was clearly the upcoming Nagios Core 4 release. Andreas Ericsson has been hard at work overhauling the Core engine to optimize performance and reduce Disk and CPU usage for Nagios, and initial tests are showing his work has paid off in a substantial way. For this experiment, we’re going to use a system with the following specs:

  • Virtual Machine running under Vmware Workstation 8
  • 2GB of RAM
  • 1 CPU, 4 Cores
  • 80GB Hard Disk
  • Nagios XI Installed
  • Nagios binaries replace with Nagios 4 monitoring engine
  • ndoutils binaries replaced with with the latest SVN code for ndoutils: nagios/ndoutils/branches/ndoutils-2-0
  • No initial performance tweaks other than Nagios 4 and ndoutils 2

I’ll post setup instructions below for users who also want to play around with this setup. Note: This setup is not intended for production installs, use this in test environments only!

Start with Nagios XI installed, either through the pre-installed VM or with a manual installation. I chose a manual installation for this demo so I could set up the hardware to my liking and give it sufficient hard drive space to test a LOT of hosts. My first attempt at the prototype only had 10GB on the box, and filled up quite quickly because of performance data. .I ran the following commands after initial Nagios XI installation and setup was completed.

From the command-line:

You can verify the upgrade succeeded by reviewing the /usr/local/nagios/var/nagios.log file. There should be some new warnings about obsolete definitions like “failure_prediction_enabled”, which we won’t worry about for now. For now I’d like to see what kind of performance impact I can expect for a large number of checks being run on this machine, so I need to quickly create a large number of checks.  I’ll achieve this by running a tools script that we include with every installation of Nagios XI.

I chose to use static configs instead of the CCM for this benchmark for ease of setup time, and also easy removal later on. This also creates a list of checks with 25% of the services showing up as critica, which is useful in testing a system stressed with alerts and notifications. However, I’m also going to turn off notifications and event handlers during this setup phase just to make sure I don’t bottleneck somewhere and tank the entire box. Now lets restart Nagios to start using the new configs.

After adding 1000 hosts and 4000 services all at a 5mn interval the CPU load is running at a nominal level, averaging anywhere from .30 – .70, which is pretty impressive for a 4 core system! There is still some Disk IO because performance data processing is happening for each service, and this will likely be one of the noticeable bottlenecks as we add more checks to this system. After the system levels out and all of the checks are settled into a hard state, I turn on notifications and event handlers and begin watching the system and testing for bottlenecks. I’ll post back with some results soon! If there are any XI users out there who want to give this a shot in their test environments and post back with their results we’d love to hear what you find!

 

 

 

Nagios XI 2012 Public Beta

After many months of anticipation and hard work, we are pleased to announce Nagios XI 2012 Beta is ready for public testing! XI 2012 is loaded with a variety of new features, and we’d love your help in making sure that 2012 is ready for full release as soon as possible. A key difference with the release of 2012 is that there will be both an Enterprise Edition and a Standard Edition, with the Enterprise Edition being targeted at users with larger environments.

Continue reading ‘Nagios XI 2012 Public Beta’

Nagios Incident Manager Public Beta

Nagios Incident Manager is an enterprise-class incident management application that allows teams and individuals to track and solve problems faster using a powerful web-based application that offers security, mobility, third-party integration, and tools for collaboration. Nagios Incident Manager can be integrated easily with Nagios XI or Nagios Core’s built-in event handling, or any other third party tool with an easy-to-use web API for creating and managing tickets. Nagios Incident Manager is currently available for public beta testing, with install instructions posted below. Supported installation platforms are CentOS and RHEL 5/6, and we recommend installing the beta on a clean system.

Beta Installation Instructions

Documentation and tutorials are coming soon!

For Nagios XI users, see documentation on Integrating Nagios IM with Nagios XI.

 

 

Automated Host Management In Nagios XI

This document describes how to automate adding and removing hosts and services in Nagios XI from the command line. Some administrators may have need to automate the process of adding and removing hosts and services in Nagios XI for use with cloud computing or large environments where solutions like Puppet or Chef may already be implemented. The procedures below outline how administrators can create their own automation solutions to safely add and remove hosts and services in Nagios XI while still maintaining the integrity of the monitoring environment.

Automated Host Management Documentation