A new release of NDOUtils (1.5) is now available from SourceForge. This version has a number of performance enhancements including a patch to insert data into the database asynchronously.
Enjoy!
A new release of NDOUtils (1.5) is now available from SourceForge. This version has a number of performance enhancements including a patch to insert data into the database asynchronously.
Enjoy!
The new Nagios XI Operations Center Component provides a NOC screen-style view of all unhandled host and service problems. The screen automatically refreshes every 30 seconds to show the latest problem events. This is one of two NOC-style screens recently created, along with the Nagios XI Operations Screen Component. Users can pick a NOC screen to suit their visual tastes that will keep a close eye on the latest problems in their environment.
Download Nagios XI Operations Center Component
A new release of NSCA (2.9.1) is now available from SourceForge. Thanks to Daniel Wittenberg for a patch that allows nsca to receive packets with a future time stamp. Thanks to a suggestion on the mailing list by John Rouillard, the server (nsca) now allows packets with both the new (>= 2.9), larger plugin output buffer and the older (<2.9) plugin buffer.
Enjoy!
We are pleased to announce that you can now easily launch your Nagios XI monitoring server easily in the Amazon Elastic Compute Cloud (EC2). We have available clean centOS 6 with Nagios XI pre-installed available for public and customer use. This makes it extremely easy for Nagios XI Administrators to start additional server without the need to procure or invest in hardware. Additionally, those wishing to demo Nagios XI can easily do so using the cloud.
The full how-to documentation is available in the Nagios Library article Using Nagios XI In Amazon EC2 Cloud
This document describes how to launch a new pre-installed Nagios XI server in the Amazon EC2 cloud and is intended for Nagios XI Administrators who would like to bring up new Nagios XI instances in the Amazon Elastic Compute Cloud (EC2) as well as those who are moving existing installations to the cloud. Start a trial copy of Nagios XI in the cloud!
If you are new to Nagios XI, or would like to see it in action, this would be a fast and efficient way to give it a test run.
Nagios Mobile is a lightweight web interface, based on the Teeny Nagios project by Hirose Masaaki. Nagios Mobile is a PHP web-based application designed for Mobile and touch-screen devices.
Key Features:
- User-level authorization for hosts, services, and commands that match Nagios Core.
- Filtered lists to quickly identify and respond to unhandled problems
- Acknowledge problems, Disable/Enable Notifications, or Schedule Downtime for authorized hosts and services
- Works with any Nagios 3.x installation
- Support for APC data caching for faster page loads
- Support for both webkit and non-webkit enabled devices
My favorite kinds of development projects always end up being on the front-end, and I certainly can’t claim much on the interface design for this project, as that goes to community member Hirose Masaaki using the JQuery Mobile framework. We loved the front-end design that he came up with for the Teeny Nagios project, so we did some revisions to the server-side code underneath to allow for host and service filtering by state, more complex permissions, data caching, and improved scalability for larger installations. We also added some code to allow Nagios Mobile to work from essentially any mobile browser.
One of the most challenging, but also rewarding projects that I’ve worked on so far during my time at Nagios is the Nagios Business Process Intelligence (BPI) project. Nagios BPI was created as a way to visualize business process health by grouping hosts and services together, and creating rules to discern the true health of the network infrastructure as it relates to the business. An admin can define rules for each BPI group, and monitor the health of the group’s state based on what has been defined. Version 1.x of BPI got a lot of positive feedback from users, and quite a few feature requests. However, as time went on it became clear that in order for BPI to be more suitable for enterprise environments, more advanced permissions needed to be implemented, as well as several other usability issues resolved. I’ve spent the last 6 weeks or so doing some seriously overhauling to the code in order to support a lot of the new features I wanted to add to a new version of BPI. I’m excited about the changes in this new version, and I also really think that this is an add-on to Nagios that can really do some good in a lot of monitoring environments. I think the future of monitoring is going to highlight the idea of monitoring within the context of the business, and this project allows users to turn host and service monitoring into actual business process monitoring. Currently this project is in a beta stage and only works with Nagios XI, and we plan to implement this as a feature of our 2012 release. A community version of Core will follow sometime later in 2012, but the intention is to pilot a lot of these new features in the XI environment, and later the code can be adapted to allow for use with Core installs as well. Here’s a highlight of the new features in BPI v2.0
See the updated documentation for BPI v2 here.
The code for this new version has not yet been released. Feel free to contact me if you’re interested in beta testing before the 2012 release of Nagios XI. Here are a few screenshots from the new version.
A question our sales team often gets from potentials customers is: “How many hosts/services can I monitor with a single Nagios XI license?” As much as we’d like to be able to give people a concrete answer to the question, it ultimately comes down to either “We don’t know,” or “That depends on….”. So as a side project, I decided to attempt my second benchmark test with Nagios XI, and see how hard we can push the software, having learned a few things since my first test almost a year ago. Most of my findings from that first test were outlined in the document Maximizing Nagios XI Performance. Since writing this, we’ve learned a few tricks from both Core and XI users that have been done in larger environments, and we’ve also played with a few ideas we’d never tried before. So here’s the rundown on what we’re using for a test machine, the tweaks I tried, and the results I found. Special thanks to Nagios Community members Daniel Wittenberg, Jeff Sly, Nate Broderick, and Max Schubert for your large installation tips.
Nagios XI Server (An older physical desktop we converted to a test machine).
The Results:
The CPU load generally hovers around a comfortable 1.75 to 2.5, and the real page load times for the XI interface range from about 1-7 seconds, depending on the page. Below is a list of tweaks that I found actually made a noticeable impact on the server’s performance
##experimental DB tweaks tmp_table_size=524288000 max_heap_table_size=524288000 table_cache=768 set-variable=max_connections=100 wait_timeout=7800 query_cache_size=12582912 query_cache_limit=80000 thread_cache_size=4 join_buffer_size=128k
#!/bin/sh ntpdate pool.ntp.org /sbin/service httpd restart /sbin/service postgresql restart psql nagiosxi nagiosxi -c "vacuum;" psql nagiosxi nagiosxi -c "vacuum analyze;" psql nagiosxi nagiosxi -c "vacuum full;"
The next stage of our benchmark testing will be to offload the checks themselves to slave machines using either DNX or Mod Gearman to distribute the check load. We’re also going to upgrade our benchmark box once more, so my hope is to to able to load a single XI instance to 20-30k checks every 5 minutes, but I’m sure we’ll discover our share of new complications and bottlenecks as we continue to scale XI to a larger install. We’ll keep you posted on what you find! If you have suggestions for further tweaking an XI install, post a comment because we’d love to hear them!
A new release of NPRE (2.13) is now available from SourceForge. Thanks to the guys at Kaspersky labs, the allowed_hosts directive in the nrpe.cfg file now allows IPv4 subnets. Also, thanks to a post by Kevin Pendleton at UtahSysAdmin.com, NRPE now compiles on Solaris 10.
Enjoy!
We had a cool meeting last week with one of our users, and he gave us some great suggestions for tweaks and improvements. One of his ideas was adding the ability to “schedule immediate checks” in bulk, so that once problems are fixed, admins can quickly cross hosts and services off the list of problems. As our user said, “I’m not so much interested in what is working as I am what’s not working.” Since the logic and filtering was already in place with the Mass Acknowledgment Component, I decided to simply add “schedule immediate check” to the list of options with this component. Thanks user TL for the suggestion! As developers, we love feedback, and usually our best ideas come from users, so keep the ideas coming! : )
We’ve had a few users request the use of the “action_ur”l and” notes_url” config options to be accessible from the XI details screen. Nothing terribly fancy with this component, just another option for those users who need it. As Nagios developers, we love feedback, and we love to know what you guys need from our software, so keep the suggestions coming!