Zenoss published the LISA 06 Survey Results on Open Source Network & Systems Monitoring:

The survey results included responses from over 100 LISA attendees both on paper surveys and via an electronic survey mechanism. Surveys were offered in the Zenoss exhibition booth, at the LISA Birds of a Feather Session: “Solving IT Management Headaches with Open Source Software “.

My main takeaways of this survey are:

  • Most pressing issues faced by administrators are monitoring, configuration, Patching & provisioning.
  • 65 % of the respondents plan to use OSS tools in the monitoring space and 36% for configuration.
  • the most popular tools are Nagios, MRTG/RRDTool, Snort and Nessus, of which none are for configuration management.

While looking at some open source projects in the area of configuration management and automation, I stumbled upon several projects and links, which I think might be useful to the community.

Bcfg2, developed in the Argonne National Laboratory. Licensed under 2-clause BSD-style license

Bcfg2 allows you to describe and deploy complex configurations across pools of GNU/Linux and Unix systems, leading to a consistent, reproducible, and verifiable description of your environment. Bcfg2′s visualization and reporting tools aid in your day-to-day administrative tasks. Its unique analysis features help you cope with the ever-increasing complexity of your networks.

Cfengine developed by Mark Burgess from the Oslo University College, Norway. Licensed under GPL2

It is used to implement policy-based configuration management on open systems (Unix-like environments), through the interpretation of it’s own declarative language.

The program focuses on a few key areas that scripts tend to mishandle. From a single configuration files (or set of files) you specify, using classes, your network configuration; cfengine then parses the file and carries out the instructions, warning you about errors (or fixing them) as it goes.

You can think of cfengine as a very high level language, higher than Perl or shell: a single command can result in many hundreds of operations being performed on multiple hosts. You can also use it as a net-wide front end for ‘cron.’

Puppet developed by Luke Kanies, who founded Reductive Labs. Licensed under GPL

Puppet lets you centrally manage every important aspect of your system using a cross-platform specification language that manages all the separate elements normally aggregated in different files, like users, cron jobs, and hosts, along with obviously discrete elements like packages, services, and files.

LCFG developed by Paul Anderson from the University of Edinburg. Licensed under GPL.

LCFG is a system for automatically installing and managing the configuration of large numbers of Unix systems. It is particularly suitable for sites with very diverse and rapidly changing configurations.

For a discussion of the merit of each, see this post on The Changelog.

For some more information on the Large Scale System Configuration Workgroup, see the lssconf website.

Or the following article: Automating Network Administration, by Luke A. Kanies.

Compuware announced yesterday that it has acquired Proxima Technology.

Proxima is the maker of Proxima Centauri, a Service Level Management tool with interesting capabilities, specially in the way they aggregate key performance indicators into business service metrics.

Proxima has also a unique way to combine Six Sigma methodology and ITIL, to, for example, normalize the events generated by the monitoring of KPIs into Defects Per Millions of Opportunities, rolling them up into service metrics as Sigma numbers.

I don’t know how Compuware is going to use this technology, but one issue with the Proxima approach is that the service impact relationships, defining how IT components are contributing to a service, is defined within Centauri, using rules instead of model relationships .

Without a tight integration with a CMDB, SLM/BSM tools are required to duplicate the CMDB items and relationships, sometimes using two different models, providing a weak synchronization.

As a comparison, BMC Service Impact Manager is using the BMC Atrium CMDB as the model repository, adding service impact relationships on top of the exiting items, making sure that there is a tight synchronization between assets and business services. And since Compuware does not have a real CMDB, or a discovery solution (they use the Collation/IBM solution), it’s unlikely that they will be able to provide this level of integration.

In September 2006 Lokomo Systems announced the release of an open source CMDB called OneCMDB.

This is a very good start, but I would point to several issues, at first glance:

  • The model used seems to be developed in house, without any link to DMTF CIM : Why re-invent a model which already exists elsewhere ?
  • The reconciliation of discovered data with existing data seems not existent.
  • The security model seems inexistent (or I did not find it): no role based access.

I don’t want to be too harsh, but one thing I discovered when I joined BMC is that a CMDB is more than just a persistence store for an object model. It’s actually very complex, specifically the reconciliation and federation of various information sources. The CMDB is a virtually centralized repository for information residing in many, many different locations.

Another key ITIL recommendation for the CMDB, is that all modifications of the CMDB should be performed under change management control. Therefore, configuration management and change management are really tied at the hip.

I’ll try to play with this open source CMDB and explain in a bit more details the need for reconciliation, and integration with change management in some upcoming posts.

Anyway, thanks to Lokomo for releasing their sources under GPL.

Today, the VMWare Fusion Beta program opened :

The new VMware desktop product for the Mac, codenamed Fusion, allows Intel-based Macs to run x86 operating systems, such as Windows, Linux, NetWare and Solaris, in virtual machines at the same time as Mac OS X. It is built on VMware’s rock-solid and advanced desktop virtualization platform that is used by over four million users today.

I’ve tried to start a windows 2003 VM Image that was created on a PC, without any problem.

fusion

I have been trying for the past few days to run Solaris 10/11 in parallels on a new macbook. The main issue I had was the JDS/gnome resolution adaptation with the 1280×800 resolution of the macbook.

Here are my steps :

  • I installed the new parallels beta (build 3036)
  • downloaded the free Solaris 10 update 3 from Sun’s web site. Download the DVD since it can be directly mounted in parallels.
  • create a new VM with the Solaris 10 type, but, before to finish, un-select the option to start the Solaris installation and edit the VM configuration to add a custom screen resolution of 1280×800.
  • start the VM to launch the install
  • log in an create a /etc/X11/xorg.conf with something like the one attached here (xorg.conf). Basically you need to add the Modeline lines (generated with /usr/X11/bin/gtf) and put 1280×800 in the appropriate Display subsections.

It should work. However, I’m still experiencing some duplicate keystrokes and garbled screen after VM resume (I have solved the later issue by switching multiple times between full screen and os window).

If anyone has any solution for these two problems, let me know.

Today, I received the results from my ITIL foundation Certification test. Some questions were really tricky, but I passed.

I was going to just go to the test, but BMC Business School has an ITIL foundation training which includes the EXIN exam at the end.

I must say that I was surprised by the quality of the class, specifically the airport simulation. Within the 3 days class, there is one day dedicated to a simulation game applying “experiential learning” techniques:

In the simulation the airport has grown rapidly requiring the addition of numerous business services. With an increasingly complex IT infrastructure, a “Service Desk” has been introduced to support the business, along with a “Technical Specialist” team to maintain the infrastructure and resolve any issues that may arise. A supplier function is also included to provide additional infrastructure and consultancy. All students play a role in this, whether in the business or in the IT team ensuring SLA’s with the business are met.

Really not a boring way to learn ITIL (I’m sure that you CAN find really boring ITIL classes, this is definitely not one). If you are interested by the class, here is the link in the catalog. and you can even watch a video of the simulation.

itil-pin

Today, I was corrected by one of my new coworker because I had the impression that all hardware within a given environment would be in the Definitive Hardware Store (DHS), while in fact ITIL defines it as :

An area should be set aside for the secure storage of definitive hardware spares. These are spare components and assemblies that are maintained at the same level as the comparative systems within the live environment. Details of these components and their respective builds and contents should be comprehensively recorded in the CMDB. These can then be used in a controlled manner when needed for additional systems or in the recovery from major Incidents. Once their (temporary) use has ended, they should be returned to the DHS or replacements obtained.

This basically means that it’s a set of standby systems used for failing over services in case of incidents. It seems that more practically, the systems in the DHS should be ready to be re-purposed on demand to solve any upcoming capacity or availability issue (instead of being pre-configured to match all possible systems in the environment).

Also, with new technologies like virtualization would this mean that virtual server images in standby could be stored in the DHS ? Is a virtual machine image closer to an hardware component or to a software package ? Any comment ?

Last month, CA acquired Cendura to complement its CMDB offering. This comes after a string of consolidations

in the discovery space (see my previous post about Applications and Asset discovery landscape).

It will be interesting to watch how CA integrates Cendura’s Cohesion with its own Sonar technology :

Sonar watches and analyzes traffic on the network, and understands more than 1,700 protocols and information sources. It builds accurate maps and keeps those maps updated as resource allocations change.

Also, Cohesion is coming with an impressive set of rules that could be applied to ensure the compliance of configurations

with best practices. Integrating these rules on top of a CMDB could provide an interesting way of performing

configuration management (configuration audit) or even problem management (root cause analysis).

CA is expecting to integrate Cendura’s technology within 120 days. Lets see.

New job

After a long break of more than a month, it is time to contribute an entry to this blog. This break coincide also with a change of employer. After 11 years with Sun, working on projects like Solstice Bandwidth Allocator, Solaris Bandwidth Manager, and the IPQOS feature of Solaris 10 (well, 9 8/03 more exactly), and more recently on N1, the change is really welcomed.

I’m joining BMC Software to look after the architecture and technical strategy of the Datacenter Management Solutions. This sounds really exiting as BMC has already all the components required to manage the full lifecycle of datacenter infrastructure, applications and services. Bringing together the Remedy Service Management , BMC Atrium CMDB , and Marimba product lines into consistent solutions has the potential to solve many of the challenges I have described in this previous post.

As before, I’ll try to be as much as possible impartial, and will speak only for me, and not BMC, remember :

The opinions expressed herein are not necessarily those of my employer,
not necessarily mine, and probably not necessary altogether.

« Older entries § Newer entries »