Jeff Boles has written a very interesting and very informative piece on the importance of virtual infrastructure management, and particularly its optimization in the virtualized datacenter. He refers to the issue as virtual infrastructure optimization. Being in the business I'm in ... next generation virtual system management for configuration and fault management ... I really appreciate the attention to virtual infrastructure (or, perhaps, more accurate virtualized infrastructure). He's said all of the right things about optimization.
While he correctly identifies many of the problem sources -- interdependencies of the virtual and physical systems being key in the argument -- he stresses performance as the aspect or dimension on which to focus. He then makes the statement:
The catastrophic impacts of which he speaks are rarely just a matter of performance optimization. They are generally issues of infrastructure configuration errors and mismatches ... often mismatches between the virtualized infrastructure (e.g. the virtual switches, portgroups, and VLAN connectivity) and the physical infrastructure (the "hard goods" ... switches, storage systems, firewalls, etc.) In fact, recent studies by Andi Mann at Enterprise Management Associates regarding virtual system management indicate that the greatest source of unplanned downtime in the virtualized datacenter (ranging from 60 - 70% of all outage incidents) is attributable to configuration errors and mismatch. This single fact should bring to light the unpleasant truth about the way in which configuration design and on-going configuration management need to be rethought.
Jeff points out that "
I liked the Q&A that Jeff uses to get across the key messages of VIA. The one I have some trouble with, simply because I don't believe he treats misconfiguration in its entirety, is this one:
In a sense, what is missing from the article and from his list of virtual systems management products is infrastructure assurance. It's not just the data required to avoid misconfiguration once an error is detected. It's also that set of tools and systems deployed for the initial design, at the time there's a major change / revision to the datacenter, and (unlike conventional datacenters) revisited every time a "golden master" is called upon:
- the starting configuration is well designed for the very fluid, dynamic environment of a virtualized datacenter,
- that the configuration is the combined physical and virtual infrastructure elements, treated as a unified infrastructure rather than as separately designed and managed "physical configurations" and "virtual configurations"
- that the instrumentation supports on-going monitoring, analysis and prescriptive actions to counteract "infrastructure drift" and eliminate (or at least significantly reduce) the major causes of catastrophic failure and performance problems.
To Boles' list of key technologies (instrumentation and infrastructure optimization) I would have to add the unified (virtual and physical) infrastructure analysis and directed re-configuration. The result is virtual infrastructure assurance (VIA) to which VIO then is applied. These are the technologies (of which Replicate's RDA is one) which provide visibility into the causes and prescriptive actions that actively and continuously reduce virtualized datacenter failure. These fall logically into his requirement for a holistic view of the entire virtual infrastructure and into the configuration diagnostics that include remedial actions. Needless to say, I'd like to see these types of discovery, analysis and guidance tools invited to the party.
InfoStor : What is virtual infrastructure optimization (VIO)?, March 2009 Page 1
While the IT practitioner's every day is a swim through waves of invisible bits, there has long been some comfort to be found in the "physicality" and accessibility of key devices. When problems arise, administrators have always been able to identify a switch port for examination, a server at the end of a wire that might be causing problems, an HBA for inspection, or any number of other physical things for further examination. But in today's data center, that comfort has vanished.
In part, this is due to virtualization, and while this trend is spearheaded by server virtualization, variations include application virtualization, network device virtualization, I/O virtualization, storage virtualization, and more.
