I ended my last blog post with "Integrate and prosper." Little did I know that Cisco, EMC, and VMware were about to unveil a Virtual Computing Environment (VCE) early the following week, the biggest cross-vendor integration project yet seen in the world of computing. Yes there were rumors about a Cisco/EMC joint venture that would sell Cisco servers packaged with EMC storage, but none that I heard captured the boldness and scope of VCE.
The core VCE compute platform is called a "Vblock," an integrated, pre-packaged IT solution consisting of server and networking resources from Cisco; storage, security, and software-based management tools from EMC; and an OS platform (vSphere) from VMware. To market and support Vblocks, the coalition has created two separate entities: a Solution Support Team staffed and funded by the coalition partners that will do presales and provide other marketing resources, and Acadia chartered to build and, if needed, operate Vblocks on premises for a customer. Who will you buy Vblocks from? Just about everyone except HP, IBM, and OracleSun.
Not to be outdone, the day following the VCE announcement, HP announced Converged Infrastructure (CI). CI is integrated server, networking, and storage resources too, but doesn't need a coalition. It takes most of what it needs from HP's own product lines. I say most because HP doesn't own a server virtualization platform--a key ingredient. Not to worry. Despite the fact that VMware is invested in Vblocks and Acadia, it likes CI too. And then of course there's the HyperV alternative out there...somewhere...
Its decidedly unclear at this point how successful these integrated compute stacks will be in an IT marketplace that's undergoing multiple transitions--from physical to virtual, from stove pipe to cloud, from decentralized to consolidated. The purveyors of integrated compute stacks are driven by a central belief: that your CIO wants to make the transition to virtual/cloud/consolidated simpler by wrapping up servers, switches, and storage arrays into one neat, pre-integrated package.
As storage pro, you may find that ironic. You may be old enough to remember when many of these same vendors were selling decentralized client server computing as the better, simpler way. Now the Three Musketeers--consolidation, centralization, and virtualization--are here to vanquish the complexity created by the-network-is-the-computer computing. Please shoot me if I even suggest that integrated compute stacks are the new mainframe.
So, as a storage professional slaving away within the bowels of corporate IT infrastructure, is the integrated compute stack about to change your life? I'd say yes if the powers that be like the Vblock concept. And if they like Vblocks then your choice of storage is EMC's. If CI wins, you get HP's flavor of the month array. Simple, right? And life could get simpler still when the IT operations group reorganizes around Vblocks. You may get to know the people in the server and network administration groups much better than you know them now as they start doing some of the things you do.
On the other hand, Vblocks and CIs may well be a tough sell in your organization. Cisco has yet to make its mark in the server world and buying a million-dollar anything from a coalition of vendors that want to run your critical applications on their collaborative platform is untried to say the least. But the biggest hurdle standing in the way of the integrated compute stack as it approaches your IT operations group may well be the following retort: "We just don't do things that way here."
Given that the phrase "our current economy" has such a negative connotation, EMC's third-quarter earnings report last week was downright upbeat.
EMC surpassed its own optimistic guidance for the fiscal quarter just ended by 4 percent. Wow. Other phrases like "increasing confidence among customer in spending their IT budgets" and "very weak first quarter progressing to more normal third and fourth quarters (of 2009)" were heard on the conference call with analysts. Cool. Happy days are here again for the storage industry, right? Well, it depends.
EMC is still regarded as a bellwether for the rest of the storage industry. Indeed, Information Storage as a line item in EMC's third-quarter 2009 financial report accounted for 65 percent of EMC's Information Infrastructure revenues. Upbeat forecasts are often extrapolated to other publicly and privately held storage companies. But EMC is changing. While it's still "where information lives," strategic initiatives include cloud computing, security, broader IT services, and of course, virtualization. It's a very different storage company than the rest of the pack in that all of these initiatives now contribute directly and indirectly to the growth of its Information Storage division. No other storage company operates under EMC's model.
Additionally, there are forces currently at work in the marketplace that tend to mitigate against either raw storage growth or profit margins. Data deduplication technologies are now in the mainstream of IT operational processes. Reducing physical storage requirements by a factor of 20:1 for a growing number of data types is increasingly commonplace, reducing the demand for raw physical storage. And while cloud storage is the latest craze, it's a tough place to generate profit dollars. Cloud storage buyers like commodity hardware, open-source software, and a DIY attitude.
I agree with EMC's executives, who are predicting that the storage industry can expect to see purchasing activity returning to somewhat normal levels in the coming quarter and extending into 2010. But I don't think we'll see a rising tide that automatically floats all boats. Rather, positioning and an ability to integrate with other things--platforms, applications, processes--will be key predictors of success.
Again, let's look at EMC. No doubt, VMware is now an indirect growth contributor to EMC's Information Storage division. The same is true for 3Par, Dell/EqualLogic, HP/LeftHand, and NetApp. It should by now be clear that integration with server virtualization is goodness if you're a storage vendor.
Positioning in the storage cloud is chic but risky at the moment. Nevertheless, if you as a storage vendor can survive the harsh profit environment, your reward will come in due course. Cloud service providers are getting away with the data center equivalent of murder right now. Security is all but nonexistent and incidents of data loss are increasing. Yet providers are still able to sign-up customers who ignore the fine print in the service agreement that absolves them of all liability. That will change. And when it does, the demand for infrastructure that supports a higher quality of service will rise.
Then there are the "stack" vendors--those who are now pursuing an end-to-end, application-to-disk and everything-in-between product integration strategy. These include IBM, HP, Oracle/Sun (yes the deal gets done), and Cisco/EMC. The move by enterprise IT to virtualization and by inference, the cloud, drives this integration. And one interesting and perhaps intended consequence is this: In the world of the integrated processing stack, all becomes visible. A vendor like Cisco supplying new data center network infrastructure to a customer (Data Center Ethernet for example) can also see and respond to needs for upgraded storage (from partner EMC), new server connection devices like NICs and HBAs, upgraded management software (again from partner EMC). Virtualization gives stack vendors a sales opportunity vantage point that plays directly to the integrated stack strategy. Should smaller point-product vendors be pursuing an integration strategy with them? You bet.
There once was a point in time when the storage industry stood proud, broad-shouldered, and fiercely independent. Remember when storage mavens scoffed at Scott McNealy's "storage is a mere feature of the server" pronouncement? Sorry, but those days are gone. Integrate and prosper.
At Storage Networking World in Phoenix this week, there was a buzz in the hallways and over breakfast tables about the T-Mobile Sidekick outage that was due, according to Microsoft, to "a system failure that created data loss in the core database and the back-up." And why not? There are about 800 enterprise-level storage administrators here. The backup process is squarely in their space as is data recovery and data integrity. Some of their colleagues and some vendors represented on the show floor were at Sidekick ground zero pulling data from the wreckage.
SNW attendees knew that there were many fingers pointing in many different directions over this and my finger shan't be one. However I will go out on a limb and say that my understanding of the situation is that it was not a result of sabotage as was once rumored. Rather it was due to failure of two coincidental processes, in this case a data migration failure that was preceded by a backup failure.
Microsoft now says that little if any data was actually lost. T-Mobile Sidekicks are being restored and all or almost all will be made right again. Life will go on normally as if nothing really happened.
Really? Put yourself in the shoes of a Sidekick user for a minute or two. Do you know where your data is and I mean all of it? And, to borrow a line from an old Dustin Hoffman movie: is it safe?
Take an inventory. You have data on your desktop, laptop, Palm device, smartphone, entertainment center, home network...Then ask yourself: how much of this data could you lose without caring whether you ever used it again? Certainly some, perhaps a lot of it, will fall into the data dumpster category. But the T-Mobile scare is yet another reminder that each of us owns data that has become critical to our daily activities. Could you function if someone grabbed your smartphone and ran away? For an increasing number of us, the answer is yes, but with ever greater difficulty. Some other data about us, our medical records for example, are life-critical.
Next, try to figure out how much of that critical data you actually have control over and then back it up. Immediately. Don't trust others to do it for you. Take control and make copies locally and/or using one of the many online backup services.
As one of my Twitter compatriots SEPATONjay observed over breakfast this week, if the service level agreement between T-Mobile and Microsoft couldn't prevent this failure, how good are the SLAs between any of the rest of us consumers and our services providers. Take an inventory of the services providers that hold your data, then read their contracts, (assuming you can find them). I'll bet all of them indemnify the vendor against the loss of your data. If you can't protect that data, don't assume they will. Use a service that offers you a way to protect the data you deem critical.
Think your patient record is beyond your control? News item: you own your patient record no matter what your health care provider might say to the contrary. Get a copy and keep that copy up to date. I'm even going to go so far as to suggest that sometime in the next five years you have your genome sequenced. Store a copy of that in a safe place as well.
We are the mistresses and masters of our data domains. We can cry foul when someone else loses our data. We can even sue. But when data is destroyed--as in gone forever--no outcry no matter how loud will get it back. Protect it and win applause from the storage administrators who assembled here in Phoenix this week.
For the last two decades, RAID (redundant array of inexpensive disks) controllers have ruled the storage world. RAID has been required for data protection in disk arrays. RAID schemes (RAID 0,1,6 10, etc.) reside on RAID controllers baked into disk arrays with many billions sold to date. But perhaps more important from the standpoint of making money, the RAID controller has also delivered differentiated value for storage vendors. Data copy and migration, snap shot, deduplication, and the list of controller-based functions goes on--all have been loaded on to the RAID controller.
It's becoming increasingly clear that the traditional RAID controller is coming to the end of its life cycle, at least within the enterprise data center. Types of applications now common to the Web 2.0 community are now populating the enterprise data center--applications that require scalability into the petabyte range. Traditional RAID controllers start to show their shortcomings at this scale level. Drive rebuild times elongate to the point where RAID data protection is no longer protection.
We can argue (and I have) over how much longer the RAID controller will survive. For sure, it's nowhere near dead and will continue on as the workhorse of the storage industry for some time. But its shortcomings are becoming increasingly obvious and are driving the creation of the next generation of storage devices. Indeed one of those devices is no "device" at all. Rather, it's software running on a collection of commodity servers and server-attached disk, both traditional and solid state disk. Think of this new "device" as software-defined storage where all of the functionality is defined and delivered in software. So as a user, when you buy a software-defined storage device, you're simply buying code. What you run it on is up to you.
MaxiScale is an interesting example of software-defined storage. MaxiScale's FLEX storage platform runs on standard servers with SATA disk, and uses standard Ethernet interconnections. It is implemented as clustered nodes--servers plus disk. I/O performance and capacity scales linearly as processing nodes and disk drives are added to the cluster.
So the storage value-delivery model is decidedly different here. You as the user buy software and essentially roll you own array. But what else is different here? First, while the RAID controller is gone, the absolute requirement to preserve data is not. Data protection is also implemented in software.
Second, the system assumes that individual nodes within the cluster will go off line or fail for one reason or another. That's OK. The FLEX storage cluster continues to function, perhaps at some degraded state for some period of time until the full cluster is restored. But the point is that once you power up the cluster, you can keep it running for years--decades if you want. Hardware is added and replaced without disruption. Software is upgraded without disruption. It's perpetual storage.
Third, FLEX is an expression of the state of the art in single or global namespace file system technology. It's this core technology that delivers the value-added storage services rather than the RAID controller.
MaxiScale is not alone in this emerging space. Other software-defined storage solutions include ParaScale's cloud storage software and Symantec's FileStore. Other traditional hardware and software players will follow with software-defined storage offerings in the coming months. Include database vendors in this space as well. Some will position their solutions as cloud storage, others as data protection and archival storage.
Will software defined storage replace traditional RAID storage? Not immediately. Not dramatically. But to me a new model is emerging. Scalability, hardware independence, and system longevity are the more compelling features when compared to traditional RAID-based storage arrays. But perhaps the most compelling feature will be an ability to buy big array performance and scalability at a fraction of the cost of big array RAID.
Earlier this week, EMC revealed that it has attracted longtime Intel executive Pat Gelsinger to run its storage business.
Gelsinger is set to become president and chief operating officer of EMC's Information Infrastructure Products (virtually all in EMC's product group except VMware), including the Enterprise Storage Division, RSA Information Security, Content Management and Archiving, and Ionix IT Management. His direct reports will be Frank Hauck, who now leads ESD, Mark Lewis of CMA, Art Coviello of RSA, and Jay Mastaj of Ionix.
A Wall Street Journal blog post quotes Gelsinger as ultimately wanting to be Intel's president, but that wasn't something that was going to happen anytime soon. In that the move is effective immediately, Gelsinger will likely not be a keynote speaker at the Intel Developer Forum, as originally planned.
Pat Gelsinger
(Credit: Intel)Gelsinger is an interesting acquisition for EMC, as it diversifies its way toward encompassing more and more traditional IT infrastructure products and services, as well as IT virtualization and emerging cloud-computing models. His chip geek credentials are solid. He wrote the book on programming the 80386, and he designed the original 486 processor. At EMC, he will move to bring to market products based on a tighter integration between existing product lines, as well as those from VMware and some key partners (read Cisco Systems).
While Gelsinger is a hard-driving executive, he reveals an actively spiritual side of himself in his book "The Juggling Act: Bringing Balance to your Faith, Family, and Work."
While the Gelsinger move seems to have attracted the most media attention, there's way more to this story. As part of the executive personnel announcement, Howard Elias was promoted to president and chief operating officer of information infrastructure and cloud services at EMC.
Previously, Elias was president of EMC Global Services and EMC's Ionix IT management group. He will now be responsible for all of EMC's service groups, including those attached to the products groups. Elias also gets to champion EMC's moves into cloud services.
For me, the hidden word here is "services." IBM spawned IGS. HP bought EDS. There's a void here that I think Elias has to fill. Can Elias give EMC a services powerhouse? Can he successfully blend the more traditional IT services with the newly emerging cloud-computing models? As I see it, those are Elias' challenges.
What's going on at the top? First, Tucci believes that the current executive management structure needs to be enhanced and expended, as it progresses from $14 billion in annual revenue to a $20 billion to $25 billion IT infrastructure company capable of competing with the likes of IBM and Hewlett-Packard. Hence the division of responsibility along the line of products vs. services.
The Wall Street Journal reports that Tucci has, with this new management structure, set up something of a three-way, three-year competition to become his successor, as Tucci also announced that he plans to remain president and CEO through 2012. Now in the running, according to the Journal report, are Gelsinger, Elias, and David Goulden, EMC's current executive vice president and chief financial officer.
Tucci is certainly laying his cards on the table for Wall Street to see, and Wall Street appears to like the move, pushing EMC shares upward over the last week. His latest moves help discourage rumors that EMC is in play as a takeover candidate.
For a storage guy, last week's VMworld 2009 in San Francisco was a great show. All the familiar storage vendors were there and then some. Walking the show floor, I found them to be uniformly positive about traffic and the response they were getting from attendees.
Digging a bit deeper I found that storage vendors were getting attention from a broad range of IT specialists including server, network, architecture, and of course, storage administrators.
Wait a minute. VMworld isn't supposed to be a storage show. And yet storage vendors were, in general, more positively impressed with VMworld 2009 than many of the previously attended storage-focused shows they have been to in the recent past.
Server virtualization is now reordering the IT landscape, and the ground storage vendors have stood on for years is moving under their feet.
At varying levels, storage vendors feel the motion. They know the server virtualization thing is huge opportunity. Said another way, they fear that they could eventually disappear if they don't position themselves properly in the eyes of IT buyers now driving toward near complete if not total virtualization of the enterprise IT function.
Decades ago, storage was a mere peripheral, a feature of the server as Scott McNealy once famously quipped. But as he made that pronouncement, storage was getting connected to its own network and creeping out from behind the shadow of the server into a limelight all its own. EMC perhaps said it best: Storage--Where Information Lives.
Now that networked storage is mature, the ground is moving once again. Data and storage management is heading back toward the server running VMware. Data replication and storage provisioning functions are now features of the VMware server with more to come.
Beyond IT architecture, the architecture of the virtualized IT operations department is undergoing perhaps an even more profound change. The boundaries that once defined operational "silos"--server, network, and storage administration--are breaking down as vCenter becomes the focal point for VMware-managed IT. Hence, storage vendors here at VMworld 2009 get visitors from all walks of VMware operational life.
What am I taking away from VMworld 2009?
- VMware needs to resist playing favorites with storage vendors, especially the one that owns them. To VMware's credit, I saw much evidence that they get this imperative. VMware is democratically exposing APIs and vCenter plug-in opportunities to any storage vendor that wants to use them.
- VMware administrators will be increasingly challenged to choose between data and storage management functions that reside on the VMware platform or live within the storage environment. Larger IT environments will likely settle on a combination of both. Smaller shops may well opt to manage data and storage from the vantage point of the VMware platform.
- Storage-focused shows may no longer be able to support themselves. The nature of the storage buyer is changing. The nature of the storage environment is changing. Both are becoming more diverse and less narrowly focused on issues that only pertain to storage.
VMworld 2010 will likely be another great storage show.
NetApp's new CEO is Tom Georgens. Georgens steps in as Dan Warmenhoven, NetApp's CEO since 1994, moves on to the position of chairman of the board and a partnership development role under the direction of Georgens.
Warmenhoven's accomplishments were many, but he may be remembered most for turning the small niche-market opportunity that NAS once was as a dedicated file server attached to a LAN into the major networked storage platform NAS has become. Along the way, he built NetApp up to a 3.4 billion dollar company with 8,000-plus employees focused on storage.
Tom Georgens
(Credit: NetApp)NetApp co-founder Dave Hitz tells us that one of Warmenhoven's personal goals has been to retire at age 60. He's one year away from that milestone. Rather than continue to lead NetApp into a new phase that will be focused on scalable NAS, virtualization, and cloud computing, Warmenhoven has decided that Georgens' time has come.
Georgens' storage roots go back to the early to mid-1990s at EMC, where he was tapped to develop a midrange storage product to complement the Symmetrix line and exploit the growing Windows storage opportunity. That project was torpedoed internally, and Georgens went on to take on the storage business at LSI. EMC subsequently bought Data General, jettisoned DG's server business, but propelled Clariion to its current position of dominance in the midrange.
At LSI, Georgens surrounded himself with some very able executives who helped him establish the Engenio storage brand as the dominant OEM storage play, selling to the likes of IBM, STK, and Sun. He attempted to take Engenio public, but pulled back when both he and the executives at LSI decided that they couldn't get what they believed to be the true value of Engenio via an IPO. Not long thereafter, NetApp came calling. Georgens stepped in and later took on the position of COO, a move many analysts interpreted as one that placed him next in line for the CEO spot.
Now is a pivotal time in NetApp's history. NetApp has successfully transitioned from NAS-only to a broader range of storage and data management software products. And it is the only major independent and publicly held storage company left standing. STK was acquired by Sun. EMC has diversified to the point where it now calls itself an IT infrastructure player. That singular position in the eyes of some makes NetApp a takeover target. Here's why I think a takeover of NetApp is now less likely.
Georgens hates to lose. Selling-out now would be tantamount to losing.
How do I know? This may sound a bit odd but Georgens and I both participate in a not well-known activity called radiosport. Radiosport is practiced by ham radio operators worldwide. On certain weekends during the year, ham radio contestants try to make as many contacts with other hams in as many countries as they can during a 48-hour period. I do it because I've been a ham since my teen years and it's still fun to copy Morse code at something like 35 words per minute. Georgens probably enjoys this, too, but he's in radiosport to take all the marbles. Unlike me, Georgens is a world-class competitor. He has won numerous worldwide competitions, often from a station on the island of Barbados, and holds several North American records. In addition, he has represented the United States in the World Radiosport Team Championships.
So what, you say? Try to send and receive high-speed code for 48 hours with only occasional short breaks and maybe an hour of sleep in between. It takes dedication and an absolute desire to win to match Georgens' achievements.
Georgens didn't go to NetApp to sell the company. He went, I believe, because he wanted continue on NetApp's growth trajectory established years ago by Warmenhoven, Tom Mendoza, and Hitz. Selling would be letting someone else win. That's not in character for Georgens.
There is a big disconnect between how long people think they should be storing data and how long they actual can. One group of vendors and academics is trying to change that.
Two years ago, the Storage Networking Industry Association's Data Management Forum reported the results of a landmark study that looked at the state of long-term storage, i.e. preserving a digital object for more than 10 years. Some disturbing results jumped out.
The study suggested that we live in a digital version of the Dark Ages. I'm talking about it now because I think the messages from the study are still very relevant to both IT administrators and consumers.
A whopping 80 percent of the 276 organizations included in the study reported a need to retain electronic records for more than 50 years, so let's start there. How many of you storage administrators out there actually think you can do 50 years of electronic records retention given current technology? Without data loss? OK, so you won't be doing the same job 50 years from now, so why care? Next question: How many of you think that you can do more than three migrations of archival data from one storage media to the next without data loss? According to the study, the answer was very few of you.
Here's one for consumers: How many of you using Internet photo services sites think that your digitized images will still be there 50 years from now? You haven't thought about that, right? You and your spouse take pictures of the newborn today, you store them online, and maybe you store them at home, too. Here's a suggestion: make sure to print them and preserve the prints for as long as you can because if the enterprise-level storage administrators who have been doing digital storage for decades have little confidence in their ability to do long-term digital preservation, you shouldn't have much confidence either.
So there's a big gap here. A group of concerned vendors and academic advisers have formed the 100 Year Archive Task Force under the auspices of the Storage Networking Industry Association's Data Management Forum wants to start filling the gap. You can follow their progress or become involved yourself here.
One more result from the study still has me puzzled. Slightly more than half of the 276 organizations surveyed reported the need for "permanent" storage. What might fall into the permanent category? I thought of the Founding Fathers writing the U.S. Constitution and wondered what that process would have been like if they were all using a collaborative work-flow tool like Microsoft SharePoint. For sure, they'd print out the final version for all to see--on parchment maybe? But what about all the draft versions and messaging back and forth--in short, all the supporting documentation that clue us in on their state of mind and tell us what they really intended? Would they have printed out all of that, too? I dare say that insight would be gone forever.
We rarely, if ever, think of saving our digitized thoughts for the sake of posterity. But for the sake of historians, lawmakers, sociologists, and scientists yet to be born, we should--or people centuries from now might look back on this as the digital version of the Dark Age centuries from now.
I responded recently to an e-mail from a good friend who works for one of the big database applications vendors. He wrote me expressing discomfort and dismay over hype that swirls around cloud computing. He pointed out that no one vendor had yet put forth a full-scale vision for a shared-responsibility, open-access "cloud." He went on further to say that vendors wouldn't know how they would make money if a true open-access cloud were to come along.
My response was that the appeal of the "plug in, turn on, get IT service" mantra is compelling in an economy that forces companies to look away from capital acquisition in favor of off-balance sheet methods like leasing and the cloud "pay-as-you-go" model. Users will be drawn to both compute and storage clouds and vendors will want to sell them clouds and/or cloud infrastructure stuff. Concern over whether they are "open" has yet to put the brakes on this accelerating force. But there is growing concern among users about cloud vendor lock-in.
Openness aside, there are a number of assumptions floating around about storage clouds that have yet to be tested. These are assumptions like infinite scalability, unlimited availability, and increased responsiveness. Nothing in computing is infinitely scalable. There have been and will continue to be storage cloud outages. Response times are not deterministic. And nobody seems to be really talking about security all that much, let alone legal issues surrounding personal data in the cloud, SEC and other compliance issues, and potential chain of custody issues.
The biggest concern I have right now is for independent cloud storage vendors, particularly some of the small but rapidly growing start-ups. At CloudCamp Boston, someone provocatively mentioned that compute clouds commoditize IT operations. Interesting observation. One could argue that IT operational commoditization at least started when the help desk went overseas. But let's push that thought forward a bit more with regard to the storage cloud.
Amazon's S3 service was carved out of an existing IT operation that supports a different core business--retail. In essence what Amazon did originally was to monetize both spare capacity and the operational support staff which presumably was already being paid for by the retail side of the house. Is Amazon the sole owner of a secret formula that turns internal storage into cloud storage gold? No. Any reasonably large IT shop with an entrepreneurial bent could do the same.
Enron (remember Enron, the energy company?) was once a storage service provider (SSP), but not like their many SSP rivals at the time. Enron wanted to create an arbitrage opportunity around the excess capacity held by their SSP contemporaries. They proposed to buy and sell storage capacity in the same way they were buying and selling energy at the time--as a fungible commodity. Yes, Enron believed that storage sitting on a raised floor inside a data center or a co-lo was fungible.
Too bad for the smartest guys in the room. They were about eight years too early. It took an Amazon to prove that storage is indeed fungible. Could other big retailers do an S3 equivalent? Why not? There's nothing really stopping them. And why not financial services firms? Manufacturers? This list goes on. Nothing stopping them either. And I believe we will soon see brokers of cloud services in the same way we saw brokers of time-sharing services back in the day of the mainframe.
Hybrid clouds--ones that are both internal and external to the enterprise--will eventually dominate. There is no reason why a large internal IT services provider couldn't do what Amazon is doing, namely partition-off some idle capacity and offer it up to a cloud-based services consortium (the "open" cloud?) for a fee. Huge investments in computing and communications infrastructure have already been made. We will see an open cloud for example when a group of entrepreneurial IT executives form a consortium that offers utility cloud-computing services. These will not only be open, but their services will be priced to market. I hesitate to say "commodity" but that's where I think utility cloud services are going.
I believe it will happen this way because a historical precedent has already been set. Remember the old mainframe time share days? When IT represents a significant investment to the enterprise, IT executives look to maximizing its efficiency. One of the ways they did this back in the day was to offer up idle mainframe compute cycles for sale. They sold cycles decades ago. Today they can sell compute cycles, storage capacity, and the operations staff to manage it, all delivered over the wire. Amazon isn't the first, nor will they be the one and only.
This economy will force big enterprise IT to look at replicating the Amazon EC2/S3 model for two reasons:
Amazon can deliver IT services more cheaply than most large enterprise IT departments so the IT executives managing large IT shops will want some of what Amazon is smoking. They'll imitate the infrastructure. They'll buy cloud services. They'll sell cloud services.
Amazon has proven it can make money off its idle compute capacity. Every IT shop runs through computing peaks and valleys. Why not sell the idle capacity during the off-peak times just like Amazon. An "open" cloud would allow them to do this.
In the end, computing vendors make money from clouds by selling the underlying infrastructure. Enterprises make money from clouds by selling access to idle infrastructure offered up to the "open" cloud. Brokers of cloud-computing services make money by inserting themselves in between the buyers and sellers of cloud-computing resources. And, all of the foregoing means that the independent cloud services providers will live in an increasingly commoditized world where the price of the commodity is set not by those selling services as a core business, but by those selling cloud services as an adjunct to their core businesses.
That in my mind is the threat that the smaller independents may or may not see coming. As an independent, having an application wrapped around a cloud infrastructure is one way to forge a more sustainable cloud business model.
During the last two weeks we saw two acquisitions of relatively small purveyors of scalable file systems by big storage players. First, HP finally pulled in its partner IBRIX. Only days later, LSI made a surprise acquisition of ONStor. If both IBRIX and ONStor offer platforms upon which one can build scalable network attached storage (NAS), do these back-to-back deals indicate some sort of emerging trend? Yes and no. Yes it is in that, if you're a major NAS vendor and want to compete with NetApp who is readying GX8, scalability is now a must-have. But IBRIX extends capabilities HP already has whereas for LSI, ONStor represents their first ever venture into the NAS world.
Amazing is the amount of blogosphere and Twitter chatter that was generated by HP's announcement that it intended to acquire IBRIX for an undisclosed sum. No offense IBRIX people (all 53 of you), but you're not exactly a household name. It looks like HP is about to make you one however. And yes, you deserve all the attention you are getting, finally. You had a "next-gen" parallel file system before many knew they would even need one. You knew that Big Data users needed a file system that was system-agnostic and that would scale to the petabyte range. At the time however, they were in a niche-y place called high-performance computing (HPC). Now, Big Data users are cropping up everywhere. You count AOL, Caterpillar, Dreamworks, JP Morgan Chase, and Pixar among your 175 customers. Who knows where this cloud thing will take you.
Big is a relative term. In the storage world, what is big today will be table stakes tomorrow. The Petabyte-scale file system is becoming a must have for storage vendors. NetApp bought Spinnaker a while back. Sun developed ZFS. IBM has GPFS, and HP bought PolyServe last year two years ago but has chosen to position it in the Windows SQL Server space where it gets the most traction. IBRIX, with its many performance and data management capabilities, represents a much larger market opportunity to HP. And LSI has chosen to enter the NAS market as scalable from the get go.
IBRIX is headquartered in what was once a Honeywell Bull facility in Billerica, Mass. When they appeared in 2000 with a unique parallel file system called Fusion, the question was how to bring this to market? Who buys a parallel, scalable file system when file systems normally come bundled with or embedded in something else? IBRIX answered that question by forming remarketing relationships with big names: Dell, EMC, HP, and IBM who bundled/embedded IBRIX with their servers and storage. Dell and EMC packaged Fusion with PowerEdge servers and Clarrion storage, presenting the package to high-performance computing (HPC) customers. HP embedded Fusion in HP Blade and ProLiant server racks.
So what exactly does HP have planned for IBRIX? According to HP's Paul Perez, "HP will put the U in unified storage." OK, but that's a bit cryptic. Short term, HP will keep on keepin'-on with blade server/blade storage and scalable ProLiant/IBRIX NAS implementations. Longer term we may well see HP use IBRIX to approach cloud computing and archival storage opportunities.
Unified storage with a capitol "U" is a bit more of a challenge to understand. Typically the term has been applied to disk arrays that support fiber channel and Ethernet connectivity. HP likely means that kind of unification plus something more. IBRIX is typically used by its partners to create scale-out NAS subsystems using Fusion as the software engine that powers a NAS platform consisting of industry standard servers as the NAS front end, and SAN or direct-attached (DAS) RAID storage on the backend. As such, the combination presents scalable file storage to applications but uses block-based SAN or DAS storage. NAS is typically characterized as file storage, while SAN is block storage. It's a distinction that traditionally has had many application implications and ramifications. What HP's big U for Unified message may also be signaling is the introduction of a file/block converged storage product bundled with new hardware form factors sometime in the near future. For HP that likely means some combination of HP StorageWorks SANs, ProLiant rack-mount and blade servers, and ProCurve Ethernet switches powered by Fusion.
It's interesting that HP has chosen to announce a marriage now. After all, they've been dating for at least four years. But NetApp, after fussing like forever with the scalable file system it acquired from Spinnaker, is finally ready to go mainstream with it as ONTAP GX8. IBM is making more noise about GPFS. Then there's ZFS and its new owner--Oracle.
Which brings us to LSI and ONStor. LSI's Engenio Storage Group wasn't in NAS until now. It is in RAID arrays and storage virtualization. Now it's in scale-out NAS and NAS/SAN gateways too. Why? LSI/Engenio sells exclusively through original equipment manufacturers. IBM is a major reseller as is (was?) Sun. Dell is also in the mix. IBM's DS3000, 4000, and 5000 series arrays are all originally produced by LSI/Engenio.
But there is much repositioning going on among the big IT vendors these days. The future of Sun's hardware business is still a matter of debate in spite of Larry Ellison's assurance that Oracle will sell hardware too, and Dell is on record as in the hunt for companies worth buying. Is another storage acquisition possible for them? I think so. As a result, the number of large OEMs that LSI/Engenio can sell through is not growing and the future is unclear with regard to OEM sales of traditional RAID arrays via the big names in IT.
So NAS to LSI/Engenio represents new growth and possibly substantial growth if they can compete effectively. HP's acquisition of IBRIX potentially leaves something of a hole in the scale out NAS product lines of Dell, IBM, and EMC that LSI/Engenio. Dell and SGI may also need NAS/SAN gateways. IBM might like to have a second source for the NAS boxes they get from NetApp because ONStor is both scale out and scale up. And let's not count out Sun/Oracle either. Whereas IBRIX had established prominence in HPC computing, ONStor went after more mainstream applications and could be a better fit in that space for the big OEM partners.
So put LSI/Engenio on the list of buyers looking for things to buy as opposed to the other way around. They didn't just suddenly decide to write a $25 million check to the owners of ONStor after learning of HP/IBRIX. Their executives have assured me that they've been looking to add NAS to the portfolio for months. Market conditions and a desire to make a "we're here to stay" statement drove the timing of the ONStor deal.
Suddenly a somewhat dormant space in the storage world is erupting with activity. Why? The Big Data apps are here and they generate big system opportunities as well as big Unified storage opportunities.






