• On mySimon: Shop for summer: double strollers
advertisement
October 7, 2009 11:58 AM PDT

MaxiScale and the emergence of software-defined storage

by John Webster

For the last two decades, RAID (redundant array of inexpensive disks) controllers have ruled the storage world. RAID has been required for data protection in disk arrays. RAID schemes (RAID 0,1,6 10, etc.) reside on RAID controllers baked into disk arrays with many billions sold to date. But perhaps more important from the standpoint of making money, the RAID controller has also delivered differentiated value for storage vendors. Data copy and migration, snap shot, deduplication, and the list of controller-based functions goes on--all have been loaded on to the RAID controller.

It's becoming increasingly clear that the traditional RAID controller is coming to the end of its life cycle, at least within the enterprise data center. Types of applications now common to the Web 2.0 community are now populating the enterprise data center--applications that require scalability into the petabyte range. Traditional RAID controllers start to show their shortcomings at this scale level. Drive rebuild times elongate to the point where RAID data protection is no longer protection.

We can argue (and I have) over how much longer the RAID controller will survive. For sure, it's nowhere near dead and will continue on as the workhorse of the storage industry for some time. But its shortcomings are becoming increasingly obvious and are driving the creation of the next generation of storage devices. Indeed one of those devices is no "device" at all. Rather, it's software running on a collection of commodity servers and server-attached disk, both traditional and solid state disk. Think of this new "device" as software-defined storage where all of the functionality is defined and delivered in software. So as a user, when you buy a software-defined storage device, you're simply buying code. What you run it on is up to you.

MaxiScale is an interesting example of software-defined storage. MaxiScale's FLEX storage platform runs on standard servers with SATA disk, and uses standard Ethernet interconnections. It is implemented as clustered nodes--servers plus disk. I/O performance and capacity scales linearly as processing nodes and disk drives are added to the cluster.

So the storage value-delivery model is decidedly different here. You as the user buy software and essentially roll you own array. But what else is different here? First, while the RAID controller is gone, the absolute requirement to preserve data is not. Data protection is also implemented in software.

Second, the system assumes that individual nodes within the cluster will go off line or fail for one reason or another. That's OK. The FLEX storage cluster continues to function, perhaps at some degraded state for some period of time until the full cluster is restored. But the point is that once you power up the cluster, you can keep it running for years--decades if you want. Hardware is added and replaced without disruption. Software is upgraded without disruption. It's perpetual storage.

Third, FLEX is an expression of the state of the art in single or global namespace file system technology. It's this core technology that delivers the value-added storage services rather than the RAID controller.

MaxiScale is not alone in this emerging space. Other software-defined storage solutions include ParaScale's cloud storage software and Symantec's FileStore. Other traditional hardware and software players will follow with software-defined storage offerings in the coming months. Include database vendors in this space as well. Some will position their solutions as cloud storage, others as data protection and archival storage.

Will software defined storage replace traditional RAID storage? Not immediately. Not dramatically. But to me a new model is emerging. Scalability, hardware independence, and system longevity are the more compelling features when compared to traditional RAID-based storage arrays. But perhaps the most compelling feature will be an ability to buy big array performance and scalability at a fraction of the cost of big array RAID.

John, a senior partner at Evaluator Group, has 30 years of experience in enterprise IT storage, spanning mainframe and open systems environments. He has served as principal IT adviser at Illuminata and has held analyst positions at IDC and Yankee Group Research. He also co-authored the book "Inescapable Data Harnessing the Power of Convergence." John is a member of the CNET Blog Network and is not an employee of CNET.
Recent posts from Data-driven
David Scott of 3Par, take a victory lap
The public cloud: Friend or foe for storage vendors?
EMC builds new data computing division around Greenplum
A challenge for storage entrepreneurs
A world without records
A report from Compellent C-Drive
The emergence of applications-managed storage
EMC's Gelsinger plans to deliver application fluidity
Add a Comment (Log in or register)
by ghaff October 7, 2009 8:18 PM PDT
Hi John,

The big question for me with higher-level sw based approaches (whether in storage or fault-tolerant systems, etc.) is what about the edge cases? When weird things happen, does sane recovery happen? Which is especially important when talking about data.
Reply to this comment
advertisement
CNET River
  • brian_tong: I just got messed up watching Marley and Me with @bombdarby on TV...something is wrong with my eyes.

  • danackerman: Almost got attacked by an agressive swan on our Hamptons jaunt. http://yfrog.com/61c2zbj

  • cnetfalcone: "Set-top box" was always a bad term, but now it's inaccurate, too: TVs are too thin for anything to sit on top of them. http://bit.ly/9Av50K

  • caro: Paired with a Dogfish Punkin Ale #TheRiseOfTheodoreRoosevelt http://bit.ly/bZQxMB

  • natalidelconte: Really people, you don't want to follow me on Ping. My music tastes are not that interesting.

  • caro: Also, the @blissspa itinerary that they send you post-booking looks so official that I'm tempted to send it to Tripit.

  • caro: Booking a massage to address aftereffects of this week's hill runs and hikes. Must say @blissspa's online reservation system is impressive.

  • raygun01: BTW if my out of warranty Mac Pro requires replacement, I might just punch myself in the face. It's only 2 years old!! http://bit.ly/988ozc

  • stshank: Spoiled by Netflix. In UK trying Lovefilm and already angry. 1. Plan descriptions opaque & misleading. 2. Is there no way to queue videos?

  • raygun01: My Mac Pro wont even stay alive long enough to boot from a CD. This sounds bad. And expensive.

  • cnetfalcone: Amazon VOD is already a strong iTunes competitor, but it needs to be supported on more devices. Android, Xbox, & PS3 would be a nice start.

  • jetscott: Bad timing on the white iPhone 4...this would have been the last day to use it.

  • loricnet: LOL RT @1001noisycamera: Last day to wear white cameras :)

  • jetscott: I'm Looking Through This Thin Slice of Nova #beatlejews

  • mollywood: I'm so shocked I blogged it. The AT&T third-party eBill verification process, in 50 easy steps. http://themolly.com/blog/?p=194

  • mollywood: I am absolutely flabbergasted at how hard AT&T makes it to sign up for online billing. It's taken weeks. Everyone else? You just. click.

  • loricnet: RT @planetMitch: Check this video out -- Richard Schleuning Of Zeiss Explains T-Stop Vs. F-Stop. http://t.co/L6N2wBO via @goforjared

  • loricnet: RT @sonyalpharumors (SR5) Firmware upgrade for NEX will deliver AF for SAM & SSM lenses! http://bit.ly/cA2Oq0

  • natalidelconte: How personal is too personal on social networking? I wrote an article about it for AOL's @mydailyuk: http://bit.ly/dAun0e

Chrome reshapes the browser market

The influence that Google's browser has had on the market is broader than its actual use. On Chrome's second anniversary, Google releases the sixth stable version.

Apple rolls out new iPods, social iTunes

A new version of Apple TV is also coming soon, as Apple follows its usual September playbook in refreshing its iPod lineup and the iTunes software.
• Roundup: New iPods, iTunes, TV?

About Data-driven

Storage is more--way more--than a mere peripheral. In Data-driven, John Webster probes into storage technologies, the vendors behind them, and how customers use them in the context of market drivers such as Web 2.0, cloud computing, and the need to get meaningful information from the data fire hose that is now part of our daily life.

John is a senior partner at Evaluator Group. He has served as principal IT adviser at Illuminata and has held analyst positions at IDC and Yankee Group Research. He also co-authored the book "Inescapable Data Harnessing the Power of Convergence." John is a member of the CNET Blog Network and is not an employee of CNET.

Add this feed to your online news reader

Data-driven topics

advertisement
Click Here

Inside CNET News

Scroll Left Scroll Right