| By Maureen O'Gara | Article Rating: |
|
| June 8, 2009 09:15 PM EDT | Reads: |
1,479 |
Forrester analyst Jim Kobielus has predicted that data warehousing will evolve into a “virtualized, cloud-based, supremely scalable distributed platform.”
Greenplum, the massively parallel open source data warehouse company, says it’s already happening and that companies like Fox Interactive Media, Zions Bank and Future Group, the big Indian retailer, have already built early iterations of so-called Enterprise Data Clouds (EDC) using its latest widgetry.
It also figures that the Enterprise Data Cloud will displace the data warehouse appliance architectures that Oracle is so fond of, one of the reasons it’s supposedly buying Sun.
Greenplum claims that Oracle is already way behind and playing catch-up with its relatively new Exadata data warehouse appliance; Greenplum and Netezza have been offering appliances for years.
But it says hardware-based solutions such as Teradata and Netezza aren’t suited to the commodity hardware-based cloud infrastructure.
The cloud supposedly needs a software-only solution and that’s exactly what Greenplum’s got.
eBay, the world’s largest database, a hefty 6.5 petabytes, runs on Greenplum, which has collected 70 paying customers in the last two-and-a-half years. Netezza is supposed to have 200 customers and Teradata, the old man of data warehouses, has 900.
In Greenplum’s experience the 20-year-old mainframe approach of trying to create one single corporate-wide logical database is an idle and expensive exercise for a company to engage in.
Corporate units inevitably want things their way and so create silos – a psychological reality that the 10-year-old data warehouse appliance plays to – but then the data is fragmented and federated silos usually prove brittle when they aren’t having problems scaling.
The alternative is self-service, which means getting data into the cloud and out to the business teams as quickly as possible and letting analysts and DBAs instantly deploy all the data marts and data warehouses and run all the analyses on the data that they want.
Greenplum claims this “model less, iterate more” approach optimized for operations rather than performance and based on a common pool of physical, virtual or public cloud infrastructure (think VMware to start) is the right compromise.
Users get the control they want and IT gets to manage the pool as one infrastructure, increasing efficiencies and delivering predictable SLAs. Plus all the data, both the stable data and the volatile data that the mainframe approach invariably ignores, will actually be in one place.
Pieces of it will simply be broken off for any new warehouse without lots of process and upfront modeling; it’s supposed to be easy to share newly loaded data or analysis results.
The EDC approach implies elastic scale and massively parallel processing as well as a large-scale data collections and fast turnaround.
Greenplum’s new Database 3.3, now generally available, introduces key EDC features such as online warehouse expansion, which means it can be resized as needed across new servers added while the system is online and responding to queries.
Each additional server of course adds more storage capacity, query performance and loading performance.
Published June 8, 2009 Reads 1,479
Copyright © 2009 SYS-CON Media, Inc. — All Rights Reserved.
Syndicated stories and blog feeds, all rights reserved by the author.
More Stories By Maureen O'Gara
Maureen O'Gara the most read technology reporter for the past 20 years, is the Cloud Computing and Virtualization News Desk editor of SYS-CON Media. She is the publisher of famous "Billygrams" and the editor-in-chief of "Client/Server News" for more than a decade. One of the most respected technology reporters in the business, Maureen can be reached by email at maureen(at)sys-con.com or paperboy(at)g2news.com, and by phone at 516 759-7025.
- The Top 150 Players in Cloud Computing
- 4th International Cloud Computing Conference & Expo Starts Today
- Yahoo! to Keynote 4th Cloud Expo: Accelerating Innovation with Cloud Computing
- SYS-CON.TV: Cloud Computing Expo Power Panel
- Exclusive Q&A with Rich Marcello - Unisys President, Systems & Technology
- The Economics of Cloud Computing Analyzed
- Commercial vs Federal Cloud Computing
- An Interview with Federal CIO Nominee Vivek Kundra
- Deputy CIO of the CIA to Keynote 1st Annual GovIT Expo
- 1st Annual Government IT Conference & Expo: Themes & Topics
- CIA was Headed to an Enterprise Cloud All Along: Jill Tummler Singer
- Industry Experts Discuss the State of Cloud Computing
- The Top 150 Players in Cloud Computing
- 4th International Cloud Computing Conference & Expo Starts Today
- Cloud CEOs, CTOs & SVPs to Speak at 4th International Cloud Computing Expo
- Unisys President To Keynote Cloud Computing Expo
- Yahoo! Named “Platinum Sponsor” of Cloud Computing Expo
- Yahoo! to Keynote 4th Cloud Expo: Accelerating Innovation with Cloud Computing
- SYS-CON.TV: Cloud Computing Expo Power Panel
- Exclusive Q&A with Rich Marcello - Unisys President, Systems & Technology
- Unisys Named “Platinum Sponsor” of Cloud Computing Expo
- The Economics of Cloud Computing Analyzed
- Commercial vs Federal Cloud Computing
- An Interview with Federal CIO Nominee Vivek Kundra
- Virtualization Conference Keynote Webcast Live on SYS-CON.TV
- The Top 150 Players in Cloud Computing
- SOA 2 Point Oh No!
- The Top 250 Players in the Cloud Computing Ecosystem
- What is Cloud Computing?
- Cloud Computing Expo Europe 2009 in Prague: Themes & Topics
- IBM's Got Its Head in the Clouds
- Cloud Computing Expo 2009 West: Call for Papers Now Closed
- Red Hat Named "Platinum Sponsor" of Virtualization Conference & Expo
- As Google's SaaS Assault Begins, Move Over Microsoft Office?
- From Enterprise to Cloud, Virtualization Today on SYS-CON.TV
- Twenty-One Experts Define Cloud Computing


































