Click here to close now.




Welcome!

@CloudExpo Authors: Bob Gourley, Kevin Benedict, David Bermingham, Dana Gardner, Pat Romanski

News Feed Item

EMC Introduces World's Most Powerful Hadoop Distribution: Pivotal HD

Unlocks Hadoop as Key to Big Data's Transformational Potential for Data-Driven Enterprises; Delivers Over 100X Performance Improvement with HAWQ

SAN FRANCISCO, Feb. 25, 2013 /PRNewswire/ --

News Summary:

  • EMC's new Pivotal HD™ is an Apache Hadoop distribution that natively integrates the industry-leading EMC Greenplum massively parallel processing (MPP) database technology with the Apache Hadoop framework.
  • The engineering innovation and technology called HAWQ™, brings Greenplum's decade of research and development on the industry's best MPP analytical database platform to Apache Hadoop, the open-source Big Data platform.
  • New HAWQ technology brings more than 100X performance improvement (in some cases up to 600X) to a wide range of query types and workloads, making Pivotal HD the single most powerful Hadoop distribution in the industry.
  • Pivotal HD with HAWQ represents the world's first true SQL processing for Hadoop, expanding the platform's reach to SQL programmers and developers, as well as enabling compatibility with traditional BI tools.

Full Story:

EMC® Corporation (NYSE: EMC) today announced a new distribution of Apache Hadoop: Pivotal HD. Pivotal HD features native integration of EMC's industry leading Greenplum® massively parallel processing (MPP) database with Apache Hadoop—the most cost-effective and flexible open source Big Data platform ever developed.  The new EMC Greenplum-developed HAWQ™ technology brings 10 years of large scale data management research and development to Hadoop and delivers more than 100X performance improvements when compared to existing SQL-like services on top of Hadoop, making Pivotal HD the single most powerful Hadoop distribution in the industry.

Hadoop has rapidly emerged as the preferred solution for Big Data analytics applications that grapple with vast repositories of unstructured data. It is flexible, scalable, inexpensive, fault-tolerant, and enjoys rapid adoption rates and a rich ecosystem surrounded by massive investment. However, customers face high hurdles to broadly adopting Hadoop as their singular data repository due to a lack of useful interfaces and high-level tooling for Business Intelligence and datamining—components that are critical to data analytics and building a data-driven enterprise. As the world's first true SQL processing for Hadoop, Pivotal HD addresses these challenges.

By offering the full spectrum of the SQL interface, and by extension, the entire ecosystem of products that support SQL, customers no longer need an army of developers to build a dashboard or run a report. Unlike competitive Hadoop distributions, Pivotal HD does this without moving data between systems or using connectors that require users to store the data twice. Pivotal HD cuts out the complexity of using Hadoop, thus expanding the platform's potential and productivity, and allowing customers to enjoy the benefits of the most cost-effective and flexible data processing platform ever developed.

About HAWQ

HAWQ (pronounced hawk) represents the EMC Greenplum engineering effort that brings 10 years of large-scale data management research and development to the Apache Hadoop framework. Leveraging the feature richness and maturity of the industry leading Greenplum MPP analytical database, this innovation has resulted in the world's first true SQL parallel database on top of the Hadoop Distributed File System (HDFS). HAWQ is the key differentiating technology in making Pivotal HD the world's most powerful Hadoop distribution. Capabilities of note include Dynamic Pipelining, a world-class query optimizer, horizontal scaling, SQL compliant, interactive query, deep analytics, and support for common Hadoop formats.

Pivotal HD and HAWQ Deliver:

  • True SQL Query Capabilities – With Pivotal HD's advanced database services (HAWQ) enterprises can now unlock the potential of Hadoop's scalable, fault-tolerant storage capabilities by bringing to bear the vast pool of "data worker" tools and languages into the Hadoop ecosystems.  With Pivotal HD's support for true, SQL-standards compliant query interfaces data mining tools, SQL-trained data analysts, and standard BI tools can now easily connect to, query, and analyze data sets stored in the Hadoop file system (HDFS).
  • Unprecedented Query Performance – Bringing over 10 years of parallel database processing technology to Hadoop, Pivotal HD delivers query response time improvements that are up to 600x faster than current SQL-like interfaces for Hadoop.
  • Robust Operational Support – Command Center enables administrators and developers to easily install and manage large clusters from interactive web user interfaces.  Command Center also exposes Command Line Interface for scripting and programmer friendly web services API for complex automation tasks. Using Command Center administrators can deploy large cluster, configure services/roles, manage services and monitor HDFS jobs and tasks.

Hadoop -- The Foundation for Change

EMC believes that Hadoop has the potential to reach beyond Big Data to catalyze new levels of business productivity and transformation. As the foundation for change in business, Hadoop represents an unprecedented opportunity to improve how organizations can get the most value from large amounts of data. Businesses that rely on Hadoop as the core of their infrastructure can not only do analytics on top of vast amounts of data, but can also go beyond analytics and the foundation for that data layer to build applications that are meaningful, and that have a very tightly coupled relationship with the data. Consumer Internet companies have reaped the benefits of this approach, and EMC believes more traditional enterprises will adopt the same model as they evolve and transform their businesses.

Pivotal HD is expected to be available at the end of the first quarter of this year as a software-only or appliance-based solution, backed by EMC's global 24x7 support infrastructure.

Executive Quotes:

Scott Yara, senior vice president of products, Greenplum, a division of EMC

"Hadoop is a big deal and poised, for many enterprises, as the key to unlock Big Data's transformational potential. Marrying the extraordinary capabilities of the Greenplum technology—essentially the Greenplum crown jewels—with this amazing open-source phenomenon has been no small feat. But we're 'all in' — investing in a manner that no other company is — to help catapult Hadoop into wide-scale adoption."

Steven Hirsch, chief data officer, SVP Global Data Services, NYSE Euronext

"With Pivotal HD, we can check off many of the items on our Hadoop wish-list—things like plug-in support for the ecosystem of tools, improved data management and greater elasticity in terms of the storage and compute layer. But above all, it provides true SQL query interfaces for data workers and tools—not a superficial implementation of the kind that's so common today, but a native implementation that delivers the capability of real and true SQL processing and optimization. Having a single Hadoop infrastructure for Big Data investigation and analysis changes everything. Now add to all of this functionality the fact that the SQL performance is up to 100x faster than other offerings and you have an environment that we at NYSE Euronext are extremely excited about."

Jeff Kelly, Big Data analyst, The Wikibon Project

"By integrating the Greenplum database natively within Hadoop, Pivotal HD opens up Big Data analytics to a new class of practitioner – the savvy business user –providing both real-time query responsiveness via a familiar SQL interface and eliminating the need to move data back and forth between Hadoop and separate analytic databases. Most importantly, by providing both real-time and batch analytics capabilities in a single environment, Pivotal HD moves Hadoop several steps closer to realizing its full potential as the de facto, comprehensive Big Data platform of the 21st century."

John Myers, senior analyst, Enterprise Management Associates 

"The Pivotal HD release reflects the results of recent Enterprise Management Associates Big Data Research and our vision for the Hybrid Data Ecosystem.  With the sharing of a common data store via HDFS and processing information based on business requirements via either a SQL or NoSQL access, Pivotal recognizes that different analytical challenges require different processing/access layers."  

Sam Grocott, vice president of marketing and product management, EMC Isilon

"The introduction of Pivotal HD, combined with EMC Isilon's native integration of the Hadoop Distributed File System (HDFS) protocol, continues the evolution of the industry's first and only enterprise-proven Hadoop solution on a scale-out NAS architecture. This powerful combination succeeds in reducing the complexities traditionally associated with Hadoop deployments and allows enterprises to easily extract business value from unstructured data."

Partners Supporting the Pivotal HD announcement:

Find out what our partners are saying about Pivotal HD by clicking here.

Additional Resources

About Greenplum, a division of EMC

Greenplum, a division of EMC, is driving the future of Big Data analytics with breakthrough products that harness the skills of data science teams to help global organizations realize the full promise of business agility and become data-driven, predictive enterprises. The division's products include Greenplum® Unified Analytics Platform, Greenplum® Data Computing Appliance, Greenplum® Database, Greenplum® Analytics Lab, Greenplum® HD and Greenplum® Chorus™. They embody the power of open systems, cloud computing, virtualization and social collaboration, enabling global organizations to gain greater insight and value from their data than ever before possible. Learn more at www.greenplum.com

About EMC

EMC Corporation is a global leader in enabling businesses and service providers to transform their operations and deliver IT as a service. Fundamental to this transformation is cloud computing. Through innovative products and services, EMC accelerates the journey to cloud computing, helping IT departments to store, manage, protect and analyze their most valuable asset — information — in a more agile, trusted and cost-efficient way. Additional information about EMC can be found at www.EMC.com.

EMC, Greenplum, HAWQ, Pivotal HD, and Chorus are trademarks or registered trademarks of EMC Corporation in the United States and/or other countries. All other trademarks used are the property of their respective owners.

This release contains "forward-looking statements" as defined under the Federal Securities Laws.  Actual results could differ materially from those projected in the forward-looking statements as a result of certain risk factors, including but not limited to: (i) adverse changes in general economic or market conditions; (ii) delays or reductions in information technology spending; (iii) the relative and varying rates of product price and component cost declines and the volume and mixture of product and services revenues; (iv) competitive factors, including but not limited to pricing pressures and new product introductions; (v) component and product quality and availability; (vi) fluctuations in VMware, Inc.'s operating results and risks associated with trading of VMware stock; (vii) the transition to new products, the uncertainty of customer acceptance of new product offerings and rapid technological and market change; (viii) risks associated with managing the growth of our business, including risks associated with acquisitions and investments and the challenges and costs of integration, restructuring and achieving anticipated synergies; (ix) the ability to attract and retain highly qualified employees; (x) insufficient, excess or obsolete inventory; (xi) fluctuating currency exchange rates; (xii) threats and other disruptions to our secure data centers or networks; (xiii) our ability to protect our proprietary technology; (xiv) war or acts of terrorism; and (xv) other one-time events and other important factors disclosed previously and from time to time in EMC's filings with the U.S. Securities and Exchange Commission.  EMC disclaims any obligation to update any such forward-looking statements after the date of this release.

SOURCE EMC Corporation

More Stories By PR Newswire

Copyright © 2007 PR Newswire. All rights reserved. Republication or redistribution of PRNewswire content is expressly prohibited without the prior written consent of PRNewswire. PRNewswire shall not be liable for any errors or delays in the content, or for any actions taken in reliance thereon.

@CloudExpo Stories
SYS-CON Events announced today that Pythian, a global IT services company specializing in helping companies adopt disruptive technologies to optimize revenue-generating systems, has been named “Bronze Sponsor” of SYS-CON's 18th Cloud Expo, which will take place on June 7-9, 2015 at the Javits Center in New York, New York. Founded in 1997, Pythian is a global IT services company that helps companies compete by adopting disruptive technologies such as cloud, Big Data, advanced analytics, and DevO...
With microservices, SOA and distributed architectures becoming more popular, it is becoming increasingly harder to keep track of where time is spent in a distributed application when trying to diagnose performance problems. Distributed tracing systems attempt to address this problem by following application requests across service boundaries, persisting metadata along the way that provide context for fine-grained performance monitoring.
With an estimated 50 billion devices connected to the Internet by 2020, several industries will begin to expand their capabilities for retaining end point data at the edge to better utilize the range of data types and sheer volume of M2M data generated by the Internet of Things. In his session at @ThingsExpo, Don DeLoach, CEO and President of Infobright, will discuss the infrastructures businesses will need to implement to handle this explosion of data by providing specific use cases for filte...
SYS-CON Events announced today that Men & Mice, the leading global provider of DNS, DHCP and IP address management overlay solutions, will exhibit at SYS-CON's 18th International Cloud Expo®, which will take place on June 7-9, 2016, at the Javits Center in New York City, NY. The Men & Mice Suite overlay solution is already known for its powerful application in heterogeneous operating environments, enabling enterprises to scale without fuss. Building on a solid range of diverse platform support,...
SYS-CON Events announced today that Fusion, a leading provider of cloud services, will exhibit at SYS-CON's 18th International Cloud Expo®, which will take place on June 7-9, 2016, at the Javits Center in New York City, NY. Fusion, a leading provider of integrated cloud solutions to small, medium and large businesses, is the industry's single source for the cloud. Fusion's advanced, proprietary cloud service platform enables the integration of leading edge solutions in the cloud, including clou...
More and more companies are looking to microservices as an architectural pattern for breaking apart applications into more manageable pieces so that agile teams can deliver new features quicker and more effectively. What this pattern has done more than anything to date is spark organizational transformations, setting the foundation for future application development. In practice, however, there are a number of considerations to make that go beyond simply “build, ship, and run,” which changes ho...
SYS-CON Events announced today that AppNeta, the leader in performance insight for business-critical web applications, will exhibit and present at SYS-CON's @DevOpsSummit at Cloud Expo New York, which will take place on June 7-9, 2016, at the Javits Center in New York City, NY. AppNeta is the only application performance monitoring (APM) company to provide solutions for all applications – applications you develop internally, business-critical SaaS applications you use and the networks that deli...
SYS-CON Events announced today that Commvault, a global leader in enterprise data protection and information management, has been named “Bronze Sponsor” of SYS-CON's 18th International Cloud Expo, which will take place on June 7–9, 2016, at the Javits Center in New York City, NY, and the 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. Commvault is a leading provider of data protection and information management...
The maker of automated server migration software reports sales bookings increased 460 percent year-over-year and last week IDC forecast spending on public cloud services will grow to more than $141 billion in 2019. Seems (at last) we are at the tipping point where enterprises are adopting cloud in a big way opening opportunities for those who can help with the transition.
SYS-CON Events announced today that Alert Logic, Inc., the leading provider of Security-as-a-Service solutions for the cloud, will exhibit at SYS-CON's 18th International Cloud Expo®, which will take place on June 7-9, 2016, at the Javits Center in New York City, NY. Alert Logic, Inc., provides Security-as-a-Service for on-premises, cloud, and hybrid infrastructures, delivering deep security insight and continuous protection for customers at a lower cost than traditional security solutions. Ful...
@DevOpsSummit taking place June 7-9, 2016 at Javits Center, New York City, and Nov 1-3, 2016, at the Santa Clara Convention Center in Santa Clara, CA, is co-located with the 18th International @CloudExpo and will feature technical sessions from a rock star conference faculty and the leading industry players in the world. @DevOpsSummit at Cloud Expo New York Call for Papers is now open.
SYS-CON Events announced today that VAI, a leading ERP software provider, will exhibit at SYS-CON's 18th International Cloud Expo®, which will take place on June 7-9, 2016, at the Javits Center in New York City, NY. VAI (Vormittag Associates, Inc.) is a leading independent mid-market ERP software developer renowned for its flexible solutions and ability to automate critical business functions for the distribution, manufacturing, specialty retail and service sectors. An IBM Premier Business Part...
Your business relies on your applications and your employees to stay in business. Whether you develop apps or manage business critical apps that help fuel your business, what happens when users experience sluggish performance? You and all technical teams across the organization – application, network, operations, among others, as well as, those outside the organization, like ISPs and third-party providers – are called in to solve the problem.
The cloud promises new levels of agility and cost-savings for Big Data, data warehousing and analytics. But it’s challenging to understand all the options – from IaaS and PaaS to newer services like HaaS (Hadoop as a Service) and BDaaS (Big Data as a Service). In her session at @BigDataExpo at @ThingsExpo, Hannah Smalltree, a director at Cazena, will provide an educational overview of emerging “as-a-service” options for Big Data in the cloud. This is critical background for IT and data profes...
Father business cycles and digital consumers are forcing enterprises to respond faster to customer needs and competitive demands. Successful integration of DevOps and Agile development will be key for business success in today’s digital economy. In his session at DevOps Summit, Pradeep Prabhu, Co-Founder & CEO of Cloudmunch, covered the critical practices that enterprises should consider to seamlessly integrate Agile and DevOps processes, barriers to implementing this in the enterprise, and pr...
It's easy to assume that your app will run on a fast and reliable network. The reality for your app's users, though, is often a slow, unreliable network with spotty coverage. What happens when the network doesn't work, or when the device is in airplane mode? You get unhappy, frustrated users. An offline-first app is an app that works, without error, when there is no network connection.
SYS-CON Events announced today that Catchpoint Systems, Inc., a provider of innovative web and infrastructure monitoring solutions, has been named “Silver Sponsor” of SYS-CON's DevOps Summit at 18th Cloud Expo New York, which will take place June 7-9, 2016, at the Javits Center in New York City, NY. Catchpoint is a leading Digital Performance Analytics company that provides unparalleled insight into customer-critical services to help consistently deliver an amazing customer experience. Designed...
As someone who has been dedicated to automation and Application Release Automation (ARA) technology for almost six years now, one of the most common questions I get asked regards Platform-as-a-Service (PaaS). Specifically, people want to know whether release automation is still needed when a PaaS is in place, and why. Isn't that what a PaaS provides? A solution to the deployment and runtime challenges of an application? Why would anyone using a PaaS then need an automation engine with workflow ...
As enterprises work to take advantage of Big Data technologies, they frequently become distracted by product-level decisions. In most new Big Data builds this approach is completely counter-productive: it presupposes tools that may not be a fit for development teams, forces IT to take on the burden of evaluating and maintaining unfamiliar technology, and represents a major up-front expense. In his session at @BigDataExpo at @ThingsExpo, Andrew Warfield, CTO and Co-Founder of Coho Data, will dis...
In most cases, it is convenient to have some human interaction with a web (micro-)service, no matter how small it is. A traditional approach would be to create an HTTP interface, where user requests will be dispatched and HTML/CSS pages must be served. This approach is indeed very traditional for a web site, but not really convenient for a web service, which is not intended to be good looking, 24x7 up and running and UX-optimized. Instead, talking to a web service in a chat-bot mode would be muc...