Welcome!

@CloudExpo Authors: Liz McMillan, Zakia Bouachraoui, Yeshim Deniz, Pat Romanski, Elizabeth White

Related Topics: @CloudExpo

@CloudExpo: Article

How Enterprise Big Data Will Affect Organizations in 2012

RainStor's predictions focus on how enterprise Big Data will affect organizations in 2012

As the new year begins, global companies face the coming year's most prominent IT and business challenge: Big Data.

The focus for IT will be to provide high performance analytics capabilities at the lowest cost, as business users need to tap into volumes of multi-structured data about their customers and markets to gain competitive advantage.

RainStor, a provider of Big Data management software, has released five predictions focused on how enterprise Big Data will affect organizations in 2012. Based on client and partner experience, market research and conversations with industry experts, here are RainStor's five predictions for Big Data in 2012:

Prediction #1: Big Data will Transition from Technology "Buzz" to a Real Business Challenge Affecting Many Large Global Enterprises
Big Data is largely centered on leveraging the open source Apache Hadoop analytics platform and the growing ecosystem of other vendor solutions surrounding this innovative data management technology. According to 451 Research, between 2008 and the end of 2010, $95.8m had been invested in the various Apache Hadoop- and NoSQL-related vendors. That figure now stands at more than $350.8m, up 266 percent(1).

Organizations that focus on solving the enterprise Big Data problem will advance further during 2012. Leading Hadoop distributors like Cloudera have succeeded because of their enterprise-ready management features, skilled professional services, training and support. As companies in banking, financial services and communications sectors roll out Hadoop projects to meet new business demands and implementations reach multi-terabyte scale, IT executives will have to address the operational costs of running Hadoop and how it ultimately impacts the budget and the top line. Hadoop, while open source, still requires skilled resources, other supporting proprietary technologies and a multitude of servers.

Prediction #2: In Key Industries, Machine-Generated Data Growth Will Outpace Social Media and other Human-Generated Data Sources
In 2012, Big Data will become a real business issue debated at the executive and board room level and will move beyond the focus of social media data sources and customer sentiment data. For specific industries, machine-generated data will exceed that created by social networks such as Facebook and Twitter.

The industries impacted most by Big Data growth will be communications, financial services and utilities/smart grids -- sectors governed by external regulators. These industries must not only keep detailed data online for specific timeframes, but also provide data scientists ongoing access to months and years of detailed data to develop better algorithms and conduct predictive analytics.

Prediction #3: Hadoop Big Data Analytics Capabilities will Converge with Traditional Data Warehouses
As the Hadoop ecosystem expands and adoption takes off, end users will realize business value from analysis that was previously executed in a traditional data warehouse. The combination of a warehouse's structured data analysis and Hadoop's multi-structured data analysis will give enterprises greater flexibility and higher productivity at a lower cost. We have already witnessed data warehouse providers investing in Hadoop related technologies such as Teradata's acquisition of Aster Data. Over time we will see more standardization within the Hadoop ecosystem to better meet enterprise expectations for resilience, security and high performance analytics.

Prediction #4: IT Developers will Require a Wider Range of Skills and New Tools to Manage Big Data Environments
As Big Data projects are rolled out using a mix of new Hadoop capabilities in combination with existing data management environments, IT will inevitably need more dedicated training on MapReduce skills. Additionally, IT resources will have an increased desire to use standard SQL in conjunction with MapReduce to improve productivity levels and maintain operational costs, which will be the ultimate goal of most IT organizations in 2012.

Prediction #5: As Big Data Becomes the Enterprise Focus, In-Memory Database Capabilities will Rise in Popularity
As compute and processing power becomes more sophisticated, in-memory will experience resurgence in popularity directly related to Big Data. This trend is already supported by SAP, with the introduction of HANA, an in-memory data analytics platform handling massive amounts of data up to 3,600 times faster for instant business insights. Innovation from both large vendors and technology start-ups will help to revitalize in-memory databases and data warehouses in the coming year.

(1) The 451 Group, November 14, 2011. 'Big Money for Big Data' by Matt Aslett

More Stories By Liz McMillan

News Desk compiles and publishes breaking news stories, press releases and latest news articles as they happen.

Comments (0)

Share your thoughts on this story.

Add your comment
You must be signed in to add a comment. Sign-in | Register

In accordance with our Comment Policy, we encourage comments that are on topic, relevant and to-the-point. We will remove comments that include profanity, personal attacks, racial slurs, threats of violence, or other inappropriate material that violates our Terms and Conditions, and will block users who make repeated violations. We ask all readers to expect diversity of opinion and to treat one another with dignity and respect.


CloudEXPO Stories
With more than 30 Kubernetes solutions in the marketplace, it's tempting to think Kubernetes and the vendor ecosystem has solved the problem of operationalizing containers at scale or of automatically managing the elasticity of the underlying infrastructure that these solutions need to be truly scalable. Far from it. There are at least six major pain points that companies experience when they try to deploy and run Kubernetes in their complex environments. In this presentation, the speaker will detail these pain points and explain how cloud can address them.
The deluge of IoT sensor data collected from connected devices and the powerful AI required to make that data actionable are giving rise to a hybrid ecosystem in which cloud, on-prem and edge processes become interweaved. Attendees will learn how emerging composable infrastructure solutions deliver the adaptive architecture needed to manage this new data reality. Machine learning algorithms can better anticipate data storms and automate resources to support surges, including fully scalable GPU-centric compute for the most data-intensive applications. Hyperconverged systems already in place can be revitalized with vendor-agnostic, PCIe-deployed, disaggregated approach to composable, maximizing the value of previous investments.
When building large, cloud-based applications that operate at a high scale, it's important to maintain a high availability and resilience to failures. In order to do that, you must be tolerant of failures, even in light of failures in other areas of your application. "Fly two mistakes high" is an old adage in the radio control airplane hobby. It means, fly high enough so that if you make a mistake, you can continue flying with room to still make mistakes. In his session at 18th Cloud Expo, Lee Atchison, Principal Cloud Architect and Advocate at New Relic, discussed how this same philosophy can be applied to highly scaled applications, and can dramatically increase your resilience to failure.
Machine learning has taken residence at our cities' cores and now we can finally have "smart cities." Cities are a collection of buildings made to provide the structure and safety necessary for people to function, create and survive. Buildings are a pool of ever-changing performance data from large automated systems such as heating and cooling to the people that live and work within them. Through machine learning, buildings can optimize performance, reduce costs, and improve occupant comfort by sharing information within the building and with outside city infrastructure via real time shared cloud capabilities.
As Cybric's Chief Technology Officer, Mike D. Kail is responsible for the strategic vision and technical direction of the platform. Prior to founding Cybric, Mike was Yahoo's CIO and SVP of Infrastructure, where he led the IT and Data Center functions for the company. He has more than 24 years of IT Operations experience with a focus on highly-scalable architectures.