Welcome!

@CloudExpo Authors: Mark Herring, Gopala Krishna Behara, Sridhar Chalasani, Tirumala Khandrika, John Katrick

Related Topics: Microservices Expo, @CloudExpo, Apache, Cloud Security

Microservices Expo: Article

The Data Explosion

Is data is growing out of control?

Data explosion is one of the biggest issues facing IT today. The amount of data that organizations store has grown exponentially in the last 10 years. According to Gartner research director April Adams, data capacity on average in enterprises grows at 40 percent to 60 percent year over year.

Data is the lifeblood of any business, and companies of all sizes are struggling with the increasing amount of data stored on their networks. Because storage capacity has increased and costs have declined, many IT administrators have become more lax about what they allow their users to store on the corporate network and for how long. While the ability to store increasing amounts of data empowers organizations, it also presents them with the challenge of managing all of that information. As network storage grows, users are also adding an additional layer of complexity as they become increasingly dependent on ubiquitous access: they want to be able to access their data from wherever they are and from a variety of devices, including smartphones, tablets and laptops.

One approach is to just back everything up, but this tactic actually impedes your ability to get operations back up and running when a failure takes place. Going through mounds of unorganized data just isn't feasible and can cause companies to waste valuable time during a disaster. Businesses simply can't afford to treat all data equally, and prioritization is key. Companies may encounter serious issues if they store huge amounts of data onto tapes or into the cloud indiscriminately.

In sum, tougher recovery demands compound the problem of growing data. Organizations are intolerant of any data loss or downtime, putting a lot of pressure on IT managers, who are working in environments in flux thanks to evolving technologies and a growing variety of endpoints that need to be protected.

The 10 Percent Rule
Not all data is created equal. There is some critical data that, when lost, will bring a business to a halt. On average, only 10 percent of an organization's data is critical. "Critical" means that a file is in active use or changes frequently. That's typically about 10% of a company's information and represents the items they access daily and need immediately when a disaster strikes. Critical varies from organization to organization, but every minute spent recovering this data means lost productivity and lost revenue.

Of course, this doesn't mean that you don't need to protect the other 90 percent. It just means that you should prioritize. Arguably, all data is important, but organizations need a structured or tiered approach to ensure critical applications and systems are operational first in the event of data failure. They should plan and prioritize their information in advance, ideally with the help of professional data support personnel, so that they can recover information efficiently in the event of a disaster.

This approach will reduce downtime in the event of a widespread failure. If data is not prioritized, much time will be squandered recovering non-critical data, extending the length of a down period.

A Real Life Example
The benefits of a well-planned recovery strategy are best illustrated using a real world scenario. Let's consider a management consulting firm that has over one terabyte of data. Some of that data is Microsoft Exchange email, some resides on a file server and some of it is from a proprietary application for their business, which runs on a SCO UNIX server.

Using the 10 percent rule as a guide, the firm determines that if it were to experience data loss as the result of a server crash or other disaster, they would need to recover the last three months of their email, the last year of their file server data and the last three months of their UNIX data in order to get their business back up and running immediately. The rest of their data could be restored a day or two later without interruption to their productivity.

Armed with this information in advance, the organization uses a cloud-based backup vendor to design the backup and construct archiving rules to reflect their recovery time objective (RTO):

  1. Local Storage for Instant Recovery

This firm has a dedicated network storage location, so their cloud vendor pushes a copy of the backups to this location while simultaneously sending encrypted data to its data center facility. Using local storage, the organization can restore files from the local copy over its local area network, making recovery as fast as a file transfer.

  1. Time-Based Archiving Rules

In order to control the amount of critical data that remains in the cloud vendor's online vault and manage costs, they create rules that automatically push older data to archive after a specified period of time.

  1. Delta Blocking for Short Backup Windows

Although the cloud vendor is protecting over 1TB of data for them, nightly backups usually run in under one hour, sometimes as fast as 20 minutes. This is due to delta-blocking technology, which identifies changes made to a file and backs up only those changes, rather than the entire file.

By designating which data needs to be restored immediately and which does not, the organization receives a customized backup and recovery strategy that fits their recovery objectives and cost requirements.

Conclusion
Putting together a comprehensive recovery strategy like the one outlined above requires a certain amount of expertise and lots of upfront planning. While the "set it, and forget it" mentality is very attractive, data is growing too quickly and technology is changing too rapidly for companies to simply entrust their backups to just any cloud provider. You may have access only to a written Q&A or a junior technology staff member reading from a script when you need help restoring your critical data. Recovery could take a long time if you try to bring back all of your data at the same time. That's why advance prioritization of data is so essential.

When disaster strikes, the last thing an IT administrator wants is to fill out online forms or talk to someone who's reading from a script. Companies need competent providers who know their data environment, understand their business needs and can help walk them through the process.

More Stories By Jennifer Walzer

Jennifer Walzer, CEO and Founder of BUMI (www.BUMI.com), has an extensive background in technology and business strategy consulting. Prior to founding BUMI, she spent her career helping organizations of all sizes (from start ups to Fortune 1000 companies) with their back office systems and online web presence. She also successfully launched and sold a software development company focused on developing interactive voice response systems for multi-employer benefit funds. She has been invited to speak on various topics such as disaster recovery and data security at major conferences across the country.

Jennifer is a 2011 graduate of The Entrepreneurial Masters Program (EMP), an executive educational program jointly hosted by the MIT Enterprise Forum and Entrepreneurs’ Organization (EO).

Comments (0)

Share your thoughts on this story.

Add your comment
You must be signed in to add a comment. Sign-in | Register

In accordance with our Comment Policy, we encourage comments that are on topic, relevant and to-the-point. We will remove comments that include profanity, personal attacks, racial slurs, threats of violence, or other inappropriate material that violates our Terms and Conditions, and will block users who make repeated violations. We ask all readers to expect diversity of opinion and to treat one another with dignity and respect.


@CloudExpo Stories
With tough new regulations coming to Europe on data privacy in May 2018, Calligo will explain why in reality the effect is global and transforms how you consider critical data. EU GDPR fundamentally rewrites the rules for cloud, Big Data and IoT. In his session at 21st Cloud Expo, Adam Ryan, Vice President and General Manager EMEA at Calligo, examined the regulations and provided insight on how it affects technology, challenges the established rules and will usher in new levels of diligence arou...
In his general session at 21st Cloud Expo, Greg Dumas, Calligo’s Vice President and G.M. of US operations, discussed the new Global Data Protection Regulation and how Calligo can help business stay compliant in digitally globalized world. Greg Dumas is Calligo's Vice President and G.M. of US operations. Calligo is an established service provider that provides an innovative platform for trusted cloud solutions. Calligo’s customers are typically most concerned about GDPR compliance, application p...
"I focus on what we are calling CAST Highlight, which is our SaaS application portfolio analysis tool. It is an extremely lightweight tool that can integrate with pretty much any build process right now," explained Andrew Siegmund, Application Migration Specialist for CAST, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
As many know, the first generation of Cloud Management Platform (CMP) solutions were designed for managing virtual infrastructure (IaaS) and traditional applications. But that's no longer enough to satisfy evolving and complex business requirements. In his session at 21st Cloud Expo, Scott Davis, Embotics CTO, explored how next-generation CMPs ensure organizations can manage cloud-native and microservice-based application architectures, while also facilitating agile DevOps methodology. He expla...
SYS-CON Events announced today that Evatronix will exhibit at SYS-CON's 21st International Cloud Expo®, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Evatronix SA offers comprehensive solutions in the design and implementation of electronic systems, in CAD / CAM deployment, and also is a designer and manufacturer of advanced 3D scanners for professional applications.
SYS-CON Events announced today that Synametrics Technologies will exhibit at SYS-CON's 22nd International Cloud Expo®, which will take place on June 5-7, 2018, at the Javits Center in New York, NY. Synametrics Technologies is a privately held company based in Plainsboro, New Jersey that has been providing solutions for the developer community since 1997. Based on the success of its initial product offerings such as WinSQL, Xeams, SynaMan and Syncrify, Synametrics continues to create and hone inn...
Cloud Expo | DXWorld Expo have announced the conference tracks for Cloud Expo 2018. Cloud Expo will be held June 5-7, 2018, at the Javits Center in New York City, and November 6-8, 2018, at the Santa Clara Convention Center, Santa Clara, CA. Digital Transformation (DX) is a major focus with the introduction of DX Expo within the program. Successful transformation requires a laser focus on being data-driven and on using all the tools available that enable transformation if they plan to survive ov...
A strange thing is happening along the way to the Internet of Things, namely far too many devices to work with and manage. It has become clear that we'll need much higher efficiency user experiences that can allow us to more easily and scalably work with the thousands of devices that will soon be in each of our lives. Enter the conversational interface revolution, combining bots we can literally talk with, gesture to, and even direct with our thoughts, with embedded artificial intelligence, whic...
To get the most out of their data, successful companies are not focusing on queries and data lakes, they are actively integrating analytics into their operations with a data-first application development approach. Real-time adjustments to improve revenues, reduce costs, or mitigate risk rely on applications that minimize latency on a variety of data sources. In his session at @BigDataExpo, Jack Norris, Senior Vice President, Data and Applications at MapR Technologies, reviewed best practices to ...
Continuous Delivery makes it possible to exploit findings of cognitive psychology and neuroscience to increase the productivity and happiness of our teams. In his session at 22nd Cloud Expo | DXWorld Expo, Daniel Jones, CTO of EngineerBetter, will answer: How can we improve willpower and decrease technical debt? Is the present bias real? How can we turn it to our advantage? Can you increase a team’s effective IQ? How do DevOps & Product Teams increase empathy, and what impact does empath...
DevOps promotes continuous improvement through a culture of collaboration. But in real terms, how do you: Integrate activities across diverse teams and services? Make objective decisions with system-wide visibility? Use feedback loops to enable learning and improvement? With technology insights and real-world examples, in his general session at @DevOpsSummit, at 21st Cloud Expo, Andi Mann, Chief Technology Advocate at Splunk, explored how leading organizations use data-driven DevOps to close th...
Smart cities have the potential to change our lives at so many levels for citizens: less pollution, reduced parking obstacles, better health, education and more energy savings. Real-time data streaming and the Internet of Things (IoT) possess the power to turn this vision into a reality. However, most organizations today are building their data infrastructure to focus solely on addressing immediate business needs vs. a platform capable of quickly adapting emerging technologies to address future ...
Most technology leaders, contemporary and from the hardware era, are reshaping their businesses to do software. They hope to capture value from emerging technologies such as IoT, SDN, and AI. Ultimately, irrespective of the vertical, it is about deriving value from independent software applications participating in an ecosystem as one comprehensive solution. In his session at @ThingsExpo, Kausik Sridhar, founder and CTO of Pulzze Systems, discussed how given the magnitude of today's application ...
There is a huge demand for responsive, real-time mobile and web experiences, but current architectural patterns do not easily accommodate applications that respond to events in real time. Common solutions using message queues or HTTP long-polling quickly lead to resiliency, scalability and development velocity challenges. In his session at 21st Cloud Expo, Ryland Degnan, a Senior Software Engineer on the Netflix Edge Platform team, will discuss how by leveraging a reactive stream-based protocol,...
Mobile device usage has increased exponentially during the past several years, as consumers rely on handhelds for everything from news and weather to banking and purchases. What can we expect in the next few years? The way in which we interact with our devices will fundamentally change, as businesses leverage Artificial Intelligence. We already see this taking shape as businesses leverage AI for cost savings and customer responsiveness. This trend will continue, as AI is used for more sophistica...
In his session at 21st Cloud Expo, Raju Shreewastava, founder of Big Data Trunk, provided a fun and simple way to introduce Machine Leaning to anyone and everyone. He solved a machine learning problem and demonstrated an easy way to be able to do machine learning without even coding. Raju Shreewastava is the founder of Big Data Trunk (www.BigDataTrunk.com), a Big Data Training and consulting firm with offices in the United States. He previously led the data warehouse/business intelligence and B...
Digital transformation is about embracing digital technologies into a company's culture to better connect with its customers, automate processes, create better tools, enter new markets, etc. Such a transformation requires continuous orchestration across teams and an environment based on open collaboration and daily experiments. In his session at 21st Cloud Expo, Alex Casalboni, Technical (Cloud) Evangelist at Cloud Academy, explored and discussed the most urgent unsolved challenges to achieve f...
"Digital transformation - what we knew about it in the past has been redefined. Automation is going to play such a huge role in that because the culture, the technology, and the business operations are being shifted now," stated Brian Boeggeman, VP of Alliances & Partnerships at Ayehu, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
You know you need the cloud, but you're hesitant to simply dump everything at Amazon since you know that not all workloads are suitable for cloud. You know that you want the kind of ease of use and scalability that you get with public cloud, but your applications are architected in a way that makes the public cloud a non-starter. You're looking at private cloud solutions based on hyperconverged infrastructure, but you're concerned with the limits inherent in those technologies. What do you do?
"We started a Master of Science in business analytics - that's the hot topic. We serve the business community around San Francisco so we educate the working professionals and this is where they all want to be," explained Judy Lee, Associate Professor and Department Chair at Golden Gate University, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.