Welcome!

@CloudExpo Authors: Elizabeth White, Yeshim Deniz, Liz McMillan, Pat Romanski, William Schmarzo

Related Topics: Microservices Expo, @CloudExpo, Apache, Cloud Security

Microservices Expo: Article

The Data Explosion

Is data is growing out of control?

Data explosion is one of the biggest issues facing IT today. The amount of data that organizations store has grown exponentially in the last 10 years. According to Gartner research director April Adams, data capacity on average in enterprises grows at 40 percent to 60 percent year over year.

Data is the lifeblood of any business, and companies of all sizes are struggling with the increasing amount of data stored on their networks. Because storage capacity has increased and costs have declined, many IT administrators have become more lax about what they allow their users to store on the corporate network and for how long. While the ability to store increasing amounts of data empowers organizations, it also presents them with the challenge of managing all of that information. As network storage grows, users are also adding an additional layer of complexity as they become increasingly dependent on ubiquitous access: they want to be able to access their data from wherever they are and from a variety of devices, including smartphones, tablets and laptops.

One approach is to just back everything up, but this tactic actually impedes your ability to get operations back up and running when a failure takes place. Going through mounds of unorganized data just isn't feasible and can cause companies to waste valuable time during a disaster. Businesses simply can't afford to treat all data equally, and prioritization is key. Companies may encounter serious issues if they store huge amounts of data onto tapes or into the cloud indiscriminately.

In sum, tougher recovery demands compound the problem of growing data. Organizations are intolerant of any data loss or downtime, putting a lot of pressure on IT managers, who are working in environments in flux thanks to evolving technologies and a growing variety of endpoints that need to be protected.

The 10 Percent Rule
Not all data is created equal. There is some critical data that, when lost, will bring a business to a halt. On average, only 10 percent of an organization's data is critical. "Critical" means that a file is in active use or changes frequently. That's typically about 10% of a company's information and represents the items they access daily and need immediately when a disaster strikes. Critical varies from organization to organization, but every minute spent recovering this data means lost productivity and lost revenue.

Of course, this doesn't mean that you don't need to protect the other 90 percent. It just means that you should prioritize. Arguably, all data is important, but organizations need a structured or tiered approach to ensure critical applications and systems are operational first in the event of data failure. They should plan and prioritize their information in advance, ideally with the help of professional data support personnel, so that they can recover information efficiently in the event of a disaster.

This approach will reduce downtime in the event of a widespread failure. If data is not prioritized, much time will be squandered recovering non-critical data, extending the length of a down period.

A Real Life Example
The benefits of a well-planned recovery strategy are best illustrated using a real world scenario. Let's consider a management consulting firm that has over one terabyte of data. Some of that data is Microsoft Exchange email, some resides on a file server and some of it is from a proprietary application for their business, which runs on a SCO UNIX server.

Using the 10 percent rule as a guide, the firm determines that if it were to experience data loss as the result of a server crash or other disaster, they would need to recover the last three months of their email, the last year of their file server data and the last three months of their UNIX data in order to get their business back up and running immediately. The rest of their data could be restored a day or two later without interruption to their productivity.

Armed with this information in advance, the organization uses a cloud-based backup vendor to design the backup and construct archiving rules to reflect their recovery time objective (RTO):

  1. Local Storage for Instant Recovery

This firm has a dedicated network storage location, so their cloud vendor pushes a copy of the backups to this location while simultaneously sending encrypted data to its data center facility. Using local storage, the organization can restore files from the local copy over its local area network, making recovery as fast as a file transfer.

  1. Time-Based Archiving Rules

In order to control the amount of critical data that remains in the cloud vendor's online vault and manage costs, they create rules that automatically push older data to archive after a specified period of time.

  1. Delta Blocking for Short Backup Windows

Although the cloud vendor is protecting over 1TB of data for them, nightly backups usually run in under one hour, sometimes as fast as 20 minutes. This is due to delta-blocking technology, which identifies changes made to a file and backs up only those changes, rather than the entire file.

By designating which data needs to be restored immediately and which does not, the organization receives a customized backup and recovery strategy that fits their recovery objectives and cost requirements.

Conclusion
Putting together a comprehensive recovery strategy like the one outlined above requires a certain amount of expertise and lots of upfront planning. While the "set it, and forget it" mentality is very attractive, data is growing too quickly and technology is changing too rapidly for companies to simply entrust their backups to just any cloud provider. You may have access only to a written Q&A or a junior technology staff member reading from a script when you need help restoring your critical data. Recovery could take a long time if you try to bring back all of your data at the same time. That's why advance prioritization of data is so essential.

When disaster strikes, the last thing an IT administrator wants is to fill out online forms or talk to someone who's reading from a script. Companies need competent providers who know their data environment, understand their business needs and can help walk them through the process.

More Stories By Jennifer Walzer

Jennifer Walzer, CEO and Founder of BUMI (www.BUMI.com), has an extensive background in technology and business strategy consulting. Prior to founding BUMI, she spent her career helping organizations of all sizes (from start ups to Fortune 1000 companies) with their back office systems and online web presence. She also successfully launched and sold a software development company focused on developing interactive voice response systems for multi-employer benefit funds. She has been invited to speak on various topics such as disaster recovery and data security at major conferences across the country.

Jennifer is a 2011 graduate of The Entrepreneurial Masters Program (EMP), an executive educational program jointly hosted by the MIT Enterprise Forum and Entrepreneurs’ Organization (EO).

Comments (0)

Share your thoughts on this story.

Add your comment
You must be signed in to add a comment. Sign-in | Register

In accordance with our Comment Policy, we encourage comments that are on topic, relevant and to-the-point. We will remove comments that include profanity, personal attacks, racial slurs, threats of violence, or other inappropriate material that violates our Terms and Conditions, and will block users who make repeated violations. We ask all readers to expect diversity of opinion and to treat one another with dignity and respect.


@CloudExpo Stories
In order to meet the rapidly changing demands of today’s customers, companies are continually forced to redefine their business strategies in order to meet these needs, stay relevant and continue to see profitable growth. IoT deployment and development is integral in this transformation, and today businesses are increasingly seeing the value of investing their resources into IoT deployments. These technologies are able increase ROI through projects such as connecting supply chains or enabling sm...
Every successful software product evolves from an idea to an enterprise system. Notably, the same way is passed by the product owner's company. In his session at 20th Cloud Expo, Oleg Lola, CEO of MobiDev, will provide a generalized overview of the evolution of a software product, the product owner, the needs that arise at various stages of this process, and the value brought by a software development partner to the product owner as a response to these needs.
SYS-CON Events announced today that Tappest will exhibit MooseFS at SYS-CON's 20th International Cloud Expo®, which will take place on June 6-8, 2017, at the Javits Center in New York City, NY. MooseFS is a breakthrough concept in the storage industry. It allows you to secure stored data with either duplication or erasure coding using any server. The newest – 4.0 version of the software enables users to maintain the redundancy level with even 50% less hard drive space required. The software func...
SYS-CON Events announced today that EARP will exhibit at SYS-CON's 20th International Cloud Expo®, which will take place on June 6-8, 2017, at the Javits Center in New York City, NY. "We are a software house, so we perfectly understand challenges that other software houses face in their projects. We can augment a team, that will work with the same standards and processes as our partners' internal teams. Our teams will deliver the same quality within the required time and budget just as our partn...
SYS-CON Events announced today that Interoute, owner-operator of one of Europe's largest networks and a global cloud services platform, has been named “Bronze Sponsor” of SYS-CON's 20th Cloud Expo, which will take place on June 6-8, 2017 at the Javits Center in New York, New York. Interoute is the owner-operator of one of Europe's largest networks and a global cloud services platform which encompasses 12 data centers, 14 virtual data centers and 31 colocation centers, with connections to 195 add...
Most technology leaders, contemporary and from the hardware era, are reshaping their businesses to do software in the hope of capturing value in IoT. Although IoT is relatively new in the market, it has already gone through many promotional terms such as IoE, IoX, SDX, Edge/Fog, Mist Compute, etc. Ultimately, irrespective of the name, it is about deriving value from independent software assets participating in an ecosystem as one comprehensive solution.
SYS-CON Events announced today that delaPlex will exhibit at SYS-CON's @ThingsExpo, which will take place on June 6-8, 2017, at the Javits Center in New York City, NY. delaPlex pioneered Software Development as a Service (SDaaS), which provides scalable resources to build, test, and deploy software. It’s a fast and more reliable way to develop a new product or expand your in-house team.
Amazon started as an online bookseller 20 years ago. Since then, it has evolved into a technology juggernaut that has disrupted multiple markets and industries and touches many aspects of our lives. It is a relentless technology and business model innovator driving disruption throughout numerous ecosystems. Amazon’s AWS revenues alone are approaching $16B a year making it one of the largest IT companies in the world. With dominant offerings in Cloud, IoT, eCommerce, Big Data, AI, Digital Assista...
You know you need the cloud, but you’re hesitant to simply dump everything at Amazon since you know that not all workloads are suitable for cloud. You know that you want the kind of ease of use and scalability that you get with public cloud, but your applications are architected in a way that makes the public cloud a non-starter. You’re looking at private cloud solutions based on hyperconverged infrastructure, but you’re concerned with the limits inherent in those technologies.
SYS-CON Events announced today that Systena America will exhibit at SYS-CON's 20th International Cloud Expo®, which will take place on June 6-8, 2017, at the Javits Center in New York City, NY. Systena Group has been in business for various software development and verification in Japan, US, ASEAN, and China by utilizing the knowledge we gained from all types of device development for various industries including smartphones (Android/iOS), wireless communication, security technology and IoT serv...
SYS-CON Events announced today that Outscale will exhibit at SYS-CON's 20th International Cloud Expo®, which will take place on June 6-8, 2017, at the Javits Center in New York City, NY. Outscale's technology makes an automated and adaptable Cloud available to businesses, supporting them in the most complex IT projects while controlling their operational aspects. You boost your IT infrastructure's reactivity, with request responses that only take a few seconds.
Everywhere we turn in our industry we can find strong opinions about the direction, type and nature of cloud’s impact on computing and business. Another word that is used in every context in our industry is “hybrid.” In his session at 20th Cloud Expo, Alvaro Gonzalez, Director of Technical, Partner and Field Marketing at Peak 10, will use a combination of a few conceptual props and some research recently commissioned by Peak 10 to offer a real-world consideration of how the various categories of...
DevOps at Cloud Expo – being held October 31 - November 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA – announces that its Call for Papers is open. Born out of proven success in agile development, cloud computing, and process automation, DevOps is a macro trend you cannot afford to miss. From showcase success stories from early adopters and web-scale businesses, DevOps is expanding to organizations of all sizes, including the world's largest enterprises – and delivering real r...
Cloud applications are seeing a deluge of requests to support the exploding advanced analytics market. “Open analytics” is the emerging strategy to deliver that data through an open data access layer, in the cloud, to be directly consumed by external analytics tools and popular programming languages. An increasing number of data engineers and data scientists use a variety of platforms and advanced analytics languages such as SAS, R, Python and Java, as well as frameworks such as Hadoop and Spark...
Cloud promises the agility required by today’s digital businesses. As organizations adopt cloud based infrastructures and services, their IT resources become increasingly dynamic and hybrid in nature. Managing these require modern IT operations and tools. In his session at 20th Cloud Expo, Raj Sundaram, Senior Principal Product Manager at CA Technologies, will discuss how to modernize your IT operations in order to proactively manage your hybrid cloud and IT environments. He will be sharing bes...
Five years ago development was seen as a dead-end career, now it’s anything but – with an explosion in mobile and IoT initiatives increasing the demand for skilled engineers. But apart from having a ready supply of great coders, what constitutes true ‘DevOps Royalty’? It’ll be the ability to craft resilient architectures, supportability, security everywhere across the software lifecycle. In his keynote at @DevOpsSummit at 20th Cloud Expo, Jeffrey Scheaffer, GM and SVP, Continuous Delivery Busine...
Most DevOps journeys involve several phases of maturity. Research shows that the inflection point where organizations begin to see maximum value is when they implement tight integration deploying their code to their infrastructure. Success at this level is the last barrier to at-will deployment. Storage, for instance, is more capable than where we read and write data. In his session at @DevOpsSummit at 20th Cloud Expo, Josh Atwell, a Developer Advocate for NetApp, will discuss the role and value...
SYS-CON Events announced today that Outscale, a global pure play Infrastructure as a Service provider and strategic partner of Dassault Systèmes, will exhibit at SYS-CON's 20th International Cloud Expo®, which will take place on June 6-8, 2017, at the Javits Center in New York City, NY. Founded in 2010, Outscale simplifies infrastructure complexities and boosts the business agility of its customers. Outscale delivers a secure, reliable and industrial strength solution for its customers, which in...
SYS-CON Events announced today that Peak 10, Inc., a national IT infrastructure and cloud services provider, will exhibit at SYS-CON's 20th International Cloud Expo®, which will take place on June 6-8, 2017, at the Javits Center in New York City, NY. Peak 10 provides reliable, tailored data center and network services, cloud and managed services. Its solutions are designed to scale and adapt to customers’ changing business needs, enabling them to lower costs, improve performance and focus intern...
SYS-CON Events announced today that CollabNet, a global leader in enterprise software development, release automation and DevOps solutions, will be a Bronze Sponsor of SYS-CON's 20th International Cloud Expo®, taking place from June 6-8, 2017, at the Javits Center in New York City, NY. CollabNet offers a broad range of solutions with the mission of helping modern organizations deliver quality software at speed. The company’s latest innovation, the DevOps Lifecycle Manager (DLM), supports Value S...