Welcome!

Cloud Expo Authors: Jeremy Geelan, Helen Ching, Adrian Bridgwater, Pat Romanski, Jim Kaskade

Related Topics: Cloud Expo

Cloud Expo: Article

Real-World Use Cases: Cloud Storage Workloads

What is your data workload?

In my previous article, "Cloud Computing Public or Private? How to Choose Cloud Storage," we covered choosing between public and private cloud storage and the appropriate data types for cloud storage. This month we will dig deeper into the workloads and file creation patterns that best fit cloud storage with a focus on private clouds. Rather than file types, the discussion will cover how files are managed and where cloud storage fits, along with a few real-world use cases.

When choosing any storage solution it's important to consider the workload and data usage patterns. This even goes beyond storage - application workloads drive server, network and all IT infrastructure decisions. Sure, most vendors will tell you that their product is the best solution for any workload, and when choices were few, that was somewhat accurate. However, today there are many different offerings, each with strengths and weaknesses in different situations.  This article will review six workload scenarios and identify where cloud storage is a good fit and where it is a poor fit.

Rapidly Changing Single File Workloads
Examples of a rapidly changing single file workload would include I/O patterns of a database, source code repository, or an active spreadsheet. In this workload there is either a very powerful single server, or many users sharing a single file. In both cases, updates to a single file are constant and rapid, driving the need for a tier-one class of storage. To facilitate this workload, the system should have lots of memory; fast, hard drives; and the ability to create snapshots for instant data protection. Today this market is well served by Enterprise NAS vendors such as EMC and NetApp.

Data Ingestion Workloads
The best example of a data ingestion workload is video surveillance. Consider, for example, the city of London and its thousands of cameras, each streaming write operations to storage. Every camera creates its own set of files and needs fast access to storage. This is an excellent workload for private cloud storage. A private storage cloud has many storage nodes that can ingest streams of information independently so there is no data bottleneck. A camera-to-storage node ratio can be established, say 10 cameras per node, and then replicated out to hundreds of nodes, and enabling thousands of cameras. Since the cloud is centrally managed, a single administrator can easily manage the video surveillance storage for the entire city.

Read-Intensive Workloads
Video streaming and online video sharing are categorized as read-intensive workloads. Consider the example of the Beijing Olympics last summer. There was unbelievable demand for online video of the events, and in the U.S. the focus was on men's swimming. When the U.S. relay team won by a fraction of a second, everybody wanted to watch. Millions of people flocked to the web and video servers churned out views. This creates a unique storage demand. With thousands of web servers trying to read a single file, the architecture must support parallel reads. With hundreds of independent nodes serving out many copies of the same file, cloud storage provides the ideal solution to read intensive workloads.

High Performance Computing (HPC) Workloads
HPC workloads are similar to data ingestion workloads with one important difference - access to a single file. Rather than every client creating a unique file, hundreds or thousands of systems access a single file that is striped across many nodes for performance. This workload requires tight coordination between every node in the cluster to ensure data integrity, file locking, and cache coherence. HPC storage is used extensively in oil and gas exploration and financial data modeling where complex transactions are processed by compute clusters. There are a number of established HPC storage vendors include Panasas, Isilon and NetApp GX.

Single Producer, Many Consumer Workloads
In June 2008, the NASA Phoenix Mars Lander discovered ice crystals on the surface of Mars. The world reacted, scientists and religious organizations confirmed their unique theories about the universe, and everybody wanted access to the data. Given the challenges of landing on Mars and collecting soil samples, it's safe to say this is an example of a write once, consume many workload. Other examples include genomic sequence findings and quarterly business results. All share a single creation event with demand for multiple points of read access. Cloud storage protects data by replicating files to one or more nodes. This same activity can create many access points, enabling a single creation event to be easily shared amongst many consumers.

Archive or Content Depot Workloads
In most cases as data ages it becomes less active. Whether it is corporate information or media content, it is important that this data be kept available, but at a cost relative to its value. Private cloud storage economics and scale capabilities are designed to address this use case. Data can be copied to the cloud to free up more expensive tier-one storage devices and delay costly infrastructure upgrades. Cloud storage can be expanded on demand using the latest (or oldest) commodity hardware and a few simple mouse clicks. When it comes time to retire cloud hardware, it can be removed without downtime, preserving access and enabling 50 year archives.

What Is Your Data Workload?
When considering storage choices, ignore the "we can do everything" vendors and think about your workload. Once you understand your requirements and how the data will be used, your answer will emerge.

More Stories By Mike Maxey

Mike Maxey is director of product management for ParaScale, a Silicon Valley startup focused on addressing the exploding bulk storage requirements for digital content and archival data.

Comments (0)

Share your thoughts on this story.

Add your comment
You must be signed in to add a comment. Sign-in | Register

In accordance with our Comment Policy, we encourage comments that are on topic, relevant and to-the-point. We will remove comments that include profanity, personal attacks, racial slurs, threats of violence, or other inappropriate material that violates our Terms and Conditions, and will block users who make repeated violations. We ask all readers to expect diversity of opinion and to treat one another with dignity and respect.


Cloud Expo Breaking News
With Cloud Expo 2012 New York (10th Cloud Expo) now under four months away, what better time to start introducing you in greater detail to the distinguished individuals in our incredible Speaker Faculty for the technical and strategy sessions at the conference... We have technical and strategy sessions for you every day from June 11 through June 14 dealing with every nook and cranny of Cloud Computing and Big Data, but what of those who are presenting? Who are they, where do they work, what e...
2011 was a year of rapid adoption for public and private cloud services. Instant and on-demand server provisioning was the driving force behind the massive growth. On top, cloud server templates and script automation simplified application installation for simple and pre-defined application stacks, but have not targeted more complex enterprise application environments. In his session at the 10th International Cloud Expo, John Yung, CEO of Appcara, will discuss how 2012 will be the year for app...
"Having been in the IT field for many years, I believe the cloud computing chapter in the industry is an exciting one and I am proud to be a part of it," said National Reconaissance Office (NRO) Chief Information Officer Jill T. Singer Tuesday, as it was announced that she was one of 10 winners of the 2012 CloudNOW "Top Ten Women in Cloud" Awards.
As more enterprises are adopting clouds, the nature of cloud computing is changing. Previously, clouds were used to test applications or for non-mission critical applications. Today, enterprises are using clouds for cost-saving advantages and launching more mission critical applications that have defined performance needs. In his session at the 10th International Cloud Expo, Eric Shepcaro, CEO and Chairman of the Board of Telx, will discuss how distributed computing has many advantages. It wou...
With Cloud Expo 2012 New York (10th Cloud Expo) just four months away, what better time to start introducing you in greater detail to the distinguished individuals in our incredible Speaker Faculty for the technical and strategy sessions at the conference... We have technical and strategy sessions for you every day from June 11 through June 14 dealing with every nook and cranny of Cloud Computing and Big Data, but what of those who are presenting? Who are they, where do they work, what else h...
Building a cloud computing environment with on-demand access to compute, network, and storage resources requires an elastic infrastructure at multiple levels. Virtualization combined with x86 servers has transformed the way we scale out compute resources. Unfortunately, legacy Fibre Channel and iSCSI storage architectures are rooted in rigid mainframe-era designs, and are fundamentally mismatched with the dynamic, shared modern data center. In his session at the 10th International Cloud Expo, ...
With Cloud Expo 2012 New York (10th Cloud Expo) now under four months away, what better time to start introducing you in greater detail to the distinguished individuals in our incredible Speaker Faculty for the technical and strategy sessions at the conference... We have technical and strategy sessions for you every day from June 11 through June 14 dealing with every nook and cranny of Cloud Computing and Big Data, but what of those who are presenting? Who are they, where do they work, what e...
With Big Data Expo 2012 New York (co-located with 10th Cloud Expo) now under four months away, what better time to start introducing you in greater detail to the distinguished individuals in our incredible Speaker Faculty for the technical and strategy sessions at the conference... We have technical and strategy sessions for you every day from June 11 through June 14 dealing with every nook and cranny of Cloud Computing and Big Data, but what of those who are presenting? Who are they, where ...
With Big Data Expo 2012 New York (co-located with 10th Cloud Expo) just four months away, what better time to start introducing you in greater detail to the distinguished individuals in our incredible Speaker Faculty for the technical and strategy sessions at the conference...
Can you bring services from the cloud to your customers faster and have them adopt it with ease of use or bring the power of bundled services to the fingertips of your clients without creating new rigid ‘apps stove pipes'? Do you want to prevent your business running away to public and unmanageably immature cloud services? In his session at the 10th International Cloud Expo, Hans van de Koppel, Sr. Enterprise Architect at Capgemini, will take Cloud Expo delegates to the developing world of clou...