Welcome!

Cloud Expo Authors: Maureen O'Gara, Kevin Benedict, Derek Harris, Pat Romanski, Francois Lascelles

Related Topics: Cloud Expo

Cloud Expo: Article

Amazon’s Elastic Block Store Opens Up S3 and The Cloud

The Big SAN in the Sky

Cloud infrastructure providers like Amazon are putting out the technology that the enterprise and SaaS providers need to move beyond testing the waters and take advantage of the Cloud today. The latest, and most important from the data storage perspective, is Amazon’s Elastic Block Store, or EBS.

Over the years we’ve witnessed a shift to hosted IT infrastructure where all the issues surrounding the physical plant are consolidated and managed by a specialist service. In the past six months we've witnessed the incredible rate at which cloud computing has really taken off and is now allowing businesses to shed the problems of ordering, racking and maintaining servers and disk storage systems.

The public cloud is now knocking down the barriers to a broader business audience that has seen the advantages of “pay as you go” IT and not having to build or rent another data center. Why do that when you can instantly spin up 10, or 1,000 virtual server instances at a fraction of the cost? Cloud infrastructure providers, like Amazon, are putting out the technology that the enterprise and SaaS providers need to move beyond testing the waters and take advantage of the cloud today. The latest, and most important from the data storage perspective, is Amazon’s Elastic Block Store, or EBS.

Datasets, Throughput and Snapshots
In short, EBS is a SAN (Storage Area Network) in the cloud that works with Amazon’s existing Elastic Compute Cloud (EC2) and Simple Storage Service (S3). One hurdle for many businesses has been the data storage and throughput limits for each instance. Now you can allocate a disk volume of 1GB to 1TB from what is a virtually endless SAN in the cloud, and attach it to an instance running in EC2. The volume is stored on redundant disks and has a lifetime that's separate from any instance on which it is mounted. This is important, as previously the data was lost when an instance was no longer used. Now you can unmount it, and later remount it on another instance. We’ll look at how to get very large datasets using EBS into the cloud.

Another benefit of EBS is taking advantage of the snapshotting feature. You can snapshot a volume to S3, where it is stored with the redundancy and durability of all objects on S3. Moreover, successive snapshots are incremental providing a very powerful and efficient backup capability for volumes.The ability to take snapshots is a complex feature, but RightScale provides some cool scripts to make it even easier to freeze all data access while the snapshot is taken to ensure that the data on the snapshot is consistent.

The RightScale Dashboard supports all the features of EBS and offers a number of additional features such as configuring volumes to automatically be attached to servers when these launch and track the ancestry of a volume or snapshot. What does EBS enable? In short: traditional processing on large datasets and reliable storage for many servers. But let's look at these two areas one-by-one.

Amazon Web Services are designed for scale. EC2, S3, SQS, and SDB are ideally suited for building large systems that process huge data volumes. The catch has been that they are geared towards modern service oriented systems using a non-relational database like Amazon SDB, and thrive on large numbers of simple servers (EC2). Business users have more traditional applications, such as relational databases, that require large datasets stored in a file system with a POSIX interface. While an EC2 X-large instance comes with about 1.4TB of local disk space, it is difficult to use in a production system. Populating the disk with data at boot time can take hours and backups, replication and restoring the data in case of an instance failure are all sore points. For up to 100GB the timescales are workable, but beyond that it gets difficult.

More Stories By Thorsten von Eicken

Thorsten von Eicken is the CTO and a founder of RightScale and is responsible for the overall technology direction of the RightScale Cloud Management Platform. Previously, he was founder and chief architect at Expertcity (acquired by Citrix Online), where he directed the architecture of the company's online services, including the popular GoToMeeting service. von Eicken also managed the Expertcity/Citrix data center operations, where he acquired deep knowledge in deploying and running secure, scalable online services. He was a professor of Computer Science at Cornell University and received his Ph.D. from the University of California, Berkeley.

Comments (1) View Comments

Share your thoughts on this story.

Add your comment
You must be signed in to add a comment. Sign-in | Register

In accordance with our Comment Policy, we encourage comments that are on topic, relevant and to-the-point. We will remove comments that include profanity, personal attacks, racial slurs, threats of violence, or other inappropriate material that violates our Terms and Conditions, and will block users who make repeated violations. We ask all readers to expect diversity of opinion and to treat one another with dignity and respect.


Most Recent Comments
Jeremy Geelan 08/21/08 02:03:47 PM EDT

Dr von Eicken will be giving a technical session at SYS-CON's "Cloud Computing Expo" (November 19-21, 2008) - a major adjunct to the 4th International Virtualization Conference & Expo being held at The Fairmont Hotel in San Jose, CA - in which he will distill the unique characteristics of clouds and describe how to best think about deployments in the clouds.

Cloud Expo Breaking News
Why are APIs so important in clouds? Do APIs have to be open? How fast or slow will standardization in the cloud be? Why is ensuring high availability for the cloud service critical? In his session at the 10th International Cloud Expo, Mårten Mickos, CEO of Eucalyptus Systems, will answer these questions and address cloud standards, APIs and the critical question: Will we end up with one, two or more competing cloud standards? And, how will this affect the evolution and adoption of cloud comput...
Very few trends in IT have generated as much buzz as cloud computing. In his session at the 10th International Cloud Expo, Mark Hinkle, Director, Cloud Computing Community at Citrix, will cut through the hype and quickly clarify the ontology for cloud computing. The bulk of the conversation will focus on the open source software that can be used to build compute clouds (infrastructure-as-a-service) and the complementary open source management tools that can be combined to automate the management...
The proliferation of device connectivity is redefining the functionality requirements and capabilities of many embedded systems as more and more of these devices look to leverage the “Cloud.” While many commercial software and hardware component vendors have begun to realign their value propositions to satisfy growing demand, commercial-off-the-shelf products (COTS) alone cannot meet every OEM’s needs. As a result, the Embedded Cloud has injected a new level of uncertainty and a new competitive ...
Hardware and chemistry improvements will make the $1,000 human genome a reality soon. While the massive amount of genomics data that will be generated represents a huge opportunity to advance personal medicine, it also presents an enormous big data challenge. In his session at the 10th International Cloud Expo, Dr Andreas Sundquist, CEO of DNAnexus, will discuss how the cloud will address these issues by enabling the management, storage, sharing and analysis of the world’s DNA data and how it ...
With Cloud Expo 2012 New York (10th Cloud Expo) just four months away, what better time to start introducing you in greater detail to the distinguished individuals in our incredible Speaker Faculty for the technical and strategy sessions at the conference... We have technical and strategy sessions for you every day from June 11 through June 14 dealing with every nook and cranny of Cloud Computing and Big Data, but what of those who are presenting? Who are they, where do they work, what else h...
With Big Data Expo 2012 New York (co-located with 10th Cloud Expo) just four months away, what better time to start introducing you in greater detail to the distinguished individuals in our incredible Speaker Faculty for the technical and strategy sessions at the conference...
In 2011, Apache Hadoop received tremendous attention for helping organizations cost-effectively capitalize on their big data. Hadoop is now disrupting the business of analyzing data. In his session at the 10th International Cloud Expo, Eric Baldeschwieler, Co-Founder & CEO of Hortonworks, will look at the current state of the Hadoop project, lessons learned by deploying it at scale, and the roadmap for its future. Big Data Track attendees will learn about the exciting developments that have ...
The focus of Java EE 7 is on the cloud, and specifically it aims to bring Platform-as-a-Service providers and application developers together so that portable applications can be deployed on any cloud infrastructure and reap all its benefits in terms of scalability, elasticity, multitenancy, etc. The existing specifications in the platform such as JPA, Servlets, EJB, and others will be updated to meet these requirements. Java EE 7 continues the ease of development push that characterized prior ...
With Cloud Expo 2012 New York (10th Cloud Expo) just four months away, what better time to start introducing you in greater detail to the distinguished individuals in our incredible Speaker Faculty for the technical and strategy sessions at the conference... We have technical and strategy sessions for you every day from June 11 through June 14 dealing with every nook and cranny of Cloud Computing and Big Data, but what of those who are presenting? Who are they, where do they work, what else h...
With Cloud Expo 2012 New York (10th Cloud Expo) just four months away, what better time to start introducing you in greater detail to the distinguished individuals in our incredible Speaker Faculty for the technical and strategy sessions at the conference... We have technical and strategy sessions for you every day from June 11 through June 14 dealing with every nook and cranny of Cloud Computing and Big Data, but what of those who are presenting? Who are they, where do they work, what else h...