Welcome!

@CloudExpo Authors: Liz McMillan, Zakia Bouachraoui, Yeshim Deniz, Pat Romanski, Elizabeth White

Related Topics: @CloudExpo

@CloudExpo: Blog Feed Post

Cloud Storage for Your File Server

How much bandwidth you need to run file server with cloud storage?

As I was explaining Gladinet CloudAFS to a group of IT people as a file server with cloud storage attached, one question I got was:

How much bandwidth you need to run file server with cloud storage?

It is very obvious where the concern is coming from. If I had an on-premise file server, with cloud storage attached as tier 2 storage, will the slowness of the Internet (as compared to LAN) slow down the file server activities?

Rest assured, Gladinet CloudAFS takes care of this. Specifically, it takes care of two issues.

A. Slow Internet to the Cloud Storage – that is why it is tier2 storage.

B. There is a glitch, Internet is down.

The secret to both maintain local LAN performance and leverage Cloud Storage is to have a tier 1 storage locally on the hard drive (or NAS or SAN). When people read and write to the file server, it is writing to and reading from the tier 1 storage, thus gives you an impression of local storage because it is indeed local storage.

Gladinet CloudAFS will then schedule transfer to and from the tier 2 storage asynchronously.  You can schedule the transfer to be 12 hours later than the tier 1 storage hit so it can happen at night when nobody is working. If the Internet is down at a moment, the transfer will resume and retry later on.

Use Case 1 – File Server Backup

When your Internet is slow or the Internet is not quite reliable, you need to put cloud storage to a more secondary role, such as backup.

In this use case, you can configure Gladinet CloudAFS tier 1 storage to have the same size as tier2 cloud storage. This way, the read/write is always from local and you are guaranteed to have 100% up time and 100% local performance. The cloud storage is something nice to have just in case that your local server crash or out of capacity, you can get it back immediately by provision another CloudAFS server (5 minute install when OS is ready).

The benefit of this use case is that you can always quickly re-provision your file server in case something happens to it.

Use Case 2 – Cloud Storage Gateway

When your Internet is fast and the Internet is reliable, you can put cloud storage to a more prominent role, or on the other hand, put CloudAFS to a more secondary role such as a Cloud Storage Gateway with cache.

For example, if you file server is with Amazon EC2 and your storage is Amazon S3, you know you have fast and reliable connections in between, you have a fit for the Cloud Storage Gateway use case.

Another example, if you are hosting with Peer1 server and also using Peer1’s CloudOne storage, you have a fast and reliable connection in between too. This case fit you as well.

In this case, the local cache on the CloudAFS can be smaller than the actual cloud storage capacity.

When the cache is smaller, you will incur cache miss if a read to the CloudAFS server doesn’t already have the file locally. This case the file needs to be fetched from the cloud storage over the Internet. Since you have fast and reliable connection, this first time and one time cost is OK.

The benefit of this use case is that you can ship a lot of data over to cloud storage. For example, you can turn local backup software to cloud backup software by pointing the backup destination to CloudAFS.

CloudAFS Parameters

The two biggest parameter for CloudAFS are

A. How big the local cache is. The slower and less reliable the Internet is, the bigger the local cache you will need to assign.

B. Scheduled flush time. CloudAFS use a write-through cache with flush schedule. The slower the Internet you have, the more delay you will need to push the synchronization to night time.

At this time, CloudAFS supports a wide variety of cloud storage providers, including Amazon S3, Windows Azure, Rackspace Cloud Files, EMC Atmos, Mezeo, OpenStack, Nirvanix, AT&T Synaptic Storage, Peer 1 CloudOne and more.

Speed is an important factor when you apply cloud storage to file server. Visit other earlier posts about how to pick a cloud storage providers.

Read the original blog entry...

More Stories By Jerry Huang

Jerry Huang, an engineer and entrepreneur, founded Gladinet with his close friends and is pursuing interests in the cloud computing. He has published articles on the company blog as well as following up on the company twitter activities. He graduated from the University of Michigan in 1998 and has lived in West Palm Beach, Florida since.

Comments (0)

Share your thoughts on this story.

Add your comment
You must be signed in to add a comment. Sign-in | Register

In accordance with our Comment Policy, we encourage comments that are on topic, relevant and to-the-point. We will remove comments that include profanity, personal attacks, racial slurs, threats of violence, or other inappropriate material that violates our Terms and Conditions, and will block users who make repeated violations. We ask all readers to expect diversity of opinion and to treat one another with dignity and respect.


CloudEXPO Stories
With more than 30 Kubernetes solutions in the marketplace, it's tempting to think Kubernetes and the vendor ecosystem has solved the problem of operationalizing containers at scale or of automatically managing the elasticity of the underlying infrastructure that these solutions need to be truly scalable. Far from it. There are at least six major pain points that companies experience when they try to deploy and run Kubernetes in their complex environments. In this presentation, the speaker will detail these pain points and explain how cloud can address them.
The deluge of IoT sensor data collected from connected devices and the powerful AI required to make that data actionable are giving rise to a hybrid ecosystem in which cloud, on-prem and edge processes become interweaved. Attendees will learn how emerging composable infrastructure solutions deliver the adaptive architecture needed to manage this new data reality. Machine learning algorithms can better anticipate data storms and automate resources to support surges, including fully scalable GPU-centric compute for the most data-intensive applications. Hyperconverged systems already in place can be revitalized with vendor-agnostic, PCIe-deployed, disaggregated approach to composable, maximizing the value of previous investments.
When building large, cloud-based applications that operate at a high scale, it's important to maintain a high availability and resilience to failures. In order to do that, you must be tolerant of failures, even in light of failures in other areas of your application. "Fly two mistakes high" is an old adage in the radio control airplane hobby. It means, fly high enough so that if you make a mistake, you can continue flying with room to still make mistakes. In his session at 18th Cloud Expo, Lee Atchison, Principal Cloud Architect and Advocate at New Relic, discussed how this same philosophy can be applied to highly scaled applications, and can dramatically increase your resilience to failure.
Machine learning has taken residence at our cities' cores and now we can finally have "smart cities." Cities are a collection of buildings made to provide the structure and safety necessary for people to function, create and survive. Buildings are a pool of ever-changing performance data from large automated systems such as heating and cooling to the people that live and work within them. Through machine learning, buildings can optimize performance, reduce costs, and improve occupant comfort by sharing information within the building and with outside city infrastructure via real time shared cloud capabilities.
As Cybric's Chief Technology Officer, Mike D. Kail is responsible for the strategic vision and technical direction of the platform. Prior to founding Cybric, Mike was Yahoo's CIO and SVP of Infrastructure, where he led the IT and Data Center functions for the company. He has more than 24 years of IT Operations experience with a focus on highly-scalable architectures.