Welcome!

@CloudExpo Authors: Elizabeth White, Liz McMillan, William Schmarzo, Yeshim Deniz, Harry Trott

Related Topics: Containers Expo Blog, Java IoT, Microservices Expo, Agile Computing, @CloudExpo, Apache

Containers Expo Blog: Blog Feed Post

Object Storage Not Yet Defined

Agreed that object storage platforms scale better than file systems & NAS

The ExecEvent Object Storage Summit earlier this month continued to generate buzz on the industry, which is very exciting. Amplidata was represented – in spirit – at the Summit by our partners Intel and Quantum; due to an insane travel and show schedule this fall that kept us from attending personally.  We’re grateful for the mention in Storage Switzerland’s sponsor briefing articles. Very cool! With all the great stuff that has been happening for Amplidata lately, including the awesome performance test results by Howard Marks, we felt a bit like we were missing our own birthday party. We’ll be there next time!

The event fostered a few “What is Object Storage?” posts from, amongst others, George Crump. Jim O’Reilly also posted a very interesting article, although I’m not sure if he was at the event. If he wasn’t, he should be next time!

Both articles add to the body of knowledge that is rapidly evolving on what object storage is, and why customers should adopt it – so, every article helps. With a topic as technical as object storage, it’s easy to evangelize with a deep technical dive.  But that misses the “elegant simplicity” point.  Hence we love George’s use of the car park analogy which we ourselves often embrace.  His article was a helpful at-a-glance overview.  On a more technical level, Jim’s explanation of such concepts as immutable blobs, “the original version is the only version”, objects still look like files etc. offer more on how object storage really works. George’s analysis on how “Objects are given unique ID numbers” is what’s missing in Jim’s article. I guess, what we’re saying is “read both articles.”

But read them critically, and you will see that we’re not there yet. As you can read in Jim’s article, the paradigm has been around much longer than many of us know and we’re not complete in defining the best use cases, implementations, architectures, etc. For example, I’m not at all sure about the reduced metadata George writes about. I believe that over time, as we start using richer applications, we will be storing more metadata, not less. To me, Jim’s statement “To be an object, a blob of data needs a much more detailed descriptor record than what file systems use.” is more accurate.

Both articles also cover the “why” of Object Storage. I’m not sure I see the use of Jim’s deduplication paragraph, and I think we are missing erasure coding as an alternative to RAID in his article (replication can be expensive too!). Jim accurately mentions that block storage was I/O focused, but omits the exceptional throughput performance some of the object stores deliver. A good thing is that Jim sees the scalability, flexibility and cost-saving opportunities. Finally, I very much like his use cases: Google Picasa, Amazon S3, Genome etc. and it is very interesting to read that Jim sees potential for object storage in the Big Data analytics space.

So back to George’s take on why we need object storage. Agreed that object storage platforms scale better than file systems & NAS but, again, not so much because of the metadata. File systems have different challenges, such as the granularity of the hardware, limitations on numbers of files or the number of levels in the hierarchy. Distributed file systems tried to solve some of these issues, but object storage is just a much simpler approach. Agreed that adding NAS heads is an expensive and not so great solution!

The second topic I thought was interesting was the issue of “bit rot”. Bit rot is a real problem and will lead to data loss with traditional storage technologies, but not every object store will solve that. How I understood it is that it is the underlying data protection scheme that solves the problem of bit rot, not necessarily Object Storage. Erasure Coding detects bit rot and prevents data loss.  I don’t think you could restore the content of an object using the identifier, but maybe there is some really cool technology out there that I don’t know of. As George wrote “The storage system does not need an elaborate RAID protection algorithm nor do its administrators need to suffer through long RAID rebuild cycles”, I think he actually alludes to Erasure Coding but didn’t want to go that deep in this article.

Another interesting point in George’s article is the issue with backups. Once you go into the petabyte range, it becomes very unwieldy to backup data. He mentions the backup window, but add to that the overhead cost. George promotes using the unique IDs to make sure “that there are always copies of each object available on-site and off-site.” Again with the proper underlying protection schemes (erasure coding) you can rule out backups altogether!

I’m sure both George and Jim will appreciate the feedback – I fully agree with the benefits object storage brings to track iterations of files and the paragraph on geo dispersion, which we have termed geo-spreading. Finally, I hope to read some more of George’s thoughts about how object storage can help to monetize archived data as that, to me, is a key argument for this new but then again not so new storage paradigm. This is obviously not the end of the discussion; a lot will and needs to be said about this new paradigm. I’m looking forward to attending the next Object Storage events…

Read the original blog entry...

More Stories By Tom Leyden

Tom Leyden is VP Product Marketing at Scality. Scality was founded in 2009 by a team of entrepreneurs and technologists. The idea wasn’t storage, per se. When the Scality team talked to the initial base of potential customers, the customers wanted a system that could “route” data to and from individual users in the most scalable, efficient way possible. And so began a non-traditional approach to building a storage system that no one had imagined before. No one thought an object store could have enough performance for all the files and attachments of millions of users. No one thought a system could remain up and running through software upgrades, hardware failures, capacity expansions, and even multiple hardware generations coexisting. And no one believed you could do all this and scale to petabytes of content and billions of objects in pure software.

@CloudExpo Stories
You know you need the cloud, but you’re hesitant to simply dump everything at Amazon since you know that not all workloads are suitable for cloud. You know that you want the kind of ease of use and scalability that you get with public cloud, but your applications are architected in a way that makes the public cloud a non-starter. You’re looking at private cloud solutions based on hyperconverged infrastructure, but you’re concerned with the limits inherent in those technologies.
SYS-CON Events announced today that Massive Networks will exhibit at SYS-CON's 21st International Cloud Expo®, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Massive Networks mission is simple. To help your business operate seamlessly with fast, reliable, and secure internet and network solutions. Improve your customer's experience with outstanding connections to your cloud.
DevOps is under attack because developers don’t want to mess with infrastructure. They will happily own their code into production, but want to use platforms instead of raw automation. That’s changing the landscape that we understand as DevOps with both architecture concepts (CloudNative) and process redefinition (SRE). Rob Hirschfeld’s recent work in Kubernetes operations has led to the conclusion that containers and related platforms have changed the way we should be thinking about DevOps and...
Everything run by electricity will eventually be connected to the Internet. Get ahead of the Internet of Things revolution and join Akvelon expert and IoT industry leader, Sergey Grebnov, in his session at @ThingsExpo, for an educational dive into the world of managing your home, workplace and all the devices they contain with the power of machine-based AI and intelligent Bot services for a completely streamlined experience.
Because IoT devices are deployed in mission-critical environments more than ever before, it’s increasingly imperative they be truly smart. IoT sensors simply stockpiling data isn’t useful. IoT must be artificially and naturally intelligent in order to provide more value In his session at @ThingsExpo, John Crupi, Vice President and Engineering System Architect at Greenwave Systems, will discuss how IoT artificial intelligence (AI) can be carried out via edge analytics and machine learning techn...
FinTechs use the cloud to operate at the speed and scale of digital financial activity, but are often hindered by the complexity of managing security and compliance in the cloud. In his session at 20th Cloud Expo, Sesh Murthy, co-founder and CTO of Cloud Raxak, showed how proactive and automated cloud security enables FinTechs to leverage the cloud to achieve their business goals. Through business-driven cloud security, FinTechs can speed time-to-market, diminish risk and costs, maintain continu...
SYS-CON Events announced today that Datera, that offers a radically new data management architecture, has been named "Exhibitor" of SYS-CON's 21st International Cloud Expo ®, which will take place on Oct 31 - Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Datera is transforming the traditional datacenter model through modern cloud simplicity. The technology industry is at another major inflection point. The rise of mobile, the Internet of Things, data storage and Big...
Existing Big Data solutions are mainly focused on the discovery and analysis of data. The solutions are scalable and highly available but tedious when swapping in and swapping out occurs in disarray and thrashing takes place. The resolution for thrashing through machine learning algorithms and support nomenclature is through simple techniques. Organizations that have been collecting large customer data are increasingly seeing the need to use the data for swapping in and out and thrashing occurs ...
As many know, the first generation of Cloud Management Platform (CMP) solutions were designed for managing virtual infrastructure (IaaS) and traditional applications. But that’s no longer enough to satisfy evolving and complex business requirements. In his session at 21st Cloud Expo, Scott Davis, Embotics CTO, will explore how next-generation CMPs ensure organizations can manage cloud-native and microservice-based application architectures, while also facilitating agile DevOps methodology. He wi...
SYS-CON Events announced today that CA Technologies has been named "Platinum Sponsor" of SYS-CON's 21st International Cloud Expo®, which will take place October 31-November 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. CA Technologies helps customers succeed in a future where every business - from apparel to energy - is being rewritten by software. From planning to development to management to security, CA creates software that fuels transformation for companies in the applic...
From 2013, NTT Communications has been providing cPaaS service, SkyWay. Its customer’s expectations for leveraging WebRTC technology are not only typical real-time communication use cases such as Web conference, remote education, but also IoT use cases such as remote camera monitoring, smart-glass, and robotic. Because of this, NTT Communications has numerous IoT business use-cases that its customers are developing on top of PaaS. WebRTC will lead IoT businesses to be more innovative and address...
An increasing number of companies are creating products that combine data with analytical capabilities. Running interactive queries on Big Data requires complex architectures to store and query data effectively, typically involving data streams, an choosing efficient file format/database and multiple independent systems that are tied together through custom-engineered pipelines. In his session at @BigDataExpo at @ThingsExpo, Tomer Levi, a senior software engineer at Intel’s Advanced Analytics ...
Blockchain is a shared, secure record of exchange that establishes trust, accountability and transparency across business networks. Supported by the Linux Foundation's open source, open-standards based Hyperledger Project, Blockchain has the potential to improve regulatory compliance, reduce cost as well as advance trade. Are you curious about how Blockchain is built for business? In her session at 21st Cloud Expo, René Bostic, Technical VP of the IBM Cloud Unit in North America, will discuss th...
yperConvergence came to market with the objective of being simple, flexible and to help drive down operating expenses. It reduced the footprint by bundling the compute/storage/network into one box. This brought a new set of challenges as the HyperConverged vendors are very focused on their own proprietary building blocks. If you want to scale in a certain way, let’s say you identified a need for more storage and want to add a device that is not sold by the HyperConverged vendor, forget about it....
SYS-CON Events announced today that App2Cloud will exhibit at SYS-CON's 21st International Cloud Expo®, which will take place on Oct. 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. App2Cloud is an online Platform, specializing in migrating legacy applications to any Cloud Providers (AWS, Azure, Google Cloud).
While some vendors scramble to create and sell you a fancy solution for monitoring your spanking new Amazon Lambdas, hear how you can do it on the cheap using just built-in Java APIs yourself. By exploiting a little-known fact that Lambdas aren’t exactly single-threaded, you can effectively identify hot spots in your serverless code. In his session at @DevOpsSummit at 21st Cloud Expo, Dave Martin, Product owner at CA Technologies, will give a live demonstration and code walkthrough, showing how ...
SYS-CON Events announced today that MobiDev, a client-oriented software development company, will exhibit at SYS-CON's 21st International Cloud Expo®, which will take place October 31-November 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. MobiDev is a software company that develops and delivers turn-key mobile apps, websites, web services, and complex software systems for startups and enterprises. Since 2009 it has grown from a small group of passionate engineers and business...
Internet of @ThingsExpo, taking place October 31 - November 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA, is co-located with 21st Cloud Expo and will feature technical sessions from a rock star conference faculty and the leading industry players in the world. The Internet of Things (IoT) is the most profound change in personal and enterprise IT since the creation of the Worldwide Web more than 20 years ago. All major researchers estimate there will be tens of billions devic...
SYS-CON Events announced today that Dasher Technologies will exhibit at SYS-CON's 21st International Cloud Expo®, which will take place on Oct 31 - Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Dasher Technologies, Inc. ® is a premier IT solution provider that delivers expert technical resources along with trusted account executives to architect and deliver complete IT solutions and services to help our clients execute their goals, plans and objectives. Since 1999, we'v...
SYS-CON Events announced today that Ayehu will exhibit at SYS-CON's 21st International Cloud Expo®, which will take place on Oct. 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Ayehu provides IT Process Automation & Orchestration solutions for IT and Security professionals to identify and resolve critical incidents and enable rapid containment, eradication, and recovery from cyber security breaches. Ayehu provides customers greater control over IT infrastructure throu...