Welcome!

Cloud Expo Authors: Carmen Gonzalez, Scott Bampton, Gilad Parann-Nissany, Elizabeth White, Paige Leidig

Related Topics: Cloud Expo, Virtualization

Cloud Expo: Article

Five Capacity Management Challenges for Private Clouds

Capacity management enables cloud operators to maximize compute cycles delivered to a customer at the lowest possible cost

Organizations that are seeking to deploy cloud-based business models for their infrastructure face unique capacity management challenges. This article will review these challenges to enable cloud providers, either public or private, to avoid the pitfalls of improper capacity management.  While the article discusses both types of clouds, the needs of private cloud providers will be especially highlighted due to the unique challenges they face with this business model.

IT Becomes a Business within a Business
For years, we have been hearing "IT must behave more like a business." A hypervisor's ability to deliver utility computing moves this vision closer to reality. Using hypervisors to deploy an infrastructure cloud fundamentally changes the relationship between IT and their customers. Application portability combined with competing cloud offerings from companies like Amazon change the dynamic between application owners and corporate IT. If end users can't explicitly move their applications between cloud providers, they can at least compare prices and service levels between providers. Right or wrong, Amazon S3 becomes a measuring stick in price, performance and service for IT organizations.

To deliver an Amazon S3 experience requires significant retooling in IT processes. Capacity management is one of the areas requiring retooling. Virtualization alone causes changes to capacity management (for more information, see http://www.vkernel.com/solutions/capacity-planning). But virtualization deployed as an infrastructure cloud adds nuances to the capacity management problem. Capacity management for cloud providers is unique for the following five reasons:

  • Capacity monitoring in addition to planning
  • Chargeback is mandatory
  • Efficiency drives return on assets
  • Tenant reporting requirements are unique
  • Optimization is a value add

Variable Demand Drives Criticality of Capacity Monitoring
Cloud deployments of virtualization technology introduce many operational changes for IT administrators. The first is a change in the amount of control IT has over the loads deployed on their hardware. With cloud deployments, either public or private, end users deploy applications using self-service portals as they see fit, load them as they desire and consume resources at whatever pace they need.

Hence, unlike the careful P2V sizing process undertaken for the first wave of virtualization where applications are sized, scheduled and deployed in a methodical manner, clouds have no careful sizing or timing that the cloud operator is aware of. Applications of unknown sizes appear, consume resources, and may just as quickly disappear. Without adequate capacity, these applications will fail to perform to customer expectations. Without real-time capacity monitoring, application deployment can dramatically impact other applications.

Hence capacity planning, a well-thought-out process for making sure there is sufficient capacity en masse, must be supplemented with capacity monitoring. Capacity monitoring is a real-time process that takes raw performance and utilization data and transforms it into actionable information concerning system-level capacity requirements. Without capacity monitoring, system administrators are left to interpret real-time utilization metrics from individual virtual machines. VKernel's research has shown that properly monitoring capacity in real time involves collecting over 20 metrics per VM at least 10 times per hour, and keeping this information for at least 30 days. A 100 VM environment would require about 17 million data points to accurately monitor capacity in the environment. This capacity monitoring, however, is a must-have to augment standard capacity planning and prevent performance issues from impacting the cloud.

Chargeback Matters
For any cloud where resources can be deployed in a self-service fashion, charging back for resources becomes a necessity. Without a method to chargeback or show back, self-service clouds would quickly find themselves at capacity since resources are essentially free.

But chargeback is a tricky area. For commercial cloud and private cloud operators, charging back for allocated resources is fairly straightforward. But, since the private chargeback operator is simply shifting costs insides the company and not impacting the bottom line, the motivations for chargeback are different. The public cloud operator is indifferent to allocated resources and utilized resources. If the public cloud operator is charging for an allocated resource pool, they make money. In fact, the higher the ratio between allocated and utilized, the more over allocation of resources is possible and the higher the profit margins. For private cloud operators, however, the goal is to actually lower the costs for the company. Hence, the private cloud operator wants the allocated resource usage to be very close to actual usage to drive resource efficiency. Highlighting the difference between actual resource usage and allocated resource usage shows internal business units the amount of corporate resources they are wasting. This motivation can then be used to right size environments and reduce overall IT spend.

While chargeback is important, cloud operators need to be mindful of what they charge. For public operators, there are competitive pressures. For private operators, charging provides a way to directly compare internal IT costs vs. external costs such as Amazon.

Is a simple $/CPU comparison between internal clouds and Amazon a fair comparison? Does Amazon contain the same level of compliance? Of control? Is the company comfortable with data outside the company firewall?

More important, chargeback for private cloud operators is primarily a means to minimize the difference between allocated and utilized resources to drive up efficiencies and VM densities. Chargeback or showback becomes a control mechanism as opposed to an actual financial transfer mechanism. Hence the rate of chargeback is not as important as the difference between allocated and actual usage.

Even here, the challenges for private cloud operators are greater. Let's assume a private cloud operator hosts 200 internal customers. Assume each of these internal customers is wasting 50% of their resource allocation. On an individual basis, the absolute value of the wasted resources may be insignificant. But across all 200 customers, the magnitude of the IT spend could be quite large. The greater good theory for IT would require that IT actually reduce resource usage for all the internal customers to claim additional savings for the company despite what the internal customers want. Private cloud operators must operate not only for their internal customers' needs, but also for the company's needs.

Setting rates for chargeback is the final tricky area for cloud operators. For a public cloud operator, the rate needs to be competitive, provide some profit margin and match customer's value. Easy enough. But what about private cloud operators? Once again, being a private operator makes things difficult. What are the rates for chargeback for a private cloud operator? Is the goal to set rates to make an internal profit when 50% of the VMs slots are filled? 75%? But if the internal cost center is making a profit, is that the right thing to do from a budgeting standpoint? Is the goal cost reclamation or efficiency?

The net of this is that for cloud operators, chargeback is critical. For private cloud operators, chargeback's purpose needs to be clearly defined to align IT not only with their customer's goals, but also the broader corporate goals.

Capacity Planning Impacts Revenue and Cost
Customers expecting to use a cloud service have high expectations with regards to time to deploy a service. For public clouds, this expectation will be a nearly instant deployment after the service request. Private cloud operators may not have quite an instantaneous expectation for their customers. Either way, the "acceptable" wait time of several weeks to deploy a new server is gone. Immediate is the word, not days.

To enable this immediate capability, sufficient capacity must be on hand to deploy new virtual machines based on both steady state and unexpected increases in demands. To meet this accelerated time duration, a high degree of capacity planning must take place to predict future capacity needs ahead of demand and allow for the slower process of procuring and installing physical servers, networks and storage.

It's easy to meet rapid deployment expectations by over procuring hardware. The danger here is that over procurement impacts cash flows and profitability for a cloud. Having large amounts of depreciating assets sitting around is not a sound business strategy. If these assets are plugged in and configured, the added power costs worsen an already bad situation. Under procuring hardware is just as bad since cloud operators will be unable to meet customer needs should their systems not have available capacity. Hence the goal is to have a solid understanding of consumption and then apply a safety factor to allow for unexpected demand.

Understanding capacity needs across the entire IT infrastructure is important. But it's just as important to understand where there are available VM slots for the best placement of VMs from a performance and a capacity standpoint. Utilizing available VM slot reporting ensures the performance of the running VMs and increases the VM density per host, which is critical to achieving a high return on assets.

Capacity planning is critical to cloud operators to generate a high return on assets while also meeting customer demand for near instantaneous deployment requests.

Tenant Reporting
With a standard virtualized environment, the IT organization may report on environment status to a few senior IT leaders. Not so with an infrastructure cloud. For public clouds and private clouds, there is a greater expectation of visibility into the environment. Reporting for cloud tenants could involve availability, resource allocation, resource utilization, current charges, and pricing plan. The amount of information revealed to a customer could depend on the business philosophy or type of cloud. For a private cloud, revealing differences between allocated resources and utilized resources and the savings a customer could achieve by reducing their resources allocation makes sense for a company trying to save money. For a public cloud provider, however, suggesting ways to reduce resource allocations may not be in the provider's interest.

Beyond questions around what type of information to provide is the manner in which information is provided. For public cloud operators, online portals are most likely the reporting distribution mechanism of choice. For private clouds, however, information needs to flow seamlessly into the enterprises existing reporting infrastructure. This could involve connections with SharePoint, with IT service management frameworks, internal portals, or simple email distribution of reports on an ongoing basis.

Optimization Is a Value Add
Many times, when virtual machines are first deployed, they are over allocated CPU, memory and storage. In a cloud deployment, this over allocation does not cost the cloud operator revenue. In fact, just the opposite occurs. The cloud operator deploys the requested resources, but despite not being used, the cloud operator still collects the revenue. As discussed earlier, the cloud operator can decide whether to reveal this to the tenant or not.

While the cloud operator may not care about wasted resources to an application, the end customer does as does the firm's CFO. Decreasing requested CPU, memory and storage reduces monthly tenant costs. Hence, optimization becomes an optional value-add service for the provider to offer tenants to reduce spending. This would be the equivalent of your cell phone company contacting you to suggest a lower monthly plan. While it lowers the monthly revenue of the cloud operator, it dramatically improves customer loyalty.

Conclusion
Because of the increased need for chargeback, monitoring, capacity planning, and reporting, capacity management takes on added urgency for cloud operators. Capacity management performed correctly enables cloud operators to maximize compute cycles delivered to a customer at the lowest possible cost and, thus, capacity management is a key building block for any cloud implementation.

More Stories By Bryan Semple

Bryan Semple is Chief Marketing Officer at VKernel. A 15+ year high-tech veteran, he has spent the last 8 years working in server and storage companies focused on virtualization technologies. He comes to VKernel from NetApp where he was the general manager of the storage virtualization business unit. Under his leadership, the group experienced record growth, expanded engineering operations to India, and built global awareness for NetApp’s industry leading storage virtualization solutions.

Prior to NetApp, Bryan was VP of Marketing at Onaro where he established the company as a leader in storage management software and built the marketing processes that supported the company’s profitability and successful acquisition by NetApp in 2008. Before Onaro, he was the VP of Product Marketing and Strategy at server blade virtualization pioneer Egenera. At Egenera, he worked with early adopters of infrastructure and server virtualization technologies in the financial services industry as the company scaled from one to several hundred customers.

Bryan holds a BS in Systems Engineering from the US Naval Academy and an MBA from Stanford University.

Comments (0)

Share your thoughts on this story.

Add your comment
You must be signed in to add a comment. Sign-in | Register

In accordance with our Comment Policy, we encourage comments that are on topic, relevant and to-the-point. We will remove comments that include profanity, personal attacks, racial slurs, threats of violence, or other inappropriate material that violates our Terms and Conditions, and will block users who make repeated violations. We ask all readers to expect diversity of opinion and to treat one another with dignity and respect.


@CloudExpo Stories
The only place to be June 9-11 is Cloud Expo & @ThingsExpo 2015 East at the Javits Center in New York City. Join us there as delegates from all over the world come to listen to and engage with speakers & sponsors from the leading Cloud Computing, IoT & Big Data companies. Cloud Expo & @ThingsExpo are the leading events covering the booming market of Cloud Computing, IoT & Big Data for the enterprise. Speakers from all over the world will be hand-picked for their ability to explore the economic...
As the Internet of Things unfolds, mobile and wearable devices are blurring the line between physical and digital, integrating ever more closely with our interests, our routines, our daily lives. Contextual computing and smart, sensor-equipped spaces bring the potential to walk through a world that recognizes us and responds accordingly. We become continuous transmitters and receivers of data. In his session at Internet of @ThingsExpo, Andrew Bolwell, Director of Innovation for HP’s Printing a...
SYS-CON Events announced today that Cloudian, Inc., the leading provider of hybrid cloud storage solutions, has been named “Bronze Sponsor” of SYS-CON's 15th International Cloud Expo®, which will take place on November 4–6, 2014, at the Santa Clara Convention Center in Santa Clara, CA. Cloudian is a Foster City, Calif.-based software company specializing in cloud storage. Cloudian HyperStore® is an S3-compatible cloud object storage platform that enables service providers and enterprises to bui...
SYS-CON Events announced today that Gridstore™, the leader in software-defined storage (SDS) purpose-built for Windows Servers and Hyper-V, will exhibit at SYS-CON's 15th International Cloud Expo®, which will take place on November 4–6, 2014, at the Santa Clara Convention Center in Santa Clara, CA. Gridstore™ is the leader in software-defined storage purpose built for virtualization that is designed to accelerate applications in virtualized environments. Using its patented Server-Side Virtual C...
SYS-CON Events announces a new pavilion on the Cloud Expo floor where WebRTC converges with the Internet of Things. Pavilion will showcase WebRTC and the Internet of Things. The Internet of Things (IoT) is the most profound change in personal and enterprise IT since the creation of the Worldwide Web more than 20 years ago. All major researchers estimate there will be tens of billions devices--computers, smartphones, tablets, and sensors – connected to the Internet by 2020. This number will con...
When an enterprise builds a hybrid IaaS cloud connecting its data center to one or more public clouds, security is often a major topic along with the other challenges involved. Security is closely intertwined with the networking choices made for the hybrid cloud. Traditional networking approaches for building a hybrid cloud try to kludge together the enterprise infrastructure with the public cloud. Consequently this approach requires risky, deep "surgery" including changes to firewalls, subnets...
SAP is delivering break-through innovation combined with fantastic user experience powered by the market-leading in-memory technology, SAP HANA. In his General Session at 15th Cloud Expo, Thorsten Leiduck, VP ISVs & Digital Commerce, SAP, will discuss how SAP and partners provide cloud and hybrid cloud solutions as well as real-time Big Data offerings that help companies of all sizes and industries run better. SAP launched an application challenge to award the most innovative SAP HANA and SAP ...
The Internet of Things (IoT) promises to evolve the way the world does business; however, understanding how to apply it to your company can be a mystery. Most people struggle with understanding the potential business uses or tend to get caught up in the technology, resulting in solutions that fail to meet even minimum business goals. In his session at Internet of @ThingsExpo, Jesse Shiah, CEO / President / Co-Founder of AgilePoint Inc., will show what is needed to leverage the IoT to transform...
SYS-CON Events announced today that AIC, a leading provider of OEM/ODM server and storage solutions, will exhibit at SYS-CON's 16th International Cloud Expo®, which will take place on June 9-11, 2015, at the Javits Center in New York City, NY. AIC is a leading provider of both standard OTS, off-the-shelf, and OEM/ODM server and storage solutions. With expert in-house design capabilities, validation, manufacturing and production, AIC's broad selection of products are highly flexible and are conf...
We are all here because we are sold on the transformative promise of The Cloud. But what good is all of this ephemeral, on-demand infrastructure if your usage doesn't actually improve the agility and speed of your business? How must Operations adapt in order to avoid stifling your Cloud initiative? In his session at DevOps Summit, Damon Edwards, co-founder and managing partner of the DTO Solutions, will highlight the successful organizational, process, and tooling patterns of high-performing c...
SYS-CON Events announced today that O'Reilly Media has been named “Media Sponsor” of SYS-CON's 15th International Cloud Expo®, which will take place on November 4–6, 2014, at the Santa Clara Convention Center in Santa Clara, CA. O'Reilly Media spreads the knowledge of innovators through its books, online services, magazines, and conferences. Since 1978, O'Reilly Media has been a chronicler and catalyst of cutting-edge development, homing in on the technology trends that really matter and spurri...
SYS-CON Events announced today that Stratogent will exhibit at SYS-CON's 15th International Cloud Expo®, which will take place on November 4–6, 2014, at the Santa Clara Convention Center in Santa Clara, CA. Stratogent is a custom managed services organization based in San Mateo, California. We design, implement, and support mission critical infrastructure 24x7 on premises, in datacenters and in the Cloud. Since 2005, we have acted as an extension of internal IT teams, achieving a customer reten...
SYS-CON Events announced today that Aria Systems, the recurring revenue expert, has been named "Bronze Sponsor" of SYS-CON's 15th International Cloud Expo®, which will take place on November 4-6, 2014, at the Santa Clara Convention Center in Santa Clara, CA. Aria Systems helps leading businesses connect their customers with the products and services they love. Industry leaders like Pitney Bowes, Experian, AAA NCNU, VMware, HootSuite and many others choose Aria to power their recurring revenue bu...
The Transparent Cloud-computing Consortium (abbreviation: T-Cloud Consortium) will conduct research activities into changes in the computing model as a result of collaboration between "device" and "cloud" and the creation of new value and markets through organic data processing High speed and high quality networks, and dramatic improvements in computer processing capabilities, have greatly changed the nature of applications and made the storing and processing of data on the network commonplace. ...
Seagate has a strong track record of collaborating with others to develop better cloud solutions. The Seagate Cloud Builder Alliance program, for example, leverages the company’s knowledge of storage and cloud-optimized solutions to give cloud service providers the customized, flexible and scalable server and storage solutions to meet the high levels of service their customers demand. Seagate also is a member of the OpenStack Foundation and Open Compute Project to help define and promote open-so...
The Internet of Things (IoT) is going to require a new way of thinking and of developing software for speed, security and innovation. This requires IT leaders to balance business as usual while anticipating for the next market and technology trends. Cloud provides the right IT asset portfolio to help today’s IT leaders manage the old and prepare for the new. Today the cloud conversation is evolving from private and public to hybrid. This session will provide use cases and insights to reinforce t...
What process has your provider undertaken to ensure that the cloud tenant will receive predictable performance and service? What was involved in the planning? Who owns and operates the data center? What technology is being used? How is it being supported? In his session at 14th Cloud Expo, Dave Weisbrot, Cloud Business Manager for QTS, will provide the attendees a look into what it takes to stand up and stand behind a highly available certified cloud IaaS.
I'll be hosting an SAP HANA Cloud webinar at 11am eastern time, Wednesday, October 29. You can sign up now. Featured speakers will be Allan Adler, Managing Partner, Channel Cloud Consulting, and Thorsten Leiduck, VP ISVs & Digital Commerce, SAP. Attendees will learn about • Cloud economics, hybrid cloud strategy, market size and opportunity • Introduction to SAP HANA Cloud Platform and how to: - Build new next-generation applications - Extend on-premise solutions non-disruptively throu...
SYS-CON Events announced today that Gigaom Research has been named "Media Sponsor" of SYS-CON's 15th International Cloud Expo®, which will take place on November 4-6, 2014, at the Santa Clara Convention Center in Santa Clara, CA. Ashar Baig, Research Director, Cloud, at Gigaom Research, will also lead a Power Panel on the topic "Choosing the Right Cloud Option." Gigaom Research provides timely, in-depth analysis of emerging technologies for individual and corporate subscribers. Gigaom Research'...
How does one bridge the gap between traditional enterprise storage infrastructures and the private, hybrid, and public cloud? In his session at 15th Cloud Expo, Dan Pollack, Chief Architect of Storage Operations at AOL Inc., will exam the workload differences and required changes to reuse existing knowledge and components when building and using a cloud infrastructure. He will look into the operational considerations, tool requirements, and behavioral changes required for private cloud storage...