Welcome!

@CloudExpo Authors: Liz McMillan, Zakia Bouachraoui, Elizabeth White, Pat Romanski, Carmen Gonzalez

Related Topics: @CloudExpo, Microservices Expo, Containers Expo Blog, Agile Computing, @DXWorldExpo, @DevOpsSummit

@CloudExpo: Article

Black Friday Business Rescue | @CloudExpo #APM #BigData #Microservices

Black Friday and Cyber Monday are the most important days in the retailer’s year

Last-Minute Black Friday Business Rescue and Cyber Monday Readiness

In order to be ready for the Christmas season, online retailers typically bring their shops into shape right before Black Friday. Together with Cyber Monday this is the most important day in the retailer's year.

Stilnest.com (@Stilnest) is a publishing house for designer jewelry, running their online shop on Magento. While the guys at Stilnest did a good job of preparing their environment, the interest in their products and, therefore, the traffic on their site, was much higher than expected. The shop even went down after one of the YouTube stars released a new video showing off her new jewelry line powered by Stilnest.

In this classical "War Room" situation I worked together with Stilnest to find the root cause(s). Good news is that we worked out a solution and brought the shop online again - just in time for Black Friday. This blog summarizes the technical details on what went wrong in their case. I hope you find this useful and may start checking your own environment - the holiday shopping season is not yet over:

  • Fixed environmental settings: Processes per CPU core
  • Optimized number of database statements per request
  • Speed up database connection
  • Optimized third-party modules (Magento marketplace)

A Quick Overview of Their Environment
The shop runs in AWS. A load balancer distributes the incoming requests to different servers, organized in an auto scaling group. Each server runs one instance of Nginx and three instances of PHP-FPM. Another server running Varnish is used for caching, while the PHP processes connect to a MySQL database, hosted by AWS.

What They Did to Prepare for Black Friday
First thing you want to confirm is that the servers running Nginx and PHP are not overloaded. The general rule for Nginx is one worker process per CPU core. In our setting we are using EC2 instances with four cores and start 1 Nginx worker per instance. The other three cores are reserved for PHP. In PHP there is no limitation on CPU cores; you can run as many child processes as your memory allows. Check the memory consumption of your application and specify the number of child processes accordingly.

Then ensure your auto scaling group in EC2 is configured properly. Several metrics can be perfectly used for this purpose, like CPU and memory consumption. But there are more relevant metrics that you should consider:

  • page load time
  • request rates
  • concurrent users
  • user experience

APM tools like Dynatrace provide these data in a most convenient way to use them in the EC2 auto scaling criteria.

These charts demonstrate the scaled environment under different load conditions:

Web requests over time

Server instances at low load

Click here for the full article

More Stories By Harald Zeitlhofer

Harald Zeitlhofer has 15+ years of experience as an architect and developer of enterprise ERP solutions and web applications with a main focus on efficient and performant business processes, usability and application design. As a Technology Strategist in Dynatrace's Centre of Excellence team he influences the Dynatrace product strategy by working closely with customers and driving their performance management and improvement at the front line. He is a frequent speaker at conferences and meetup groups around the world. Follow him @HZeitlhofer

Comments (0)

Share your thoughts on this story.

Add your comment
You must be signed in to add a comment. Sign-in | Register

In accordance with our Comment Policy, we encourage comments that are on topic, relevant and to-the-point. We will remove comments that include profanity, personal attacks, racial slurs, threats of violence, or other inappropriate material that violates our Terms and Conditions, and will block users who make repeated violations. We ask all readers to expect diversity of opinion and to treat one another with dignity and respect.


CloudEXPO Stories
ScaleMP is presenting at CloudEXPO 2019, held June 24-26 in Santa Clara, and we’d love to see you there. At the conference, we’ll demonstrate how ScaleMP is solving one of the most vexing challenges for cloud — memory cost and limit of scale — and how our innovative vSMP MemoryONE solution provides affordable larger server memory for the private and public cloud. Please visit us at Booth No. 519 to connect with our experts and learn more about vSMP MemoryONE and how it is already serving some of the world’s largest data centers. Click here to schedule a meeting with our experts and executives.
At CloudEXPO Silicon Valley, June 24-26, 2019, Digital Transformation (DX) is a major focus with expanded DevOpsSUMMIT and FinTechEXPO programs within the DXWorldEXPO agenda. Successful transformation requires a laser focus on being data-driven and on using all the tools available that enable transformation if they plan to survive over the long term. A total of 88% of Fortune 500 companies from a generation ago are now out of business. Only 12% still survive. Similar percentages are found throughout enterprises of all sizes.
When you're operating multiple services in production, building out forensics tools such as monitoring and observability becomes essential. Unfortunately, it is a real challenge balancing priorities between building new features and tools to help pinpoint root causes. Linkerd provides many of the tools you need to tame the chaos of operating microservices in a cloud native world. Because Linkerd is a transparent proxy that runs alongside your application, there are no code changes required. It even comes with Prometheus to store the metrics for you and pre-built Grafana dashboards to show exactly what is important for your services - success rate, latency, and throughput.
In his general session at 21st Cloud Expo, Greg Dumas, Calligo’s Vice President and G.M. of US operations, discussed the new Global Data Protection Regulation and how Calligo can help business stay compliant in digitally globalized world. Greg Dumas is Calligo's Vice President and G.M. of US operations. Calligo is an established service provider that provides an innovative platform for trusted cloud solutions. Calligo’s customers are typically most concerned about GDPR compliance, application performance guarantees & data privacy.
Modern software design has fundamentally changed how we manage applications, causing many to turn to containers as the new virtual machine for resource management. As container adoption grows beyond stateless applications to stateful workloads, the need for persistent storage is foundational - something customers routinely cite as a top pain point. In his session at @DevOpsSummit at 21st Cloud Expo, Bill Borsari, Head of Systems Engineering at Datera, explored how organizations can reap the benefits of the cloud without losing performance as containers become the new paradigm.