Welcome!

@CloudExpo Authors: Yeshim Deniz, Liz McMillan, Elizabeth White, Pat Romanski, Stackify Blog

Related Topics: @CloudExpo, Containers Expo Blog, @DevOpsSummit

@CloudExpo: Blog Feed Post

Docker Cloud Monitoring and Logging By @Sematext | @CloudExpo #Cloud #Containers

Docker Cloud is a container management service that supports multiple cloud providers

Docker Cloud Monitoring and Logging
By Stefan Thies

Docker Cloud is a hosted service for Docker Container Management, originally based on Tutum Cloud, which was acquired by Docker in October 2015. Sematext supported the deployment of Sematext Docker Agent on Tutum Cloud from the get-go, so naturally we were quick to add support for Docker Cloud as well.

What is Docker Cloud?
Docker Cloud is a container management service that supports multiple cloud providers such as Amazon, DigitalOcean, IBM Softlayer, MS Azure and Packet.net. This makes it much easier to switch Docker deployments to different cloud providers or use a mix of providers including on-premises nodes for hybrid cloud applications. The user interface in Docker Cloud makes it easy to manage nodes on all supported cloud platforms and is able to deploy application stacks in containers, defined in a "Stack YAML" file. This Stack files are very similar to Docker Compose files, but with additional options, e.g. to define deployment strategies for the containers. The graphical user interface helps to view and modify container configurations.

Docker Cloud Metrics & Logs
Once containers are deployed you can get a very basic real-time log stream view per container (see below). This is helpful for a quick glance at the most recent logs of a specific container.

Real-time log view in Docker Cloud

Real-time log view in Docker Cloud

There are currently no Docker metrics exposed anywhere in Docker Cloud, though that will surely be added with time. Docker Cloud does an excellent job for the "Build, Ship and Run" containers paradigm. But if you've ever built a production system you know there is more to it. There is this little wrinkle called Operations. So let's talk about the more realistic scenario - "Build, Ship, Run and Monitor"

Thanks to the Docker API, it is possible to add this functionality to Docker Cloud. Sematext Docker Agent is a small container that collects all Docker metrics, all app and Docker logs and all Docker events from Docker Cloud and together with SPM for Performance Monitoring and Logsene for Log Management and Analytics it provides advanced Performance Monitoring and Log Management functionality for stacks deployed in Docker Cloud:

  1. Detailed Metrics with a long retention time. Having detailed metrics helps optimize resource usage of applications. Detailed metrics let you set application-specific alerts for any critical resources your applications depends on. Metrics are aggregated for all hosts, images and containers and filterable by hosts, images, and containers. This lets you drill down from a cluster view down to a single container while troubleshooting or simply understand operations details. Long retention times for metrics make it possible to compare resources before and after different deployments and releases or troubleshoot problems that appear only when a service has been running over several days or weeks!
  2. Full-text search, filtering, and analytics across all containers. Logs are collected, parsed and shipped by Sematext Docker Agent. The integrated charting functions in Logsene and integrations for Kibana and Grafana make it easy to analyze logs collected in Docker Cloud. In short, you can use Logsene as a "super grep" for your Docker and application logs, but also as amuch more affordable Splunk or any other BI tool, or a managed Elastic stack (aka ELK).
  3. Long retention time for logs, metrics and events. Comparing metrics and logs during deployments or watching the performance under different workloads requires one to store logs and metrics for a reasonable time. We have seen cases where memory leaks started to get serious after a few weeks of stable operations, although initially they were not detected. In such a case all context information like logs, events and metrics could be very valuable in identifying the root cause of such problems.
  4. Tracking of all Docker Events. Tracking of all Docker Events gives you a clear view of your containers' life cycle. For example, by collecting Events you gain insight into what happens with your containers during (re)deployments or the re-scheduling of containers to different nodes. Some containers might be configured for automatic restarts and the events could indicate if container processes crash frequently. In case of out-of-memory events, it might be wise to modify the memory limits or check with the developers, why this event happened.
  5. Anomaly detection and alerts for all logs and metrics. Who wants to watch metrics and logs all day long? Not me! Let the monitoring system watch outliers for metrics or query your logs! Anomaly detection can help reduce the noise and alert fatigue often caused by classic threshold-based alerts. Even log-alerting is possible with Logsene e.g. to detect anomalies in the log frequency of a specific query. For example, a search for "error" in the system might normally return a dozen non-critical errors, which could be ignored. A growth in the log frequency of error messages indicates that something might be going wrong. Another type of alerts is the Heartbeat alert for all cluster nodes. Disk Space alerts are very useful for Docker nodes, because Docker images might consume a lot of disk space. Docker Cloud runs some cleanup agents to remove unused containers and images; nevertheless the default disk-space alert created by SPM gives you an early warning before the capacity limit is reached.

Here's a short video about Log Management and Monitoring for Docker. It gives a general overview of Monitoring and Log Management in Docker context, plus shows you how to use SPM and Logsene as a single pane of glass for your Docker metrics and logs.

Having all this operational insights, and having it in a single pane of glass makes everyone's work (and that means life, too) simpler. We all want that, no? With that in mind, we've made sure the Sematext Docker Agent setup is super quick and easy:

  1. Get a free account at apps.sematext.com, if you don't have one already
  2. Create an SPM App of type "Docker" to obtain the SPM Application Token and/or
    Create a Logsene App to obtain the Logsene Application Token
  3. Click the "Deploy to Cloud" button in Sematext UI and copy the generated token into the Stackfile text field in Docker Cloud

    Create SPM app and deploy to Docker Cloud

    Create SPM app and deploy to Docker Cloud

  4. As soon you click "Create and deploy" in Docker Cloud the Sematext Docker Agent will be pulled from Docker Hub and it will start on all nodes managed by Docker Cloud. A few seconds later you should see Events, Logs & Metrics in SPM & Logsene.

Docker Cloud Metrics Overview in SPM

Docker Events and Metrics in SPM

Structured Docker Logs in Logsene / Kibana discover view

Everything mentioned above could take you just 10-15 minutes, meaning that in 10-15 you could be looking at charts with all your Docker operations data in one place, accessible by your whole team!

If you have feedback for monitoring and logging on Docker Cloud get in touch with us via @sematext or email us at [email protected] - we love to talk about monitoring and logging and appreciate user feedback that helps us improve our services and make them better serve your needs. If you want to try SPM or Logsene, start here.

Read the original blog entry...

More Stories By Sematext Blog

Sematext is a globally distributed organization that builds innovative Cloud and On Premises solutions for performance monitoring, alerting and anomaly detection (SPM), log management and analytics (Logsene), and search analytics (SSA). We also provide Search and Big Data consulting services and offer 24/7 production support for Solr and Elasticsearch.

@CloudExpo Stories
SYS-CON Events announced today that Grape Up will exhibit at SYS-CON's 21st International Cloud Expo®, which will take place on Oct. 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Grape Up is a software company specializing in cloud native application development and professional services related to Cloud Foundry PaaS. With five expert teams that operate in various sectors of the market across the U.S. and Europe, Grape Up works with a variety of customers from emergi...
As DevOps methodologies expand their reach across the enterprise, organizations face the daunting challenge of adapting related cloud strategies to ensure optimal alignment, from managing complexity to ensuring proper governance. How can culture, automation, legacy apps and even budget be reexamined to enable this ongoing shift within the modern software factory?
SYS-CON Events announced today that Elastifile will exhibit at SYS-CON's 21st International Cloud Expo®, which will take place on Oct 31 - Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Elastifile Cloud File System (ECFS) is software-defined data infrastructure designed for seamless and efficient management of dynamic workloads across heterogeneous environments. Elastifile provides the architecture needed to optimize your hybrid cloud environment, by facilitating efficient...
SYS-CON Events announced today that Grape Up will exhibit at SYS-CON's 21st International Cloud Expo®, which will take place on Oct. 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Grape Up is a software company specializing in cloud native application development and professional services related to Cloud Foundry PaaS. With five expert teams that operate in various sectors of the market across the U.S. and Europe, Grape Up works with a variety of customers from emergi...
@DevOpsSummit at Cloud Expo taking place Oct 31 - Nov 2, 2017, at the Santa Clara Convention Center, Santa Clara, CA, is co-located with the 21st International Cloud Expo and will feature technical sessions from a rock star conference faculty and the leading industry players in the world. The widespread success of cloud computing is driving the DevOps revolution in enterprise IT. Now as never before, development teams must communicate and collaborate in a dynamic, 24/7/365 environment. There is ...
SYS-CON Events announced today that Golden Gate University will exhibit at SYS-CON's 21st International Cloud Expo®, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Since 1901, non-profit Golden Gate University (GGU) has been helping adults achieve their professional goals by providing high quality, practice-based undergraduate and graduate educational programs in law, taxation, business and related professions. Many of its courses are taug...
DevOps at Cloud Expo, taking place October 31 - November 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA, is co-located with 21st Cloud Expo and will feature technical sessions from a rock star conference faculty and the leading industry players in the world. The widespread success of cloud computing is driving the DevOps revolution in enterprise IT. Now as never before, development teams must communicate and collaborate in a dynamic, 24/7/365 environment. There is no time to w...
When shopping for a new data processing platform for IoT solutions, many development teams want to be able to test-drive options before making a choice. Yet when evaluating an IoT solution, it’s simply not feasible to do so at scale with physical devices. Building a sensor simulator is the next best choice; however, generating a realistic simulation at very high TPS with ease of configurability is a formidable challenge. When dealing with multiple application or transport protocols, you would be...
With Cloud Foundry you can easily deploy and use apps utilizing websocket technology, but not everybody realizes that scaling them out is not that trivial. In his session at 21st Cloud Expo, Roman Swoszowski, CTO and VP, Cloud Foundry Services, at Grape Up, will show you an example of how to deal with this issue. He will demonstrate a cloud-native Spring Boot app running in Cloud Foundry and communicating with clients over websocket protocol that can be easily scaled horizontally and coordinate...
yperConvergence came to market with the objective of being simple, flexible and to help drive down operating expenses. It reduced the footprint by bundling the compute/storage/network into one box. This brought a new set of challenges as the HyperConverged vendors are very focused on their own proprietary building blocks. If you want to scale in a certain way, let’s say you identified a need for more storage and want to add a device that is not sold by the HyperConverged vendor, forget about it....
21st International Cloud Expo, taking place October 31 - November 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA, will feature technical sessions from a rock star conference faculty and the leading industry players in the world. Cloud computing is now being embraced by a majority of enterprises of all sizes. Yesterday's debate about public vs. private has transformed into the reality of hybrid cloud: a recent survey shows that 74% of enterprises have a hybrid cloud strategy. Me...
DevOps at Cloud Expo, taking place October 31 - November 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA, is co-located with 21st Cloud Expo and will feature technical sessions from a rock star conference faculty and the leading industry players in the world. The widespread success of cloud computing is driving the DevOps revolution in enterprise IT. Now as never before, development teams must communicate and collaborate in a dynamic, 24/7/365 environment. There is no time to w...
Vulnerability management is vital for large companies that need to secure containers across thousands of hosts, but many struggle to understand how exposed they are when they discover a new high security vulnerability. In his session at 21st Cloud Expo, John Morello, CTO of Twistlock, will address this pressing concern by introducing the concept of the “Vulnerability Risk Tree API,” which brings all the data together in a simple REST endpoint, allowing companies to easily grasp the severity of t...
Recently, WebRTC has a lot of eyes from market. The use cases of WebRTC are expanding - video chat, online education, online health care etc. Not only for human-to-human communication, but also IoT use cases such as machine to human use cases can be seen recently. One of the typical use-case is remote camera monitoring. With WebRTC, people can have interoperability and flexibility for deploying monitoring service. However, the benefit of WebRTC for IoT is not only its convenience and interopera...
In his session at 20th Cloud Expo, Scott Davis, CTO of Embotics, discussed how automation can provide the dynamic management required to cost-effectively deliver microservices and container solutions at scale. He also discussed how flexible automation is the key to effectively bridging and seamlessly coordinating both IT and developer needs for component orchestration across disparate clouds – an increasingly important requirement at today’s multi-cloud enterprise.
Connecting to major cloud service providers is becoming central to doing business. But your cloud provider’s performance is only as good as your connectivity solution. Massive Networks will place you in the driver's seat by exposing how you can extend your LAN from any location to include any cloud platform through an advanced high-performance connection that is secure and dedicated to your business-critical data. In his session at 21st Cloud Expo, Paul Mako, CEO & CIO of Massive Networks, wil...
WebRTC is great technology to build your own communication tools. It will be even more exciting experience it with advanced devices, such as a 360 Camera, 360 microphone, and a depth sensor camera. In his session at @ThingsExpo, Masashi Ganeko, a manager at INFOCOM Corporation, will introduce two experimental projects from his team and what they learned from them. "Shotoku Tamago" uses the robot audition software HARK to track speakers in 360 video of a remote party. "Virtual Teleport" uses a...
Any startup has to have a clear go –to-market strategy from the beginning. Similarly, any data science project has to have a go to production strategy from its first days, so it could go beyond proof-of-concept. Machine learning and artificial intelligence in production would result in hundreds of training pipelines and machine learning models that are continuously revised by teams of data scientists and seamlessly connected with web applications for tenants and users.
When shopping for a new data processing platform for IoT solutions, many development teams want to be able to test-drive options before making a choice. Yet when evaluating an IoT solution, it’s simply not feasible to do so at scale with physical devices. Building a sensor simulator is the next best choice; however, generating a realistic simulation at very high TPS with ease of configurability is a formidable challenge. When dealing with multiple application or transport protocols, you would be...
IT organizations are moving to the cloud in hopes to approve efficiency, increase agility and save money. Migrating workloads might seem like a simple task, but what many businesses don’t realize is that application migration criteria differs across organizations, making it difficult for architects to arrive at an accurate TCO number. In his session at 21st Cloud Expo, Joe Kinsella, CTO of CloudHealth Technologies, will offer a systematic approach to understanding the TCO of a cloud application...