Welcome!

@CloudExpo Authors: Yeshim Deniz, Elizabeth White, Liz McMillan, Zakia Bouachraoui, Pat Romanski

Related Topics: @CloudExpo, Microservices Expo, @DXWorldExpo

@CloudExpo: Article

Cloud Native Applications | @CloudExpo #BigData #DataLake #Microservices

There are two important such design patterns that are emerging in the cloud era - microservices and data lake

Cloud Native Applications
As the cloud becomes more of the norm as part of enterprise computing, enterprises now have to deal with the issue of how to ensure that applications effectively use the attributes of cloud. There are monolithic applications from the previous era that are continuing to be migrated to the cloud using a lift and shift approach. With minimal changes, they do benefit from certain attributes of cloud like availability and management, but there is also a new set of application architecture emerging, namely the ‘cloud native applications.'

A cloud native application is designed to take the best advantage of its deployment to a cloud platform. By adopting cloud native applications they improve their agility in the way they build, deploy and manage their applications.

The design patterns of the existing application architecture don't fully support the concept of cloud native applications and require new set of design patterns. There are two important such design patterns that are emerging in the cloud era.

  1. Microservices
  2. Data Lake

While both represent two different layers of architecture, there are some ways both are interrelated as explained below.

Microservices
A lot of information is available about microservices. Over the years multi-tiered and tightly coupled applications have grown and are typically called as "monolithic" applications. However, these "monolithic" applications are not able to take full advantage of cloud platforms and it's difficult to scale them at a component level. Another issue is that these applications tend to have single point of failures due to tight coupling in nature. Microservices address this issue with the design of self-contained services that can be deployed, versioned and scaled independently with the rest of the components of the application. A cloud native approach to application design warrant a microservices design approach where by decoupled federation of services makes up an application. Microservices also provide well-defined interfaces based on industry standard protocols so that they can interface with each other.

Data Lake
In the era of Big Data, a data lake is an enterprise-wide repository of data of any size, type and format. Typically data lake repositories are built on a Hadoop HDFS based file system.

  • Unlike a typical data warehouse or an operational data store, data store does not require data validation and cleansing before it gets loaded.
  • A data lake does not require Schema On Write, but rather go by Schema On Read. However the data producing applications can use their own schema while pushing data to the data lake.
  • A data lakecannot use the traditional JOINS to relate data, but depend on complex processing capability to provide results at low latency.

Issues with Microservices and Usage of DataLake
While there are advantages in using a microservices pattern from a cloud native applications, it does have its limitations. As per the definition of microservices, they have to be self-contained, which means that each microservice has to have a separate data store of its own and can't use a shared database as is typically done in monolithic applications.

Having separate databases per microservice typically makes the data integration and application-wide data access across microservices very difficult. Here is where the concept of data lake comes in handy.

By ensuring that the microservices are managing their own data, if they also write their persistent data to an enterprise-wide data lake, then the data lake can be utilized for centralized data access and integration needs.

The following diagram shows how microservices and data lake are related from enterprise application design perspective.

Microservices and DataLake Support in Major Cloud Platforms
All major cloud platforms started supporting both microservices and data lake as part of their offerings so that the enterprises can build cloud native applications. For example in Microsoft Azure the following support is supported.

  • Service Fabric enables you to build and manage scalable and reliable applications composed of microservices running at a Service Fabric cluster.
  • Azure Service Fabric offers two high-level frameworks for building services: the Reliable Services API and the Reliable Actors API.

From a data lake perspective, Microsoft Azure provides the following services.

  • Azure Data Lake includes all the capabilities required to make it easy for developers, data scientists and analysts to store data of any size, shape and speed and do all types of processing and analytics across platforms
  • With Data Lake Analytics, use U-SQL, a query language which blends the declarative nature of SQL with the expressive power of C#.

It may require separate write-ups to go into detail about these two services. But the point is that cloud providers support the design patterns for cloud native applications and organizations can appropriately use them.

More Stories By Srinivasan Sundara Rajan

Highly passionate about utilizing Digital Technologies to enable next generation enterprise. Believes in enterprise transformation through the Natives (Cloud Native & Mobile Native).

CloudEXPO Stories
Even if your IT and support staff are well versed in agility and cloud technologies, it can be an uphill battle to establish a DevOps style culture - one where continuous improvement of both products and service delivery is expected and respected and all departments work together throughout a client or service engagement. As a service-oriented provider of cloud and data center technology, Green House Data sought to create more of a culture of innovation and continuous improvement, from our helpdesk on to our product development and cloud service teams. Learn how the Chief Executive team helped guide managers and staff towards this goal with metrics to measure progress, staff hiring or realignment, and new technologies and certifications.
Technology has changed tremendously in the last 20 years. From onion architectures to APIs to microservices to cloud and containers, the technology artifacts shipped by teams has changed. And that's not all - roles have changed too. Functional silos have been replaced by cross-functional teams, the skill sets people need to have has been redefined and the tools and approaches for how software is developed and delivered has transformed. When we move from highly defined rigid roles and systems to more fluid ones, we gain agility at the cost of control. But where do we want to keep control? How do we take advantage of all these new changes without losing the ability to efficiently develop and ship great software? And how should program and project managers adapt?
When Enterprises started adopting Hadoop-based Big Data environments over the last ten years, they were mainly on-premise deployments. Organizations would spin up and manage large Hadoop clusters, where they would funnel exabytes or petabytes of unstructured data.However, over the last few years the economics of maintaining this enormous infrastructure compared with the elastic scalability of viable cloud options has changed this equation. The growth of cloud storage, cloud-managed big data environments, and cloud data warehouses like Snowflake, Redshift, BigQuery and Azure SQL DW, have given the cloud its own gravity - pulling data from existing environments. In this presentation we will discuss this transition, describe the challenges and solutions for creating the data flows necessary to move to cloud analytics, and provide real-world use-cases and benefits obtained through adop...
Docker and Kubernetes are key elements of modern cloud native deployment automations. After building your microservices, common practice is to create docker images and create YAML files to automate the deployment with Docker and Kubernetes. Writing these YAMLs, Dockerfile descriptors are really painful and error prone.Ballerina is a new cloud-native programing language which understands the architecture around it - the compiler is environment aware of microservices directly deployable into infrastructures like Docker and Kubernetes.
Your applications have evolved, your computing needs are changing, and your servers have become more and more dense. But your data center hasn't changed so you can't get the benefits of cheaper, better, smaller, faster... until now. Colovore is Silicon Valley's premier provider of high-density colocation solutions that are a perfect fit for companies operating modern, high-performance hardware. No other Bay Area colo provider can match our density, operating efficiency, and ease of scalability.