Welcome!

Cloud Expo Authors: Yeshim Deniz, Elizabeth White, Aria Blog, Roger Strukhoff, Pat Romanski

Related Topics: Cloud Expo, Java, SOA & WOA, Virtualization, GovIT

Cloud Expo: Blog Feed Post

MaaS – The Solution to Design, Map, Integrate and Publish Open Data

Data models can be shared, off-line tested and verified to define data designing requirements, data topology, performance, place

Open Data is data that can be freely used, reused and redistributed by anyone – subject only, at the most, to the requirement for attributes and sharealikes (Open Software Service Definition – OSSD). As a consequence, Open Data should create value and might have a positive impact in many different areas such as government (tax money expenditure), health (medical research, hospital acceptance by pathology), quality of life (air breathed in our city, pollution) or might influence public decisions like investments, public economy and expenditure. We are talking about services, so open data are services needed to connect the community with the public bodies. However, the required open data should be part of a design and then integrated, mapped, updated and published in a form, which is easy to use. MaaS is the Open Data driver and enables Open Data portability into the Cloud.

Introduction
Data models used as a service mainly provide the following topics:

  • Implementing and sharing data structure models;
  • Verifying data model properties according to private and public cloud requirements;
  • Designing and testing new query types. Specific query classes need to support heterogeneous data;
  • Designing of the data storage model. The model should enable query processing directly against databases to ensure privacy and secure changes from data updates and review;
  • Modeling data to predict usage “early”;
  • Portability, a central property when data is shared among fields of application;
  • Sharing, redistribution and participation of data among datasets and applications.

As a consequence, the data should be available as a whole and at a reasonable fee, preferably by finding, navigating and downloading over the Cloud. It should also be available in a usable and changeable form. This means modeling Open Data and then using the models to map location and usage, configuration, integration and changes along the Open Data lifecycle.

What is MaaS
Data models can be shared, off-line tested and verified to define data designing requirements, data topology, performance, placement and deployment. This means models themselves can be supplied as a service to allow providers to verify how and where data has to be designed to meet the Cloud service’s requisites: this is MaaS. As a consequence by using MaaS, Open Data designers can verify “on-premise” how and why datasets meet Open Data requirements. With this approach, Open Data models can be tuned on real usage and then mapped “on-premise” to the public body’s service. Further, MaaS inherits all the defined service’s properties and so the data model can be reused, shared and classified for new Open Data design and publication.

Open Data implementation is MaaS (Model as a Service) driven
Open Data is completely supported by data modeling and then MaaS completely supports Open Data. MaaS should be the first practice, helping to tune analysis and Open Data design. Furthermore, data models govern design, deployment, storage, changes, resources allocation, hence MaaS supports:

  • Applying Best Practice for Open Data design;
  • Classifying Open Data field of application;
  • Designing Open Data taxonomy and integration;
  • Guiding Open Data implementation;
  • Documenting data maturity and evolution by applying DaaS lifecycle.

Accordingly, Maas provides “on-premise” properties supporting Open Data design and publication:

  1. AnalysisWhat data are you planning to make open? When working with MaaS, a data model is used to perform data analysis. This means the Open Data designer might return to this step to correct, update and improve the incoming analysis: he always works on an “on-premise” data model. Analysis performed by model helps in identifying data integration and interoperability. The latter assists in choosing what data has to be published and in defining open datasets;
  2. DesignDuring the analysis step, the design is carried out too. The design can be changed and traced along the Open Data lifecycle. Remember that with MaaS the model is a service, and the data opened offers the designed service;
  3. Data securityData security becomes the key property to rule data access and navigation. MaaS plays a crucial role in data security: in fact, the models contain all the infrastructure properties and include information to classify accesses, classes of users, perimeters and risk mitigation assets. Models are the central way to enable data protection within the Open Data device;
  4. Participation - Because the goal is “everyone must be able to use Open Data”, participation is comprehensive of people and groups without any discrimination or restriction. Models contain data access rules and accreditations (open licensing).
  5. Mapping – The MaaS mapping property is important because many people can obtain the data after long navigation and several “bridges” connecting different fields of applications. Looking at this aspect, MaaS helps the Open Data designer to define the best initial “route” between transformation and aggregation linking different areas. Then continually engaging citizens, developers, sector’s expert, managers … helps in modifying the model to better update and scale Open Data contents: the easier it is for outsiders to discover data, the faster new and useful Open Data services will be built.
  6. OntologyDefining metadata vocabulary for describing ontologies. Starting from standard naming definition, data models provide grouping and reorganizing vocabulary for further metadata re-use, integration, maintenance, mapping and versioning;
  7. Portability – Models contain all the properties belonging to data in order that MaaS can enable Open Data service’s portability to the Cloud. The model is portable by definition and it can be generated to different database and infrastructures;
  8. Availability – The DaaS lifecycle assures structure validation in terms of MaaS accessibility;
  9. Reuse and distribution – Open Data can include merging with additional datasets belonging to other fields of application (for example, medical research vs. air pollution). Open Data built by MaaS has this advantage. Merging open datasets means merging models by comparing and synchronizing, old and new versions, if needed;
  10. Change Management and History – Data models are organized in libraries to preserve Open Data changes and history. Changes are traced and maintained to restore, if necessary, model and/or datasets;
  11. Redesign – Redesigning Open Data, means redesigning the model it belongs to: the  model drives the history of the changes;
  12. Fast BI – Publishing Open Data is an action strictly related to the BI process. Redesigning and publishing Open Data are two automated steps starting from the design of the data model and from its successive updates.

Conclusion
MaaS is the emerging solution for Open Data implementation. Open Data is public and private accessible data, designed to connect the social community with the public bodies. This data should be made available without restriction although it is placed under security and open licensing. In addition, Open Data is always up-to-date and transformation and aggregation have to be simple and time saving for inesperienced users. To achieve these goals, the Open Data service has to be model driven designed and providing data integration, interoperability, mapping, portability, availability, security, distribution, all properties assured by applying MaaS.

References
[1] N. Piscopo - ERwin® in the Cloud: How Data Modeling Supports Database as a Service (DaaS) Implementations
[2] N. Piscopo - CA ERwin® Data Modeler’s Role in the Relational Cloud
[3] N. Piscopo - DaaS Contract templates: main constraints and examples, in press
[4] D. Burbank, S. Hoberman - Data Modeling Made Simple with CA ERwin® Data Modeler r8
[7] N. Piscopo – Best Practices for Moving to the Cloud using Data Models in theDaaS Life Cycle
[8] N. Piscopo – Using CA ERwin® Data Modeler and Microsoft SQL Azure to Move Data to the Cloud within the DaaS Life Cycle
[9] The Open Software Service Definition (OSSD) at opendefinition.org

Read the original blog entry...

More Stories By Cloud Ventures

The Cloud Ventures Network is an expert community of leading Cloud pioneers. Follow our best practice blogs at http://CloudBestPractices.net

Cloud Expo Latest Stories
Hardware will never be more valuable than on the day it hits your loading dock. Each day new servers are not deployed to production the business is losing money. While Moore’s Law is typically cited to explain the exponential density growth of chips, a critical consequence of this is rapid depreciation of servers. The hardware for clustered systems (e.g., Hadoop, OpenStack) tends to be significant capital expenses. In his session at 15th Cloud Expo, Mason Katz, CTO and co-founder of StackIQ, to discuss how infrastructure teams should be aware of the capitalization and depreciation model of these expenses to fully understand when and where automation is critical.
Over the last few years the healthcare ecosystem has revolved around innovations in Electronic Health Record (HER) based systems. This evolution has helped us achieve much desired interoperability. Now the focus is shifting to other equally important aspects – scalability and performance. While applying cloud computing environments to the EHR systems, a special consideration needs to be given to the cloud enablement of Veterans Health Information Systems and Technology Architecture (VistA), i.e., the largest single medical system in the United States.
In his session at 15th Cloud Expo, Mark Hinkle, Senior Director, Open Source Solutions at Citrix Systems Inc., will provide overview of the open source software that can be used to deploy and manage a cloud computing environment. He will include information on storage, networking(e.g., OpenDaylight) and compute virtualization (Xen, KVM, LXC) and the orchestration(Apache CloudStack, OpenStack) of the three to build their own cloud services. Speaker Bio: Mark Hinkle is the Senior Director, Open Source Solutions, at Citrix Systems Inc. He joined Citrix as a result of their July 2011 acquisition of Cloud.com where he was their Vice President of Community. He is currently responsible for Citrix open source efforts around the open source cloud computing platform, Apache CloudStack and the Xen Hypervisor. Previously he was the VP of Community at Zenoss Inc., a producer of the open source application, server, and network management software, where he grew the Zenoss Core project to over 10...
Most of today’s hardware manufacturers are building servers with at least one SATA Port, but not every systems engineer utilizes them. This is considered a loss in the game of maximizing potential storage space in a fixed unit. The SATADOM Series was created by Innodisk as a high-performance, small form factor boot drive with low power consumption to be plugged into the unused SATA port on your server board as an alternative to hard drive or USB boot-up. Built for 1U systems, this powerful device is smaller than a one dollar coin, and frees up otherwise dead space on your motherboard. To meet the requirements of tomorrow’s cloud hardware, Innodisk invested internal R&D resources to develop our SATA III series of products. The SATA III SATADOM boasts 500/180MBs R/W Speeds respectively, or double R/W Speed of SATA II products.
14th International Cloud Expo, held on June 10–12, 2014 at the Javits Center in New York City, featured three content-packed days with a rich array of sessions about the business and technical value of cloud computing, Internet of Things, Big Data, and DevOps led by exceptional speakers from every sector of the IT ecosystem. The Cloud Expo series is the fastest-growing Enterprise IT event in the past 10 years, devoted to every aspect of delivering massively scalable enterprise IT as a service.
As more applications and services move "to the cloud" (public or on-premise) cloud environments are increasingly adopting and building out traditional enterprise features. This in turn is enabling and encouraging cloud adoption from enterprise users. In many ways the definition is blurring as features like continuous operation, geo-distribution or on-demand capacity become the norm. NuoDB is involved in both building enterprise software and using enterprise cloud capabilities. In his session at 15th Cloud Expo, Seth Proctor, CTO at NuoDB, Inc., will discuss the experiences from building, deploying and using enterprise services and suggest some ways to approach moving enterprise applications into a cloud model.
Until recently, many organizations required specialized departments to perform mapping and geospatial analysis, and they used Esri on-premise solutions for that work. In his session at 15th Cloud Expo, Dave Peters, author of the Esri Press book Building a GIS, System Architecture Design Strategies for Managers, will discuss how Esri has successfully included the cloud as a fully integrated SaaS expansion of the ArcGIS mapping platform. Organizations that have incorporated Esri cloud-based applications and content within their business models are reaping huge benefits by directly leveraging cloud-based mapping and analysis capabilities within their existing enterprise investments. The ArcGIS mapping platform includes cloud-based content management and information resources to more widely, efficiently, and affordably deliver real-time actionable information and analysis capabilities to your organization.
Almost everyone sees the potential of Internet of Things but how can businesses truly unlock that potential. The key will be in the ability to discover business insight in the midst of an ocean of Big Data generated from billions of embedded devices via Systems of Discover. Businesses will also need to ensure that they can sustain that insight by leveraging the cloud for global reach, scale and elasticity. In his session at Internet of @ThingsExpo, Mac Devine, Distinguished Engineer at IBM, will discuss bringing these three elements together via Systems of Discover.
Cloud and Big Data present unique dilemmas: embracing the benefits of these new technologies while maintaining the security of your organization’s assets. When an outside party owns, controls and manages your infrastructure and computational resources, how can you be assured that sensitive data remains private and secure? How do you best protect data in mixed use cloud and big data infrastructure sets? Can you still satisfy the full range of reporting, compliance and regulatory requirements? In his session at 15th Cloud Expo, Derek Tumulak, Vice President of Product Management at Vormetric, will discuss how to address data security in cloud and Big Data environments so that your organization isn’t next week’s data breach headline.
The cloud is everywhere and growing, and with it SaaS has become an accepted means for software delivery. SaaS is more than just a technology, it is a thriving business model estimated to be worth around $53 billion dollars by 2015, according to IDC. The question is – how do you build and scale a profitable SaaS business model? In his session at 15th Cloud Expo, Jason Cumberland, Vice President, SaaS Solutions at Dimension Data, will give the audience an understanding of common mistakes businesses make when transitioning to SaaS; how to avoid them; and how to build a profitable and scalable SaaS business.
SYS-CON Events announced today that Gridstore™, the leader in software-defined storage (SDS) purpose-built for Windows Servers and Hyper-V, will exhibit at SYS-CON's 15th International Cloud Expo®, which will take place on November 4–6, 2014, at the Santa Clara Convention Center in Santa Clara, CA. Gridstore™ is the leader in software-defined storage purpose built for virtualization that is designed to accelerate applications in virtualized environments. Using its patented Server-Side Virtual Controller™ Technology (SVCT) to eliminate the I/O blender effect and accelerate applications Gridstore delivers vmOptimized™ Storage that self-optimizes to each application or VM across both virtual and physical environments. Leveraging a grid architecture, Gridstore delivers the first end-to-end storage QoS to ensure the most important App or VM performance is never compromised. The storage grid, that uses Gridstore’s performance optimized nodes or capacity optimized nodes, starts with as few a...
SYS-CON Events announced today that Solgenia, the global market leader in Cloud Collaboration and Cloud Infrastructure software solutions, will exhibit at SYS-CON's 15th International Cloud Expo®, which will take place on November 4–6, 2014, at the Santa Clara Convention Center in Santa Clara, CA. Solgenia is the global market leader in Cloud Collaboration and Cloud Infrastructure software solutions. Designed to “Bridge the Gap” between personal and professional social, mobile and cloud user experiences, our solutions help large and medium-sized organizations dramatically improve productivity, reduce collaboration costs, and increase the overall enterprise value by bringing collaboration and infrastructure solutions to the cloud.
Cloud computing started a technology revolution; now DevOps is driving that revolution forward. By enabling new approaches to service delivery, cloud and DevOps together are delivering even greater speed, agility, and efficiency. No wonder leading innovators are adopting DevOps and cloud together! In his session at DevOps Summit, Andi Mann, Vice President of Strategic Solutions at CA Technologies, will explore the synergies in these two approaches, with practical tips, techniques, research data, war stories, case studies, and recommendations.
Enterprises require the performance, agility and on-demand access of the public cloud, and the management, security and compatibility of the private cloud. The solution? In his session at 15th Cloud Expo, Simone Brunozzi, VP and Chief Technologist(global role) for VMware, will explore how to unlock the power of the hybrid cloud and the steps to get there. He'll discuss the challenges that conventional approaches to both public and private cloud computing, and outline the tough decisions that must be made to accelerate the journey to the hybrid cloud. As part of the transition, an Infrastructure-as-a-Service model will enable enterprise IT to build services beyond their data center while owning what gets moved, when to move it, and for how long. IT can then move forward on what matters most to the organization that it supports – availability, agility and efficiency.
Every healthy ecosystem is diverse. This is especially true in cloud ecosystems, where portability and interoperability are more important than old enterprise models of proprietary ownership. In his session at 15th Cloud Expo, Mark Baker, Server Product Manager at Canonical/Ubuntu, will discuss how single vendors used to take the lead in creating and delivering technology, but in a cloud economy, where users want tools of their preference, when and where they need them, it makes no sense.