In an ideal developer/systems administrator’s world, most applications would deploy seamlessly to multiple platforms and scale elastically with minimal effort bringing the unprecedented agility of the cloud within immediate reach of developer teams and IT organizations.
OpenStack, a RackSpace and NASA initiative, is now managed by an independent foundation and is supported by multiple vendors. It defines APIs for compute, storage, networking, services, monitoring, and additional infrastructure...| By David Smith | Article Rating: |
|
| September 19, 2012 11:00 AM EDT | Reads: |
1,618 |
This guest post is by Alex Guazzelli, VP of Analytics at Zementis Inc. -- ed.
PMML, the
Predictive Model Markup Language, is the de facto standard to represent predictive
analytics and data mining models. With PMML, it is extremely easy to move a
predictive solution from one system to another, since it avoids proprietary
issues and incompatibilities.
Companies around the globe are benefiting from PMML to
make instant use of their predictive solutions. With PMML, there is no
need for custom coding: you can easily move
your solution from the scientist’s desktop, where it was built, to the production
environment, where it is operationally deployed. Companies
also use PMML as the common language between service providers and external vendors.
In this way, it defines a single and clear process for the exchange of
predictive solutions. It becomes the bridge not only between data analysis,
model building, and deployment systems, but also between all the people and
teams involved in the analytical process. This is extremely important, since PMML
is used to disseminate knowledge and best practices, and to ensure
transparency.
All the top analytical tools, commercial and open-source,
support PMML. And, the language itself has reached a great level of maturity
and refinement. PMML 4.1, its latest version, makes it extremely easy for
predictive solutions to be represented in an open and standard way. With PMML, you
can represent a myriad of pre- and post-processing steps, besides the
predictive modeling techniques per se. PMML 4.1 allows for multiple models
(model composition, chaining, segmentation, and ensemble, which includes random
forest models), to be represented by a single and concise language element. It
also allows for model outputs to be transformed into business decisions. Therefore,
a PMML file is able to represent the entire solution, from raw data to business
decision, with one or multiple predictive models.
The availability
of a standard such as PMML combined with scoring solutions in the cloud, for
Hadoop, and in-database make it possible for predictive analytics to fulfill
its promise and crack the big data code. Zementis, Inc. has been in the
forefront of PMML-based scoring, first through its ADAPA Scoring Engine, which
is available for on-site deployment or as a service on cloud (Amazon and IBM),
and lately through its Universal PMML Plug-in which is offered for a range of
databases and for Hadoop. Zementis has partnered with Revolution Analytics, so
that predictive solutions built in R can benefit from the vast scoring infrastructure
already in place. I am proud to be associated with Zementis and excited to be
part of an ever-growing PMML community.
A PMML package for R that exports all kinds of predictive
models is available directly from CRAN.
Traditionally, the PMML Package offered support for the
following data mining algorithms:
ksvm (kernlab):
Support Vector Machines
nnet: Neural
Networks
rpart: C&RT Decision
Trees
lm & glm
(stats): Linear and Binary Logistic Regression Models
arules: Association
Rules
kmeans and hclust:
Clustering Models
Recently, it has been expanded to support:
multinom (nnet):
Multinomial Logistic Regression Models;
glm (stats):
Generalized Linear Models for classification and regression with a wide variety
of link functions
randomForest:
Random Forest Models for classification and regression (click HERE for examples);
rsf
(randomSurvivalForest): Random Survival Forest Models;
And,
this expansion is still on-going as the R community implements support for
other packages and techniques. For more on the PMML package, please take a look
at the paper we published with Graham Williams from Togaware in “The R Journal”.
For that just follow the link below:
PMML: An Open
Standard for Sharing Models
There may be quite a few reasons for you to move your
predictive solution from R to an independent deployment platform. Among them,
you may want parallel execution on big data or real-time scoring for
applications such as fraud detection or recommender systems. With PMML you can
easily move your model to the cloud or inside the database for scoring. Or,
even have it executed on Hadoop. It is really up to you! On top of that, PMML
allows for side-by-side deployment of predictive assets from R as well as other
commercial data mining tools, supporting a multi-vendor environment as well as
platform independent deployment.
More and more companies and individuals are using the PMML
standard for the obvious benefits it provides, putting their predictive
solutions on the fast track. With PMML, the speed of predictive solutions can
be on par with the speed of business.
Dr. Alex Guazzelli is the VP of Analytics
at Zementis Inc. where he is responsible for developing core technology and
predictive solutions under ADAPA, a PMML-based decisioning platform. With more
than 20 years of experience in predictive analytics, Dr. Guazzelli holds a PhD
in Computer Science from the University of Southern California and has co-authored
the book PMML
in Action: Unleashing the Power of Open Standards for Data Mining and
Predictive Analytics, now in its second edition (paperback and
kindle). You can follow him at @DrAlexGuazzelli. Read the original blog entry...
Published September 19, 2012 Reads 1,618
Copyright © 2012 SYS-CON Media, Inc. — All Rights Reserved.
Syndicated stories and blog feeds, all rights reserved by the author.
More Stories By David Smith
David Smith is Vice President of Marketing and Community at Revolution Analytics. He has a long history with the R and statistics communities. After graduating with a degree in Statistics from the University of Adelaide, South Australia, he spent four years researching statistical methodology at Lancaster University in the United Kingdom, where he also developed a number of packages for the S-PLUS statistical modeling environment. He continued his association with S-PLUS at Insightful (now TIBCO Spotfire) overseeing the product management of S-PLUS and other statistical and data mining products.< David smith is the co-author (with Bill Venables) of the popular tutorial manual, An Introduction to R, and one of the originating developers of the ESS: Emacs Speaks Statistics project. Today, he leads marketing for REvolution R, supports R communities worldwide, and is responsible for the Revolutions blog. Prior to joining Revolution Analytics, he served as vice president of product management at Zynchros, Inc. Follow him on twitter at @RevoDavid
In an ideal developer/systems administrator’s world, most applications would deploy seamlessly to multiple platforms and scale elastically with minimal effort bringing the unprecedented agility of the cloud within immediate reach of developer teams and IT organizations.
OpenStack, a RackSpace and NASA initiative, is now managed by an independent foundation and is supported by multiple vendors. It defines APIs for compute, storage, networking, services, monitoring, and additional infrastructure...May. 22, 2013 06:00 AM EDT Reads: 1,853 |
By Jeremy Geelan In his session at 12th Cloud Expo | Cloud Expo New York [June 10-13, 2013], Intel's Chris Black will review the background of Apache Hadoop, its application, and methods to accelerate data system clusters with Intel SSD technology. The session will overview the genius of Hadoop and provide an overview of the ecosystem landscape.
Cloud Expo/Big Data Expo delegates will learn how the Hadoop framework and SSD technology augment cloud data systems ranging from analytics to on-line transaction pro...May. 22, 2013 05:45 AM EDT Reads: 2,001 |
By Pat Romanski Cloud computing is transforming the way businesses think about and leverage technology. As a result, the general understanding of cloud computing has come a long way in a short time. However, there are still many misconceptions about what cloud computing is and what it can do for businesses that adopt this game-changing computing model.
In his General Session at the 12th International Cloud Expo, Gene Eun, Senior Director, Oracle Cloud at Oracle, will discuss and dispel some of the common myth...May. 22, 2013 02:00 AM EDT Reads: 2,338 |
By Elizabeth White SYS-CON Events announced today that Wowrack will exhibit at SYS-CON's 12th International Cloud Expo, which will take place on June 10–13, 2013, at the Javits Center in New York City, New York.
Wowrack’s core expertise lies in high-availability Private and Public Cloud IaaS Hosting Solutions. Wowrack provides a true Hybrid service – where business release all IT management and hardware provisioning – taking the data center and server system administrative headaches off our customer’s shoulders. ...May. 21, 2013 03:37 PM EDT Reads: 639 |
By Pat Romanski SYS-CON Events announced today that nfina Technologies, a provider of highly reliable cloud server products, will exhibit at SYS-CON's 12th International Cloud Expo, which will take place on June 10–13, 2013, at the Javits Center in New York City, New York.
nfina Technologies develops, manufactures, and markets highly reliable cloud server products, designed to solve the most demanding data center requirements in mission-critical cloud applications. Nfina’s staff has decades of experience in co...May. 21, 2013 02:11 PM EDT Reads: 660 |
By Liz McMillan SYS-CON Events announced today that OpenStack will exhibit at SYS-CON's 12th International Cloud Expo, which will take place on June 10–13, 2013, at the Javits Center in New York City, New York. OpenStack software controls large pools of compute, storage, and networking resources throughout a datacenter, all managed by a dashboard that gives administrators control while empowering their users to provision resources through a web interface.
OpenStack powers some of the most widely-used SaaS app...May. 21, 2013 01:16 PM EDT Reads: 681 |
By Elizabeth White May. 21, 2013 12:31 PM EDT Reads: 774 |
By Elizabeth White As enterprises deploy private IaaS clouds into production they are reevaluating their future application delivery models. SUSE and WSO2 believe that private PaaS will leverage the automation and scalability of Private IaaS solutions, such as OpenStack-based SUSE Cloud, to deliver the secure, standardized development environments that will make migrating to an agile, serviceoriented delivery model possible.
In their session at the 12th International Cloud Expo, Chris Haddad, VP of Technology Ev...May. 21, 2013 10:00 AM EDT Reads: 1,025 |
By Liz McMillan Organizations across the world are increasingly starting to see the benefits of moving more and more services to the cloud. The focus on the cost-saving potential of cloud is rapidly shifting to completely transforming the business with cloud. As organizations are investing enormous sums on technology they are starting to realize that in order to maximize the return on investment and accelerate the business transformation process the first area of focus should be people. By ensuring the organiza...May. 21, 2013 08:47 AM EDT Reads: 626 |
By Jeremy Geelan "Since Cloud Expo is running the week of June 10, we thought it'd be a great idea to schedule our Meetup this week. That way, if you have colleagues, friends, or family in town that week for the Expo, you can invite them to join you!" With those words, the OpenStack New York Meetup Group's organizer's launched a landing page this week where anyone interested can register for the June 12 evening event.May. 21, 2013 08:30 AM EDT Reads: 583 |
- Cloud People: A Who's Who of Cloud Computing
- Cloud Expo New York Speaker Profile: Dave Linthicum – Cloud Technology Partners
- Cloud Expo New York: Cloud Is Changing the Economics of Business
- Windows Azure IaaS Reaches General Availability
- Cloud Expo New York Speaker Profile: Nicos Vekiarides – TwinStrata
- AMD and Adobe Collaborate on Upcoming Version of Adobe Premiere Pro Software to Enable Breakthrough Video Editing Performance Through Open Standards
- State and Local Governments Adopt Microsoft Dynamics CRM to Improve Citizen Service Delivery
- Enterasys Spotlights SDN's Impact on Traditional Networking in Upcoming Webinar
- New Relic Q1 2013 Blazes Past Growth Targets and Reaches 40,000 Active Customer Accounts
- Best CIO Practices Shared from SHI’s Customers
- Cloud Expo New York: Deploying Hybrid Cloud for Performance and Uptime
- Cloud Expo New York: Delivering Digital Marketing on the Cloud
- Cloud People: A Who's Who of Cloud Computing
- Cloud Expo New York: Best CIO Practices Shared from SHI’s Customers
- Cloud Expo New York Speaker Profile: Dave Linthicum – Cloud Technology Partners
- Cloud Expo New York Speaker Profile: Jill T. Singer – NRO
- Cloud Expo New York Speaker Profile: Greg O'Connor – AppZero
- Cloud Expo New York: Cloud Is Changing the Economics of Business
- Cloud Expo New York: How to Use Google Apps Script
- Windows Azure IaaS Reaches General Availability
- Cloud Expo New York Speaker Profile: Nicos Vekiarides – TwinStrata
- Cloud Computing Bootcamp at Cloud Expo New York
- AMD and Adobe Collaborate on Upcoming Version of Adobe Premiere Pro Software to Enable Breakthrough Video Editing Performance Through Open Standards
- State and Local Governments Adopt Microsoft Dynamics CRM to Improve Citizen Service Delivery
- The Top 150 Players in Cloud Computing
- What is Cloud Computing?
- Six Benefits of Cloud Computing
- The Top 250 Players in the Cloud Computing Ecosystem
- Twenty-One Experts Define Cloud Computing
- What's the Difference Between Cloud Computing and SaaS?
- Virtualization Conference Keynote Webcast Live on SYS-CON.TV
- The Future of Cloud Computing
- A Brief History of Cloud Computing: Is the Cloud There Yet?
- GDS International: Global Warming Scam?
- Cloud Expo Europe 2009 in Prague: Themes & Topics
- Cloud Computing Expo 2009 West: Call for Papers Now Closed








In his session at 12th Cloud Expo | Cloud Expo New York [June 10-13, 2013], Intel's Chris Black will review the background of Apache Hadoop, its application, and methods to accelerate data system clusters with Intel SSD technology. The session will overview the genius of Hadoop and provide an overview of the ecosystem landscape.
Cloud Expo/Big Data Expo delegates will learn how the Hadoop framework and SSD technology augment cloud data systems ranging from analytics to on-line transaction pro...
Cloud computing is transforming the way businesses think about and leverage technology. As a result, the general understanding of cloud computing has come a long way in a short time. However, there are still many misconceptions about what cloud computing is and what it can do for businesses that adopt this game-changing computing model.
In his General Session at the 12th International Cloud Expo, Gene Eun, Senior Director, Oracle Cloud at Oracle, will discuss and dispel some of the common myth...
SYS-CON Events announced today that Wowrack will exhibit at SYS-CON's 12th International Cloud Expo, which will take place on June 10–13, 2013, at the Javits Center in New York City, New York.
Wowrack’s core expertise lies in high-availability Private and Public Cloud IaaS Hosting Solutions. Wowrack provides a true Hybrid service – where business release all IT management and hardware provisioning – taking the data center and server system administrative headaches off our customer’s shoulders. ...
SYS-CON Events announced today that nfina Technologies, a provider of highly reliable cloud server products, will exhibit at SYS-CON's 12th International Cloud Expo, which will take place on June 10–13, 2013, at the Javits Center in New York City, New York.
nfina Technologies develops, manufactures, and markets highly reliable cloud server products, designed to solve the most demanding data center requirements in mission-critical cloud applications. Nfina’s staff has decades of experience in co...
SYS-CON Events announced today that OpenStack will exhibit at SYS-CON's 12th International Cloud Expo, which will take place on June 10–13, 2013, at the Javits Center in New York City, New York. OpenStack software controls large pools of compute, storage, and networking resources throughout a datacenter, all managed by a dashboard that gives administrators control while empowering their users to provision resources through a web interface.
OpenStack powers some of the most widely-used SaaS app...
As enterprises deploy private IaaS clouds into production they are reevaluating their future application delivery models. SUSE and WSO2 believe that private PaaS will leverage the automation and scalability of Private IaaS solutions, such as OpenStack-based SUSE Cloud, to deliver the secure, standardized development environments that will make migrating to an agile, serviceoriented delivery model possible.
In their session at the 12th International Cloud Expo, Chris Haddad, VP of Technology Ev...
Organizations across the world are increasingly starting to see the benefits of moving more and more services to the cloud. The focus on the cost-saving potential of cloud is rapidly shifting to completely transforming the business with cloud. As organizations are investing enormous sums on technology they are starting to realize that in order to maximize the return on investment and accelerate the business transformation process the first area of focus should be people. By ensuring the organiza...
"Since Cloud Expo is running the week of June 10, we thought it'd be a great idea to schedule our Meetup this week. That way, if you have colleagues, friends, or family in town that week for the Expo, you can invite them to join you!" With those words, the OpenStack New York Meetup Group's organizer's launched a landing page this week where anyone interested can register for the June 12 evening event.
Online collaboration has evolved during the last decade, delivering even greater value -- thanks to a new generation of business technology applications. Forbes Insights released "Collaborating in the Cloud," a Cisco-sponsored study examining the ways business leaders increasingly look at cloud coll...
New technologies allow schools, colleges and universities to analyze absolutely everything that happens. From student behavior, testing results, career development of students as well as educational needs based on changing societies. A lot of this data has already been stored and is used for statist...
A recent Gartner study states that the function of the modern CIO is in flux and that his or her future focus must incorporate digital assets (aka cloud-based data and applications) to remain relevant. Towards the goal of riding the sea change a compiler of stacks to a broker of business needs, secu...
In the coming years, big data will change the way organisations and societies are operated and managed. Big data however, is not the only trend that will impact significantly how organisations operate. Another major trend at the moment is gamification. Gamification will change the way organisations ...
We all talk about cloud differently, but is there a way we should be speaking about this tech?
Cloud computing is now a widely reported, if not accepted, IT movement that, depending on who you talk to, has changed or is changing the way businesses utilize infrastructure.
The age of data center automation is upon us. Whether it's cloud or SDN or devops in general, automation as a means to achieve efficiency and, one hopes, free up resources that can be then redirected to focus on innovation.
As is always the case when we begin to move further upwards, abstracting ...
Windows Azure Virtual Networks offers the power to open up several cross-premises use case scenarios, including Active Directory Disaster Recovery, SQL Database Replication, Windows Server 2012 DFS-R File Replication, Accelerated Cloud File Services with BranchCache, Hybrid Web Applications and MORE...
As the infrastructure cloud market (IaaS and PaaS) continues to grow rapidly, we are seeing quite a few customers who are delivering an application – whether it is a mission-critical or SaaS application – and basing their solution on VMware.
VMware Security Cloud Encryption cloud keyboard Cloud Enc...
Have you heard of products like IBM’s InfoSphere Streams, Tibco’s Event Processing product, or Oracle’s CEP product? All good examples of commercially available stream processing technologies which help you process events in real-time.
I’ve been asked what I consider as “Big Data” versus “Small Dat...













