Welcome!

@CloudExpo Authors: William Schmarzo, Elizabeth White, Mauro Carniel, John Worthington, Pat Romanski

Related Topics: @CloudExpo, Linux Containers

@CloudExpo: Blog Feed Post

SQL Data Services: Your Database in the Cloud

This will really make sharing of data in the cloud so much easier

One thing in the Microsoft cloud I find really interesting is SQL Data Services and Huron/Data Hub - SQL cloud sync service, one of the “cloud” offerings I believe has lots of potential and will really make sharing of data in the cloud so much easier.

I had the pleasure to sit down and talk about this subject with Liam Cavanagh, Sr. Program Manager at Microsoft, with the SDS/Huron team, and get some insights about the current state and the future of this remarkable new technology. In this article I’ll talk about SQL Data Services, and I’ll follow up with one about Data Hub/Huron.

SQL Data Services is at the core, nothing more than a (Microsoft SQL) database-as-a-service offering from Microsoft, part of the Azure Services Platform. First thing you’ll find about SQL Data Services is that “is just SQL” (at least that’s how Microsoft is advertising it). And it is. You’re able to change your connection string from your local database to your cloud database and you can access the “cloud” SQL. You can use SQL Studio to run queries, create tables, everything (oh well, almost) you do locally. First version of SQL Data Services will support: tables, indexes, views, stored procedures, triggers, constraints, table variables, session temp tables etc. It will not support: distributed transactions or queries, CLR, Service Broker, Spatial, physical server or catalog DDL and views. Also, reporting services, Business Intelligence  services, will be available sometimes in the future. So far there’s no information for when some of the features not included in the first version will be available.

The initial commercial release will have some limitations on database size, most likely it will be around 10 GB. The limitation might be lifted on future releases, but for now will be there to stay. This limitation is mainly because Microsoft feels that this is a good size they can easily manage in the background: backups, moving the database from a server to another server, data recovery, etc. You can have as many databases you want, and let’s be honest, 10 GB is a lot of data to store.

Other limitation will have to do with the duration of transactions and resource load on the server hosting your data. Keep in mind that your data will be living on servers in Microsoft’s data centers, with data from other customers. Microsoft makes sure your data is secure (I’m sure we’ll see some guarantees in the SLA), but in order to maintain good multi-tenant practices it will have to throttle or otherwise make sure that all the databases on the server get enough resources to function properly. One of the techniques used is moving more active databases from a loaded server to an idle server.

Like with any other database, corruption of data can happen in the cloud database as well. Microsoft has mechanisms in place to recover from data corruption (mainly by keeping database replicas on multiple servers), however, they don’t provide any user level backup of the database (at least in the first version). As we’ve seen in some of the PDC 2008 presentations, in the future we will probably see database backup/restore and geo-replication (synchronous – replica set spans datacenters and asynchronous – independent replica sets in different datacenters).

There’s no surprise on how concurrency is handled in the cloud database, SDS has the same mechanism like any SQL Server. SQL Server supports optimistic (time-stamps or value comparisons) or pessimistic concurrency models. The presence of the “cloud” doesn’t change the model at all. If you’re really curious about the subject, here’s a link to some information about SQL Server 2008 Concurrency which essentially deals with how the SQL Server handles locking.

By having the database in the cloud, there’s going to be a latency when accessing it from your premises. Microsoft recommends running your applications that are using the database in the cloud on the Azure Platform, so the latency is minimal. When you deploy an application on Windows Azure and provision an SDS server, the two are going to be co-located, to provide low latency between the application and the data.

You will find out rather quickly that there’s no web based administration tool for managing your database in the cloud, but most probably some kind of web admin tool (Microsoft or third party) will be available in the near future.

The exact billing model is not yet available. However, we know from Nigel Ellis (the person responsible for the design, development, and release of SQL Data Services) that customers will be charged for the physical database size including all data and indexes defined.

What is SDS offering more than other SQL hosting services? High availability - your data is guaranteed, is available all the time. If you’re hosting SQL, in order to have high availability, you need to probably have two servers (mirrored) in case one goes down, the other one can take over. Also, SDS solution seems to be cost effective, since you pay just for what you’re using.

Initially SDS was built to use SOAP and REST protocols to access the data. With the switch to being a full relational database in the cloud, SDS is now using Tabular Data Stream (TDS) protocol, an application layer protocol used to transfer data between a database server and a client, initially developed by Sybase Inc. for their Sybase SQL Server relational database engine in 1984, and later by Microsoft in Microsoft SQL Server. There are already lots of drivers already implemented for this protocol: ODBC, OLEDB, ADO .NET, ODBC driver for PHP stack, you can access it from ruby, from linux using the Open TDS driver.

Of course, it will take some time for the platform to mature. It is the goal of this first version to address the needs of 95% or more web and departmental applications.

The SQL Data Services Community Technology Preview (CTP) will be available soon. You can join the mailing list in order to receive an e-mail notification when it will become available.

Related posts:

Read the original blog entry...

More Stories By Alin Irimie

Alin Irimie is a software engineer - architect, designer, and developer with over 10 years experience in various languages and technologies. Currently he is Messaging Security Manager at Sunbelt Software, a security company. He is also the CTO of RADSense Software, a software consulting company. He has expertise in Microsoft technologies such as .NET Framework, ASP.NET, AJAX, SQL Server, C#, C++, Ruby On Rails, Cloud computing (Amazon and Windows Azure),and he also blogs about cloud technologies here.

@CloudExpo Stories
In his session at 21st Cloud Expo, Carl J. Levine, Senior Technical Evangelist for NS1, will objectively discuss how DNS is used to solve Digital Transformation challenges in large SaaS applications, CDNs, AdTech platforms, and other demanding use cases. Carl J. Levine is the Senior Technical Evangelist for NS1. A veteran of the Internet Infrastructure space, he has over a decade of experience with startups, networking protocols and Internet infrastructure, combined with the unique ability to it...
"MobiDev is a software development company and we do complex, custom software development for everybody from entrepreneurs to large enterprises," explained Alan Winters, U.S. Head of Business Development at MobiDev, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
Vulnerability management is vital for large companies that need to secure containers across thousands of hosts, but many struggle to understand how exposed they are when they discover a new high security vulnerability. In his session at 21st Cloud Expo, John Morello, CTO of Twistlock, addressed this pressing concern by introducing the concept of the “Vulnerability Risk Tree API,” which brings all the data together in a simple REST endpoint, allowing companies to easily grasp the severity of the ...
Agile has finally jumped the technology shark, expanding outside the software world. Enterprises are now increasingly adopting Agile practices across their organizations in order to successfully navigate the disruptive waters that threaten to drown them. In our quest for establishing change as a core competency in our organizations, this business-centric notion of Agile is an essential component of Agile Digital Transformation. In the years since the publication of the Agile Manifesto, the conn...
In his session at 21st Cloud Expo, James Henry, Co-CEO/CTO of Calgary Scientific Inc., introduced you to the challenges, solutions and benefits of training AI systems to solve visual problems with an emphasis on improving AIs with continuous training in the field. He explored applications in several industries and discussed technologies that allow the deployment of advanced visualization solutions to the cloud.
Enterprises are adopting Kubernetes to accelerate the development and the delivery of cloud-native applications. However, sharing a Kubernetes cluster between members of the same team can be challenging. And, sharing clusters across multiple teams is even harder. Kubernetes offers several constructs to help implement segmentation and isolation. However, these primitives can be complex to understand and apply. As a result, it’s becoming common for enterprises to end up with several clusters. Thi...
While some developers care passionately about how data centers and clouds are architected, for most, it is only the end result that matters. To the majority of companies, technology exists to solve a business problem, and only delivers value when it is solving that problem. 2017 brings the mainstream adoption of containers for production workloads. In his session at 21st Cloud Expo, Ben McCormack, VP of Operations at Evernote, discussed how data centers of the future will be managed, how the p...
"NetApp is known as a data management leader but we do a lot more than just data management on-prem with the data centers of our customers. We're also big in the hybrid cloud," explained Wes Talbert, Principal Architect at NetApp, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
"Space Monkey by Vivent Smart Home is a product that is a distributed cloud-based edge storage network. Vivent Smart Home, our parent company, is a smart home provider that places a lot of hard drives across homes in North America," explained JT Olds, Director of Engineering, and Brandon Crowfeather, Product Manager, at Vivint Smart Home, in this SYS-CON.tv interview at @ThingsExpo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
The question before companies today is not whether to become intelligent, it’s a question of how and how fast. The key is to adopt and deploy an intelligent application strategy while simultaneously preparing to scale that intelligence. In her session at 21st Cloud Expo, Sangeeta Chakraborty, Chief Customer Officer at Ayasdi, provided a tactical framework to become a truly intelligent enterprise, including how to identify the right applications for AI, how to build a Center of Excellence to oper...
"IBM is really all in on blockchain. We take a look at sort of the history of blockchain ledger technologies. It started out with bitcoin, Ethereum, and IBM evaluated these particular blockchain technologies and found they were anonymous and permissionless and that many companies were looking for permissioned blockchain," stated René Bostic, Technical VP of the IBM Cloud Unit in North America, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Conventi...
Coca-Cola’s Google powered digital signage system lays the groundwork for a more valuable connection between Coke and its customers. Digital signs pair software with high-resolution displays so that a message can be changed instantly based on what the operator wants to communicate or sell. In their Day 3 Keynote at 21st Cloud Expo, Greg Chambers, Global Group Director, Digital Innovation, Coca-Cola, and Vidya Nagarajan, a Senior Product Manager at Google, discussed how from store operations and ...
"Infoblox does DNS, DHCP and IP address management for not only enterprise networks but cloud networks as well. Customers are looking for a single platform that can extend not only in their private enterprise environment but private cloud, public cloud, tracking all the IP space and everything that is going on in that environment," explained Steve Salo, Principal Systems Engineer at Infoblox, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Conventio...
"We're focused on how to get some of the attributes that you would expect from an Amazon, Azure, Google, and doing that on-prem. We believe today that you can actually get those types of things done with certain architectures available in the market today," explained Steve Conner, VP of Sales at Cloudistics, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
Large industrial manufacturing organizations are adopting the agile principles of cloud software companies. The industrial manufacturing development process has not scaled over time. Now that design CAD teams are geographically distributed, centralizing their work is key. With large multi-gigabyte projects, outdated tools have stifled industrial team agility, time-to-market milestones, and impacted P&L stakeholders.
"ZeroStack is a startup in Silicon Valley. We're solving a very interesting problem around bringing public cloud convenience with private cloud control for enterprises and mid-size companies," explained Kamesh Pemmaraju, VP of Product Management at ZeroStack, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
"Codigm is based on the cloud and we are here to explore marketing opportunities in America. Our mission is to make an ecosystem of the SW environment that anyone can understand, learn, teach, and develop the SW on the cloud," explained Sung Tae Ryu, CEO of Codigm, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
Data scientists must access high-performance computing resources across a wide-area network. To achieve cloud-based HPC visualization, researchers must transfer datasets and visualization results efficiently. HPC clusters now compute GPU-accelerated visualization in the cloud cluster. To efficiently display results remotely, a high-performance, low-latency protocol transfers the display from the cluster to a remote desktop. Further, tools to easily mount remote datasets and efficiently transfer...
High-velocity engineering teams are applying not only continuous delivery processes, but also lessons in experimentation from established leaders like Amazon, Netflix, and Facebook. These companies have made experimentation a foundation for their release processes, allowing them to try out major feature releases and redesigns within smaller groups before making them broadly available. In his session at 21st Cloud Expo, Brian Lucas, Senior Staff Engineer at Optimizely, discussed how by using ne...
Gemini is Yahoo’s native and search advertising platform. To ensure the quality of a complex distributed system that spans multiple products and components and across various desktop websites and mobile app and web experiences – both Yahoo owned and operated and third-party syndication (supply), with complex interaction with more than a billion users and numerous advertisers globally (demand) – it becomes imperative to automate a set of end-to-end tests 24x7 to detect bugs and regression. In th...