Welcome!

@CloudExpo Authors: Yeshim Deniz, Pat Romanski, Elizabeth White, Zakia Bouachraoui, Liz McMillan

Related Topics: @CloudExpo, Microservices Expo

@CloudExpo: Article

HBase Big Data on Amazon Web Services

Use Toad for Cloud for Easy Random Access to Hadoop

Hadoop is designed to store extremely large volumes of data. HBase, an open source NoSQL data store, makes it possible to randomly access such large data sets. HBase is included in Cloudera's Hadoop distribution.

One of the major obstacles to a wider adoption of NoSQL databases is the lack of query languages, i.e., lack of comprehensive non-programmatic interfaces to data inside NoSQL data store. We expect NoSQL databases to come up with such query languages in near future. In meantime, Quest's Toad for Cloud fills this gap and makes it easy to seamlessly access NoSQL, Cloud and relational data sources via a single interface. You can use a familiar SQL interface and issue DML ( SELECT, INSERT, UPDATE, DELETE) commands to access HBase/Hadoop, Cassandra and other NoSQL and Cloud sources.

It is straightforward to start HBase service from Cloudera Manager's main Service panel:

We can now start Toad for Cloud and map a new Data Source named ETLData and provide connection parameters to our HBase Data Store. Our HBase Stargate ( REST ) server name is ec2-107-21-36-222.compute-1.amazonaws.com ( Amazon Web Services virtual server ):

We are now able to see HBase tables we previously created via HBase shell interface ( in our case table name is Customer ):

Since Toad SQL is an abstraction layer on top of HBase, it needs to map HBase table to its own table. Toad does it automatically for us - it will correctly recognize that our Customer table has a single column family with two columns - Name and Surname:

We can now issue familiar SQL statements to query or modify data:

More Stories By Ranko Mosic

Ranko Mosic, BScEng, is specializing in Big Data/Data Architecture consulting services ( database/data architecture, machine learning ). His clients are in finance, retail, telecommunications industries. Ranko is welcoming inquiries about his availability for consulting engagements and can be reached at 408-757-0053 or [email protected]

Comments (0)

Share your thoughts on this story.

Add your comment
You must be signed in to add a comment. Sign-in | Register

In accordance with our Comment Policy, we encourage comments that are on topic, relevant and to-the-point. We will remove comments that include profanity, personal attacks, racial slurs, threats of violence, or other inappropriate material that violates our Terms and Conditions, and will block users who make repeated violations. We ask all readers to expect diversity of opinion and to treat one another with dignity and respect.


CloudEXPO Stories
Every organization is facing their own Digital Transformation as they attempt to stay ahead of the competition, or worse, just keep up. Each new opportunity, whether embracing machine learning, IoT, or a cloud migration, seems to bring new development, deployment, and management models. The results are more diverse and federated computing models than any time in our history.
On-premise or off, you have powerful tools available to maximize the value of your infrastructure and you demand more visibility and operational control. Fortunately, data center management tools keep a vigil on memory contestation, power, thermal consumption, server health, and utilization, allowing better control no matter your cloud's shape. In this session, learn how Intel software tools enable real-time monitoring and precise management to lower operational costs and optimize infrastructure for today even as you're forecasting for tomorrow.
"Calligo is a cloud service provider with data privacy at the heart of what we do. We are a typical Infrastructure as a Service cloud provider but it's been designed around data privacy," explained Julian Box, CEO and co-founder of Calligo, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
Isomorphic Software is the global leader in high-end, web-based business applications. We develop, market, and support the SmartClient & Smart GWT HTML5/Ajax platform, combining the productivity and performance of traditional desktop software with the simplicity and reach of the open web. With staff in 10 timezones, Isomorphic provides a global network of services related to our technology, with offerings ranging from turnkey application development to SLA-backed enterprise support. Leading global enterprises use Isomorphic technology to reduce costs and improve productivity, developing & deploying sophisticated business applications with unprecedented ease and simplicity.
While a hybrid cloud can ease that transition, designing and deploy that hybrid cloud still offers challenges for organizations concerned about lack of available cloud skillsets within their organization. Managed service providers offer a unique opportunity to fill those gaps and get organizations of all sizes on a hybrid cloud that meets their comfort level, while delivering enhanced benefits for cost, efficiency, agility, mobility, and elasticity.