Welcome!

@CloudExpo Authors: Liz McMillan, Zakia Bouachraoui, Dana Gardner, Yeshim Deniz, Elizabeth White

Related Topics: @CloudExpo, Microservices Expo

@CloudExpo: Article

HBase Big Data on Amazon Web Services

Use Toad for Cloud for Easy Random Access to Hadoop

Hadoop is designed to store extremely large volumes of data. HBase, an open source NoSQL data store, makes it possible to randomly access such large data sets. HBase is included in Cloudera's Hadoop distribution.

One of the major obstacles to a wider adoption of NoSQL databases is the lack of query languages, i.e., lack of comprehensive non-programmatic interfaces to data inside NoSQL data store. We expect NoSQL databases to come up with such query languages in near future. In meantime, Quest's Toad for Cloud fills this gap and makes it easy to seamlessly access NoSQL, Cloud and relational data sources via a single interface. You can use a familiar SQL interface and issue DML ( SELECT, INSERT, UPDATE, DELETE) commands to access HBase/Hadoop, Cassandra and other NoSQL and Cloud sources.

It is straightforward to start HBase service from Cloudera Manager's main Service panel:

We can now start Toad for Cloud and map a new Data Source named ETLData and provide connection parameters to our HBase Data Store. Our HBase Stargate ( REST ) server name is ec2-107-21-36-222.compute-1.amazonaws.com ( Amazon Web Services virtual server ):

We are now able to see HBase tables we previously created via HBase shell interface ( in our case table name is Customer ):

Since Toad SQL is an abstraction layer on top of HBase, it needs to map HBase table to its own table. Toad does it automatically for us - it will correctly recognize that our Customer table has a single column family with two columns - Name and Surname:

We can now issue familiar SQL statements to query or modify data:

More Stories By Ranko Mosic

Ranko Mosic, BScEng, is specializing in Big Data/Data Architecture consulting services ( database/data architecture, machine learning ). His clients are in finance, retail, telecommunications industries. Ranko is welcoming inquiries about his availability for consulting engagements and can be reached at 408-757-0053 or [email protected]

Comments (0)

Share your thoughts on this story.

Add your comment
You must be signed in to add a comment. Sign-in | Register

In accordance with our Comment Policy, we encourage comments that are on topic, relevant and to-the-point. We will remove comments that include profanity, personal attacks, racial slurs, threats of violence, or other inappropriate material that violates our Terms and Conditions, and will block users who make repeated violations. We ask all readers to expect diversity of opinion and to treat one another with dignity and respect.


CloudEXPO Stories
"There is a huge interest in Kubernetes. People are now starting to use Kubernetes and implement it," stated Sebastian Scheele, co-founder of Loodse, in this SYS-CON.tv interview at DevOps at 19th Cloud Expo, held November 1-3, 2016, at the Santa Clara Convention Center in Santa Clara, CA.
Is advanced scheduling in Kubernetes achievable?Yes, however, how do you properly accommodate every real-life scenario that a Kubernetes user might encounter? How do you leverage advanced scheduling techniques to shape and describe each scenario in easy-to-use rules and configurations? In his session at @DevOpsSummit at 21st Cloud Expo, Oleg Chunikhin, CTO at Kublr, answered these questions and demonstrated techniques for implementing advanced scheduling. For example, using spot instances and cost-effective resources on AWS, coupled with the ability to deliver a minimum set of functionalities that cover the majority of needs – without configuration complexity.
At CloudEXPO Silicon Valley, June 24-26, 2019, Digital Transformation (DX) is a major focus with expanded DevOpsSUMMIT and FinTechEXPO programs within the DXWorldEXPO agenda. Successful transformation requires a laser focus on being data-driven and on using all the tools available that enable transformation if they plan to survive over the long term. A total of 88% of Fortune 500 companies from a generation ago are now out of business. Only 12% still survive. Similar percentages are found throughout enterprises of all sizes.
Here to help unpack insights into the new era of using containers to gain ease with multi-cloud deployments are our panelists: Matt Baldwin, Founder and CEO at StackPointCloud, based in Seattle; Nic Jackson, Developer Advocate at HashiCorp, based in San Francisco, and Reynold Harbin, Director of Product Marketing at DigitalOcean, based in New York. The discussion is moderated by Dana Gardner, principal analyst at Interarbor Solutions.
Atmosera delivers modern cloud services that maximize the advantages of cloud-based infrastructures. Offering private, hybrid, and public cloud solutions, Atmosera works closely with customers to engineer, deploy, and operate cloud architectures with advanced services that deliver strategic business outcomes. Atmosera's expertise simplifies the process of cloud transformation and our 20+ years of experience managing complex IT environments provides our customers with the confidence and trust that they are being taken care of.