Any App. Any Server. Any Cloud.

Adine Deford

Subscribe to Adine Deford: eMailAlertsEmail Alerts
Get Adine Deford: homepageHomepage mobileMobile rssRSS facebookFacebook twitterTwitter linkedinLinkedIn


Related Topics: SOA & WOA Magazine, Amazon Cloud Journal, Big Data on Ulitzer

Article

HBase Big Data on Amazon Web Services

Use Toad for Cloud for Easy Random Access to Hadoop

Hadoop is designed to store extremely large volumes of data. HBase, an open source NoSQL data store, makes it possible to randomly access such large data sets. HBase is included in Cloudera's Hadoop distribution.

One of the major obstacles to a wider adoption of NoSQL databases is the lack of query languages, i.e., lack of comprehensive non-programmatic interfaces to data inside NoSQL data store. We expect NoSQL databases to come up with such query languages in near future. In meantime, Quest's Toad for Cloud fills this gap and makes it easy to seamlessly access NoSQL, Cloud and relational data sources via a single interface. You can use a familiar SQL interface and issue DML ( SELECT, INSERT, UPDATE, DELETE) commands to access HBase/Hadoop, Cassandra and other NoSQL and Cloud sources.

It is straightforward to start HBase service from Cloudera Manager's main Service panel:

We can now start Toad for Cloud and map a new Data Source named ETLData and provide connection parameters to our HBase Data Store. Our HBase Stargate ( REST ) server name is ec2-107-21-36-222.compute-1.amazonaws.com ( Amazon Web Services virtual server ):

We are now able to see HBase tables we previously created via HBase shell interface ( in our case table name is Customer ):

Since Toad SQL is an abstraction layer on top of HBase, it needs to map HBase table to its own table. Toad does it automatically for us - it will correctly recognize that our Customer table has a single column family with two columns - Name and Surname:

We can now issue familiar SQL statements to query or modify data:

More Stories By Ranko Mosic

Ranko Mosic, BScEng, is specializing in Big Data/Data Architecture consulting services ( database/data architecture, machine learning ). His clients are in finance, retail, telecommunications industries. Ranko is welcoming inquiries about his availability for consulting engagements and can be reached at 408-757-0053 or ranko.mosic@gmail.com

Comments (0)

Share your thoughts on this story.

Add your comment
You must be signed in to add a comment. Sign-in | Register

In accordance with our Comment Policy, we encourage comments that are on topic, relevant and to-the-point. We will remove comments that include profanity, personal attacks, racial slurs, threats of violence, or other inappropriate material that violates our Terms and Conditions, and will block users who make repeated violations. We ask all readers to expect diversity of opinion and to treat one another with dignity and respect.