• Post Reply Bookmark Topic Watch Topic
  • New Topic

What is HBase used for?  RSS feed

 
Joe Harry
Ranch Hand
Posts: 10128
3
Eclipse IDE Mac PPC Ubuntu
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
I will probably using HBase for one of the projects and I'm starting to explore on what HBase is all about. As I'm just following the article on setting up and installing a standalone HBase version on my machine, I do not understand why I need the hbase-XXX-hadoop2.tar.gz? Why do I need the Hadoop2 version of HBase. What is the relation between Hadoop and HBase?

https://hbase.apache.org/book/quickstart.html
 
Joe Harry
Ranch Hand
Posts: 10128
3
Eclipse IDE Mac PPC Ubuntu
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
While I was looking for some good video tutorials, I came across this course:

http://bigdatauniversity.com/bdu-wp/bdu-course/using-hbase-for-real-time-access-to-your-big-data-version-2/

Did anyone take this? Could you kindly share your experience?
 
Akhilesh Trivedi
Ranch Hand
Posts: 1608
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator

HBase is an open source implementation of Google’s BigTable architecture. Similar to traditional relational database management systems (RDBMSs), data in HBase is organized in tables. Unlike RDBMSs, however, HBase supports a very loose schema de nition, and does not provide any joins, query language, or SQL.

-Source : "Professional Hadoop Solutions" - Wrox Press.


I would also be interested to know if HBase is based on HDFS? How does it differ from HDFS? May be HDFS is just a file system and HBase is a data-system leveraging HDFS.
 
Mayaaa Singh
Greenhorn
Posts: 3
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
HBase is a non-relational column oriented distributed database that runs on top of HDFS. It is a NoSQL open source database in which data is stored in rows and columns. Cell is the intersection of rows and columns.

To track changes in the cell, versioning makes it possible to retrieve any version of contents. Versioning makes difference between HBase tables and RDBMS.

Each cell value includes a “version” attribute, which is nothing more than a timestamp uniquely identifying the cell. Each value in the map is an uninterrupted array of bytes.

The map is indexed by a row key, column key, and a timestamp. Implementations of HBase are highly scalable, sparse, distributed, persistent, and multidimensional-sorted maps.

HBase real world use cases:

  • Medical
    Sports
    Web




  •  
    • Post Reply Bookmark Topic Watch Topic
    • New Topic
    Boost this thread!