String data types 7
In this post, we will discuss about all Hive Data Types With Examples for each data type. Hive supports most of the primitive data types supported by many relational databases and even if anything are missing, they are being added/introduced to hive in each release. Hive Data Types With Examples Hive […]

Hive Data Types With Examples

Hive Skewed Tables Example
In this post, we will discuss about hive table commands with examples. This post can be treated as sequel to the previous post Hive Database Commands. Hive Table Creation Commands Introduction to Hive Tables In Hive, Tables are nothing but collection of homogeneous data records which have same schema for […]

Hive Table Creation Commands

Drop Database in Hive 1
In this post, we will discuss about Hive Database Commands (Create/Alter/Use/Drop Database) with some examples for each statement. All these commands and their options are from hive-0.14.0 release documentations. So, in order to use these commands with all the options described below we need at least hive-0.14.0 release. Hive Database Commands […]

Hive Database Commands

Bar charts in qlikview 1
In this post we will discuss about basic introduction to Qlikview BI tool and Qlikview Integration with hadoop hive. In this post we will use Cloudera Hive and its jdbc drivers/connectors to connect with Qlikview and we will see sample table retrieval from cloudera hadoop hive database. QlikView Overview What […]

QlikView Integration with Hadoop

Hive UDF usage 5
In this post we will describe about the process of creating custom UDF in Hive. Though there are many generic UDFs (User defined functions)  provided by Hive we might need to write our custom UDFs sometime to meet our requirements. In this post, we will discuss about one of the […]

Creating Custom UDF in Hive – Auto Increment Column in ...

Hunk Visualization 3
In this post we will discuss about the configuration required for Hive connectivity with Hunk, Hadoop flavor of Splunk, the famous visualization tool. Splunk Overview: Splunk tool captures, indexes and correlates real-time data in a searchable repository from which it can generate graphs, reports, dashboards and visualizations. Splunk released a product […]

Hive Connectivity With Hunk (Splunk)

hive tez2 1
In this post, we will discuss about Hive integration with Tez framework or Enabling Tez for Hive Queries. And we will also run sample hive queries both on Mapreduce and Tez frameworks and we will evaluate the performance difference between Tez and MR Frameworks. Tez Advantages: Tez offers a customizable […]

Hive on Tez – Hive Integration with Tez

Tez Dag job output 4
Apache Tez Overview What is Apache Tez? Apache Tez is another execution framework project from Apache Software Foundation and it is built on top of Hadoop YARN. It is considered as a more flexible and powerful successor of the mapreduce framework. Apache Tez Features: Tez provides, Performance gain over Map Reduce […]

Apache Tez – Successor of Mapreduce Framework

Hive Table Mapping with HBase 18
In this post, we will discuss about the setup needed for HBase Integration with Hive and we will test this integration with the creation of some test hbase tables from hive shell and populate the contents of it from another hive table and finally verify these contents in hbase table. […]

HBase Integration with Hive

In our previous post, we have discussed about Hive CLI commands and now we will focus on continuation for the same topic with Hive Interactive Shell Commands and a few examples on these options. Hive Interactive Shell Commands By default Hive enters into Interactive shell mode, if we do not […]

Hive Interactive Shell Commands

Hive Query Execution from HDFS file
In our previous posts, we have seen about Hive Overview and Hive Architecture and now we will discuss about the default service in hive, Hive Command Line Interface and Hive CLI Commands. Ways to Interact with Hive CLI, command-line interface . Karmasphere ( ) (commercial product), Cloudera’s open source Hue (https://git […]

Hive CLI Commands

Hive WC out 3
In this post we will discuss the differences between Java vs Hive with the help of word count example. We will examine the Word Count Algorithm first using the Java MapReduce API and then using Hive. The following Java implementation is included in the Apache Hadoop distribution. [crayon-5e263a6addbf1178593446/] For implementing the Word […]

Java vs Hive