Hive provides SQL-like query language on HDFS(Hadoop Distributed File System)

CK3224 
Created at Mar 14, 2012 12:27:17 

  1,432   0   0  

Hive defines a simple SQL-like query language, called QL, that enables users familiar with SQL to query the data. At the same time, this language also allows programmers who are familiar with the MapReduce framework to be able to plug in their custom mappers and reducers to perform more sophisticated analysis that may not be supported by the built-in capabilities of the language. QL can also be extended with custom scalar functions (UDF's), aggregations (UDAF's), and table functions (UDTF's).

Hive provides SQL-like query language on HDFS(Hadoop Distributed File System)

Hive Query Language provides following features

Basic SQL

  • From clause subquery
  • ANSI JOIN (euqi-joini only)
  • Multi-table Insert
  • Multi group-by
  • Sampling
  • Objects traversal

Extensibility

  • Pluggable MapReduce scripts in the language of your choice using TRANSFORM (Syntax changing soon!!)
  • Pluggable User Defined Functions
  • Pluggable User Defined Types
  • Pluggable SerDes to read different konds of Data Formats

 

See below example of Hive query language. Amaging thing is Hiveis compatible with standard SQL.

SELECT pageid, COUNT(DISTINCT userid)
FROM page_view GROUP BY pageid

It is almost the same as the usual RDB SQL. This is really great feature of Hive so programmers having experiences in RDB can implement software easily.

Hive does not mandate read or written data be in the "Hive format"---there is no such thing. Hive works equally well on Thrift, control delimited, or your specialized data formats. Please see File Format and SerDe in the Developer Guide for details.

Hive is not designed for OLTP workloads and does not offer real-time queries or row-level updates. It is best used for batch jobs over large sets of append-only data (like web logs). What Hive values most are scalability (scale out with more machines added dynamically to the Hadoop cluster), extensibility (with MapReduce framework and UDF/UDAF/UDTF), fault-tolerance, and loose-coupling with its input formats.

Hive provides SQL-like query language on HDFS(Hadoop Distributed File System)

Following is Data Model for Hive.

Hive provides SQL-like query language on HDFS(Hadoop Distributed File System)

References

https://cwiki.apache.org/confluence/display/Hive/Home

Hive ApacheCon 2008, New Oreleans, LA (Ashish Thusoo, Facebook)



Tags: AWS Amazon Blob Chun Computers & Internet HBase Hadoop Mascot Java NoSQL TRANSFORM TaskTracker UDAF UDF UDTF Share on Facebook Share on X

◀ PREVIOUS
Apache HBase is a storage system, with roots in Hadoop, and uses HDFS for underlying storage.

▶ NEXT
With AWS cloud,we met our reliability and performance objectives at a fraction of the cost – Mr. Chun Kang – Pricipal Engineer, Samsung

  Comments 0
SIMILAR POSTS

Apache HBase is a storage system, with roots in Hadoop, and uses HDFS for underlying storage.

(created at Mar 01, 2012)

HDFS(Hadoop Distributed File System) is designed to run on commodity hardware – Low cost hardware

(created at Feb 28, 2012)

Hadoop MapReduce is a software framework for processing vast amounts of data in-parallel on large clusters

(created at Feb 28, 2012)

Apache Hadoop is designed to scale up from single servers to thousands of machines

(created at Feb 28, 2012)

Web Cache function in Network Gateway could cause internet service trouble

(created at Oct 15, 2010)

Samsung Movies and TV Shows User Interface

(created at Aug 13, 2014)

FNC Batch Renames Digital Photos by Date

(created at Aug 21, 2009)

OTHER POSTS IN THE SAME CATEGORY

Microsoft and Qualcomm Collaborate to Bring On-Device AI to Windows Devices

(updated at Dec 07, 2024)

Flash Memory has lifetime, so you have to consider backup just in case

(updated at Nov 01, 2025)

Samsung Movies and TV Shows User Interface

(created at Aug 13, 2014)

The way can see the picture on the received email in outlook

(updated at Apr 27, 2024)

If you want to copy clipboard image on livewriter as local image, use Clipboard Capture or Clipboard Live – Live Writer Plug-in

(updated at Nov 08, 2025)

With AWS cloud,we met our reliability and performance objectives at a fraction of the cost – Mr. Chun Kang – Pricipal Engineer, Samsung

(updated at Apr 27, 2024)

Apache HBase is a storage system, with roots in Hadoop, and uses HDFS for underlying storage.

(created at Mar 01, 2012)

HDFS(Hadoop Distributed File System) is designed to run on commodity hardware – Low cost hardware

(created at Feb 28, 2012)

Hadoop MapReduce is a software framework for processing vast amounts of data in-parallel on large clusters

(created at Feb 28, 2012)

Apache Hadoop is designed to scale up from single servers to thousands of machines

(created at Feb 28, 2012)

Web Cache function in Network Gateway could cause internet service trouble

(created at Oct 15, 2010)

How to add groups to active resistered user in phpBB ?

(updated at Apr 27, 2024)

The simplest and most reliable application or method for editing digital photo EXIF data

(created at Aug 25, 2009)

FNC Batch Renames Digital Photos by Date

(created at Aug 21, 2009)

Enhancing Online Privacy - Special Tools for Hiding Your IP Address

(updated at Apr 28, 2024)

Anonymous Proxy Servers for hiding IP address

(updated at Apr 27, 2024)

Online Gaming IP Ban Proxy Servers Internet Security Digital Identity

(updated at Apr 27, 2024)

Simple Steps to Mask Your IP Address on Home Networks

(updated at Dec 07, 2024)

Hiding Your IP Address Made Easy: Using HTTP Proxy Servers

(updated at Dec 07, 2024)

Changing Your IP Address: Easy Methods for Privacy Protection

(updated at Nov 08, 2025)

UPDATES

Do we know what causes endometrial cancer?

(updated at Nov 22, 2025)

What should you ask your physician about endometrial cancer?

(updated at Nov 22, 2025)

How is cancer of the cervix treated?

(updated at Nov 22, 2025)

Navigating Startup Funding Rounds: Expectations and Investment Sizes

(updated at Nov 19, 2025)

What is hot water steam cleaning and aren't all steam cleaners the same?

(updated at Nov 14, 2025)

To Pad or Not to Pad: The Lowdown on Carpet Padding

(updated at Nov 14, 2025)

Decoding the Pelvic Exam: A Comprehensive Women's Health Assessment

(updated at Nov 14, 2025)

How do they put people to sleep for an oral surgery?

(updated at Nov 14, 2025)

Unveiling the Link: Helicobacter pylori Infection and the Risk of Stomach Cancer

(updated at Nov 14, 2025)

Steve Jobs' Battle with Pancreatic Cancer and His Unconventional Dietary Choices

(updated at Nov 14, 2025)

Surgery for Colorectal Cancer

(updated at Nov 13, 2025)

Oral Microbiome Linked to Increased Pancreatic Cancer Risk

(updated at Nov 13, 2025)

What are the causes and risk factors for Pancreatic Cancer?

(updated at Nov 13, 2025)

What should I do if someone professes they love me after a short correspondence?

(updated at Nov 13, 2025)

I need a song that is like a love letter to lover?

(updated at Nov 13, 2025)

The Importance of Teeth Cleaning During Pregnancy

(updated at Nov 13, 2025)

Do I have to respond to all who answer my ad?

(updated at Nov 13, 2025)

Wrapping a Christmas present beautifully

(updated at Nov 13, 2025)

Wrapping an odd-shaped present

(updated at Nov 13, 2025)

Navigating Diet Shakes Safely During Pregnancy

(updated at Nov 13, 2025)

Precision and Progress: Navigating Surgery as a Primary Treatment for Stomach Cancer

(updated at Nov 13, 2025)

Understanding the Link: Vinyl Chloride, Thorium Dioxide, and Liver Cancer Risk

(updated at Nov 13, 2025)

What are the symptoms of Brain Tumors?

(updated at Nov 11, 2025)

Side effects of Vulvar Cancer Surgery

(updated at Nov 11, 2025)

Can You Use Chemical Hair Removers like Nair During Pregnancy?

(updated at Nov 11, 2025)

Is Getting a Mud Bath Safe During Pregnancy? Here's What You Need to Know

(updated at Nov 11, 2025)

Rock Concerts During Pregnancy: A Guide for Expecting Parents

(updated at Nov 11, 2025)

Ice-skating or Rollerblading While Pregnant - Weighing the Risks and Benefits

(updated at Nov 09, 2025)

Tips for Managing Being on Your Feet All Day While Pregnant

(updated at Nov 09, 2025)

Ensuring Long-Term Success: How Often Should You Have Your Implants Checked?

(updated at Nov 09, 2025)

Radiation Therapy Side Effects of Vulvar Cancer

(updated at Nov 09, 2025)

Are there Clinical Trials available for Vulvar Cancer?

(updated at Nov 09, 2025)

Understanding Age as a Risk Factor for Vulvar Cancer

(updated at Nov 09, 2025)

Side effects of Chemotheray for Ovarian Cancer

(updated at Nov 09, 2025)

What are the Causes and Risk Factors for Ovarian Cancer?

(updated at Nov 09, 2025)

Sinus Surgery: When Is It Necessary?

(updated at Nov 09, 2025)

Is Bleaching Body Hair Safe During Pregnancy?

(updated at Nov 09, 2025)

Is Getting a Tattoo During Pregnancy Safe? Here's What You Need to Know

(updated at Nov 09, 2025)

Swimming Safely - Chlorinated Pools and Pregnancy

(updated at Nov 09, 2025)

What is an Optician?

(updated at Nov 09, 2025)

Natural Ways to Boost Low FSH Levels in Men

(updated at Nov 09, 2025)

To understand Acne and Pimples

(updated at Nov 08, 2025)

Safe Abdominal Exercises for Pregnancy - Alternatives to Sit-ups and Crunches

(updated at Nov 08, 2025)

The Safety when sitting in a Hot Tub during Pregnancy

(updated at Nov 08, 2025)

Bleaching Teeth during Pregnancy

(updated at Nov 08, 2025)

Understanding the Risks - Cured and Smoked Foods in Pregnancy and Beyond

(updated at Nov 08, 2025)

How long do I have to take antibiotics for Sinus Treatment?

(updated at Nov 08, 2025)

Is Tennis Safe During Pregnancy? Understanding the Risks and Recommendations

(updated at Nov 08, 2025)