SlideShare uma empresa Scribd logo
1 de 32
APACHE HIVE
(Apache Hadoop Sub Project)


Agenda:
 Story – Making of Apache Hive
 What is Apache Hive
 Physical Layout
 Hive CLI
 Hive QL
Can Elephants Fly?




Concern: Can hadoop be used more efficiently/fruitfully by developers?

                 © 2012 Sabre Holdings Pvt. Ltd. | All rights reserved   3
Thinking…. ?
Step 1. Give him Wings




                                                        Mr. Hadoop energizing himself.




         © 2012 Sabre Holdings Pvt. Ltd. | All rights reserved                           5
Thinking… ?
Step 2. Pray to Gravity

Thanks to gravity, sky never fell down on us ;)
But wait 2012 is not yet over. Keep Praying.




                     Mr. Hadoop enjoying his first air ride.

   “God did not create the universe, gravity did” - Stephen Hawking

                   © 2012 Sabre Holdings Pvt. Ltd. | All rights reserved   6
© 2012 Sabre Holdings Pvt. Ltd. | All rights reserved   7
Upshot of the down-fall




              Victims                                                          Mr. Hadoo
                                                                                        p – The Fly
                                                                                                   ing Elephan
                                                                                                              t


Blame Gravity! The Fall will have a huge impact.




                           © 2012 Sabre Holdings Pvt. Ltd. | All rights reserved                                  8
Saving Life…
                                  Step1. Shrink


BEFORE -




          ACME Elephant Shrinker


AFTER -


                        © 2012 Sabre Holdings Pvt. Ltd. | All rights reserved   10
Saving Life…
Step2. Genetic Engineering & a bit of magic
         BEFORE                                                     AFTER




                                             Mr. Hadoop

                                                                    Ms. Hive




                    Injecting Insecto-receptors



            © 2012 Sabre Holdings Pvt. Ltd. | All rights reserved              11
Behind the scenes…?




Hive was initially developed by Facebook.


 © 2012 Sabre Holdings Pvt. Ltd. | All rights reserved   13
 Hive is a datawarehouse infrastructure built
  on top of hadoop.
 Supports analysis of large datasets stored in
  Hadoop compatible file systems like HDFS,
  Amazon S3 fs.
 Provides SQL-like query language called
  HiveQL.
 To accelerate queries, it provides indexing.


            © 2012 Sabre Holdings Pvt. Ltd. | All rights reserved   14
   Warehouse directory in hdfs
     /user/hive/warehouse
   Tables ~ Subdirectories of warehouse
   Partitions ~ Subdirectories of corresponding
    Table directory.




               © 2012 Sabre Holdings Pvt. Ltd. | All rights reserved   15
 Hive Queries are implicitly converted to map-
  reduce code by hive engine.
 Compiler translates all the queries into a
  directed acyclic graph of map-reduce jobs.
 These map-reduce jobs are sent to hadoop
  for execution.



            © 2012 Sabre Holdings Pvt. Ltd. | All rights reserved   16
   /user/hive directory is created automatically as soon
    as hive session is started first time.
   /user/hive/warehouse directory shall be accessible
    by all.
     hadoop dfs -chmod –R 1777 /user/hive/warehouse
   Recommended to activate sticky bit if supported by
    the hadoop version installed on cluster.
   /tmp directory shall also be made as a sticky
    directory.
     hadoop dfs –chmod –R 1777 /tmp

                © 2012 Sabre Holdings Pvt. Ltd. | All rights reserved   17
   Hive CLI(Command Line Interface) can be
    invoked by hive command.
     % hive




               © 2012 Sabre Holdings Pvt. Ltd. | All rights reserved   18
© 2012 Sabre Holdings Pvt. Ltd. | All rights reserved   19
 DML’s
  ▪ Select
 DDL’s
  ▪ SHOW TABLES
  ▪ CREATE TABLE
  ▪ ALTER TABLE
  ▪ DROP TABLE




          © 2012 Sabre Holdings Pvt. Ltd. | All rights reserved   21
© 2012 Sabre Holdings Pvt. Ltd. | All rights reserved   23
   Normal Tables are created under warehouse
    directory. (source Data migrates to warehouse)
   Normal Tables are directly visible through hdfs
    directory browsing.
   On Dropping a normal table, the source data and
    table meta data both are deleted.
   External Tables read directly from hdfs files.
   External tables not visible in warehouse
    directory.
   On Dropping an external table, only the meta
    data is deleted but not the source data.

              © 2012 Sabre Holdings Pvt. Ltd. | All rights reserved   24
© 2012 Sabre Holdings Pvt. Ltd. | All rights reserved   25
© 2012 Sabre Holdings Pvt. Ltd. | All rights reserved   26
© 2012 Sabre Holdings Pvt. Ltd. | All rights reserved   27
 Hive QL supports Joins on only equality
  expressions. Complex boolean expressions,
  inequality conditions are not supported.
 More than 2 tables can be joined.
 Number of map-reduce jobs generated for a
  join depend on the columns being used.
     If same col is used for all the tables, then n=1
     Otherwise n>1


                © 2012 Sabre Holdings Pvt. Ltd. | All rights reserved   28
© 2012 Sabre Holdings Pvt. Ltd. | All rights reserved   29
 HiveQL Doesn’t follow SQL-92 standard
 Lack support
     No Materialized views
     No Transaction level support
     Limited Sub-query support




               © 2012 Sabre Holdings Pvt. Ltd. | All rights reserved   30
Hadoop – Entering into the new world!




    © 2012 Sabre Holdings Pvt. Ltd. | All rights reserved   31
Reach me




                    Tapan Avasthi
Associate Software Developer Intern, Travelocity Global
           tapan.avasthi@travelocity.com
             tapan.k.avasthi@gmail.com


        © 2012 Sabre Holdings Pvt. Ltd. | All rights reserved   32

Mais conteúdo relacionado

Mais procurados

Mais procurados (20)

Hive paris
Hive parisHive paris
Hive paris
 
Hive on mesos Strata
Hive on mesos StrataHive on mesos Strata
Hive on mesos Strata
 
Dancing elephants - efficiently working with object stores from Apache Spark ...
Dancing elephants - efficiently working with object stores from Apache Spark ...Dancing elephants - efficiently working with object stores from Apache Spark ...
Dancing elephants - efficiently working with object stores from Apache Spark ...
 
Big Data Warehousing: Pig vs. Hive Comparison
Big Data Warehousing: Pig vs. Hive ComparisonBig Data Warehousing: Pig vs. Hive Comparison
Big Data Warehousing: Pig vs. Hive Comparison
 
Oracle Migration to Postgres in the Cloud
Oracle Migration to Postgres in the CloudOracle Migration to Postgres in the Cloud
Oracle Migration to Postgres in the Cloud
 
Improving Python and Spark Performance and Interoperability with Apache Arrow
Improving Python and Spark Performance and Interoperability with Apache ArrowImproving Python and Spark Performance and Interoperability with Apache Arrow
Improving Python and Spark Performance and Interoperability with Apache Arrow
 
The Columnar Era: Leveraging Parquet, Arrow and Kudu for High-Performance Ana...
The Columnar Era: Leveraging Parquet, Arrow and Kudu for High-Performance Ana...The Columnar Era: Leveraging Parquet, Arrow and Kudu for High-Performance Ana...
The Columnar Era: Leveraging Parquet, Arrow and Kudu for High-Performance Ana...
 
Authoring and Hosting Applications on YARN using Slider
Authoring and Hosting Applications on YARN using SliderAuthoring and Hosting Applications on YARN using Slider
Authoring and Hosting Applications on YARN using Slider
 
Enabling Diverse Workload Scheduling in YARN
Enabling Diverse Workload Scheduling in YARNEnabling Diverse Workload Scheduling in YARN
Enabling Diverse Workload Scheduling in YARN
 
Big Data Certification
Big Data CertificationBig Data Certification
Big Data Certification
 
SQL et in-memory sur Hadoop avec Pivotal et HAWQ
SQL et in-memory sur Hadoop avec Pivotal et HAWQSQL et in-memory sur Hadoop avec Pivotal et HAWQ
SQL et in-memory sur Hadoop avec Pivotal et HAWQ
 
Hd insight essentials quick view
Hd insight essentials quick viewHd insight essentials quick view
Hd insight essentials quick view
 
Internet of things Crash Course Workshop
Internet of things Crash Course WorkshopInternet of things Crash Course Workshop
Internet of things Crash Course Workshop
 
Double Your Hadoop Hardware Performance with SmartSense
Double Your Hadoop Hardware Performance with SmartSenseDouble Your Hadoop Hardware Performance with SmartSense
Double Your Hadoop Hardware Performance with SmartSense
 
High-level Programming Languages: Apache Pig and Pig Latin
High-level Programming Languages: Apache Pig and Pig LatinHigh-level Programming Languages: Apache Pig and Pig Latin
High-level Programming Languages: Apache Pig and Pig Latin
 
Big data overview by Edgars
Big data overview by EdgarsBig data overview by Edgars
Big data overview by Edgars
 
How to Use Apache Zeppelin with HWX HDB
How to Use Apache Zeppelin with HWX HDBHow to Use Apache Zeppelin with HWX HDB
How to Use Apache Zeppelin with HWX HDB
 
Introduction to pig
Introduction to pigIntroduction to pig
Introduction to pig
 
DeathStar: Easy, Dynamic, Multi-Tenant HBase via YARN
DeathStar: Easy, Dynamic, Multi-Tenant HBase via YARNDeathStar: Easy, Dynamic, Multi-Tenant HBase via YARN
DeathStar: Easy, Dynamic, Multi-Tenant HBase via YARN
 
S3Guard: What's in your consistency model?
S3Guard: What's in your consistency model?S3Guard: What's in your consistency model?
S3Guard: What's in your consistency model?
 

Semelhante a Introduction to Apache Hive

Hadoop-as-a-Service for Lifecycle Management Simplicity
Hadoop-as-a-Service for Lifecycle Management SimplicityHadoop-as-a-Service for Lifecycle Management Simplicity
Hadoop-as-a-Service for Lifecycle Management Simplicity
DataWorks Summit
 
Go daddy.com Cloud Storage Solution (Adam Knapp)
Go daddy.com Cloud Storage Solution (Adam Knapp)Go daddy.com Cloud Storage Solution (Adam Knapp)
Go daddy.com Cloud Storage Solution (Adam Knapp)
Ontico
 
Greenplum Database on HDFS
Greenplum Database on HDFSGreenplum Database on HDFS
Greenplum Database on HDFS
DataWorks Summit
 

Semelhante a Introduction to Apache Hive (20)

Large Scale Performance Monitoring for ElasticSearch, HBase, Solr, SenseiDB, ...
Large Scale Performance Monitoring for ElasticSearch, HBase, Solr, SenseiDB, ...Large Scale Performance Monitoring for ElasticSearch, HBase, Solr, SenseiDB, ...
Large Scale Performance Monitoring for ElasticSearch, HBase, Solr, SenseiDB, ...
 
Hadoop2 new and noteworthy SNIA conf
Hadoop2 new and noteworthy SNIA confHadoop2 new and noteworthy SNIA conf
Hadoop2 new and noteworthy SNIA conf
 
Hadoop Overview
Hadoop Overview Hadoop Overview
Hadoop Overview
 
Track B-2: Advancing Collaboration & eLearning to Achieve Mission Goals, ...
Track B-2: Advancing Collaboration & eLearning to Achieve Mission Goals, ...Track B-2: Advancing Collaboration & eLearning to Achieve Mission Goals, ...
Track B-2: Advancing Collaboration & eLearning to Achieve Mission Goals, ...
 
The Evolution and Future of Hadoop Storage (Hadoop Conference Japan 2016キーノート...
The Evolution and Future of Hadoop Storage (Hadoop Conference Japan 2016キーノート...The Evolution and Future of Hadoop Storage (Hadoop Conference Japan 2016キーノート...
The Evolution and Future of Hadoop Storage (Hadoop Conference Japan 2016キーノート...
 
Building infrastructure for Big Data
Building infrastructure for Big DataBuilding infrastructure for Big Data
Building infrastructure for Big Data
 
Node.js and Photoshop Generator - JSConf Asia 2013
Node.js and Photoshop Generator - JSConf Asia 2013Node.js and Photoshop Generator - JSConf Asia 2013
Node.js and Photoshop Generator - JSConf Asia 2013
 
Paremus Cloud and OSGi Beyond the VM - OSGi Cloud Workshop March 2012
Paremus Cloud and OSGi Beyond the VM - OSGi Cloud Workshop March 2012Paremus Cloud and OSGi Beyond the VM - OSGi Cloud Workshop March 2012
Paremus Cloud and OSGi Beyond the VM - OSGi Cloud Workshop March 2012
 
Hadoop-as-a-Service for Lifecycle Management Simplicity
Hadoop-as-a-Service for Lifecycle Management SimplicityHadoop-as-a-Service for Lifecycle Management Simplicity
Hadoop-as-a-Service for Lifecycle Management Simplicity
 
Go daddy.com Cloud Storage Solution (Adam Knapp)
Go daddy.com Cloud Storage Solution (Adam Knapp)Go daddy.com Cloud Storage Solution (Adam Knapp)
Go daddy.com Cloud Storage Solution (Adam Knapp)
 
HBase and Hadoop at Adobe
HBase and Hadoop at AdobeHBase and Hadoop at Adobe
HBase and Hadoop at Adobe
 
Greenplum Database on HDFS
Greenplum Database on HDFSGreenplum Database on HDFS
Greenplum Database on HDFS
 
OWF12/Java Sacha labourey
OWF12/Java Sacha laboureyOWF12/Java Sacha labourey
OWF12/Java Sacha labourey
 
Machine Learning and Hadoop: Present and Future
Machine Learning and Hadoop: Present and FutureMachine Learning and Hadoop: Present and Future
Machine Learning and Hadoop: Present and Future
 
Hadoop Operations: Starting Out Small / So Your Cluster Isn't Yahoo-sized (yet)
Hadoop Operations: Starting Out Small / So Your Cluster Isn't Yahoo-sized (yet)Hadoop Operations: Starting Out Small / So Your Cluster Isn't Yahoo-sized (yet)
Hadoop Operations: Starting Out Small / So Your Cluster Isn't Yahoo-sized (yet)
 
Hadoop operations
Hadoop operationsHadoop operations
Hadoop operations
 
Oop2012 keynote Design Driven Development
Oop2012 keynote Design Driven DevelopmentOop2012 keynote Design Driven Development
Oop2012 keynote Design Driven Development
 
Hadoop Performance at LinkedIn
Hadoop Performance at LinkedInHadoop Performance at LinkedIn
Hadoop Performance at LinkedIn
 
eFolder Webinar: How One Partner Leverages Dell AppAssure and StorageCraft
eFolder Webinar: How One Partner Leverages Dell AppAssure and StorageCrafteFolder Webinar: How One Partner Leverages Dell AppAssure and StorageCraft
eFolder Webinar: How One Partner Leverages Dell AppAssure and StorageCraft
 
Hadoop's Impact on the Future of Data Management | Amr Awadallah
Hadoop's Impact on the Future of Data Management | Amr AwadallahHadoop's Impact on the Future of Data Management | Amr Awadallah
Hadoop's Impact on the Future of Data Management | Amr Awadallah
 

Último

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Último (20)

Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 

Introduction to Apache Hive

  • 1. APACHE HIVE (Apache Hadoop Sub Project) Agenda:  Story – Making of Apache Hive  What is Apache Hive  Physical Layout  Hive CLI  Hive QL
  • 2.
  • 3. Can Elephants Fly? Concern: Can hadoop be used more efficiently/fruitfully by developers? © 2012 Sabre Holdings Pvt. Ltd. | All rights reserved 3
  • 4.
  • 5. Thinking…. ? Step 1. Give him Wings Mr. Hadoop energizing himself. © 2012 Sabre Holdings Pvt. Ltd. | All rights reserved 5
  • 6. Thinking… ? Step 2. Pray to Gravity Thanks to gravity, sky never fell down on us ;) But wait 2012 is not yet over. Keep Praying. Mr. Hadoop enjoying his first air ride. “God did not create the universe, gravity did” - Stephen Hawking © 2012 Sabre Holdings Pvt. Ltd. | All rights reserved 6
  • 7. © 2012 Sabre Holdings Pvt. Ltd. | All rights reserved 7
  • 8. Upshot of the down-fall Victims Mr. Hadoo p – The Fly ing Elephan t Blame Gravity! The Fall will have a huge impact. © 2012 Sabre Holdings Pvt. Ltd. | All rights reserved 8
  • 9.
  • 10. Saving Life… Step1. Shrink BEFORE - ACME Elephant Shrinker AFTER - © 2012 Sabre Holdings Pvt. Ltd. | All rights reserved 10
  • 11. Saving Life… Step2. Genetic Engineering & a bit of magic BEFORE AFTER Mr. Hadoop Ms. Hive Injecting Insecto-receptors © 2012 Sabre Holdings Pvt. Ltd. | All rights reserved 11
  • 12.
  • 13. Behind the scenes…? Hive was initially developed by Facebook. © 2012 Sabre Holdings Pvt. Ltd. | All rights reserved 13
  • 14.  Hive is a datawarehouse infrastructure built on top of hadoop.  Supports analysis of large datasets stored in Hadoop compatible file systems like HDFS, Amazon S3 fs.  Provides SQL-like query language called HiveQL.  To accelerate queries, it provides indexing. © 2012 Sabre Holdings Pvt. Ltd. | All rights reserved 14
  • 15. Warehouse directory in hdfs  /user/hive/warehouse  Tables ~ Subdirectories of warehouse  Partitions ~ Subdirectories of corresponding Table directory. © 2012 Sabre Holdings Pvt. Ltd. | All rights reserved 15
  • 16.  Hive Queries are implicitly converted to map- reduce code by hive engine.  Compiler translates all the queries into a directed acyclic graph of map-reduce jobs.  These map-reduce jobs are sent to hadoop for execution. © 2012 Sabre Holdings Pvt. Ltd. | All rights reserved 16
  • 17. /user/hive directory is created automatically as soon as hive session is started first time.  /user/hive/warehouse directory shall be accessible by all.  hadoop dfs -chmod –R 1777 /user/hive/warehouse  Recommended to activate sticky bit if supported by the hadoop version installed on cluster.  /tmp directory shall also be made as a sticky directory.  hadoop dfs –chmod –R 1777 /tmp © 2012 Sabre Holdings Pvt. Ltd. | All rights reserved 17
  • 18. Hive CLI(Command Line Interface) can be invoked by hive command.  % hive © 2012 Sabre Holdings Pvt. Ltd. | All rights reserved 18
  • 19. © 2012 Sabre Holdings Pvt. Ltd. | All rights reserved 19
  • 20.
  • 21.  DML’s ▪ Select  DDL’s ▪ SHOW TABLES ▪ CREATE TABLE ▪ ALTER TABLE ▪ DROP TABLE © 2012 Sabre Holdings Pvt. Ltd. | All rights reserved 21
  • 22.
  • 23. © 2012 Sabre Holdings Pvt. Ltd. | All rights reserved 23
  • 24. Normal Tables are created under warehouse directory. (source Data migrates to warehouse)  Normal Tables are directly visible through hdfs directory browsing.  On Dropping a normal table, the source data and table meta data both are deleted.  External Tables read directly from hdfs files.  External tables not visible in warehouse directory.  On Dropping an external table, only the meta data is deleted but not the source data. © 2012 Sabre Holdings Pvt. Ltd. | All rights reserved 24
  • 25. © 2012 Sabre Holdings Pvt. Ltd. | All rights reserved 25
  • 26. © 2012 Sabre Holdings Pvt. Ltd. | All rights reserved 26
  • 27. © 2012 Sabre Holdings Pvt. Ltd. | All rights reserved 27
  • 28.  Hive QL supports Joins on only equality expressions. Complex boolean expressions, inequality conditions are not supported.  More than 2 tables can be joined.  Number of map-reduce jobs generated for a join depend on the columns being used.  If same col is used for all the tables, then n=1  Otherwise n>1 © 2012 Sabre Holdings Pvt. Ltd. | All rights reserved 28
  • 29. © 2012 Sabre Holdings Pvt. Ltd. | All rights reserved 29
  • 30.  HiveQL Doesn’t follow SQL-92 standard  Lack support  No Materialized views  No Transaction level support  Limited Sub-query support © 2012 Sabre Holdings Pvt. Ltd. | All rights reserved 30
  • 31. Hadoop – Entering into the new world! © 2012 Sabre Holdings Pvt. Ltd. | All rights reserved 31
  • 32. Reach me Tapan Avasthi Associate Software Developer Intern, Travelocity Global tapan.avasthi@travelocity.com tapan.k.avasthi@gmail.com © 2012 Sabre Holdings Pvt. Ltd. | All rights reserved 32