Which of the following is NOT a main component of Hive architecture?
Think about which component is specific to Hadoop HDFS rather than Hive.
NameNode is part of Hadoop HDFS, not Hive. Hive architecture mainly includes Metastore, Driver, Compiler, Execution Engine, and CLI/Thrift Server.
What is the output of the following Hive query execution flow print statements?
print('Start Query Compilation')
print('Semantic Analysis')
print('Query Optimization')
print('Execution Plan Generation')
print('Query Execution')
print('Fetch Results')Recall the typical steps in Hive query processing from compilation to execution.
The print statements show the full sequence of Hive query execution steps in order.
Given the Hive Metastore stores metadata in a relational database, what would be the output of this SQL query on the Metastore database?
SELECT tbl_name FROM TBLS WHERE db_id = 5;
Assuming the database with id 5 has tables: 'sales', 'customers', 'products'.
Think about what tables belong to a database in Hive Metastore.
The query returns all table names for the database with id 5, which includes all three tables.
Which option best describes the error when Hive query execution fails due to missing metadata?
org.apache.hadoop.hive.ql.metadata.HiveException: MetaException(message:NoSuchObjectException: Table not found)
Focus on the error message about 'NoSuchObjectException'.
This error means Hive cannot find the table metadata in the Metastore, so the table likely does not exist or was dropped.
You want to optimize a Hive query that processes large data by choosing the right execution engine. Which engine should you select for faster execution on a Hadoop cluster with YARN?
Consider which execution engine is designed to improve performance over MapReduce in Hive.
Tez is an optimized execution engine for Hive that runs on YARN and improves speed compared to MapReduce. Spark is also fast but requires additional setup.