How will you find the cpu utilization on linux machine?
Big Data Engineer Interview Questions
1,784 big data engineer interview questions shared by candidates
Where data get stored when map reduce job gets executed?
How will you find which data node is failed?
DNS, WLAN, HDFS, Spark, NoSQL, Linux
DNS in detail, TTL, how'd u tshoot a webpage not found, ip addressing, number of available addresses in a mask, lots and lots of scenario based behavioral questions
Basic questions related to big data but in depth like working of components i.e.what is namenode and if the replication factor is 3 how will you figure out what is happening inside the hdfs when you are trying to write something in hdfs etc.
Coding: Sqoop Job to import data into table Streaming data to Kafka and spark to read and replicate data to hive and exposing it using impala Several Linux questions
What is the answer to this question 1+1=
Solve some logic problem in a limited time.
--Round 1 Questions on Resume --Round 2 1. why hive column should not be added like alter table 2. what are narrow transformations 3. what are stages and what are types of stages? 4. Default how many stages in job 5. deploy the code in production 6. comparision of oozie and jeninks 7. Can you connect spark sql to RDBMS 8. spark and yarn architecture 9.Word count in file using accumulator 10.SQL question using analytical function ---Round 3 1.what is Object 2.what is inheritance 3.what is polymorphisim 4.How to read a binary file, like zip, gzip 5.How to read all files in zipped file 6.How to get filelist from hadoop location in spark 7.How to read a file of different size effecienty files are 1MB, 16MB, etc.. 8.How to create a dataframe of Map type 9.What is broadcast, accumulator 10.What is cache and repartition 11.Difference between cache and broadcast
Viewing 1121 - 1130 interview questions