Spark, Hive, Hadoop questions
Big Data Internship Interview Questions
1,784 big data internship interview questions shared by candidates
Repartiton and colaesce, why we are going for shuffuling if any image processed data are incoming what you choose to process many spark sql questions
what are transformation and action you have used in spark
Spark optimization techniques. Spark memory management.
Very generic questions which do not have any specific or single answer like - how would you optimize a hive query? how would you optimize a spark job?
There is two table. one is emp_details another is employee salary Find emp who has salary more than 8000 and who has worked in multiple project. 2. Find prime num. between 1 to 50 in Python/java. 3. Execution of spark job 4.Big-data-ques
First Round Experience He asked me 2 SQL Questions(Questions were based on Join and group byAlso asked to write same code in pyspark) and few basic spark questions like cache and persist, reparation and coalesc, RDD, Dataframe and dataset difference. I cleared first round and HR again scheduled second round Second Round: 2 SQL questions which was based on joins and union, union and union all difference, RDD vs Dataframe vs Dataset, Repartition and coalesce,Spark Architecture, Hive and other project related and basic questions. I cleared second round as well then HR scheduled next round which is HR round. HR round: Introduce yourself, What I know about Impetus, Why changing job and walked me through company info and CTC breakdown. She has some budget constraints or approval constraints so not sure why they called and conducted this whole process.
General and scenario based hadoop and spark questions
Aggregate cumulative statistics with a 1h moving window
Related Hadoop
Viewing 1761 - 1770 interview questions