There were questions like "what technologies would you choose for your next project if you had $1m" without specifying any project-related info/requirements or even the underlying data for which you should pick the tools.
Data Engineer Interview Questions
20,267 data engineer interview questions shared by candidates
ADF: scenario based Pyspark: Coalesce vs repartition wide vs narrow transformation spark architecture one dataset to apply pivot transformation SQL: two questions (department wise highest salary, SQL question using REPLACE function)
How do you manage your daily tasks during the pandemic?
You are given a sorted array with repeated numbers. [1,1,1,3,3,3,3,3,4,5,6,6,6] Your task is to return the array by not repeating any number more than twice. And the array count. (In place) Output : [1,1,3,3,4,5,6,6]
Give an example of when you have used data to improve a business process or technical system.
About project , architectures ,some basics like partition,bucketing,RDD, Data frames, DAG execution engine, why from Hive to Spark SQL, difference between RDD, DataFrames, Datasets, how to make joins between data frames, what to do in spark job if our infrastructure is limited.
- What is the difference between shallow and deep copy in Python?
General purpose Azure Storage types ?
What was my previous experience.
Architecture and ETL process of my previous employment with an example of End to end ETL flow. Couple of questions on AWS services. Learning spirit Differences between batch processing and stream processing. Questions on Kappa architecture Many more relavent to my previous experience mentioned in CV
Viewing 1761 - 1770 interview questions