Data Engineer Interview Questions

20,267 data engineer interview questions shared by candidates

There were questions like "what technologies would you choose for your next project if you had $1m" without specifying any project-related info/requirements or even the underlying data for which you should pick the tools.
avatar

Data Engineer

Interviewed at Semrush

4
Mar 29, 2023

There were questions like "what technologies would you choose for your next project if you had $1m" without specifying any project-related info/requirements or even the underlying data for which you should pick the tools.

ADF: scenario based Pyspark: Coalesce vs repartition wide vs narrow transformation spark architecture one dataset to apply pivot transformation SQL: two questions (department wise highest salary, SQL question using REPLACE function)
avatar

Data Engineer

Interviewed at ValueMomentum

3.4
Jun 14, 2023

ADF: scenario based Pyspark: Coalesce vs repartition wide vs narrow transformation spark architecture one dataset to apply pivot transformation SQL: two questions (department wise highest salary, SQL question using REPLACE function)

You are given a sorted array with repeated numbers. [1,1,1,3,3,3,3,3,4,5,6,6,6] Your task is to return the array by not repeating any number more than twice. And the array count. (In place) Output : [1,1,3,3,4,5,6,6]
avatar

Senior Data Engineer

Interviewed at Delivery Hero

3.5
Nov 10, 2021

You are given a sorted array with repeated numbers. [1,1,1,3,3,3,3,3,4,5,6,6,6] Your task is to return the array by not repeating any number more than twice. And the array count. (In place) Output : [1,1,3,3,4,5,6,6]

About project , architectures ,some basics like partition,bucketing,RDD, Data frames, DAG execution engine, why from Hive to Spark SQL, difference between RDD, DataFrames, Datasets, how to make joins between data frames, what to do in spark job if our infrastructure is limited.
avatar

Data Engineer

Interviewed at Capgemini

4.2
Nov 30, 2020

About project , architectures ,some basics like partition,bucketing,RDD, Data frames, DAG execution engine, why from Hive to Spark SQL, difference between RDD, DataFrames, Datasets, how to make joins between data frames, what to do in spark job if our infrastructure is limited.

Architecture and ETL process of my previous employment with an example of End to end ETL flow. Couple of questions on AWS services. Learning spirit Differences between batch processing and stream processing. Questions on Kappa architecture Many more relavent to my previous experience mentioned in CV
avatar

Data Engineer

Interviewed at AO

3.1
Mar 24, 2021

Architecture and ETL process of my previous employment with an example of End to end ETL flow. Couple of questions on AWS services. Learning spirit Differences between batch processing and stream processing. Questions on Kappa architecture Many more relavent to my previous experience mentioned in CV

Viewing 1761 - 1770 interview questions

Glassdoor has 20,267 interview questions and reports from Data engineer interviews. Prepare for your interview. Get hired. Love your job.