Overview of the architecture of a previous data engineering project
Data Engineer Interview Questions
20,237 data engineer interview questions shared by candidates
Asked about how PySpark internally works basically the architecture of PySpark.
Asked me to model a program in scala
Intermediate SQL queries based on the take-home assignment. I was also asked a window function query.
What is a window function? Joins vs subqueries. Outer vs inner joins. Infrastructure based questions.
Spark and hive optimization techniques, partitioning and bucketing concepts, small programs in pyspark
1. How would you determine the frequency of each unique element in a list? 2. A problem that can be solved using a SQL window function, and explain your approach?
How does memory management work in Python?
Explain the join and union and describe the differences between them.
Synchronization of two different data frame and asking the complexity of the algorithm. Domain interview: bunch of pyspark related questions and SQL simple query Second programming interview: Find if a path exists in a Graph
Viewing 1171 - 1180 interview questions