Let's assume you have 100 partitions in Spark, and by mistake, you used Coalesce(200) to increase partitions to 200. What will happen?
Data Engineer Interview Questions
20,165 data engineer interview questions shared by candidates
How many joins are there? Please explain each
sql and python
You are given an integer list coins representing coins of different denominations and an integer amount representing a total amount of money. Return the fewest number of coins that you need to make up that amount. If that amount of money cannot be made up by any combination of the coins, return {}. You may assume that you have an infinite number of each kind of coin. Example 1: Input: coins = [1, 5, 6, 9, 15], amount = 31 Output: {15:2, 1:1} Example 2: Input: coins = [1, 5, 6, 9, 15], amount = 100 Output: {15:6, 9:1, 1:1}
If you do a xor b xor a, what is the end result?
Typical data engineering questions
The gave me driver data, passenger data and booking data. You will be asked to run some aggregation on this and publish the results.
1. Imagine a table keeping track of all the rides a driver undertakes in a given day. How do you query that table to find out the time when he did not have any rides. So the table may have a record when he picks up his first ride and then he picks another and then drops both of them. After a while he picks up a few more which again is recorded in the table. So with all that how do you find the data when he does not have any rides
# Q2: Same as Q1, but now assume there can be duplicates. The output should not have duplicates # Ex: # l1 = [1,2,3,4,5,5,5] # l2 = [1,5,7,11,100] # result = [1,5]
What percent of all products in the grocery chain's catalog are both low fat and recyclable?
Viewing 181 - 190 interview questions