How would you design an ETL pipeline to handle real-time data ingestion from multiple sources (e.g., web servers, mobile apps) and ensure data quality and consistency before loading it into a data warehouse?
Data Engineer Interview Questions
20,177 data engineer interview questions shared by candidates
Unfortunately unable to share because of NDA
Find the Hamming distance of these two lists
SQL Question asking for a percentage of the selected data.
Same as mentioned in other responses here.
What are the top five (ranked in decreasing order) single-channel media types that correspond to the most money the grocery chain had spent on its promotional campaigns? media_type can contain mutliple values seperated by a comma, so single channel is when media_type only has one value.
python:1. find 's' in mississippi 2. uncommon words 3. replace none with next element in the list. SQL: 2 questions on case, 1 on order-limit/rank/top(whatever you want to use),percentage/ratio calculations/joins
Glassdoor questions should be enough.
python data structures such as sets, dictionary, list, string, etc
How did you collect requirements for the new application?
Viewing 531 - 540 interview questions