Site Reliability Engineer Interview Questions

2,635 site reliability engineer interview questions shared by candidates

code in the language of your choice: A site has 4,320,000 users. The users are sequentially numbered -- each user has a user_id between 1 and 4320000. You want to slowly roll out a new feature to all users, at a consistent rate of X users per second. The roll-out should start at noon today, beginning with the newest user (id 4320000). The rollout should last two days, so user_id 1 gets the feature around noon two days from now. $Write a function to determine whether the feature is currently enabled for a given user_id: can_use_new_feature(int user_id) -> bool.
avatar

Director of Site Reliability Engineering

Interviewed at Tumblr

3.9
Apr 2, 2014

code in the language of your choice: A site has 4,320,000 users. The users are sequentially numbered -- each user has a user_id between 1 and 4320000. You want to slowly roll out a new feature to all users, at a consistent rate of X users per second. The roll-out should start at noon today, beginning with the newest user (id 4320000). The rollout should last two days, so user_id 1 gets the feature around noon two days from now. $Write a function to determine whether the feature is currently enabled for a given user_id: can_use_new_feature(int user_id) -> bool.

1. Explain in details what is happening when you type curl https://... 2. Copy a huge set of files over 10000 nodes, keep the files updated, check the consistency 3. How to organize a monitoring in a n-node cluster running a web application
avatar

Site Reliability Engineer

Interviewed at LinkedIn

3.8
Jan 28, 2020

1. Explain in details what is happening when you type curl https://... 2. Copy a huge set of files over 10000 nodes, keep the files updated, check the consistency 3. How to organize a monitoring in a n-node cluster running a web application

Asked me how the kernel new to connect to a remote machine. Wasn't too sure if they were asking the lower level c calls, general OS theory, or just basic files / networking components (routing table). Also asked basic monitoring questions like how to monitor query times (bonked on this, just couldn't think well being nervous)
avatar

Site Reliability Engineer

Interviewed at LinkedIn

3.8
Apr 4, 2013

Asked me how the kernel new to connect to a remote machine. Wasn't too sure if they were asking the lower level c calls, general OS theory, or just basic files / networking components (routing table). Also asked basic monitoring questions like how to monitor query times (bonked on this, just couldn't think well being nervous)

There was a paper presented to you with a number of nagios alerts and you had to rate them in the order you would approach fixing them. For example, one of them was a production host being 100% offline. Another was an environment alert about an entire cab that was overheating. Another was the tablet vip being down, another was a load average for the main website being really high. There were also a number of them that were QPS (queries per sec) related and included several security related alerts like XSS QPS and failed logins QPS
avatar

Site Reliability Engineer

Interviewed at LinkedIn

3.8
Jul 15, 2014

There was a paper presented to you with a number of nagios alerts and you had to rate them in the order you would approach fixing them. For example, one of them was a production host being 100% offline. Another was an environment alert about an entire cab that was overheating. Another was the tablet vip being down, another was a load average for the main website being really high. There were also a number of them that were QPS (queries per sec) related and included several security related alerts like XSS QPS and failed logins QPS

Viewing 41 - 50 interview questions

Glassdoor has 2,635 interview questions and reports from Site reliability engineer interviews. Prepare for your interview. Get hired. Love your job.