Powered by ProofFactor - Social Proof Notifications

Site Reliability Engineer

Container Solutions

Full time

Nov 4

This job is no longer accepting applications.

About the role

We are looking for Site Reliability Engineers based in Europe to join our new Customer Reliability Engineering (CRE) team. As part of the team you will have responsibility for the availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning of our customers’ applications and infrastructure. We are building a remote-first team across multiple time zones with the goal of eventually enabling a follow-the-sun on-call rotation.

Core Responsibilities

➤ Be part of a fully remote team across multiple continents and time zones

➤ Regularly engage with customers to consult and share information

➤ Develop simple, sustainable, and repeatable solutions and processes

➤ Participate in your team’s effort to continuously improve our customers’ production environments

➤ Own your team's tech and tools stack and contribute to the relevant open-source projects

➤ Design, analyse, and troubleshoot large-scale distributed systems

➤ Participate in your team’s on-call rotation

➤ Create and refine documentation and processes

➤ Automate almost all the things

➤ Provide ideas for future roadmap items, based on customer, operational, and/or organisational needs

➤ Learn and share by being part of the Cloud Native community through open-source tooling and processes, writing blog posts, and giving meetup or conference talks

What does it take?

Please make sure you meet these requirements before applying, as we will be checking for all of them in our hiring process. 

Must-have requirements

➤ strong engineering or operations background, and the commitment to develop continuously in both disciplines

➤ At least 5 years experience working in a related field

➤ A strong understanding and knowledge of the following

 — Kubernetes API, core principles and components

 — Linux networking and security related to containers

 — Distributed systems and common distributed system failure modes

➤ Proven production experience with at least one of the following:

 — Common CI/CD systems such as Github Action, Jenkins, Gitlab CI, etc.

 — Major cloud service providers such as Amazon Web Services (AWS), Google Cloud Platform (GCP), or Microsoft Azure

  Modern infrastructure automation or configuration management system such as Terraform, Puppet, Ansible, Saltstack, or Chef

 — Container platforms such as Kubernetes

 — Programming languages such as Go, Python, Rust, C, or equivalent

➤ Ability to contribute to polyglot code bases

➤ Experience working with distributed architectures, e.g. microservices or service-oriented architectures

➤ Experience operating and maintaining production systems on Linux in a public cloud

➤ Can work effectively in a globally distributed team

➤ An urge to collaborate and communicate asynchronously

➤ An analytical mind; debugging and problem solving skills are paramount

➤ Attention to detail and excellent communication skills, both written and verbal

➤ If you have read this far, please include a cover letter under Additional Information when you apply for the role

➤ Ability to work on your own as well as part of a team

➤ Flexibility to learn from and work with different technical environments and teams

➤ Comfortable with working on-call (as part of a regular 24x7 schedule)

Nice-to-have requirements:

➤ Bachelor's degree in computer science, engineering, math, or a relevant field

➤ Experience being part of an on-call schedule

➤ Experience working 100% remotely

➤ Experience implementing monitoring solutions

➤ Operations experience with a production user-facing application

➤ Have developed a Kubernetes controller, operator, or other platform component

➤ A background in writing reliable software and/or automation tooling  

If you are selected for this role and come to work for us on an employee basis, you can count on the following:

➤ A competitive compensation package 

➤ Possibility for employee ownership and equity based compensation

➤ 25 days of paid leave annually 

➤ Company-wide mental health days off 

➤ ‘No Meeting’ Wednesdays

➤ Access to leadership development programmes, coaching and mentoring

➤ Access to our in-house psychologists

The selection process

Stage 1:

➤ CV sift based on our core requirements for this role.

➤ Phone call with one of our recruiters to verify your details. 

➤ Skills Assessment: You will be asked to complete online abstract reasoning aptitude tests. We will only take the highest performers to the next stages.

➤ Tech Challenge: You will receive instructions for a technical assessment, which will be evaluated by one of our current CRE team members. 

Stage 2:

➤ Personality Profile Assessment: You will complete an online personality assessment and undertake an interview based on your profile via Google Hangouts.

Stage 3:

➤ Final Behavioural and Situational Interview with two members of our engineering team.

We are closing this advert on 20 November 2020. Any applications following this date will not be considered.

Container Solutions is proud to be committed to a diverse and inclusive workplace. Container Solutions is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or any other legally protected status. For individuals with disabilities who would like to request an accommodation, please let a member of our Talent Team know.

You must be logged in to to apply to this job.


Your application has been successfully submitted.

Please fix the errors below and resubmit.

Something went wrong. Please try again later or contact us.

Personal Information


View resume


Container Solutions

We bring culture, strategy, and technology together - to make sure your Cloud Native Transformation is done right.