Skip to main content
Saved

Site Reliability Engineer



Job Details

We are looking for you, if you:

  • have experience in operating system administration (Windows),
  • know key cloud proconcepts you can describe cloud-native
  • understand and have knowledge about other stack layers – Network, Virtualization, Middleware, Databases (MS SQL),
  • have good understanding of programming (preferred languages: Python, PoweShell, Golang),
  • know how to use IaC/orchestration/automation tooling like Azure Pipelines, Ansible, Terraform,
  • can identify and automate infrastructural management tasks using best infra-as-code practice,
  • know key reliability engineering framework practices, consumer engineering idea and acronyms like SLI, MTTR and BCM are not just a couple random letters glued together.

You'll get extra points:

  • you value your time and don’t log in to host to run commands – Infra as a Code is your creed.
  • you do not like solving Incidents you prevent them from happening.
  • you like to always be step ahead and use new technologies.

English level: B2

    As the Site Reliability Engineering Department, we focus on four key topics:

    • Run & Change
    • Enablement
    • Rapid Response
    • Education

    Your responsibilities:

    • Implementation of reliability across global platforms & services, global supporting tooling and entities:
      • Operating in strong cooperation with involved Enterprise Architects, other SREs & DevOps engineers,
      • Implementing observability measures via respective tooling of our critical business services,
      • Identifying service level objectives with associated indicators
      • Look for and elimination of manual and repetitive task (commonly known as toil
      • Planning and evaluating new releases of features within infrastructure environment (release trains)
    • Later on, focus will also be on other practices e.g.
      • Mature major incident management process (major incident mgt, problem mgt, post-mortem & root-cause analysis)
      • Mature capacity planning & forecasting practice
      • Mature reliability reporting
      • Introduction of Error budgeting
      • Knowledge management about spreading “reliability by design” concept and execution of all required reliability practices

    Information about the squad:

    We are a Team of Infra admins who got tired of manual work and decided to move to Infra as a Code approach. We want to prevent, not repair and make our system Reliable. Taking best approach from Google and Microsoft we want to create Culture of SRE Engineering with focus on Design, Run Enable, Rapid Response, Educate and Review. Are you up for the challenge?

    Your place of work Explore the area

    Questions? Just ask
    ING Recruitment team

    Apply now

    ING’s vision is to unlock our people’s full potential through our inclusive culture where everyone has the opportunity to develop and have impact for our customers and society. To achieve this vision, our policies support diversity, equity, and inclusion. As an equal opportunity employer, we do not tolerate discrimination of any kind with regard to age, gender, gender identity, cultural background, experience, religion, race, ethnicity, disability, family responsibilities, sexual orientation, social origin, or any other status protected by applicable law. If you require any assistance or if we can accommodate you in any way when participating in our application and/or interview process, please email the recruiting contact listed for the relevant position. We will be happy to work with you to ensure a fair and accessible process. Read more about our commitment to diversity, inclusion and belonging here.

    More for you

    No jobs viewed

    No jobs saved

    The latest jobs straight to your inbox

    Interested In

    By submitting your information, you acknowledge that you have read our privacy policy and consent to receive email communication from ING.