Skip to main content
Gespeichert

Site Reliability Engineer - Retail & Banking Technology



Ich möchte mich bewerben

Discover ING Hubs Romania

ING Hubs Romania offers 130 services in software development, data management, non-financial risk & compliance, audit, and retail operations to 24 ING units worldwide, with the help of over 1700 high-performing engineers, risk, and operations professionals.

We started out in 2015 as ING’s software development hub – a distinct entity from ING Bank Romania – then steadily expanded our range to include more services and competencies.

Now we provide borderless services with bank-wide capabilities and operate from two locations: Bucharest and Cluj-Napoca.

Our tech capabilities remain the core of our business, with more than 1500 colleagues active in Data Management, Touchpoint Channels & Integration, Core Banking, and Global Products.

We enjoy a flexible way of working and a highly collaborative environment, where fair and constructive feedback is encouraged.

For us, impact isn't a perk. It's the driver of our work.

We are guided and rewarded by a shared desire to make the world a better place, one innovative solution at a time. Our colleagues make it their job to do impactful things and they love doing it in good company. Do you?

The Mission:

The R&BT Site Reliability Engineering (SRE) team is a multidisciplinary team of senior engineers with proven track records in development and operations across applications and infrastructure. The primary goal is to continuously and structurally improve the reliability and maintainability of the IT environments involved with the R&BT Platforms, delivered and managed from different (international) ING domains.

  • Objective: Site Reliability Engineering (SRE) enhances the reliability and scalability of BTP platform services through collaborative efforts, prioritizing availability, performance, efficiency, and observability.
  • Measurement: SRE targets increased MTBF, decreased MTTR, and minimized operational toil.
  • Approach: This is facilitated by automation, standardized procedures, and the adoption of SRE best practices.
  • Cultivate a Reliability Mindset: The aim is to foster a culture of reliability throughout the BTP organization, encouraging proactive behaviors and attitudes.

Your day to day

  • Ensure Service Level Objective (SLO) levels are set and met
  • Optimize our Observability tooling like Grafana dashboards
  • Report on GSRE targets and KPIs
  • Do yearly Well Architected Reviews and observability Assessments for all critical components
  • Drive Always Available mindset and behavior within the R&BT organization. Be able to recognize shortcomings in knowledge and expertise, and deliver the necessary resources, skills, guidance and training to DevOps teams where needed.
  • Define and enhance standards for logging monitoring and alerting, and actively monitor end to end platform performance through white and black box monitoring tools.
  • Improve incident response practices and be actively engaged in incident response of escalated and critical incidents. On call duty is currently not part of the job, but should not be an objection if and when required.
  • Participate in Root Cause Analysis. Prioritize and implement the RCA recommendations through improvement plans with the responsible Squads / DevOps teams
  • Track and trace actions out of post mortems and Emirs
  • Drive Continuous improvement on all services in the R&BT Platforms through analysis of the current level of service, functional and technical setup, code, dev/ops practices and the underlying causes of incidents, underperformance, etc.
  • Roll out new resilience features trough the organization
  • Setting up and maintaining automatic reporting and feedback loops
  • Contribute to automating Build, Test and Deployment practices through the CI/CD pipeline
  • Contribute to tuning application resources and updating high available deployment patterns of (mostly) container and VM based environments.
  • Initiate and contribute to new SRE initiatives like AI Ops, Chaos Engineering, migrations to Public Cloud, and Error Budgeting
  • Participate and initiate experiments with new tools and concepts, and evaluate its value against set goals

What you’ll bring to the team:

You are an enthusiastic Software and/or Reliability Engineer with a focus on creating amazing solutions and frameworks. You have solid technical knowledge, and use that to formulate solutions, support and coach other engineers. You have a passion for highly resilient and reliable software and really hate repetitive manual tasks preventing you to do really cool stuff! You are able to inspire squads to spread the SRE mind-set. You are enthusiastic about transferring your knowledge to others within your team, but also with all DevOps teams in the R&BT Organization and the rest of ING. 

Background:

Operations expert: 5+ years of experience working using Agile DevOps principles

Solid understanding how technology setup and ITSM processes relate to service level objectives like Availability (time based, successful call rate, response times), MTTR, and MTBF.

Good understanding of microservices architecture and related high availability / resilience patterns and experience building systems with multiple layers of redundancy to withstand failures in software, hardware, network infrastructure.

Proven experience:

  • working as a Site Reliability Engineer or DevOps engineer
  • scripting in at least one of the following: Ruby, Python, Bash, PowerShell,
  • set up Build and Deployment pipelines in Azure DevOps (ADO)
  • set up white-box monitoring and able to formulate meaningful metrics for monitoring and reporting: Grafana, TraceING.
  • eliminate toil through automation and process optimization
  • Able to coordinate/lead incident response and Post mortem / root cause analysis activities
  • Understanding of IT Service Management processes (ING Global Way of Working) and the way the relate to SRE objectives
  • God understanding of Public Cloud concepts

Prior work experience with tools:

  • CI/CD Pipeline: OnePipeline / Azure Devops / Kingsroad
  • Cloud computing and container orchestration: Linux VM’s and Kubernetes container platforms. Knowledge of OpenShift + AKS and related certifications are a pre.
  • Touchpoint service mesh and SDK/Merak
  • logging/monitoring/alerting: Kafka, ELK, Prometheus, and IAT. Experience with black box monitoring tools like Rigor/Splunk and AI Ops tools like Loom is a pre.
  • Backlog management: Azure Boards
  • ITSM: SNOW

The ideal candidate has:

  • A Bachelor or Master’s degree in computer science or related field
  • Experience coaching and training DevOps engineers on technical subjects
  • Previous experience as a consumer of R&BT Platforms, preferably Touchpoint Platform
  • Understanding of the ING application risk journey

If you want to deep dive into the processing of personal data conducted by ING Hubs Romania during the recruitment process and your rights related to it, read the privacy notices on our website (make sure to scroll until you reach the Data Protection section/ Candidates tab). 

Ich möchte mich bewerben
Your place of work Explore the area

Questions? Just ask
ING Recruitment team

Ich möchte mich bewerben

Bei der ING möchten wir, dass jeder sein volles Potenzial entfalten kann. Wir schaffen eine Kultur, in der jeder wachsen und einen Unterschied machen kann – für unsere Kundinnen und Kunden und die Gesellschaft. Diversität, Gleichberechtigung und Inklusion sind für uns selbstverständlich. Wir tolerieren keine Form der Diskriminierung, sei es aufgrund von Alter, Geschlecht, Geschlechtsidentität, kulturellem Hintergrund, Erfahrung, Religion, Herkunft, Behinderung, familiären Verpflichtungen, sexueller Orientierung oder anderen Gründen. Wenn Ihr während des Bewerbungsprozesses oder der Vorstellungsgespräche Unterstützung benötigt, kontaktiert uns bitte über die E-Mail-Adresse in der Stellenanzeige. Wir helfen gerne, um einen fairen und einfachen Ablauf sicherzustellen. Hier erfahrt Ihr mehr über unser Engagement für Diversity, Inclusion & Belonging.

Mehr erfahren

The latest jobs straight to your inbox

Interested In

By submitting your information, you acknowledge that you have read our privacy policy and consent to receive email communication from ING.