About The Role
We are looking for an experienced and passionate
Senior Site Reliability Engineer (SRE) to join our Group Technology team at RHB Banking Group. In this role, you will drive the SRE practice and deliver a high level of system and infrastructure resiliency that meets business and regulatory requirements. This position also requires strong software engineering skills to automate manual processes and identify potential issues with applications before they impact operations.
What You Will Be Doing
- Drive consistent SRE practices across application, infrastructure, and IT security teams.
- Set up and operationalize SRE teams for specific areas within Group Technology.
- Provide coaching and guidance to SRE engineers and support teams to ensure consistent execution of SRE principles.
- Contribute to the development and documentation of SRE best practices and procedures.
- Take ownership of application monitoring tools such as Dynatrace, and collaborate with vendors to drive consistent adoption across teams.
- Design, develop, and deploy automation scripts and tools to monitor, manage, and optimize systems.
- Analyze system metrics and logs to identify potential issues and improvement areas.
- Build internal expertise in observability and train teams on the use of monitoring tools.
- Support deep analysis and troubleshooting of technical issues in critical and high-availability systems.
- Advocate a culture of system resiliency and ensure non-functional requirements are met throughout project delivery.
- Continuously enhance the SRE framework in line with evolving business and technology needs.
- Collaborate closely with application, infrastructure, and IT security teams to build strong reliability practices and partnerships.
What We're Looking For
- Master's or Degree in Computer Science, IT, or related discipline.
- 8–10
- years of experience in IT system development and implementation within the Financial Services Industry.
- 3–5 years of experience in system architecture and design.
- Proficiency in programming languages such as Java, .NET C#, Python, Bash, or PowerShell.
- Strong understanding of databases (MSSQL, Oracle, NoSQL) and mainframe technologies (z/OS, CICS).
- Experience designing and delivering non-functional requirements (high availability, disaster recovery, backup, etc.).
- Solid grasp of SRE principles, including SLOs, SLIs, automation, and system observability.
- Strong analytical, problem-solving, and communication skills.
- A collaborative mindset with the ability to influence and drive cultural change across teams.
What We Offer
At RHB Banking Group, we foster a collaborative and forward-thinking work environment that values innovation, continuous learning, and teamwork. You'll have the opportunity to lead reliability initiatives that shape the future of our technology infrastructure, backed by competitive remuneration and ongoing professional development.