Get To Know Our Company
GX Bank Berhad - the Grab-led Digital Bank - is the FIRST digital bank in Malaysia, approved by BNM to commence operations. We aim to leverage technology and innovation to serve the financial needs of the unserved and underserved individuals, and micro and small medium enterprises.
We are driven by our shared purpose and passion to bring positive transformation to the banking industry, starting with solutions that address the financial struggles of Malaysians and businesses.
We are seeking a driven and motivated individual to join our Engineering team for our new Digibank initiative. This role will be based in Malaysia.
The Day-to-Day Activities
- Lead and contribute to projects across teams, taking ownership from design through to implementation and rollout, with minimal guidance.
- Proactively identify and troubleshoot issues across the infrastructure stack and application codebase to ensure system reliability and performance.
- Contribute to the design and improvement of automated infrastructure, aligned with Infrastructure-as-Code (IaC) principles.
- Drive operational excellence by identifying recurring issues and implementing automation to eliminate them.
- Collaborate with engineering teams to enhance system reliability, scalability, and performance.
- Mentor junior and mid-level engineers, foster a culture of quality and accountability, and support the growth of the team through knowledge sharing and best practices.
- Define, implement, and optimise SRE best practices, policies, and procedures to ensure high availability, scalability, and performance of critical systems.
- Act as a technical subject matter expert, providing guidance and expertise to the team and across departments on complex SRE challenges.
- Drive incident management and post-mortem processes, ensuring root cause analysis and proactive measures to prevent future occurrences.
- Participate in on-call rotation to ensure maximum service availability.
The Must Haves
- Strong knowledge of cloud infrastructure across AWS, GCP, and Azure, along with container orchestration technologies such as Kubernetes and Docker. Any relevant AWS certifications will be a plus.
- Hands-on experience with Infrastructure as Code (IaC) tools including Terraform, CloudFormation, and Ansible.
- Familiar with observability tools such as Datadog, CloudWatch, Prometheus or ELK stack for effective monitoring and logging.
- Solid understanding of networking fundamentals and internet protocols (TCP/IP, HTTP/S, DNS); experience with service mesh technologies (e.g., Istio, Consul, Linkerd) is a plus.
- Consistently applies a strong security-first mindset across all tasks and responsibilities
- Takes initiative, demonstrates a strong sense of responsibility for system reliability, and sees issues through to resolution.
- Clear, concise, and effective communication skills to explain complex technical issues to diverse audiences (technical and non-technical).
- Excellent diagnostic skills to identify, analyse, and resolve complex technical issues under pressure.
- Eagerness to learn new technologies, tools, and processes, and adapt to evolving technical landscapes.
- Ability to anticipate potential issues, identify areas for improvement, and implement preventative measures rather than just reacting.