
Search by job, company or skills

About PayNet: At PayNet, your work doesn't just move money; it moves a nation.
We make every payment count toward Malaysians shared prosperity by powering the platforms millions use every day, from DuitNow and FPX to MyDebit and JomPAY. Our systems keep Malaysia's digital economy running securely, seamlessly, and inclusively, whether you're tapping, transferring, paying bills, or expanding a business.
If you're excited about creating impact at a national scale and shaping how Malaysia pays, connects, and progresses, you'll fit right in.
About the Technology Division: Building the backbone of Malaysia's payment infrastructure. Led by Chief Technology Officer, Technology designs, evolves, and secures the always-on architecture that powers trusted, scalable digital payment services for the nation.
Summary of the role:
As a Data Resiliency Engineer, you will play a critical role in ensuring the stability, reliability, and continuous improvement of our data platform. You will be responsible for analyzing incidents, identifying root causes, and implementing effective solutions to deliver a seamless user experience. Leveraging your expertise in data ecosystem monitoring, incident management, and data quality optimization, you will help uphold high standards of operational excellence. You will utilize advanced monitoring and diagnostic tools (such as Datadog and other relevant platforms) to proactively detect, investigate, and resolve issues. In addition, you will serve as a subject matter expert for data-related inquiries and actively drive data quality initiatives across the organization. You will collaborate closely with development teams to ensure that payment reporting systems—both for external participants and internal stakeholders—are accurate, reliable, and aligned with evolving business requirements, supporting continuous improvement in data-driven operations. Beyond production support, you will also contribute to pre-production readiness processes and participate in major maintenance activities, including product deployments, onboarding preparations, and disaster recovery simulation exercises
Key Responsibilities:
What will make you successful
You will be successful by combining strong expertise in data platform operations, monitoring, and incident management with a disciplined approach to root cause analysis and continuous improvement. The ability to proactively detect, diagnose, and resolve data issues using advanced observability tools, while consistently upholding high standards of data quality and reliability, is critical. Success in this role also depends on close collaboration with development and business teams to ensure payment reporting systems are accurate, resilient, and aligned with evolving requirements. A mindset focused on operational excellence, production readiness, and resilience, supported by active participation in deployments, maintenance activities, and disaster recovery exercises, will enable you to deliver dependable, high‑quality data services at scale.
Must-have:
Data Operations & Production Application Maintenance Experience
Familiarity with Modern Data Lake Data Pipeline Architecture
Advantage to have:
Demonstrates strong knowledge in data engineering concepts and best practices.
Experienced in operating and managing data tools on Kubernetes platforms, including Amazon EKS.
Hands-on experience in data pipeline operations, as well as software incident and problem management.
Strong experience with AWS data services, including Amazon S3, AWS Glue, Amazon Athena, Amazon QuickSight, and AWS Lambda.
Job ID: 146121241