Search by job, company or skills

Virtual Calibre MSC Sdn Bhd

Site Reliability Engineer

4-10 Years
Save
new job description bg glownew job description bg glownew job description bg svg
  • Posted 10 hours ago
  • Be among the first 10 applicants
Early Applicant
Quick Apply

Job Description

Job Summary

We are looking for an experienced and highly motivated Site Reliability Engineer (SRE) to support and enhance the reliability, scalability, and operational efficiency of enterprise digital platforms and infrastructure environments.

The ideal candidate will possess strong hands-on experience in CI/CD, containerised microservices environments, infrastructure monitoring, automation, and production support. This role requires a proactive individual who can work closely with development, infrastructure, and vendor teams to ensure high system availability and operational excellence.

Key Responsibilities

  • Maintain, monitor, and improve overall system reliability, availability, and performance.
  • Support digital service infrastructure reliability and operational stability.
  • Perform proactive monitoring, troubleshooting, and issue resolution for production environments.
  • Participate in on-call support rotation for SRE-related incidents and critical production issues.
  • Manage and support CI/CD pipelines and deployment processes using GitLab and equivalent deployment orchestration tools.
  • Automate deployment and operational workflows to improve efficiency and reduce manual intervention.
  • Manage containerised microservices environments using Docker and Podman.
  • Perform microservices deployment, scaling, startup, shutdown, and maintenance activities.
  • Support container orchestration and service reliability initiatives.
  • Implement and manage monitoring and observability tools such as Prometheus and Grafana.
  • Monitor APIs, application health, system metrics, and infrastructure performance.
  • Refresh, restart, or optimise APIs and services when required.
  • Perform disk usage monitoring, cleanup, and housekeeping activities.
  • Ensure infrastructure environments remain optimised, secure, and stable.
  • Support patching, maintenance, and operational improvements.
  • Participate in Business Continuity and Disaster Recovery (BCDR) planning and execution activities.
  • Support incident response, root cause analysis (RCA), and post-incident reporting.
  • Ensure operational readiness and recovery processes are maintained.
  • Coordinate and work closely with external vendors for infrastructure-related tasks and support activities.
  • Collaborate with development, infrastructure, DevOps, and business teams to resolve operational challenges.

Job Requirements

  • Minimum 4–8 years of experience in Site Reliability Engineering, DevOps, Infrastructure Operations, or related roles.
  • Proven experience supporting enterprise production environments and mission-critical applications.
  • Strong understanding of CI/CD concepts and automation, infrastructure reliability and monitoring, and Linux/Unix environments.
  • Hands-on experience with GitLab, Docker, Podman, Prometheus, and Grafana.
  • Experience managing microservices environments and API monitoring.
  • Experience in production support, incident management, and on-call support environments.
  • Proven ability in system troubleshooting and root cause analysis.
  • Familiarity with cloud platforms (AWS / Azure / GCP).
  • Exposure to Kubernetes / OpenShift container orchestration platforms.
  • Experience with scripting or automation tools and Infrastructure as Code (IaC).
  • Good communication and stakeholder management skills.
  • Ability to work effectively under pressure in production environments.
  • Strong ownership mindset, operational discipline, and a proactive approach to reliability.
  • Team player with the ability to collaborate across technical teams.

Nice to Have

  • Bachelor's Degree in Computer Science, Information Technology, Engineering, or a related field.
  • Relevant certifications in Kubernetes, AWS / Azure, DevOps / SRE, or Linux Administration.
  • Exposure to Infrastructure as Code tools such as Terraform or Ansible.

More Info

Job Type:
Function:

About Company

Virtual Calibre Group is a JAST Company (Japan System Techniques Ltd) which is listed in Tokyo Stock Exchange Board 1 with its operation’s spreading across Japan, China, Thailand and Singapore with revenue exceeding RM 850 Million in FY 2018.

With the acquisition, Virtual Calibre will be expanding its SAP Consulting Services in the ASEAN region as one of global delivery centers, supporting Malaysian and other ASEAN Clients. We provide services across various industry sectors. Our offshore delivery approach is flexible and is customized to the specific needs
of our clients. We are one of the fastest growing SAP provider in the region.

These key competitive advantages are the core foundation on which our SAP Consulting practice is built upon

OUTREACH: We have delivered SAP projects both locally and in the ASEAN regions as well as the Middle East.
FOCUS: We are specialists in key SAP Solution such as SAP S/4 Hana and SAP ECC 6.0
VALUE: We place high priority on customer relationships, trust and integrity
EXPOSURE: We have over 19 years of SAP industry experience
COST: We are confident that we offer among the most competitive rates in the market
TALENT: We have an excellent lineup of SAP Consultants locally as well as access to an extensive pool of SAP resources all over the world.

Our Services in SAP S/4 Hana and SAP ECC 6.0 are:

- Implementation
- Production Support
- Remote Consulting
- Talents

Please visit our website for more information: www.virtualcalibre.com

Job ID: 147222459

User Avatar
0 Active Jobs

Similar Jobs

Malaysia, Kuala Lumpur

Skills:

Aws LambdaCloudformationGpoPowerShellRoutingPrometheusDnsAWS CloudWatchGrafanaDatadogJenkinsTerraformAnsibleNetworking ConceptsPythonSysinternalsWindows infrastructure applicationsRegistry KeyWindows AD PoliciesWindows Server core services

Malaysia, Kuala Lumpur

Skills:

AWSPrometheusRedisMySQLZabbixPythonAzureGrafanaShellNginxLinux Operating SystemGo