Job Description
Title: GPU / Data center and hardware/Infrastructure Monitoring & Support specialist
Function: IT Operations / Infrastructure Support
Level: Entry to Mid-Level
Salary Range: RM3,000 RM7,500 (multiple role from Monitoring, onsite, L1 to L2 support)
Work Location: On-site
Role Overview
The Data Center & Infrastructure Support Engineer is responsible for maintaining the reliability, availability, and day-to-day operations of enterprise IT infrastructure within data center and customer environments. This role involves hands-on technical support, infrastructure monitoring, incident management, and coordination with vendors to ensure stable and efficient system operations.
Key Responsibilities
- Monitor and assess infrastructure performance, identifying anomalies and escalating issues when required
- Provide first- and second-level technical support for server, network, and connectivity issues across CPU and GPU platforms
- Perform physical infrastructure tasks including rack installation, hardware replacement, system decommissioning, and structured cabling
- Coordinate with hardware vendors and third-party service providers for on-site maintenance and repair activities
- Conduct routine site inspections, equipment verification, and preventive maintenance checks
- Log, prioritize, and manage incidents and service requests through the service management system
- Track incidents through to resolution, ensuring proper documentation, root-cause analysis, and closure compliance
- Prepare operational reports and contribute to technical documentation and internal knowledge bases
- Support continuous improvement initiatives, including process refinement and standard operating procedure updates
- Assist with departmental tasks and perform other operational duties as assigned
- Carry out hands-on data center activities, including handling and installation of physical equipment weighing up to approximately 25 kg when required
- 0-25% in-Countries travel may require.
Candidate Requirements
- Diploma or Bachelor's degree in Information Technology, Engineering, Computer Science, or a related field
- 12 years of experience in infrastructure support, IT operations, Monitoring or technical support roles
- Strong interest in enterprise infrastructure, data center operations, and large-scale IT environments
- Willingness to work on-site at customer locations or data centers, including shift or rotational schedules
- Self-driven team player with a practical, hands-on approach to problem solving
- Ability to troubleshoot issues methodically and communicate clearly with technical and non-technical stakeholders
- Comfortable working independently while adhering to defined operational procedures
- Fresh graduates with relevant internships, lab exposure, or project experience are encouraged to apply
- Prior exposure to monitoring tools or IT service management platforms is an advantage
- Familiarity with tools such as Nagios, Zabbix, Grafana, Prometheus, SolarWinds, or PRTG is a plus
- Mandarin language proficiency is an added advantage but not mandatory