We are seeking a highly skilled and customer-focused Site Reliability Engineer (SRE) with deep expertise in VMware technologies and a strong background in system administration and automation. The ideal candidate will play a key role in maintaining and optimizing infrastructure reliability, scalability, and performance, while collaborating with cross-functional teams to deliver seamless support and continuous improvement
1. Principal Responsibilities
- Administer and support VMware environments including VCF, VCD, NSX, ESXi, vCenter, vSAN, vRA/vRO, and Tanzu.
- Design, implement, and maintain automation scripts and tools to improve system reliability and operational efficiency.
- Provide expert-level technical support and troubleshooting for infrastructure-related issues.
- Collaborate with development and operations teams to integrate CI/CD pipelines and DevOps practices.
- Ensure system security, compliance, and performance through proactive monitoring and maintenance.
- Document procedures, configurations, and best practices for internal knowledge sharing.
2. Experience
- Proven experience as a Senior System Engineer or Cloud Administrator in enterprise environments.
- Strong background in customer-facing support roles with a focus on reliability and service excellence.
- Hands-on experience managing multiple VMware products such as vCenter, vSphere, NSX, and vSAN.
- Solid understanding of Linux system administration and networking fundamentals.
- Familiarity with DevOps methodologies and tools including CI/CD pipelines and infrastructure as code.
3.Tools & Systems
- VMware Suite: VCF, VCD, NSX, ESXi, vCenter, vSAN, vRA/vRO, Tanzu
- Automation & Configuration Management: Ansible
- Operating Systems: Linux (RHEL, Ubuntu, etc.)
- DevOps Tools: CI/CD platforms (Jenkins, GitLab CI, etc.)
- Networking & Security: Firewalls, VPNs, VLANs, IDS/IPS