About the role:
Platform Reliability Engineer (PRE) is responsible for engineering, operating, and maintaining GEL's internal container platform and its supporting infrastructure, with a strong focus on reliability, resiliency, and security. As a Senior PRE within GEL's Infrastructure team, you will play a pivotal role in designing, building, and operating distributed container hosting solutions using Broadcom's Tanzu product.
Our Requirements:
- Working experience as a Platform Reliability Engineer or strong working experience as a Site Reliability Engineer in a cloud operating environment. Candidates with excellent DevOps experience will be considered.
- Strong experience in managing Tanzu Application Service and Kubernetes clusters.
- Good working knowledge of DevOps pipeline and automation tools (E.g. Selenium, SOAPUI, Bamboo, Jenkins, Ansible, Maven, Github, Bitbucket, Nexus, Jira, Confluence etc).
- Strong technical and business acumen with the ability to lead a small technical team.
- Experience with infrastructure-as-code, server templating, orchestration, configuration management and provisioning tools is advantageous e.g. Terraform, Chef, Docker, Packer, Kubernetes.
- Must code, debug and optimize code and automate repetitive tasks.
- Systematic problem-solving approach, coupled with effective communication skills and a sense of ownership and drive.
Required Experience:
- Experience: 6+ years of hands-on experience managing Application Service and Kubernetes clusters.
- Bachelor's or master's degree in information technology.
- Experienced in one or more of the following: C, C++, Java, Python, Go, Perl or Ruby.
- Strong experience in a Continuous Integration/Continuous Delivery (CI/CD) environment with strong appreciation of change/version control process and methodologies
- Strong experience in dealing with platform upgrades, patching and buildpack management
- Strong experience in troubleshooting network related issues
- Good working knowledge of NSX-T solution and its integration with various Tanzu suite of products
- Soft Skills: Excellent communication and collaboration skills with a strong analytical and troubleshooting mindset.
- Candidate should be open to take up on call support on rotation basis
- Candidate should be willing to work in shifts