Principle responsibilities & duties:
- Design, implement, and maintain Kubernetes clusters to orchestrate containerized applications efficiently.
- Develop and maintain CI/CD pipelines using tools like GitLab CI/CD, Jenkins, or similar, to automate build, test, and deployment processes.
- Create and manage infrastructure as code (IaC) using OpenTofu/Terraform to provision and manage cloud resources on AWS.
- Automate repetitive tasks and workflows using scripting languages like Bash or Python.
- Collaborate with development teams to optimize application performance, reliability, and scalability.
- Monitor and troubleshoot production systems, ensuring high availability, performance, and security.
- Implement and maintain logging, monitoring, and alerting systems to ensure timely detection and resolution of issues.
- Stay updated with industry best practices, emerging technologies, and trends in DevOps and cloud computing.
Minimum job requirement (Education & Experience):
- Bachelor's Degree in Computer Science, Engineering, or related field.
- Min 3 years of experience working in a DevOps or Site Reliability Engineering role.
- Strong knowledge and hands-on experience with Kubernetes for container orchestration.
- Proficiency in building and maintaining CI/CD pipelines using tools like GitLab CI/CD, Jenkins, or similar.
- Experience with infrastructure as code (IaC) using Terraform or similar tools.