Job Description:
- Be responsible for the reliability and uptime appropriate to users needs of Cloud solutions and services.
- Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews.
- Maintain and improve services once they are live by measuring and monitoring availability, latency and overall system health.
- Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity.
- Gauge the effectiveness and efficiency of existing systems and infrastructure; implement strategies for improving or further leveraging these systems within a geoscience workflow.
- Collaborate with network and security staff to ensure smooth, secure and reliable operation of application software and systems.
- Develop, implement and document best practice policies and procedures for new projects or initiatives.
- Use the service management systems, ensuring that best practices and lessons learned are made available to the wider technical community.
- Engage in incident response and blameless postmortems.
Minimum Education & Experience Requirements:
- Bachelor's degree or above, major in Computer Science/ Information Technology or equivalent.
- Basic understanding of virtualization, cloud computing, containerization, and orchestration technologies (e.g., VMWare. Azure, Kubernetes).
- Fast Learner, good at teamwork.
- Proactive & self-driven
- Language Skills (Written/Oral): English