Search by job, company or skills

YASH Technologies

AI Training Environment Developer

new job description bg glownew job description bg glownew job description bg svg
  • Posted 20 hours ago
  • Be among the first 10 applicants
Early Applicant

Job Description

Role

  • Manage the global AI Training environment (Servers/ Data center) that dynamically allocates compute and GPU resources based on model training requirements.
  • MLOps platform for model tracking, cataloging, and deployment using tools such as MLFlow and KServe.
  • Dashboard development to show the servers / environment
  • Admin on AWS cloud and on-premises system, including usage tracking and billing through a chargeback model.
  • Technical support on server

What are the mandatory skills

  • Unix / Linux & Windows Server Admin experience in Data Center and Servers
  • Familiarity with containerized environments, Kubernetes/Docker, and Rancher.
  • Experience with virtual machines (VMs), containerized systems, and cloud infrastructure basics (AWS).
  • Scripting in Server admin role

Good to have

  • Python, SQL, NodeJS, and web services design/development.
  • messaging services such as RabbitMQ and Kafka.

Level of experience required

  • 3 - 7 years

Working hours

  • Normal, but need to attend night call according project need (Global project with US team)

More Info

Job Type:
Industry:
Employment Type:

About Company

Job ID: 139502357