Search by job, company or skills

Microsoft

System Level Test Engineer

new job description bg glownew job description bg glownew job description bg svg
  • Posted 15 days ago
  • Be among the first 20 applicants
Early Applicant

Job Description

Overview

Microsoft Silicon, Cloud Hardware, and Infrastructure Engineering (SCHIE) is the team behind Microsoft's expanding Cloud Infrastructure and responsible for powering Microsoft's Intelligent Cloud mission. SCHIE delivers the core infrastructure and foundational technologies for Microsoft's over 200 online businesses including Bing, MSN, Office 365, Xbox Live, Teams, OneDrive, and the Microsoft Azure platform globally with our server and data center infrastructure, security and compliance, operations, globalization, and manageability solutions. Our focus is on smart growth, high efficiency, and delivering a trusted experience to customers and partners worldwide and we are looking for passionate engineers to help achieve that mission.

The Compute Silicon & Manufacturing Engineering (CSME) organization within SCHIE is responsible for design, development, manufacturing and packaging of Microsoft's state-of-the-art computer chips, notably the Azure Cobalt. Our solutions provide sustainable strategic advantage to Microsoft and enable our customers to achieve more.

As Microsoft's cloud business continues to grow the ability to deploy new offerings and hardware infrastructure on time, in high volume with high quality and lowest cost is of paramount importance. To achieve this goal, the Silicon, Manufacturing, and Packaging Engineering (SMPE) team is instrumental in defining and delivering operational measures of success for hardware manufacturing, improving the planning process, quality, delivery, scale and sustainability related to Microsoft cloud hardware. We are looking for seasoned engineers with a dedicated passion for customer focused solutions, insight and industry knowledge to envision and implement future technical solutions that will manage and optimize the Cloud infrastructure.

We are looking for a System Level Test Engineer with deep technical expertise in HPC/AI systems to drive systemscale validation, manufacturing readiness, and postdeployment quality for Microsoft's custom silicon platforms to join the team.

#SCHIE #CSME

Responsibilities

As a System Level Test Engineer, you will own systemlevel test strategy, execution, and diagnostics for complex CPU, GPU, and acceleratorbased platforms across the highvolume manufacturing and datacenter deployment lifecycle. As a part of this role, you will be expected to support our team by:

  • Developing system level test (SLT) platform, strategy, requirements, tools, methodologies across hardware and software. Programming (C# or Python) knowledge is a must.
  • Collaborating with team members and partner teams to develop and integrate hardware, firmware, content, automation solution for SLT platform.
  • Generate and leverage correlation, characterization, and performance data to optimize outgoing quality and power/performance.
  • Manage high volume manufacturing on OSAT for System Level Testing (SLT). Work on improving stability, test time and yield of System Level Testing at OSAT.
  • Involve in RMA (Feedback from System/Rack failures) checkout, debug and dispositioning. This includes system level failure triage and debug across silicon, hardware, software as well as identification of test coverage improvement at ATE or SLT socket to plug the RMA gap.
  • Product characterization and correlation for power/performance from system to ATE.
  • Define, guide, and contribute to the development of the software automation infrastructure for SLT and system to tester correlation activities.
  • Management of multiple development activities across a variety of product groups.

Qualifications

Position Requirements:

  • Strong technical background in HPC, AI, GPU, and CPU architectures
  • Proven experience in systemlevel validation and systemscale testing
  • Handson experience with highvolume manufacturing, systemlevel test, and ATEbased functional testing
  • Experience with production RMA execution, failure analysis, and dispositioning
  • Understanding of system bringup, power, memory, IO, interconnects, and firmware interactions
  • Demonstrated ability to perform crossdomain root cause analysis and drive issues to closure
  • Experience supporting datacenterclass systems or largescale deployments
  • Familiarity with advanced packaging, chipletbased designs, and heterogeneous integration
  • Experience correlating lab, manufacturing, and infield failures with strong data analysis background
  • Strong scripting or automation background (Python, C/C++, or equivalent)

This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.

Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.

More Info

Job Type:
Industry:
Employment Type:

About Company

Job ID: 143324785