AMD Instinct Power Management Validation Staff Engineer

Jun 28, 2024
Not specified,
... Not specified
... Intermediate
Full time
... Office work


WHAT YOU DO AT AMD CHANGES EVERYTHING

We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences – the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world’s most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives. 

AMD together we advance_




THE ROLE:

AMD is looking for a highly talented engineer to join our AMD Instinct Engineering team. Technical lead responsible for AMD Instinct Power Management system debug/validation. This individual will be primarily responsible for interfacing with silicon, firmware, software, and platform design groups for PM validation and debug/verify issues in time to meet product development or sustaining milestones.

 

KEY RESPONSIBILITIES:

  • Debug AI and HPC systems with an emphasis on platform-level debug.
  • Knowledge and utilization of standard tools, debuggers, programmers, and emulators.
  • Familiarity with firmware development and firmware debugging.
  • Possess a strong understanding of BIOS, OS, and driver interactions at the system level, as well as a proficient understanding of x86 CPU architecture, GPU architecture and functionality.
  • Understand AI/HPC industry-standard buses, such as HBM, PCIe, and other high-speed IO protocols, in addition to detailed knowledge of high-speed digital design and signal integrity.
  • Provide root cause analysis and guidance to internal and customer design teams to help resolve issues.
  • Support projects independently and handle all issues, collaborating effectively with internal and external teams.
  • A self-starter capable of dealing with a high level of ambiguity.
  • Work with cross-functional teams to improve post-silicon validation test strategy, methodology, and process.
  • Conduct thorough validation of power management features, including but not limited to power states, power management algorithms, thermal management, and power delivery.
  • Analyze firmware code, power consumption data, and performance metrics to identify inefficiencies and areas for improvement.
  • Develop and implement automated scripts to streamline validation, data collection, and ensure consistent and repeatable validation processes.

PREFERRED EXPERIENCE:

  • Experience with SoC architecture and microprocessor cores.
  • Understanding of modern x86 microprocessor architecture and AI/HPC platform architecture is highly desired.
  • Experience developing validation methodologies and infrastructure.
  • Experience in developing and executing validation plans.
  • Participation in silicon bring up and debug, supporting internal engineering teams.
  • Able to execute and drive the success of programs with multiple projects simultaneously.
  • Debugging skills at both SoC and system levels.
  • Familiarity with programming (C/C++) / scripting languages (Python, Perl).
  • Working knowledge of Server OSes (Linux, Windows), including CPUs, memory, storage and peripheral devices.
  • Self-starting team player with excellent communication skills who can work with minimal guidance.
  • Forward thinker who drives improvement in the development process, code architecture and fosters a spirit of innovation and continuous improvement.
  • Strong verbal and written English communication skills for documenting test results, writing reports, and communicating effectively with cross-functional teams.
  • Working knowledge of lab equipment such as oscilloscopes, logic analyzers and protocol analyzers.
  • AI/HPC software stacks, such as ROCm, TensorFlow or Pytorch is a plus with regards to AI application. 
  • Soft skills:

    • Attention to detail, methodical approach to testing and debugging, along with strong analytical and problem-solving skills.
    • Strong desire to learn and grow with a continuous improvement mindset.
    • Self-organizing with the ability to multitask based on priorities.
    • Strong interpersonal skills to work effectively with various teams and stakeholders.

ACADEMIC CREDENTIALS:

  • Bachelor’s or Master’s degree in Electrical Engineering, Computer Engineering, Computer Science, Mechatronics Engineering or a related field.

LOCATION:

Penang, Malaysia

 

#LI-SONG

#LI-Hybrid

 




Benefits offered are described:  AMD benefits at a glance.

 

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law.   We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.

Benefits offered are described:  AMD benefits at a glance.

 

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law.   We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.

THE ROLE:

AMD is looking for a highly talented engineer to join our AMD Instinct Engineering team. Technical lead responsible for AMD Instinct Power Management system debug/validation. This individual will be primarily responsible for interfacing with silicon, firmware, software, and platform design groups for PM validation and debug/verify issues in time to meet product development or sustaining milestones.

 

KEY RESPONSIBILITIES:

  • Debug AI and HPC systems with an emphasis on platform-level debug.
  • Knowledge and utilization of standard tools, debuggers, programmers, and emulators.
  • Familiarity with firmware development and firmware debugging.
  • Possess a strong understanding of BIOS, OS, and driver interactions at the system level, as well as a proficient understanding of x86 CPU architecture, GPU architecture and functionality.
  • Understand AI/HPC industry-standard buses, such as HBM, PCIe, and other high-speed IO protocols, in addition to detailed knowledge of high-speed digital design and signal integrity.
  • Provide root cause analysis and guidance to internal and customer design teams to help resolve issues.
  • Support projects independently and handle all issues, collaborating effectively with internal and external teams.
  • A self-starter capable of dealing with a high level of ambiguity.
  • Work with cross-functional teams to improve post-silicon validation test strategy, methodology, and process.
  • Conduct thorough validation of power management features, including but not limited to power states, power management algorithms, thermal management, and power delivery.
  • Analyze firmware code, power consumption data, and performance metrics to identify inefficiencies and areas for improvement.
  • Develop and implement automated scripts to streamline validation, data collection, and ensure consistent and repeatable validation processes.

PREFERRED EXPERIENCE:

  • Experience with SoC architecture and microprocessor cores.
  • Understanding of modern x86 microprocessor architecture and AI/HPC platform architecture is highly desired.
  • Experience developing validation methodologies and infrastructure.
  • Experience in developing and executing validation plans.
  • Participation in silicon bring up and debug, supporting internal engineering teams.
  • Able to execute and drive the success of programs with multiple projects simultaneously.
  • Debugging skills at both SoC and system levels.
  • Familiarity with programming (C/C++) / scripting languages (Python, Perl).
  • Working knowledge of Server OSes (Linux, Windows), including CPUs, memory, storage and peripheral devices.
  • Self-starting team player with excellent communication skills who can work with minimal guidance.
  • Forward thinker who drives improvement in the development process, code architecture and fosters a spirit of innovation and continuous improvement.
  • Strong verbal and written English communication skills for documenting test results, writing reports, and communicating effectively with cross-functional teams.
  • Working knowledge of lab equipment such as oscilloscopes, logic analyzers and protocol analyzers.
  • AI/HPC software stacks, such as ROCm, TensorFlow or Pytorch is a plus with regards to AI application. 
  • Soft skills:

    • Attention to detail, methodical approach to testing and debugging, along with strong analytical and problem-solving skills.
    • Strong desire to learn and grow with a continuous improvement mindset.
    • Self-organizing with the ability to multitask based on priorities.
    • Strong interpersonal skills to work effectively with various teams and stakeholders.

ACADEMIC CREDENTIALS:

  • Bachelor’s or Master’s degree in Electrical Engineering, Computer Engineering, Computer Science, Mechatronics Engineering or a related field.

LOCATION:

Penang, Malaysia

 

#LI-SONG

#LI-Hybrid

 

COMPANY JOBS
1155 available jobs
WEBSITE