AI/HPC Power Performance Attainment Engineer

Jul 03, 2024
Not specified,
... Not specified
... Intermediate
Full time
... Office work


WHAT YOU DO AT AMD CHANGES EVERYTHING

We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences – the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world’s most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives. 

AMD together we advance_




The AMD Data Center Power and Performance Systems Engineering Team is a cutting-edge hardware-based lab team engaged on an array of leading technologies found in today’s leading Data Center Products. 

This team is essential to the success of AMD as a growing company.  It is a very exciting environment with a friendly, collaborative, and highly skilled staff and management that will be there to guide you throughout your stay at AMD. 

 

THE ROLE: 

The successful candidate will assume responsibility for post-silicon activities related to performance characterization and optimization of AMD Datacenter productsThis specific role is targeted to the Power focused domain covering such areas as di/dt, power characterization, power modeling, power performance optimization 

 

THE PERSON: 

Must be a self-starter, effective communicator and able to independently drive tasks to completionThe candidate must be willing to work with the North American Power Attainment team for training and collaboration on key tasks and programs 

 

KEY RESPONSIBILITIES: 

  • Become a key stakeholder in product performance validation process 
  • Develop and execute characterization test plans for Datacenter GPUs 
  • Optimize power and performance features for AI, Machine learning & High performance computing 
  • Analyze and debug interactions between various power management features 
  • Develop and execute performance validation test plans for HPC/ML frameworks 
  • Configure and setup test and customer based ML/AI Datacenter GPU systems for data collection, experiments and post-silicon activities 
  • Support prototyping experiments for new GPU features that impact performance and power 
  • Troubleshoot system-level issues that may occur in test environments and platforms 
  • Proactively driving continuous improvement for post-silicon power and performance activities 
  • Participating in the product definition process 
  • Actively participate in analysis of post silicon performance and power data collected to insure integrity of results and to provide summary and conclusions of results 
  • Learn and Execute Power Attainment test plans in post-silicon time periods in support of Data Center GPU product roadmap 
  • Proactively driving continuous improvement for post-silicon power attainment activities 
  • Participate in development of automation environment in developing scripts automating workloads, enhancing capabilities of execution capabilities in Linus, Python and other support software support tools 
  • Hands-on experience with computers, systems or data center hardware for practical knowledge with hardware applicable to servers, data centers or thermal equipement on occasion 

 

PREFERRED EXPERIENCE: 

  • Excellent grasp of computer organization/architecture
  • Knowledge of NUMA, Cache Coherency, PCIe
  • Extensive experience in platform optimization. Solid knowledge of Computer I/O.
  • Experience with tools for performance analysis
  • Strong programming skills, experience in Python preferred
  • Desirable to be proficient in Linux command line environment and Shell scripting
  • Deep knowledge of power management techniques like deep sleep and clock gating
  • Experience with virtualization(ex. KVM)
  • Experience with container technologies(ex. Docker)
  • Strong analytical and problem-solving skills with a key attention to detail
  • Experience in data analysis, summarization, and presentation 
  • Excellent presentation and communication skills

ACADEMIC CREDENTIALS: 

  • BS degree in Electronics/Computer Engineering or Computer Science with emphasis on computer architecture and workload analysis, MS preferred 
  • 5+ years’ experience preferred.  

LOCATION: 

Penang Malaysia 

 

#LI-SH2

#LI-Hybrid




Benefits offered are described:  AMD benefits at a glance.

 

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law.   We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.

Benefits offered are described:  AMD benefits at a glance.

 

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law.   We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.

The AMD Data Center Power and Performance Systems Engineering Team is a cutting-edge hardware-based lab team engaged on an array of leading technologies found in today’s leading Data Center Products. 

This team is essential to the success of AMD as a growing company.  It is a very exciting environment with a friendly, collaborative, and highly skilled staff and management that will be there to guide you throughout your stay at AMD. 

 

THE ROLE: 

The successful candidate will assume responsibility for post-silicon activities related to performance characterization and optimization of AMD Datacenter productsThis specific role is targeted to the Power focused domain covering such areas as di/dt, power characterization, power modeling, power performance optimization 

 

THE PERSON: 

Must be a self-starter, effective communicator and able to independently drive tasks to completionThe candidate must be willing to work with the North American Power Attainment team for training and collaboration on key tasks and programs 

 

KEY RESPONSIBILITIES: 

  • Become a key stakeholder in product performance validation process 
  • Develop and execute characterization test plans for Datacenter GPUs 
  • Optimize power and performance features for AI, Machine learning & High performance computing 
  • Analyze and debug interactions between various power management features 
  • Develop and execute performance validation test plans for HPC/ML frameworks 
  • Configure and setup test and customer based ML/AI Datacenter GPU systems for data collection, experiments and post-silicon activities 
  • Support prototyping experiments for new GPU features that impact performance and power 
  • Troubleshoot system-level issues that may occur in test environments and platforms 
  • Proactively driving continuous improvement for post-silicon power and performance activities 
  • Participating in the product definition process 
  • Actively participate in analysis of post silicon performance and power data collected to insure integrity of results and to provide summary and conclusions of results 
  • Learn and Execute Power Attainment test plans in post-silicon time periods in support of Data Center GPU product roadmap 
  • Proactively driving continuous improvement for post-silicon power attainment activities 
  • Participate in development of automation environment in developing scripts automating workloads, enhancing capabilities of execution capabilities in Linus, Python and other support software support tools 
  • Hands-on experience with computers, systems or data center hardware for practical knowledge with hardware applicable to servers, data centers or thermal equipement on occasion 

 

PREFERRED EXPERIENCE: 

  • Excellent grasp of computer organization/architecture
  • Knowledge of NUMA, Cache Coherency, PCIe
  • Extensive experience in platform optimization. Solid knowledge of Computer I/O.
  • Experience with tools for performance analysis
  • Strong programming skills, experience in Python preferred
  • Desirable to be proficient in Linux command line environment and Shell scripting
  • Deep knowledge of power management techniques like deep sleep and clock gating
  • Experience with virtualization(ex. KVM)
  • Experience with container technologies(ex. Docker)
  • Strong analytical and problem-solving skills with a key attention to detail
  • Experience in data analysis, summarization, and presentation 
  • Excellent presentation and communication skills

ACADEMIC CREDENTIALS: 

  • BS degree in Electronics/Computer Engineering or Computer Science with emphasis on computer architecture and workload analysis, MS preferred 
  • 5+ years’ experience preferred.  

LOCATION: 

Penang Malaysia 

 

#LI-SH2

#LI-Hybrid

COMPANY JOBS
1111 available jobs
WEBSITE