AI/HPC Performance and Characterization Sr.Engineer

Jul 03, 2024
Not specified,
... Not specified
... Senior
Full time
... Office work


WHAT YOU DO AT AMD CHANGES EVERYTHING

We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences – the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world’s most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives. 

AMD together we advance_




The AMD Data Center Power and Performance Systems Engineering Team is a cutting-edge hardware-based lab team engaged on an array of leading technologies found in today’s leading Data Center Products.  


This team is essential to the success of AMD as a growing company.  It is a very exciting environment with a friendly, collaborative, and highly skilled staff and management that will be there to guide you throughout your stay at AMD.

 

THE ROLE: 

The successful candidate will assume responsibility for post-silicon activities related to performance characterization and optimization of AMD Datacenter products 

  

THE PERSON: 

Must be a self-starter, effective communicator and able to independently drive tasks to completion 

  

KEY RESPONSIBILITIES: 

  • Become a key stakeholder in product performance validation process 
  • Develop and execute performance validation test plans for HPC/ML frameworks 
  • Evaluate performance bottlenecks and come up with ways to improve product performance in various product sub-domains Power management, SW, Compute, Memory Subsystem, IO, and Networking
  • Generate consistent performance metrics based on industry standards and develop frameworks, needed scripts for collecting and reporting metrics
  • Work actively on creating and maintaining micro benchmarks for performance evaluation in SRIOV and Bare Metal environments.
  • Characterize critical product KPIs to ensure product meets pre-silicon targets
  • Provide feedback to next generation products through performance data and debug lessons learned   
  • Debugging and troubleshooting system-level issues that may occur in test and customer platforms 
  • Proactively driving continuous improvement for post-silicon performance activities 

PREFERRED EXPERIENCE: 

  • Excellent grasp of computer organization/architecture
  • Detailed knowledge of NUMA, Cache Coherency, PCIe
  • Extensive experience in platform optimization. Solid knowledge of Computer I/O.
  • Experience with tools for performance analysis
  • Strong programming skills, experience in Python preferred 
  • Desirable to be proficient in Linux command line environment and Shell scripting  
  • Deep knowledge of power management techniques like deep sleep and clock gating
  • Experience with virtualization(ex. KVM)
  • Experience with container technologies(ex. Docker)
  • Strong analytical and problem-solving skills with a key attention to detail 
  • Excellent presentation and communication skills 

ACADEMIC CREDENTIALS: 

BS degree in Electronics/Computer Engineering or Computer Science with emphasis on computer architecture and workload analysis, MS preferred 

5+ years’ experience preferred. 

 

LOCATION:

Penang, Malaysia

 

#LI-SH2

#LI-Hybrid




Benefits offered are described:  AMD benefits at a glance.

 

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law.   We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.

Benefits offered are described:  AMD benefits at a glance.

 

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law.   We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.

The AMD Data Center Power and Performance Systems Engineering Team is a cutting-edge hardware-based lab team engaged on an array of leading technologies found in today’s leading Data Center Products.  


This team is essential to the success of AMD as a growing company.  It is a very exciting environment with a friendly, collaborative, and highly skilled staff and management that will be there to guide you throughout your stay at AMD.

 

THE ROLE: 

The successful candidate will assume responsibility for post-silicon activities related to performance characterization and optimization of AMD Datacenter products 

  

THE PERSON: 

Must be a self-starter, effective communicator and able to independently drive tasks to completion 

  

KEY RESPONSIBILITIES: 

  • Become a key stakeholder in product performance validation process 
  • Develop and execute performance validation test plans for HPC/ML frameworks 
  • Evaluate performance bottlenecks and come up with ways to improve product performance in various product sub-domains Power management, SW, Compute, Memory Subsystem, IO, and Networking
  • Generate consistent performance metrics based on industry standards and develop frameworks, needed scripts for collecting and reporting metrics
  • Work actively on creating and maintaining micro benchmarks for performance evaluation in SRIOV and Bare Metal environments.
  • Characterize critical product KPIs to ensure product meets pre-silicon targets
  • Provide feedback to next generation products through performance data and debug lessons learned   
  • Debugging and troubleshooting system-level issues that may occur in test and customer platforms 
  • Proactively driving continuous improvement for post-silicon performance activities 

PREFERRED EXPERIENCE: 

  • Excellent grasp of computer organization/architecture
  • Detailed knowledge of NUMA, Cache Coherency, PCIe
  • Extensive experience in platform optimization. Solid knowledge of Computer I/O.
  • Experience with tools for performance analysis
  • Strong programming skills, experience in Python preferred 
  • Desirable to be proficient in Linux command line environment and Shell scripting  
  • Deep knowledge of power management techniques like deep sleep and clock gating
  • Experience with virtualization(ex. KVM)
  • Experience with container technologies(ex. Docker)
  • Strong analytical and problem-solving skills with a key attention to detail 
  • Excellent presentation and communication skills 

ACADEMIC CREDENTIALS: 

BS degree in Electronics/Computer Engineering or Computer Science with emphasis on computer architecture and workload analysis, MS preferred 

5+ years’ experience preferred. 

 

LOCATION:

Penang, Malaysia

 

#LI-SH2

#LI-Hybrid

COMPANY JOBS
1189 available jobs
WEBSITE