Power Attainment Engineer - Data Center GPU

Sep 07, 2024
Austin, United States
... Not specified
... Intermediate
Full time
... Office work


WHAT YOU DO AT AMD CHANGES EVERYTHING

We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences – the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world’s most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives. 

AMD together we advance_




Power Attainment Engineer - Data Center GPU

                                                                                                                

THE TEAM:

The AMD Data Center Power and Performance Systems Engineering Team is a cutting-edge hardware-based lab team engaged on an array of leading technologies found in today’s leading Data Center Products.

This team is essential to the success of AMD as a growing company.  It is a very exciting environment with a friendly, collaborative, and highly skilled staff and management that will be there to guide you throughout your stay at AMD.

 

THE ROLE:

The successful candidate will assume responsibility for mostly post-silicon activities related to power attainment and optimization of AMD Datacenter products.  Power Attainment within AMD’s datacenter GPU organization cover the development of automation and software infrastructure, design model correlation, insuring production readiness and power feature tuning. 

 

THE PERSON:

Must be a self-starter, effective communicator, strong team collaborator and able to independently drive tasks to completion.  The candidate must be willing to work with collocated team with focus on power management as well as key software infrastructure necessary for power attainment activities. 

 

KEY RESPONSIBILITIES:

  • Actively participate in analysis of post silicon performance and power data collected to ensure integrity of results and to provide summary and conclusions of results
  • Learn and Execute Power Attainment test plans in post-silicon time periods in support of Data Center GPU product roadmap
  • Proactively driving continuous improvement for post-silicon power attainment activities
  • Participate in development of automation environment in developing scripts automating workloads, enhancing capabilities of execution capabilities in Linux, Python and other support software support tools
  • Hands-on experience locally or remotely with computers, systems or data center hardware for practical knowledge with hardware applicable to servers, data centers or thermal equipment as a means to accomplish power attainment work
  • Develop and execute characterization test plans for Datacenter GPUs related to Power attainment and feature tuning for performance optimization
  • Analyzing data from workload or execution output datalogs using excel or analysis tools manually or developed automation
  • Optimize power and performance features for AI, Machine learning & High performance computing
  • Work in a fast paced constrained environment  
  • Become a key stakeholder in product performance validation process
  • Analyze and debug interactions between various power management features
  • Develop and execute performance validation test plans for HPC/ML frameworks
  • Configure and setup test and customer based ML/AI Datacenter GPU systems for data collection, experiments and post-silicon activities
  • Work in Windows and Linux environments
  • Support prototyping experiments for new GPU features that impact performance and power
  • Troubleshoot system-level issues that may occur in test environments and platforms
  • Proactively driving continuous improvement for post-silicon power and performance activities

PREFERRED EXPERIENCE:

  • Experience in datacenter environment preferred
  • Excellent grasp of computer organization/architecture and power management
  • Knowledge in power limited performance methodologies and control theory
  • Knowledge in memory partitioning and access
  • Extensive experience in platform optimization. Solid knowledge of Computer I/O.
  • Experience with tools for performance analysis
  • Strong programming skills, experience in Python preferred
  • Desirable to be proficient in Linux command line environment and Shell scripting
  • Deep knowledge of power management techniques like deep sleep and clock gating
  • Experience with container technologies(ex. Docker)
  • Strong analytical and problem-solving skills with a key attention to detail
  • Experience in data analysis, summarization, and presentation
  • Excellent presentation and communication skills
  • Experience in debug and lab tools such as oscilloscopes, DAQs, power measurement capabilities

ACADEMIC CREDENTIALS:

  • Bachelors or Masters in Computer Engineering, Electrical Engineering, or Computer Science with emphasis on computer architecture and workload analysis
  • 7+ years’ experience preferred.

LOCATION:

Austin, Texas

 

#LI-SL2

 




At AMD, your base pay is one part of your total rewards package.  Your base pay will depend on where your skills, qualifications, experience, and location fit into the hiring range for the position. You may be eligible for incentives based upon your role such as either an annual bonus or sales incentive. Many AMD employees have the opportunity to own shares of AMD stock, as well as a discount when purchasing AMD stock if voluntarily participating in AMD’s Employee Stock Purchase Plan. You’ll also be eligible for competitive benefits described in more detail here.

 

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law.   We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.

At AMD, your base pay is one part of your total rewards package.  Your base pay will depend on where your skills, qualifications, experience, and location fit into the hiring range for the position. You may be eligible for incentives based upon your role such as either an annual bonus or sales incentive. Many AMD employees have the opportunity to own shares of AMD stock, as well as a discount when purchasing AMD stock if voluntarily participating in AMD’s Employee Stock Purchase Plan. You’ll also be eligible for competitive benefits described in more detail here.

 

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law.   We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.

Power Attainment Engineer - Data Center GPU

                                                                                                                

THE TEAM:

The AMD Data Center Power and Performance Systems Engineering Team is a cutting-edge hardware-based lab team engaged on an array of leading technologies found in today’s leading Data Center Products.

This team is essential to the success of AMD as a growing company.  It is a very exciting environment with a friendly, collaborative, and highly skilled staff and management that will be there to guide you throughout your stay at AMD.

 

THE ROLE:

The successful candidate will assume responsibility for mostly post-silicon activities related to power attainment and optimization of AMD Datacenter products.  Power Attainment within AMD’s datacenter GPU organization cover the development of automation and software infrastructure, design model correlation, insuring production readiness and power feature tuning. 

 

THE PERSON:

Must be a self-starter, effective communicator, strong team collaborator and able to independently drive tasks to completion.  The candidate must be willing to work with collocated team with focus on power management as well as key software infrastructure necessary for power attainment activities. 

 

KEY RESPONSIBILITIES:

  • Actively participate in analysis of post silicon performance and power data collected to ensure integrity of results and to provide summary and conclusions of results
  • Learn and Execute Power Attainment test plans in post-silicon time periods in support of Data Center GPU product roadmap
  • Proactively driving continuous improvement for post-silicon power attainment activities
  • Participate in development of automation environment in developing scripts automating workloads, enhancing capabilities of execution capabilities in Linux, Python and other support software support tools
  • Hands-on experience locally or remotely with computers, systems or data center hardware for practical knowledge with hardware applicable to servers, data centers or thermal equipment as a means to accomplish power attainment work
  • Develop and execute characterization test plans for Datacenter GPUs related to Power attainment and feature tuning for performance optimization
  • Analyzing data from workload or execution output datalogs using excel or analysis tools manually or developed automation
  • Optimize power and performance features for AI, Machine learning & High performance computing
  • Work in a fast paced constrained environment  
  • Become a key stakeholder in product performance validation process
  • Analyze and debug interactions between various power management features
  • Develop and execute performance validation test plans for HPC/ML frameworks
  • Configure and setup test and customer based ML/AI Datacenter GPU systems for data collection, experiments and post-silicon activities
  • Work in Windows and Linux environments
  • Support prototyping experiments for new GPU features that impact performance and power
  • Troubleshoot system-level issues that may occur in test environments and platforms
  • Proactively driving continuous improvement for post-silicon power and performance activities

PREFERRED EXPERIENCE:

  • Experience in datacenter environment preferred
  • Excellent grasp of computer organization/architecture and power management
  • Knowledge in power limited performance methodologies and control theory
  • Knowledge in memory partitioning and access
  • Extensive experience in platform optimization. Solid knowledge of Computer I/O.
  • Experience with tools for performance analysis
  • Strong programming skills, experience in Python preferred
  • Desirable to be proficient in Linux command line environment and Shell scripting
  • Deep knowledge of power management techniques like deep sleep and clock gating
  • Experience with container technologies(ex. Docker)
  • Strong analytical and problem-solving skills with a key attention to detail
  • Experience in data analysis, summarization, and presentation
  • Excellent presentation and communication skills
  • Experience in debug and lab tools such as oscilloscopes, DAQs, power measurement capabilities

ACADEMIC CREDENTIALS:

  • Bachelors or Masters in Computer Engineering, Electrical Engineering, or Computer Science with emphasis on computer architecture and workload analysis
  • 7+ years’ experience preferred.

LOCATION:

Austin, Texas

 

#LI-SL2

 

COMPANY JOBS
1180 available jobs
WEBSITE