WHAT YOU DO AT AMD CHANGES EVERYTHING
We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences – the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world’s most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives.
AMD together we advance_
Datacenter GPU Applications Stress Engineer
THE ROLE:
The Datacenter Accelerated Computing Validation Team is looking for dynamic and energetic engineers to join our growing team. As a key contributor to the validation of AMD’s GPU based datacenter accelerators, you will work in cross-functional teams to deliver industry leading products for Artificial Intelligence (AI), Machine Learning (ML), and High-Performance Computing (HPC) applications. Specifically, this role focuses on enabling, characterizing, and deploying these critical applications and workloads on targeted HW platforms to validate and stress AMD’s leading datacenter GPUs.
THE PERSON:
We are looking for someone who:
- Has strong analytical thinking and problem solving skills with excellent attention to details
- Must be a team player but also be able to work efficiently with minimal supervision
- Has a strong interest in GPU hardware and deep knowledge on AI, ML, and HPC applications workloads
- Is very familiar with Linux and Linux systems programming
- Must have strong communication and collaboration skills
- Must be a self-starter and be able to independently drive tasks to completion
KEY RESPONSIBILITIES:
As a DC GPU Applications Stress Engineer, you will partner with GPU HW and SW teams to create, enable, and characterize AI, ML, and HPC applications/workloads on targeted HW platforms to validate and stress AMD’s datacenter GPUs.
Responsibilities include:
- Identify AI, ML and HPC workloads needed to validate / stress AMD DC GPUs
- Collaborate with SW and HW teams to create and enable these workloads in various targeted platforms
- Create test plans using these applications to validate and stress AMD DC GPUs
- Hands-on characterization of these workloads and debug issues found during test plan execution
- Automate and deploy these applications for broad adoption e by cross-functional teams
PREFERRED EXPERIENCE:
- Experience with industry standard benchmarks, AI/ML/HPC applications
- Understanding of Linux OS, shell scription and controlling of processes
- History of applied Python development skills with focus on object oriented and adherence to best practices
- Experience using Linux package managers and other provisioning methods such as Ansible or Packer
- Power user of docker or other containers
- Understanding of GPU programming, Parallel compute, and ML frameworks such as Pytorch is an asset
- Must have strong analytical skills for test creation and debug
- Hands-on experience in datacenter system architecture an asset
ACADEMIC CREDENTIALS:
- Bachelor or master's degree in Electrical/Computer Engineering, Mathematics, Computer Science or an equivalent preferred
LOCATION:
Austin, Texas
#LI-SL2
At AMD, your base pay is one part of your total rewards package. Your base pay will depend on where your skills, qualifications, experience, and location fit into the hiring range for the position. You may be eligible for incentives based upon your role such as either an annual bonus or sales incentive. Many AMD employees have the opportunity to own shares of AMD stock, as well as a discount when purchasing AMD stock if voluntarily participating in AMD’s Employee Stock Purchase Plan. You’ll also be eligible for competitive benefits described in more detail here.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.
At AMD, your base pay is one part of your total rewards package. Your base pay will depend on where your skills, qualifications, experience, and location fit into the hiring range for the position. You may be eligible for incentives based upon your role such as either an annual bonus or sales incentive. Many AMD employees have the opportunity to own shares of AMD stock, as well as a discount when purchasing AMD stock if voluntarily participating in AMD’s Employee Stock Purchase Plan. You’ll also be eligible for competitive benefits described in more detail here.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.
Datacenter GPU Applications Stress Engineer
THE ROLE:
The Datacenter Accelerated Computing Validation Team is looking for dynamic and energetic engineers to join our growing team. As a key contributor to the validation of AMD’s GPU based datacenter accelerators, you will work in cross-functional teams to deliver industry leading products for Artificial Intelligence (AI), Machine Learning (ML), and High-Performance Computing (HPC) applications. Specifically, this role focuses on enabling, characterizing, and deploying these critical applications and workloads on targeted HW platforms to validate and stress AMD’s leading datacenter GPUs.
THE PERSON:
We are looking for someone who:
- Has strong analytical thinking and problem solving skills with excellent attention to details
- Must be a team player but also be able to work efficiently with minimal supervision
- Has a strong interest in GPU hardware and deep knowledge on AI, ML, and HPC applications workloads
- Is very familiar with Linux and Linux systems programming
- Must have strong communication and collaboration skills
- Must be a self-starter and be able to independently drive tasks to completion
KEY RESPONSIBILITIES:
As a DC GPU Applications Stress Engineer, you will partner with GPU HW and SW teams to create, enable, and characterize AI, ML, and HPC applications/workloads on targeted HW platforms to validate and stress AMD’s datacenter GPUs.
Responsibilities include:
- Identify AI, ML and HPC workloads needed to validate / stress AMD DC GPUs
- Collaborate with SW and HW teams to create and enable these workloads in various targeted platforms
- Create test plans using these applications to validate and stress AMD DC GPUs
- Hands-on characterization of these workloads and debug issues found during test plan execution
- Automate and deploy these applications for broad adoption e by cross-functional teams
PREFERRED EXPERIENCE:
- Experience with industry standard benchmarks, AI/ML/HPC applications
- Understanding of Linux OS, shell scription and controlling of processes
- History of applied Python development skills with focus on object oriented and adherence to best practices
- Experience using Linux package managers and other provisioning methods such as Ansible or Packer
- Power user of docker or other containers
- Understanding of GPU programming, Parallel compute, and ML frameworks such as Pytorch is an asset
- Must have strong analytical skills for test creation and debug
- Hands-on experience in datacenter system architecture an asset
ACADEMIC CREDENTIALS:
- Bachelor or master's degree in Electrical/Computer Engineering, Mathematics, Computer Science or an equivalent preferred
LOCATION:
Austin, Texas
#LI-SL2