Product Application Engineer

Dec 18, 2024
Santa Clara, Cuba
... Not specified
... Intermediate
Full time
... Office work


WHAT YOU DO AT AMD CHANGES EVERYTHING

We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences – the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world’s most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives. 

AMD together we advance_




Product Applications Engineer-Data Center GPU

 

THE TEAM:

AMD's Data Center GPU organization is transforming the industry with our AI based Graphic Processors. Our primary objective is to design exceptional products that drive the evolution of computing experiences, serving as the cornerstone for enterprise Data Centers, (AI) Artificial Intelligence, HPC and Embedded systems. If this resonates with you, come and joining our Data Center GPU organization where we are building amazing AI powered products with amazing people.

 

THE ROLE:

The Datacenter GPU Product Applications Engineer is a key technical lead, responsible for the technical execution of AMD's Datacenter graphics hardware/software subsystem projects for AMD OEM partners and enterprise commercial end-customers. This position offers a unique opportunity to apply your strong HPC, datacenter, graphics, compute, AI / Machine Learning as well as program management skills, to collaboratively work with customers that use AMD Instinct™ Accelerators.

 

THE PERSON:

An engineer, computational scientist, or physicist with experience in multiple scientific computing domains and experience with high performance computing and AI/ML settings. Must be self-motivated and possess the ability to work well within a team environment. 

 

KEY RESPONSIBILITIES:

  • Resolve technical issues for customers that use AMD Instinct™ products.
  • Assist development teams to root cause hardware / software technical issues and help to drive them to closure in a timely manner during the entire product lifecycle (i.e. from initial hardware bring-up through product end-of-life).
  • Provide technical guidance and information to our customers in support of their server graphics and compute projects for HPC workloads.
  • Mentor more junior members of the technical staff.
  • Own the customer technical relationship and technical requirements.
  • Provide technical guidance to internal teams based on customer feedback.
  • Partner with program manager on project schedules, maintain action items tracker, ensure deliverables are met, provide project status updates to customers and AMD management.
  • Qualify and assess new software functionality to ensure customer compatibility.

 

PREFERRED EXPERIENCE:

  • Experience in a datacenter customer support role.
  • Expert Linux knowledge; install-setup, usage, debug.
  • Familiar with datacenter GPU software stack such as AMD ROCm™ or Nvidia CUDA
  • Software programming and scripting proficiency (C++, Shell script, Python, Fortran)
  • Experience with a converged HPC/AI application is a plus
  • Broad experience creating, adapting, and running workloads with widely used HPC applications is a plus
  • Familiarity with installation and setup of various HPC applications is a plusIn-depth knowledge of software development practices including debug, test, revision control, documentation, and bug tracking
  • Knowledge of server architecture and functionality, including server remote management, network topologies, graphics software and hardware sub-systems
  • Familiarity with distributed model training via NCCL/RCCL, MPI, or similar network technologies
  • Experience in implementing and optimizing parallel methods on GPU accelerators in distributed memory systems with MPI, CUDA, HIP, OpenMP, etc.
  • Understanding of site reliability engineering best practices.
  • Experience with build system tools including Make, CMake, autoconf, and autotools
  • Strong debug, problem solving, and analysis.

 

ACADEMIC CREDENTIALS AND EXPERIENCE:

  • Master’s or PhD in Computer Science, Computational Physics, Engineering or related subjects, or equivalent experience desired
  • Advanced candidates bring an additional 3-5 years of relevant industry experience

 

LOCATION:

  • Santa Clara, CA or some remote locations.

#LI-EV1

#LI-HYBRID




At AMD, your base pay is one part of your total rewards package.  Your base pay will depend on where your skills, qualifications, experience, and location fit into the hiring range for the position. You may be eligible for incentives based upon your role such as either an annual bonus or sales incentive. Many AMD employees have the opportunity to own shares of AMD stock, as well as a discount when purchasing AMD stock if voluntarily participating in AMD’s Employee Stock Purchase Plan. You’ll also be eligible for competitive benefits described in more detail here.

 

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law.   We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.

At AMD, your base pay is one part of your total rewards package.  Your base pay will depend on where your skills, qualifications, experience, and location fit into the hiring range for the position. You may be eligible for incentives based upon your role such as either an annual bonus or sales incentive. Many AMD employees have the opportunity to own shares of AMD stock, as well as a discount when purchasing AMD stock if voluntarily participating in AMD’s Employee Stock Purchase Plan. You’ll also be eligible for competitive benefits described in more detail here.

 

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law.   We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.

Product Applications Engineer-Data Center GPU

 

THE TEAM:

AMD's Data Center GPU organization is transforming the industry with our AI based Graphic Processors. Our primary objective is to design exceptional products that drive the evolution of computing experiences, serving as the cornerstone for enterprise Data Centers, (AI) Artificial Intelligence, HPC and Embedded systems. If this resonates with you, come and joining our Data Center GPU organization where we are building amazing AI powered products with amazing people.

 

THE ROLE:

The Datacenter GPU Product Applications Engineer is a key technical lead, responsible for the technical execution of AMD's Datacenter graphics hardware/software subsystem projects for AMD OEM partners and enterprise commercial end-customers. This position offers a unique opportunity to apply your strong HPC, datacenter, graphics, compute, AI / Machine Learning as well as program management skills, to collaboratively work with customers that use AMD Instinct™ Accelerators.

 

THE PERSON:

An engineer, computational scientist, or physicist with experience in multiple scientific computing domains and experience with high performance computing and AI/ML settings. Must be self-motivated and possess the ability to work well within a team environment. 

 

KEY RESPONSIBILITIES:

  • Resolve technical issues for customers that use AMD Instinct™ products.
  • Assist development teams to root cause hardware / software technical issues and help to drive them to closure in a timely manner during the entire product lifecycle (i.e. from initial hardware bring-up through product end-of-life).
  • Provide technical guidance and information to our customers in support of their server graphics and compute projects for HPC workloads.
  • Mentor more junior members of the technical staff.
  • Own the customer technical relationship and technical requirements.
  • Provide technical guidance to internal teams based on customer feedback.
  • Partner with program manager on project schedules, maintain action items tracker, ensure deliverables are met, provide project status updates to customers and AMD management.
  • Qualify and assess new software functionality to ensure customer compatibility.

 

PREFERRED EXPERIENCE:

  • Experience in a datacenter customer support role.
  • Expert Linux knowledge; install-setup, usage, debug.
  • Familiar with datacenter GPU software stack such as AMD ROCm™ or Nvidia CUDA
  • Software programming and scripting proficiency (C++, Shell script, Python, Fortran)
  • Experience with a converged HPC/AI application is a plus
  • Broad experience creating, adapting, and running workloads with widely used HPC applications is a plus
  • Familiarity with installation and setup of various HPC applications is a plusIn-depth knowledge of software development practices including debug, test, revision control, documentation, and bug tracking
  • Knowledge of server architecture and functionality, including server remote management, network topologies, graphics software and hardware sub-systems
  • Familiarity with distributed model training via NCCL/RCCL, MPI, or similar network technologies
  • Experience in implementing and optimizing parallel methods on GPU accelerators in distributed memory systems with MPI, CUDA, HIP, OpenMP, etc.
  • Understanding of site reliability engineering best practices.
  • Experience with build system tools including Make, CMake, autoconf, and autotools
  • Strong debug, problem solving, and analysis.

 

ACADEMIC CREDENTIALS AND EXPERIENCE:

  • Master’s or PhD in Computer Science, Computational Physics, Engineering or related subjects, or equivalent experience desired
  • Advanced candidates bring an additional 3-5 years of relevant industry experience

 

LOCATION:

  • Santa Clara, CA or some remote locations.

#LI-EV1

#LI-HYBRID

COMPANY JOBS
842 available jobs
WEBSITE