AI Application Engineer

May 30, 2024
Boston, United States
... Not specified
... Intermediate
Full time
... Office work


WHAT YOU DO AT AMD CHANGES EVERYTHING

We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences – the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world’s most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives. 

AMD together we advance_




THE ROLE:

 

PLEASE NOTE- this role is not considered an entry level role, and interns and/or new graduates with advanced degrees will not be considered. This role requires significant professional engineering working experience combined with an advanced technical  degree.

 

We are seeking an experienced AI Application Engineer to join the HPC Centre of Excellence (HPC CoE). The role broadly involves;

Working with customers and partners in support of RFP-driven requests. Providing hands-on support to customers and partners to enable their applications to run on AMD hardware and provide expected performance.

Broaden engineering investigations to understand both performance and characteristic performance across popular and customer-specific training and inference workloads. Understand competitive positioning.

Creating a body of technical documentation of AI performance on AMD hardware to support the Field Application team, partners, and customers.

This is a ‘hand-on’ technical role and we are looking for an individual with an established background in AI who is already familiar in executing and tuning training and inference workloads. In addition to the technical aspect, you will also need to be able to create and deliver presentations and training both remotely and in person to our customers and partners; therefore this will require some global travel.

AMD continues to ramp its in-house AI expertise, and as such this role provides an opportunity for the candidate to grow within the role while creating a significant impact on the wider business, with visibility to a senior level.

The role is initially advertised as located in North America, but it is also open to suitable candidates already based within Europe.

 

THE PERSON:

Does this sound familiar? We'd love to talk!

  • Track record working within AI. You are currently in a role undertaking AI inference or training as a key part of your function.
  • Demonstrable hands-on expertise working with popular AI frameworks.
  • Strong positive can-do attitude willing to do what is necessary and lead others in the wider FAE team by example. Available to help colleagues.
  • Skilled in independently prioritizing opportunities to deliver results on time.
  • Excellent verbal and written communication skills
  • Open to travel both domestic and international, approximately 20%.
  • Fluent in English and already have the right to remain in the USA/Europe

 

KEY RESPONSIBILITIES:

  • Support winning new AI business. Enabling customers to execute their AI workloads on AMD GPUs and CPUs (principally ‘Instinct’ and ‘EPYC’ respectively). Supporting partners in RFP responses by testing requested workloads.
  • Engineering: execute popular and customer-driven AI inference and training workloads, generate results and create a characteristic understanding of AI performance on AMD hardware. Understand how system and software choices affect performance. Compare performance to our competition.
  • Run training and inference performance investigations using common frameworks (Pytorch, Tensorflow, JAX) and repositories (MLperf, Hugging Face etc.)
  • Build a body of documentation for internal and external dissemination: AMD-internal guides, whitepapers, tuning guides, training collateral.
  • Liaise and advise customers and partners through Proof of Concepts, presentations, and training.
  • Assisting or lead effort to port applications, e.g. to different frameworks, change elements within the software
  • Creating scripts and tools to enable a ‘fast start’ for customers and developers.
  • Active engagement across AMD teams: GPU Business Unit, Engineering, Architecture, Platform, Software, and Product Development teams providing feedback and leadership from the field on requirements.
  • Assist in creating TCO models to assist pricing with bid desk.
  • Technically owning and resolving customer and partner issues. Submitting JIRA tickets and driving resolution.
  • Automating repeatable procedures.

 

PREFERRED EXPERIENCE:

  • Demonstrated experience with training and inference workloads on GPUs
  • Executing applications in common frameworks (Pytorch, Tensorflow, Jax).
  • Any experience with popular AI repositories (e.g. HuggingFace, MLperf) and understanding derived performance/functionality differences between them
  • Proven understanding of baseline testing of popular AI applications (e.g. Resnet, BERT, DLRM, etc) and other real world codes such as Large Language Models, image detection, NLP, recommendation engines etc
  • Understanding system level hardware design and its impact on performance
  • Understand how the software stack affects performance: frameworks, precision, compilers, libraries, and other accompanying middleware applications.
  • Customer-facing experience. Able to write technical documents and communicate at an appropriate level depending on the audience.
  • Some Linux administration; understanding setup for HPC/AI middleware.

 

Nice to Haves:

  • Hands-on AI experience within automotive, finance, enterprise, defense verticals
  • Programming experience with any of HIP, CUDA, Python, C/C++, Fortran, OpenACC, OpenMP, pSTL
  • Understanding impact of inter-node network choices on performance at scale. Creating performance projections for applications
  • Deep Neural Networks and their design for different Machine Learning cases.
  • Any experience understanding/inspecting/writing assembly
  • Understanding of memory and cache hierarchy and methods to query performance/latency at each level. Inspecting and dataflow down to the register-level
  • Government level security clearance

 

ACADEMIC CREDENTIALS:

  • Bachelors' Degree in a technical field (Computer Science, Electrical Engineering, Physics, Mathematics) preferred

 

LOCATION:

Any US city near major airport in Eastern or Central US Time Zone

 

 

 

#LI-KH1




At AMD, your base pay is one part of your total rewards package.  Your base pay will depend on where your skills, qualifications, experience, and location fit into the hiring range for the position. You may be eligible for incentives based upon your role such as either an annual bonus or sales incentive. Many AMD employees have the opportunity to own shares of AMD stock, as well as a discount when purchasing AMD stock if voluntarily participating in AMD’s Employee Stock Purchase Plan. You’ll also be eligible for competitive benefits described in more detail here.

 

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law.   We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.

At AMD, your base pay is one part of your total rewards package.  Your base pay will depend on where your skills, qualifications, experience, and location fit into the hiring range for the position. You may be eligible for incentives based upon your role such as either an annual bonus or sales incentive. Many AMD employees have the opportunity to own shares of AMD stock, as well as a discount when purchasing AMD stock if voluntarily participating in AMD’s Employee Stock Purchase Plan. You’ll also be eligible for competitive benefits described in more detail here.

 

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law.   We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.

THE ROLE:

 

PLEASE NOTE- this role is not considered an entry level role, and interns and/or new graduates with advanced degrees will not be considered. This role requires significant professional engineering working experience combined with an advanced technical  degree.

 

We are seeking an experienced AI Application Engineer to join the HPC Centre of Excellence (HPC CoE). The role broadly involves;

Working with customers and partners in support of RFP-driven requests. Providing hands-on support to customers and partners to enable their applications to run on AMD hardware and provide expected performance.

Broaden engineering investigations to understand both performance and characteristic performance across popular and customer-specific training and inference workloads. Understand competitive positioning.

Creating a body of technical documentation of AI performance on AMD hardware to support the Field Application team, partners, and customers.

This is a ‘hand-on’ technical role and we are looking for an individual with an established background in AI who is already familiar in executing and tuning training and inference workloads. In addition to the technical aspect, you will also need to be able to create and deliver presentations and training both remotely and in person to our customers and partners; therefore this will require some global travel.

AMD continues to ramp its in-house AI expertise, and as such this role provides an opportunity for the candidate to grow within the role while creating a significant impact on the wider business, with visibility to a senior level.

The role is initially advertised as located in North America, but it is also open to suitable candidates already based within Europe.

 

THE PERSON:

Does this sound familiar? We'd love to talk!

  • Track record working within AI. You are currently in a role undertaking AI inference or training as a key part of your function.
  • Demonstrable hands-on expertise working with popular AI frameworks.
  • Strong positive can-do attitude willing to do what is necessary and lead others in the wider FAE team by example. Available to help colleagues.
  • Skilled in independently prioritizing opportunities to deliver results on time.
  • Excellent verbal and written communication skills
  • Open to travel both domestic and international, approximately 20%.
  • Fluent in English and already have the right to remain in the USA/Europe

 

KEY RESPONSIBILITIES:

  • Support winning new AI business. Enabling customers to execute their AI workloads on AMD GPUs and CPUs (principally ‘Instinct’ and ‘EPYC’ respectively). Supporting partners in RFP responses by testing requested workloads.
  • Engineering: execute popular and customer-driven AI inference and training workloads, generate results and create a characteristic understanding of AI performance on AMD hardware. Understand how system and software choices affect performance. Compare performance to our competition.
  • Run training and inference performance investigations using common frameworks (Pytorch, Tensorflow, JAX) and repositories (MLperf, Hugging Face etc.)
  • Build a body of documentation for internal and external dissemination: AMD-internal guides, whitepapers, tuning guides, training collateral.
  • Liaise and advise customers and partners through Proof of Concepts, presentations, and training.
  • Assisting or lead effort to port applications, e.g. to different frameworks, change elements within the software
  • Creating scripts and tools to enable a ‘fast start’ for customers and developers.
  • Active engagement across AMD teams: GPU Business Unit, Engineering, Architecture, Platform, Software, and Product Development teams providing feedback and leadership from the field on requirements.
  • Assist in creating TCO models to assist pricing with bid desk.
  • Technically owning and resolving customer and partner issues. Submitting JIRA tickets and driving resolution.
  • Automating repeatable procedures.

 

PREFERRED EXPERIENCE:

  • Demonstrated experience with training and inference workloads on GPUs
  • Executing applications in common frameworks (Pytorch, Tensorflow, Jax).
  • Any experience with popular AI repositories (e.g. HuggingFace, MLperf) and understanding derived performance/functionality differences between them
  • Proven understanding of baseline testing of popular AI applications (e.g. Resnet, BERT, DLRM, etc) and other real world codes such as Large Language Models, image detection, NLP, recommendation engines etc
  • Understanding system level hardware design and its impact on performance
  • Understand how the software stack affects performance: frameworks, precision, compilers, libraries, and other accompanying middleware applications.
  • Customer-facing experience. Able to write technical documents and communicate at an appropriate level depending on the audience.
  • Some Linux administration; understanding setup for HPC/AI middleware.

 

Nice to Haves:

  • Hands-on AI experience within automotive, finance, enterprise, defense verticals
  • Programming experience with any of HIP, CUDA, Python, C/C++, Fortran, OpenACC, OpenMP, pSTL
  • Understanding impact of inter-node network choices on performance at scale. Creating performance projections for applications
  • Deep Neural Networks and their design for different Machine Learning cases.
  • Any experience understanding/inspecting/writing assembly
  • Understanding of memory and cache hierarchy and methods to query performance/latency at each level. Inspecting and dataflow down to the register-level
  • Government level security clearance

 

ACADEMIC CREDENTIALS:

  • Bachelors' Degree in a technical field (Computer Science, Electrical Engineering, Physics, Mathematics) preferred

 

LOCATION:

Any US city near major airport in Eastern or Central US Time Zone

 

 

 

#LI-KH1

COMPANY JOBS
1111 available jobs
WEBSITE