WHAT YOU DO AT AMD CHANGES EVERYTHING
We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences – the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world’s most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives.
AMD together we advance_
THE ROLE:
We are looking for a dynamic, energetic candidate to join our growing team in AI Group. In this role, the individual will be responsible architecting and defining AI workload models, dataflow, defining block level and system level performance of Neural Processing Unit (NPU), NPU network performance modeling, and performance bottleneck analysis on pre/post silicon platforms. As a member of our dynamic team, you will have the opportunity to shape the future of AI model development
THE PERSON:
We are looking for a candidate who possesses strong engineering skills to tackle complex challenges on AI model development work.
You should have experience in optimizing and accelerating CNN/Generative AI models. Person needs excellent cross team collaboration skills to succeed in this role.
Strong experience in developing ML compiler for efficient network mapping on NPU
Work with cross-functional teams to optimize various parts of the SW stack – AI Compiler, AI frameworks, device drivers, and firmware.
Bring up emerging ML models based on CNN, transformers and characterize performance.
Work on quantization, sparsity, and architecture search methods to optimize and enhance the performance, efficiency, and accuracy of Generative AI models.
Collaborate closely with software engineers, data scientists, and researchers to integrate AI models into software applications and platforms.
KEY RESPONSIBILITIES:
Research, design, and implement novel methods for efficient CNN, GEN AI models.
Model optimization method design including quantization, sparsity, NAS, etc.
Collaborate with other team members and teams.
Collaborate with compiler team to develop optimization strategies for the compiler.
PREFERRED EXPERIENCE:
Experience with deep learning framework, e.g., Pytorch/ONNX/TensorFlow.
Experience on model compression, quantization, and end-to-end inference optimization.
Strong coding skills in C/C++, Python required.
Experience with any of the following also a plus: LLMs, stable diffusion, NeRF, or text-to-video generation.
Solid knowledge of AI and ML concepts and techniques. Practical experience applying these concepts to solve real-world problems in the context of research or work experience.
Understanding the performance implications on AI acceleration of different compute, memory, and communication configurations and hardware and software implementation choices.
Developing and optimizing code for VLIW processors. Analyzing code for high performance CONV, GEMM and non-linear operators Deep understanding of AI frameworks, preferably ONNX.
ACADEMIC CREDENTIALS:
BS or MS with 7+ years of industry experience
LOCATION:
San Jose, CA
#LI-RF1
#LI-HYBRID
At AMD, your base pay is one part of your total rewards package. Your base pay will depend on where your skills, qualifications, experience, and location fit into the hiring range for the position. You may be eligible for incentives based upon your role such as either an annual bonus or sales incentive. Many AMD employees have the opportunity to own shares of AMD stock, as well as a discount when purchasing AMD stock if voluntarily participating in AMD’s Employee Stock Purchase Plan. You’ll also be eligible for competitive benefits described in more detail here.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.
At AMD, your base pay is one part of your total rewards package. Your base pay will depend on where your skills, qualifications, experience, and location fit into the hiring range for the position. You may be eligible for incentives based upon your role such as either an annual bonus or sales incentive. Many AMD employees have the opportunity to own shares of AMD stock, as well as a discount when purchasing AMD stock if voluntarily participating in AMD’s Employee Stock Purchase Plan. You’ll also be eligible for competitive benefits described in more detail here.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.
THE ROLE:
We are looking for a dynamic, energetic candidate to join our growing team in AI Group. In this role, the individual will be responsible architecting and defining AI workload models, dataflow, defining block level and system level performance of Neural Processing Unit (NPU), NPU network performance modeling, and performance bottleneck analysis on pre/post silicon platforms. As a member of our dynamic team, you will have the opportunity to shape the future of AI model development
THE PERSON:
We are looking for a candidate who possesses strong engineering skills to tackle complex challenges on AI model development work.
You should have experience in optimizing and accelerating CNN/Generative AI models. Person needs excellent cross team collaboration skills to succeed in this role.
Strong experience in developing ML compiler for efficient network mapping on NPU
Work with cross-functional teams to optimize various parts of the SW stack – AI Compiler, AI frameworks, device drivers, and firmware.
Bring up emerging ML models based on CNN, transformers and characterize performance.
Work on quantization, sparsity, and architecture search methods to optimize and enhance the performance, efficiency, and accuracy of Generative AI models.
Collaborate closely with software engineers, data scientists, and researchers to integrate AI models into software applications and platforms.
KEY RESPONSIBILITIES:
Research, design, and implement novel methods for efficient CNN, GEN AI models.
Model optimization method design including quantization, sparsity, NAS, etc.
Collaborate with other team members and teams.
Collaborate with compiler team to develop optimization strategies for the compiler.
PREFERRED EXPERIENCE:
Experience with deep learning framework, e.g., Pytorch/ONNX/TensorFlow.
Experience on model compression, quantization, and end-to-end inference optimization.
Strong coding skills in C/C++, Python required.
Experience with any of the following also a plus: LLMs, stable diffusion, NeRF, or text-to-video generation.
Solid knowledge of AI and ML concepts and techniques. Practical experience applying these concepts to solve real-world problems in the context of research or work experience.
Understanding the performance implications on AI acceleration of different compute, memory, and communication configurations and hardware and software implementation choices.
Developing and optimizing code for VLIW processors. Analyzing code for high performance CONV, GEMM and non-linear operators Deep understanding of AI frameworks, preferably ONNX.
ACADEMIC CREDENTIALS:
BS or MS with 7+ years of industry experience
LOCATION:
San Jose, CA
#LI-RF1
#LI-HYBRID