Cluster Network Architect 

Oct 30, 2024
Austin, United States
... Not specified
... Intermediate
Full time
... Office work


WHAT YOU DO AT AMD CHANGES EVERYTHING

We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences – the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world’s most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives. 

AMD together we advance_




 

THE ROLE: 

We are looking for a dynamic, energetic Lead / Principal Cluster Network Architect to join our growing team. As a key contributor to the success of AMD’s product, you will be part of a leading team to drive and improve AMD’s abilities to deliver the highest quality, industry-leading technologies to market. AMD's Systems Design Engineering team fosters and encourages continuous technical innovation to showcase successes as well as facilitate continuous career development. 

 

THE PERSON: 

The Cluster Network Architect plays a critical role in shaping the future of AI/ML training and inferencing systems as they move into the Ethernet era. This individual will collaborate with a broad range of internal and external partners, including NIC, Switch, and Software Enablement teams, to integrate state-of-the-art technology solutions that pave the way for ethernet to be used as a viable network technology for the GPU-to-GPU communication required during AI inferencing and training.  

 

KEY RESPONSIBILITIES: 

  • Designing state of the art cluster network architectures for large AI/ML training and inferencing systems which can be optimized for hyperscale capabilities

  • Engage with AMD customer base while aligning system and networking architecture
  • Standardize ethernet network architectures and best practices for GPU-to-GPU communication for deep learning and AI workloads using Infiniband and Ethernet technologies

  • Co-design new Ethernet technology with AMD partner companies to build the next generation of AI cluster networks

  • Pioneering system and container networking strategies to facilitate seamless operation and scaling of AI clusters

  • Developing scalable AI/ML training and inferencing communication network reference architectures for each generation of AMD AI/ML products

  • Serve as chief network engineer on projects supporting Partner OEM co-design of AI/ML clusters

  • Participate in design phase of each AMD AI/ML GPU generation by developing cluster communication network architectures and requirements

  • Collaborate across AMD internal and external partner teams to improve communication performance for AMD AI/ML clusters

 

PREFERRED EXPERIENCE: 

  • In-depth knowledge and experience with network topologies such as Clos and Fat Tree, and technologies including Infiniband, RDMA, RoCE, NVLINK, and PCIe

  • Expertise in network security, automation, and visualization, along with a solid understanding of OSI network models and TCP/IP suites

  • Professional certifications such as Cisco CCNA, CCNP, CCIE, CompTIA Network+, and Arista ACE are highly regarded

  • Extensive real world experience designing hyperscale ethernet networks

  • Expert in the TCP/IP protocol and it’s application

  • Strong analytical/problem-solving skills and pronounced attention to details 
  • Must be a self-starter, and able to independently drive tasks to completion

 

ACADEMIC CREDENTIALS: 

  • Bachelors or Masters degree in electrical or computer engineering

 

#LI-RW1

#LI-HYBRID




At AMD, your base pay is one part of your total rewards package.  Your base pay will depend on where your skills, qualifications, experience, and location fit into the hiring range for the position. You may be eligible for incentives based upon your role such as either an annual bonus or sales incentive. Many AMD employees have the opportunity to own shares of AMD stock, as well as a discount when purchasing AMD stock if voluntarily participating in AMD’s Employee Stock Purchase Plan. You’ll also be eligible for competitive benefits described in more detail here.

 

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law.   We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.

At AMD, your base pay is one part of your total rewards package.  Your base pay will depend on where your skills, qualifications, experience, and location fit into the hiring range for the position. You may be eligible for incentives based upon your role such as either an annual bonus or sales incentive. Many AMD employees have the opportunity to own shares of AMD stock, as well as a discount when purchasing AMD stock if voluntarily participating in AMD’s Employee Stock Purchase Plan. You’ll also be eligible for competitive benefits described in more detail here.

 

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law.   We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.

 

THE ROLE: 

We are looking for a dynamic, energetic Lead / Principal Cluster Network Architect to join our growing team. As a key contributor to the success of AMD’s product, you will be part of a leading team to drive and improve AMD’s abilities to deliver the highest quality, industry-leading technologies to market. AMD's Systems Design Engineering team fosters and encourages continuous technical innovation to showcase successes as well as facilitate continuous career development. 

 

THE PERSON: 

The Cluster Network Architect plays a critical role in shaping the future of AI/ML training and inferencing systems as they move into the Ethernet era. This individual will collaborate with a broad range of internal and external partners, including NIC, Switch, and Software Enablement teams, to integrate state-of-the-art technology solutions that pave the way for ethernet to be used as a viable network technology for the GPU-to-GPU communication required during AI inferencing and training.  

 

KEY RESPONSIBILITIES: 

  • Designing state of the art cluster network architectures for large AI/ML training and inferencing systems which can be optimized for hyperscale capabilities

  • Engage with AMD customer base while aligning system and networking architecture
  • Standardize ethernet network architectures and best practices for GPU-to-GPU communication for deep learning and AI workloads using Infiniband and Ethernet technologies

  • Co-design new Ethernet technology with AMD partner companies to build the next generation of AI cluster networks

  • Pioneering system and container networking strategies to facilitate seamless operation and scaling of AI clusters

  • Developing scalable AI/ML training and inferencing communication network reference architectures for each generation of AMD AI/ML products

  • Serve as chief network engineer on projects supporting Partner OEM co-design of AI/ML clusters

  • Participate in design phase of each AMD AI/ML GPU generation by developing cluster communication network architectures and requirements

  • Collaborate across AMD internal and external partner teams to improve communication performance for AMD AI/ML clusters

 

PREFERRED EXPERIENCE: 

  • In-depth knowledge and experience with network topologies such as Clos and Fat Tree, and technologies including Infiniband, RDMA, RoCE, NVLINK, and PCIe

  • Expertise in network security, automation, and visualization, along with a solid understanding of OSI network models and TCP/IP suites

  • Professional certifications such as Cisco CCNA, CCNP, CCIE, CompTIA Network+, and Arista ACE are highly regarded

  • Extensive real world experience designing hyperscale ethernet networks

  • Expert in the TCP/IP protocol and it’s application

  • Strong analytical/problem-solving skills and pronounced attention to details 
  • Must be a self-starter, and able to independently drive tasks to completion

 

ACADEMIC CREDENTIALS: 

  • Bachelors or Masters degree in electrical or computer engineering

 

#LI-RW1

#LI-HYBRID

COMPANY JOBS
1099 available jobs
WEBSITE