Machine Learning Kernel Performance Engineer Job at Annapurna Labs Inc., Cupertino, CA

ZkpZOTNsWnBZODJiRVljTEJJRnRiRnJGM0E9PQ==
  • Annapurna Labs Inc.
  • Cupertino, CA

Job Description

Salary: $151,300 - 261,500 per year Requirements:

  • A minimum of 5 years of professional software development experience (non-internship)
  • At least 5 years of programming experience in at least one software programming language
  • 5 years of leadership in design or architecture of new and existing systems, including design patterns, reliability, and scaling
  • 5 years of experience with the complete software development life cycle, covering coding standards, code reviews, source control management, build processes, testing, and operations
  • Experience in mentoring, leading a technical team, or serving as a tech lead
  • Preferred Qualifications:
  • Bachelor's degree in computer science or a related field
  • 6 years of total software development experience
  • Expertise in accelerator architectures for machine learning (ML) or high-performance computing (HPC), such as GPUs, CPUs, FPGAs, or specialized architectures
  • Experience in optimizing GPU kernels and GPGPU computing, including with CUDA, NKI, Triton, OpenCL, SYCL, or ROCm
  • Proven experience with NVIDIA PTX and/or AMD GPU ISA
  • Familiarity with developing high-performance libraries for HPC applications
  • Proficient in low-level performance optimization for GPUs
  • Experience in LLVM/MLIR backend development for GPUs
  • Knowledge of ML frameworks (PyTorch, TensorFlow) and their GPU backends
  • Experience with parallel programming and optimization techniques
  • Understanding of GPU memory hierarchies and optimization strategies
Responsibilities:
  • As a member of our Annapurna Labs team at Amazon Web Services (AWS), I will be involved in developing AWS Neuron, the software development kit designed to accelerate deep learning and GenAI workloads on our custom machine learning accelerators, Inferentia and Trainium. Working within the Acceleration Kernel Library team, I will focus on maximizing performance for AWS's ML accelerators. My duties will include designing and implementing high-performance compute kernels for ML operations by utilizing the Neuron architecture and programming models, analyzing and optimizing performance at the kernel level across various generations of Neuron hardware, and conducting detailed performance analysis using profiling tools to identify and resolve bottlenecks. I will collaborate closely with teams to develop innovative kernel optimization techniques and will work directly with customers to optimize their ML models on AWS accelerators.
Technologies:
  • AI
  • AWS
  • Backend
  • CUDA
  • Hardware
  • Support
  • LLVM
  • Machine Learning
  • PyTorch
  • TensorFlow
  • Web
  • Cloud
  • Architect
  • Backbone
  • Flow
  • GitHub

More:

I am proud to be part of a team that values diverse experiences and encourages individuals with varied backgrounds to apply, even if they do not meet all the qualifications outlined. At Amazon Web Services (AWS), we strive to maintain an inclusive team culture that celebrates our differences. We prioritize work-life balance and support mentorship and career growth for our team members. Our environment fosters knowledge sharing and gives everyone the opportunity to work on exciting and innovative projects at the forefront of machine learning and high-performance computing. We offer flexibility in working hours and a commitment to personal and professional well-being. If I need accommodations during the application and hiring process, there are resources available to ensure I have support. Overall, we are looking for passionate individuals who are ready to contribute to shaping the future of AI acceleration technology with us.

last updated 52 week of 2025

Job Tags

Full time, Internship,

Similar Jobs

Omada Health

Senior Data Analyst, Clinical Quality and Product Job at Omada Health

 ...and engage people in lifelong health, one step at a time. Job Overview: Omadas Clinical Quality Team is seeking a Senior Data Analyst to advance our mission of delivering high-quality, equitable healthcare through data-driven insights and innovation. This role sets... 

Covenant HealthCare

NEURO/OR MONITOR TECH Job at Covenant HealthCare

 ...supervision, provides high quality EEGs, EPs, NCVs, Ambulatory EEGs, Transcranial Doppler Studies and all aspects of Intraoperative Monitoring. Operates several different types of machines and computers, and provides a safe and comfortable environment for patients. The... 

UPMC Western Maryland

CNA/PCT, High Level Care Job at UPMC Western Maryland

 ...UPMC Western Marylands High Level Care Unit is looking for a full-time CNA! This day shift position requires working every other weekend and holiday. Rate and title will be determined by the recruitment team and unit leadership at the time of offer based on experience... 

Oliver James

Senior Fund Accountant - Private Equity Job at Oliver James

 ...My client, a Boutique Private Equity firm is looking for a Fund Accountant to join their expanding team to handle internal fund accounting. Our client works on a hybrid basis in...  ...Preparation of financial statements processing investments, and monthly bank reconciliations... 

Graphic Packaging International

Production Planner/Scheduler III Job at Graphic Packaging International

 ...analytical skills and problem-solving aptitude, we have the perfect job for you! We are currently searching for a qualified Production Scheduler/Planner to manage paperboard procurement activities for our production. This role's main goal is to minimize paperboard working...