Sr. Embedded Software Development Engineer, Machine Learning Accelerators

Do you have experience writing software for embedded systems or custom hardware? Have you dabbled in RTL development for FPGAs or ASIC? Are you familiar with machine learning and looking to apply your embedded programming experience to execute ML models on bare metal hardware?Custom Amazon designed silicon chips live at the heart of AWS Machine Learning servers, enabling faster, more capable, and more accurate machine learning for our customers. We’re looking for skilled software engineers to scale the team that develops the embedded software stack critical to the functionality of these bleeding-edge system-on-chips (SoCs). Your software will directly drive the execution and management of hardware accelerated neural network models deep within the SOC's Neuron Cores.You'll also work closely with our architecture and design teams to drive hardware/software co-design, developing both firmware and custom hardware that enables ML within our accelerator chips. Our team's charter is to make deep learning pervasive for everyday software developers and to democratize access to industry leading infrastructure - you'll be enabling that vision from the ground up.We invite you visit the link below for a glimpse inside our labs to see exactly the incredible technology and people you will work with at Annapurna Labs!https://www.aboutamazon.com/news/aws/take-a-look-inside-the-lab-where-aws-makes-custom-chipsThis is a fast-paced, challenging position, where you'll work with thought-leaders in multiple technology areas. You'll have high standards for yourself and everyone you work with, and you'll be constantly looking for ways to improve our products' performance, quality, and cost. We're searching for individuals who want to reach beyond what is possible today and change an industry.You'll learn about the inner workings of ML and our accelerators as part of your onboarding, so no in-depth ML knowledge is required for this role, but any ML background you have will be helpful.Key job responsibilities- Software / hardware architecture and co-design- Facilitate and lead embedded systems architecture / design sessions- Drives embedded systems deliverables across cross-functional teams and stakeholders- Embedded software development, testing, debug, and performance improvements- Test suite and infrastructure development- Developing software which can be maintained, improved upon, documented, tested, and reused- Close collaboration with RTL designers, design verification engineers, and other software teamsBASIC QUALIFICATIONS- Bachelor's degree in computer science or equivalent- 6+ years of non-internship professional software development experience- 5+ years of programming using a modern programming language such as Java, C++, or C#, including object-oriented design experience- 4+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience- Experience in embedded development in C/C++- Experience in computer architecture ...

MLA IP Design Verification Engineer, Annapurna Labs

Amazon Web Services provides a highly reliable, scalable, low-cost infrastructure platform in the cloud that powers hundreds of thousands of businesses in 190 countries around the world.We are seeking an experienced Design Verification Engineers to build the next generation of our cloud server platforms. Our success depends on our world-class infrastructure; we’re handling massive scale and rapid integration of emergent technologies.As a member of the Cloud-Scale Machine Learning Acceleration team you’ll be responsible for the design and validation of machine learning hardware in our data centers.Some of your responsibilities will include verifying/validating that our hardware and software solutions achieve their desired functionality, developing and executing multi-faceted verification/validation plans, and measuring the teams progress towards our ambitious customer metrics.Key job responsibilitiesSome of your responsibilities will include verifying/validating that our hardware and software solutions achieve their desired functionality, developing and executing multi-faceted verification/validation plans, and measuring the teams progress towards our ambitious customer metrics.About the teamnclusive Team CultureHere at AWS, we embrace our differences. We are committed to furthering our culture of inclusion. We have ten employee-led affinity groups, reaching 40,000 employees in over 190 chapters globally. We have innovative benefit offerings, and host annual and ongoing learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences. Amazon’s culture of inclusion is reinforced within our 16 Leadership Principles, which remind team members to seek diverse perspectives, learn and be curious, and earn trust. Work/Life BalanceOur team puts a high value on work-life balance. It isn’t about how many hours you spend at home or at work; it’s about the flow you establish that brings energy to both parts of your life. We believe striking the right balance between your personal and professional life is critical to life-long happiness and fulfillment. We offer flexibility in working hours and encourage you to find your own balance between your work and personal lives. Mentorship & Career GrowthOur team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge sharing and mentorship. We care about your career growth and strive to assign projects based on what will help each team member develop into a better-rounded professional and enable them to take on more complex tasks in the future.BASIC QUALIFICATIONSBS Degree or Higher in EE or CS or CE.3+ years of design verification experience using System Verilog and UVM3+ years of experience in testbench development including: stimulus, checkers, assertions and coverage ...

Software Development Engineer, Annapurna Labs, Trainium Collectives

We are seeking an experienced engineer to work on distributed AI/ML systems. This role involves working on collective operations - the fundamental operations that enable AI to scale across multiple accelerators & servers. Most of our stack is C/C++ and relatively low level, so solid knowledge of Linux, kernels, and performant code is important. Experience with embedded systems is valued, and experience with high-speed networking or HPC interconnects is valued highly.If you like solving hard problems, want to work with HPC and ML customers, iterate fast and deliver meaningful solutions at scale, then come join us! This truly is a role on the forefront of AI/ML, you’ll be working on features for the largest clusters, with the largest customers, for the largest AI models.The org you would be joining is Annapurna Labs, an integral part of AWS and develops hardware and software components that are critical building blocks for EC2 infrastructure. Every instance in EC2 is running some type of hardware designed in Annapurna Labs. We specialize in designing software, systems and chips that optimize the AWS customer experience. A day in the lifeAnnapurna Labs, a crucial part of AWS, is responsible for developing hardware and software components for EC2 infrastructure. Our team focuses on building networking solutions that for Machine Learning (ML) and High-Performance Computing (HPC) workloads on AWS.We have mixed discipline orgs, you’d be working side by side with infrastructure experts, hardware engineers, RTL engineers, scientists & architects. Our workforce spans the globe and is truly international, you’ll find yourself working side by side with individuals from numerous countries. We take mentorship seriously, you can both expect senior mentorship and will be expected to mentor new and junior engineers. The pace is fast as we work on the latest advancements of AI/ML, but we take the time to bond as a team and enjoy the successes. We offer flexibility in working hours, and respect WLB as a core org tenet. The team enjoys working with numerous principal-level engineers and closely with directors, career growth opportunities are certainly available. This is a role where you will always be encouraged to keep learning, the AI/ML field is fast moving and constantly evolving.About the teamOur team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge-sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. We care about your career growth and strive to assign projects that help our team members develop your engineering expertise so you feel empowered to take on more complex tasks in the future.Diverse ExperiencesAWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.About AWSAmazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.Inclusive Team CultureHere at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences, inspire us to never stop embracing our uniqueness.Work/Life BalanceWe value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.Mentorship & Career GrowthWe’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.BASIC QUALIFICATIONS- 3+ years of non-internship professional software development experience- 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience- Experience programming with at least one software programming language ...

Sr Software Engineer, Graviton Software, Annapurna Labs

The AWS Graviton Software team is looking for Software Engineers to develop tools to drive the optimization of open source and internal applications. Annapurna Labs, part of AWS, designed Graviton as a strategic initiative to improve how software works at Amazon scale. Graviton is Arm-based CPU that delivers better performance, lower price and lower carbon footprint over comparable x86-based instances. Key job responsibilitiesAs a Graviton Software Developer, you will:- Build software framework for tools to analyze performance of hardware and software components.- Leverage existing perf tools like sysstat, sysctl, perf, etc.- Automate the collection and analysis of processor, OS and workload performance data.- Help external AWS customers and various internal AWS services like AWS Lambda, Elastic Map Reduce, ElastiCache, and RDS to troubleshoot bottlenecks and to optimize the architecture, algorithms, and deployment on Graviton.- Work on Linux and other open source code, improving it and contributing the changes back to the community.- Play an instrumental role in driving the AWS roadmap to deliver cost-effective and performant computing systems.- Use and further develop your deep knowledge in areas including design, implementation, and data analysis.- Have the opportunity to lead the innovation and deliver software that powers the world largest cloud provider.- If you are already an open source developer or passionate about it, you will be able to continue your passion and contribute back to the community across all those projects. A day in the lifeHere at AWS, we embrace our differences. We are committed to furthering our culture of inclusion. We have ten employee-led affinity groups, reaching 40,000 employees in over 190 chapters globally. We have innovative benefit offerings, and host annual and ongoing learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences. Amazon’s culture of inclusion is reinforced within our 16 Leadership Principles, which remind team members to seek diverse perspectives, learn and be curious, and earn trust.Work/Life BalanceOur team puts a high value on work-life balance. It isn’t about how many hours you spend at home or at work; it’s about the flow you establish that brings energy to both parts of your life. We believe striking the right balance between your personal and professional life is critical to life-long happiness and fulfillment. We offer flexibility in working hours and encourage you to find your own balance between your work and personal lives.Mentorship & Career GrowthOur team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge sharing and mentorship. We care about your career growth and strive to assign projects based on what will help each team member develop into a better-rounded professional and enable them to take on more complex tasks in the future.About the teamThe Graviton Software organization ports, optimizes, and develops software to drive down the cost of adoption and operation for the AWS Graviton instances. We pro-actively improve and upstream open source software, including Linux kernel, operating system, compilers, libraries, and applications. We help internal and external customers to troubleshoot and fix performance bottlenecks that prevent them from using Graviton. We develop tools to automate most of the heavy lifting and maintain publicly available documentation: Graviton Developer Guide on GitHubBASIC QUALIFICATIONS- 5+ years of non-internship professional software development experience- 5+ years of programming with at least one software programming language experience- 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience- 5+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience ...

Sr. Software Engineer- AI/ML, AWS Neuron Distributed Training

AWS Utility Computing (UC) provides product innovations that continue to set AWS’s services and features apart in the industry. As a member of the UC organization, you’ll support the development and management of Compute, Database, Storage, Platform, and Productivity Apps services in AWS, including support for customers who require specialized security solutions for their cloud services. Additionally, this role may involve exposure to and experience with Amazon's growing suite of generative AI services and other cutting-edge cloud computing offerings across the AWS portfolio.Annapurna Labs (our organization within AWS UC) designs silicon and software that accelerates innovation. Customers choose us to create cloud solutions that solve challenges that were unimaginable a short time ago—even yesterday. Our custom chips, accelerators, and software stacks enable us to take on technical challenges that have never been seen before, and deliver results that help our customers change the world.AWS Neuron is the complete software stack for the AWS Inferentia and Trainium cloud-scale machinelearning accelerators and the Trn1 and Inf1 servers that use them. This role is for a senior software engineer in the Machine Learning Applications (ML Apps) team for AWS Neuron. This role is responsible for development, enablement and performance tuning of a wide variety of ML model families, including massive scale large language models like GPT2, GPT3 and beyond, as well as stable diffusion, Vision Transformers and many more. The ML Apps team works side by side with chip architects, compiler engineers and runtime engineers to create , build and tune distributed training solutions with Trn1. Experience training these large models using Python is a must. FSDP, Deepspeed and other distributed training libraries are central to this and extending all of this for the Neuron based system is key.Key job responsibilitiesThis role will help lead the efforts building distributed training and inference support into Pytorch, Tensorflow, Jax using XLA and the Neuron compiler and runtime stacks. This role will help tune these models to ensure highest performance and maximize the efficiency of them running on the customer AWS Trainium and Inferentia silicon and the TRn1 , Inf1 servers. Strong software development and ML knowledge are both critical to this role.About the teamAbout the teamOur team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge-sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. We care about your career growth and strive to assign projects that help our team members develop your engineering expertise so you feel empowered to take on more complex tasks in the future.Diverse ExperiencesAWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.About AWSAmazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.Inclusive Team CultureHere at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences, inspire us to never stop embracing our uniqueness.Work/Life BalanceWe value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.Mentorship & Career GrowthWe’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.BASIC QUALIFICATIONS- - 5+ years of non-internship professional software development experience- - 5+ years of programming with at least one software programming language experience- - 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience- - 5+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience- - Experience as a mentor, tech lead or leading an engineering team ...

2025 ASIC RTL Engineer Intern, Annapurna Labs

Amazon Web Services (AWS) internships are full-time (40 hours/week) for 12 consecutive weeks during summer. By applying to this position, your application will be considered for all locations we hire for in the United States.In Annapurna Labs we are at the forefront of hardware co-design not just in Amazon Web Services (AWS) but across the industry. The work we do is cutting-edge and internet-scale while also being deeply important to our customers. We design and build every component of our hardware and software to come together into products that our customers use for accelerated computing through Machine Learning acceleration and FPGA acceleration. If you are interested in "building a complete product" from inception to delighted customers, Annapurna is a fantastic choice.If this sounds exciting to you - come build the future with us!Responsibilities: • Participate in logic design activities as part of Amazon's machine learning custom silicon solutions• Work with physical design teams to achieve performance and area requirements.• Develop a deep understanding of the end customer requirements, including software applications, use models, system architecture and the SoC architecture/micro-architecture of our solutions• Develop and execute design automation mechanisms and flowsMentorship & Career GrowthOur team is dedicated to supporting new team members in an environment that celebrates knowledge sharing and mentorship. Projects and tasks are assigned in a way that leverages your strengths and helps you further develop your skillset.Inclusive Team CultureHere at AWS, we embrace our differences. We are committed to furthering our culture of inclusion. We have ten employee-led affinity groups, reaching 40,000 employees in over 190 chapters globally. We have innovative benefit offerings, and host annual and ongoing learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences.Work/Life HarmonyOur team puts a high value on work-life harmony. It isn’t about how many hours you spend at home or at work; it’s about the flow you establish that brings energy to both parts of your life. We believe striking the right balance between your personal and professional life is critical to life-long happiness and fulfillment. We offer flexibility and encourage you to find your own balance between your work and personal lives.BASIC QUALIFICATIONS- Enrolled in a Bachelors’ degree program or higher in Electrical Engineering, Computer Engineering, or a related field with a graduation conferral date between December 2025 and September 2026- Programming experience in C/C++- Programming experience in System Verilog or Verilog ...

Systems Development Engineer, Annapurna Labs, ML Accelerator Systems

The Machine Learning Platform Software Team is looking for a Systems Development Engineer who wants to develop industry leading acceleration platforms with an affinity towards efficient, robust, and highly available systems.AWS Utility Computing (UC) provides product innovations that continue to set AWS’s services and features apart in the industry. Annapurna Labs (our organization within AWS UC) designs silicon and software that accelerates innovation. Customers choose us to create cloud solutions that solve challenges that were unimaginable a short time ago—even yesterday. Our custom chips, accelerators, and software stacks enable us to take on technical challenges that have never been seen before, and deliver results that help our customers change the world.Key job responsibilitiesYou will develop software systems to measure and evaluate machine learning accelerator firmware performance, automate failure detection and remediation, and automate validation of device metrics.- Evaluate and optimize firmware performance- Develop testing frameworks to validate firmware- Develop software that initializes and monitors system health- Build data collection and aggregation systems at AWS scale- Build error detection and recovery mitigation systems at AWS scaleA day in the lifeThe team is focused on our organization's ability to scale. Automation, software best practices, and good architectural abstractions are key to this endeavor.You will have the opportunity to develop software and automation systems in a highly cross-functional environment, working side by side with software and hardware teams to optimize customer experience. You will be responsible for building scalable software systems that can be tested throughout the stages of product development including manufacturing and production. You will leverage automation, continuous integration, and fleet metrics to deploy and monitor your changes.About the teamOur team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge-sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. We care about your career growth and strive to assign projects that help our team members develop your engineering expertise so you feel empowered to take on more complex tasks in the future.Diverse ExperiencesAWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.About AWSAmazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.Inclusive Team CultureHere at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences, inspire us to never stop embracing our uniqueness.Work/Life BalanceWe value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud. Mentorship & Career GrowthWe’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional. BASIC QUALIFICATIONS- 2+ years of non-internship professional software development experience- 1+ years of designing or architecting (design patterns, reliability and scaling) of new and existing systems experience- Knowledge of systems engineering fundamentals (networking, storage, operating systems)- Experience programming with at least one modern language such as C++, C#, Java, Python, Golang, PowerShell, Ruby ...

Executive Assistant, Annapurna Labs, AWS

Annapurna Labs, Inc, the silicon innovation arm of Amazon Web Services, is seeking a dedicated, multitasking and customer-obsessed individual to support our growing U.S. organization. This role requires superior attention to detail, the ability to meet tight deadlines, great organizational skills, and the ability to juggle multiple critical requests. A high level of integrity and discretion in handling confidential information, and professionalism in dealing with senior professionals inside and outside the company is important. The successful candidate will possess the ability to handle complex scheduling of meetings, travel, events, and tasks across multiple groups with little to no guidance, and with the ability to operate successfully in ambiguity.Working in a highly ambiguous environment, you will demonstrate initiative, complete tasks and projects quickly, and proactively take action without having to know the total picture. You will learn organizational structures and objectives of the teams you support, as well as build a strong internal network. You will apply a superior attention to detail, great organizational skills, and the ability to meet tight deadlines and juggle multiple critical requests while maintaining flexibility, a sense of humor, and most importantly, grace under pressure. Over time, you will build deep and trusting professional relationships with the leaders and their teams, which will blossom into strong partnerships that provide opportunity for special projects with increased responsibility and long-term career growth.Key job responsibilities• Supporting the leadership team across sites• Proactively coordinate and engage with leadership in planning and organizing events, activities and calendars• Manage complex calendar and scheduling requirements; ability to work with other assistants• Act as a liaison for the team across Annapurna sites, especial attention to time-zones and global holidays• Drive key team activities, such as staff meeting agendas, all-hands meetings, and customer engagements• Manage group logistics, travel, and expense reportingAbout the teamDiverse ExperiencesAmazon values diverse experiences. Even if you do not meet all of the preferred qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.Why AWSAmazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating - that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.Work/Life BalanceWe value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.Inclusive Team CultureHere at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences, inspire us to never stop embracing our uniqueness.Mentorship and Career GrowthWe’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.BASIC QUALIFICATIONS- 5+ years of senior level leadership support experience- Experience with executive level calendar management- Experience with domestic or international travel coordination ...

Software Engineer - AI/ML, AWS Neuron Distributed Training - Multimodal

AWS Utility Computing (UC) provides product innovations that continue to set AWS’s services and features apart in the industry. As a member of the UC organization, you’ll support the development and management of Compute, Database, Storage, Platform, and Productivity Apps services in AWS, including support for customers who require specialized security solutions for their cloud services. Additionally, this role may involve exposure to and experience with Amazon's growing suite of generative AI services and other cutting-edge cloud computing offerings across the AWS portfolio. Annapurna Labs (our organization within AWS UC) designs silicon and software that accelerates innovation. Customers choose us to create cloud solutions that solve challenges that were unimaginable a short time ago—even yesterday. Our custom chips, accelerators, and software stacks enable us to take on technical challenges that have never been seen before, and deliver results that help our customers change the world.AWS Neuron is the complete software stack for the AWS Inferentia (Inf1/Inf2) and Trainium (Trn1), our cloud-scale Machine Learning accelerators. This role is for a machine learning engineer in the Distribute Training team for AWS Neuron, responsible for development, enablement and performance tuning of a wide variety of ML model families, including massive-scale Large Language Models (LLM) such as GPT and Llama, as well as Stable Diffusion, Vision Transformers (ViT) and many more.The ML Distributed Training team works side by side with chip architects, compiler engineers and runtime engineers to create, build and tune distributed training solutions with Trainium instances. Experience with training these large models using Python is a must. FSDP (Fully-Sharded Data Parallel), Deepspeed and other distributed training libraries are central to this and extending all of this for the Neuron based system is key.Key job responsibilitiesYou will help lead the efforts building distributed training support into Pytorch, Tensorflow using XLA and the Neuron compiler and runtime stacks. You will help tune these models to ensure highest performance and maximize the efficiency of them running on the custom AWS Trainium and Inferentia silicon and the Trn1, Inf1/2 servers. Strong software development and Machine Learning knowledge are both critical to this role.About the teamOur team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge-sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. We care about your career growth and strive to assign projects that help our team members develop your engineering expertise so you feel empowered to take on more complex tasks in the future. Diverse Experiences AWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying. About AWS Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses. Inclusive Team Culture Here at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences, inspire us to never stop embracing our uniqueness. Work/Life Balance We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud. Mentorship & Career Growth We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.BASIC QUALIFICATIONS- Bachelor's degree in computer science or equivalent- 3+ years of non-internship professional software development experience- 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience- Experience programming with at least one software programming language- Experience in machine learning, data mining, information retrieval, statistics or natural language processing ...

Sr. Physical Design Methodology Engineer, Annapurna Labs

AWS Utility Computing (UC) provides product innovations that continue to set AWS’s services and features apart in the industry. As a member of the UC organization, you’ll support the development and management of Compute, Database, Storage, Platform, and Productivity Apps services in AWS, including support for customers who require specialized security solutions for their cloud services. Additionally, this role may involve exposure to and experience with Amazon's growing suite of generative AI services and other cutting-edge cloud computing offerings across the AWS portfolio.Annapurna Labs (our organization within AWS UC) designs silicon and software that accelerates innovation. Customers choose us to create cloud solutions that solve challenges that were unimaginable a short time ago—even yesterday. Our custom chips, accelerators, and software stacks enable us to take on technical challenges that have never been seen before, and deliver results that help our customers change the world.Amazon Web Services provides a highly reliable, scalable, low-cost infrastructure platform in the cloud that powers hundreds of thousands of businesses in 190 countries around the world. We have data center locations in the U.S., Europe, Singapore, and Japan, and customers across all industries.Custom SoCs (System on Chip) live at the heart of AWS Machine Learning servers. As a member of the Cloud-Scale Machine Learning Acceleration team you’ll be responsible for the design and optimization of hardware in our data centers including AWS Inferentia, Trainium Systems (our custom designed machine learning inference and training datacenter servers). Our success depends on our world-class server infrastructure; we’re handling massive scale and rapid integration of emergent technologies. We’re looking for an ASIC Physical Design Methodology Engineer to help us trail-blaze new technologies and architectures, while ensuring high design quality and making the right trade-offs.Key job responsibilitiesDefine, develop and deploy innovative physical design methodologies (RTL2GDS) and CAD flows for ML Accelerator chips in advanced nodesDrive improvement in RTL2GDS flows/methodology for PPA and TAT improvementsCreate Dashboard and Central reports for project tracking and visualizing QoR/statsFine tune cloud infrastructure to improve turnaround times for physical design work.Interface directly with RTL, Physical Design, Package Design, DFT and other teams to improve methodologies and efficiencies and drive efforts to resolution.Work with EDA tool vendors to evaluate new methods, solve bugs, improve usability, etc.Drive setting up RTL2GDS flows for new nodes, run regressions, quality assurance checksAbout the teamAbout the teamOur team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge-sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. We care about your career growth and strive to assign projects that help our team members develop your engineering expertise so you feel empowered to take on more complex tasks in the future.Diverse ExperiencesAWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.About AWSAmazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.Inclusive Team CultureHere at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences, inspire us to never stop embracing our uniqueness.Work/Life BalanceWe value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.Mentorship & Career GrowthWe’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.BASIC QUALIFICATIONS- BS + 10yrs or MS + 7yrs in EE/CS- 5+ years of experience in developing physical design methodology or CAD flows in synthesis, PNR, and sign-off areas for advanced technology nodes. - Proficient in programming/scripting languages (Perl, Python, C++)- Solid understanding of ASIC physical design, and methodologies including synthesis, place and route, STA, IR, formal and physical verification. - Demonstrated level of expertise in PD tools such as Innovus, ICC2, Fusion Compiler, STA, and Sign-Off. - Proven track record of delivering metric driven PPA flow development and support. ...

Software Dev Engineer - Embedded, Runtime, Storage, System & Performance , Annapurna Labs

Are you excited about Machine Learning, chip acceleration, compilers, storage, systems or EC2? Are you passionate about delivering high quality services that affect hundreds of thousands of users? We are the dubbed the "secret sauce" behind AWS's success with development centers in the U.S. and Israel, Annarpuna is at the forefront of innovation by combining cloud scale with the world’s most talented engineers.We hire for multiple disciplines Software and Hardware engineers including but not limited to: compiler engineer, machine learning engineer, runtime engineer, performance engineer and ML chip accelerator, ASIC, physical designs. Because of our teams’ breadth of talent, we’ve been able to improve AWS cloud infrastructure in networking and security with products such as AWS Nitro, Enhanced Network Adapter (ENA), and Elastic Fabric Adapter (EFA), in compute with AWS Graviton and F1 EC2 Instances, in machine learning with AWS Neuron, Inferentia and Trainium ML Accelerators, and in storage with scalable NVMe.Key job responsibilities- Innovating and delivering creative SW Designs to develop new services, solve operational problems, drive improvements in developer velocity, or positively impact operational safety- Writing requirements capturing documents, design documents, integration test plans, and deployment plans- Communicating status and progress of deliverables to schedule, and sharing learnings/ innovations with your team and stakeholdersBASIC QUALIFICATIONS- Currently enrolled in, or completed a Bachelor’s degree program or higher in Computer Science, Computer Engineering, Electrical Engineering or related field- To qualify, applicants should have earned a Bachelor’s or Master’s degree between May 2023 to September 2025. Possible start dates for this role are between January 2025 to October 2025.- Programming experience in internship or coursework with programming language such as Python and/or C or C++.Candidates with strong interests and academic qualifications/research focus in two of the following:• Distributed systems, algorithms (MPI, NCCL, or similar)• Operating System - Linux system programming/services• Computer architecture• System Development• Complexity analysis ...

Senior SoC Functional Modeling Engineer, AWS Machine Learning Accelerators

Custom SoCs (System on Chips) are the brains behind AWS’s Machine Learning servers. Our team builds C++ & SystemC functional models of these custom-designed accelerator SoCs for use by AWS internal teams. We’re looking for a Senior SoC Modeling Engineer to join the team and deliver new functional models, infrastructure, and tooling for our customers.As part of the ML accelerator modeling team, you will:- Develop and own SoC functional models end-to-end, including model architecture, integration with other model or infrastructure components, testing, and debug- Work closely with architecture, RTL design, design verification, emulation, and software teams to build, debug, and deploy your models- Innovate on the tooling you provide to customers, making it easier for them to use our SoC models- Drive model and modeling infrastructure performance improvements to help our models scale- Develop software which can be maintained, improved upon, documented, tested, and reusedAnnapurna Labs, our organization within AWS, designs and deploys some of the largest custom silicon in the world, with many subsystems that must all be modeled and tested with high quality. Our SoC model is a critical piece of software used in both our SoC development process and by our partner software teams. You’ll collaborate with many internal customers who depend on your models to be effective themselves, and you'll work closely with these teams to push the boundaries of how we're using modeling to build successful products.You will thrive in this role if you:- Are an expert in functional modeling for SoCs, ASICs, TPUs, GPUs, or CPUs- Are comfortable modeling in C++ or SystemC, and familiar with Python- Enjoy learning new technologies, building software at scale, moving fast, and working closely with colleagues as part of a small team within a large organization- Want to jump into an ML-aligned role, or get deeper into the details of ML at the hardware/system-levelAlthough we are building machine learning chips, no machine learning background is needed for this role. This role spans modeling of the ML and management regions of our chips, and you’ll dip your toes into both. You’ll be able to ramp up on ML as part of this role, and any ML knowledge that’s required can be learned on-the-job.This role can be based in either Cupertino, CA or Austin, TX. The broader team is split between the two sites, with a slight preference for CA, due to colocation with more customer teams.We're changing an industry. We're searching for individuals who are ready for this challenge, who want to reach beyond what is possible today. Come join us and build the future of machine learning!A day in the lifeA few videos help explain what the Annapurna Labs ML team is working on:- https://youtu.be/4nfkonjjICo?si=nKhM1Wv4108mnIOa- https://youtu.be/n38WDflRbjQ?si=SZEV5i_5du1jKYP-About the teamAWS Utility Computing (UC) provides product innovations that continue to set AWS’s services and features apart in the industry. As a member of the UC organization, you’ll support the development and management of Compute, Database, Storage, Platform, and Productivity Apps services in AWS, including support for customers who require specialized security solutions for their cloud services. Additionally, this role may involve exposure to and experience with Amazon's growing suite of generative AI services and other cutting-edge cloud computing offerings across the AWS portfolio.Annapurna Labs (our organization within AWS UC) designs silicon and software that accelerates innovation. Customers choose us to create cloud solutions that solve challenges that were unimaginable a short time ago—even yesterday. Our custom chips, accelerators, and software stacks enable us to take on technical challenges that have never been seen before, and deliver results that help our customers change the world.Our team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge-sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. We care about your career growth and strive to assign projects that help our team members develop your engineering expertise so you feel empowered to take on more complex tasks in the future.Diverse ExperiencesAWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying. About AWSAmazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.Inclusive Team CultureHere at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences, inspire us to never stop embracing our uniqueness.Work/Life BalanceWe value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud. Mentorship & Career GrowthWe’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional. BASIC QUALIFICATIONS- 6+ years of non-internship professional experience writing functional or performance models- Experience programming with C++ and/or SystemC- Familiarity with SoC, CPU, GPU, and/or ASIC architecture and micro-architecture ...

Sr. Software Development Manager - ML Compiler, AWS Neuron, Annapurna Labs

The Ideal Candidate: Experience in developing programming languages and related compilers and tools. Plus any experience in ML workloads and Kernel languages.The Product: AWS Machine Learning accelerators are at the forefront of AWS innovation. The Inferentia chip delivers best-in-class ML inference performance at the lowest cost in the cloud. Trainium will deliver the best-in-class ML training performance with the most teraflops (TFLOPS) of compute power for ML in the cloud. This is all enabled by a new software stack, the AWS Neuron Software Development Kit (SDK), which includes an ML compiler, runtime and natively integrates into popular ML frameworks, such as PyTorch, TensorFlow and MxNet. The Neuron SDK optimizes performance of complex neural net models executed on AWS Inferentia and Trainium. AWS Neuron is used at scale with customers and partners like PyTorch, Epic Games, Snap, AirBnB, Autodesk, Amazon Alexa, Amazon Rekognition and more customers in various other segments.The Team: The Amazon Annapurna Labs team is responsible for building innovation in silicon and software for AWS customers. We are at the forefront of innovation by combining cloud scale with the world’s most talented engineers. Our team covers multiple disciplines including silicon engineering, hardware design and verification, software and operations. With such breadth of talent, there's opportunity to learn all of the time. We operate in spaces that are very large, yet our teams remain small and agile. There is no blueprint. We're inventing. We're experimenting. When you couple that with the ability to work on so many different products and services, it's a very unique learning culture.Learn more about Our History:https://www.amazon.science/how-silicon-innovation-became-the-secret-sauce-behind-awss-successYou: As a Senior Manager on the AWS Neuron team, you'll be leading a team of compiler engineers through developing, deploying, and scaling a compiler targeting AWS Inferentia and Trainium. You'll need to be technically capable, credible and curious in your own right as a trusted AWS Neuron Manager, innovating on behalf of our customers. You’ll leverage your vision and technical communication skills as a hands-on partner to AWS ML services teams, to be involved in pre-silicon design, bring new products/optimizations/features to market, and many other exciting projects to ensure the Neuron SDK exceeds our customers' needs of high performance, low cost, and ease of use.You will have deep knowledge of resource management, scheduling, code generation, optimization, and new instruction architectures including CPU, NPU, GPU and novel forms of compute.Explore the Product:https://awsdocs-neuron.readthedocs-hosted.com/en/latest/neuron-guide/neuron-cc/index.htmlhttps://github.com/aws/aws-neuron-sdkhttps://aws.amazon.com/machine-learning/neuron/https://aws.amazon.com/machine-learning/neuron/About the teamInclusive Team CultureHere at Annapurna Labs, we embrace our differences. We are committed to furthering our culture of inclusion. We have ten employee-led affinity groups, reaching 40,000 employees in over 190 chapters globally. We have innovative benefit offerings, and host annual and ongoing learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon conferences. Amazon’s culture of inclusion is reinforced within our 14 Leadership Principles, which remind team members to seek diverse perspectives, learn and be curious, and earn trust.Work/Life BalanceOur team puts a high value on work-life balance. It isn’t about how many hours you spend at home or at work; it’s about the flow you establish that brings energy to both parts of your life. We believe striking the right balance between your personal and professional life is critical to life-long happiness and fulfillment. We offer flexibility in working hours and encourage you to find your own balance between your work and personal lives.Mentorship & Career GrowthOur team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge sharing and mentorship. Our senior members enjoy one-on-one mentoring. We care about your career growth and strive to assign projects based on what will help each team member develop into a better-rounded engineer and enable them to take on more complex tasks in the future. BASIC QUALIFICATIONS- - 6+ years of engineering team management experience- - 8+ years of working directly within engineering teams experience- - 6+ years of designing or architecting (design patterns, reliability and scaling) of new and existing systems experience- - Experience partnering with product or program management teams- - Excellent software design fundamentals, knowledge of software engineering principles, and a deep understanding of compilers (resource management, instruction scheduling, code generation, and compute graph optimization ...

Manufacturing Platform Engineer - Software Development, Annapurna Labs

Annapurna Labs (our organization within AWS UC) designs silicon and software that accelerates innovation. Customers choose us to create cloud solutions that solve challenges that were unimaginable a short time ago—even yesterday. Our custom chips, accelerators, and software stacks enable us to take on technical challenges that have never been seen before, and deliver results that help our customers change the world.Annapurna Labs is currently seeking highly experienced Software Development Engineers to enable automation systems and develop test infrastructure for manufacturing environments globally. As a member of the Machine Learning Acceleration team, you will develop and maintain critical test infrastructure for our ML acceleration hardware, ensuring quality and reliability across our manufacturing operations.You will work on developing at-scale software solutions to manage the manufacturing environments at board and server level test. You will work together with other engineering teams to unify testing solutions between manufacturing and data center operations groups. You will develop and maintain the test deployment and distribution systems to ensure that our manufacturing partners have access to appropriate versions of our software as soon as it's available. You will respond to new issues raised by our manufacturing partners, analyze logs and failures, and then develop and deploy solutions to those issues. You will develop documentation as well as testing and debug procedures for our manufacturing partners to follow.Key job responsibilities- Develop, validate and deploy test infrastructure mechanisms into manufacturing environments globally- Design and manage scaled fleets of custom test equipment, ensuring optimal performance and maintenance- Architect and develop data processing solutions using internal cloud services to collect, analyze, and manage manufacturing test data- Drive manufacturing test infrastructure unification efforts between lab and production environments- Collaborate with cross-functional teams to implement robust automation solutionsA day in the lifeAbout the teamOur team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge-sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. We care about your career growth and strive to assign projects that help our team members develop your engineering expertise so you feel empowered to take on more complex tasks in the future.Diverse ExperiencesAWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.About AWSAmazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.Inclusive Team CultureHere at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon conferences, inspire us to never stop embracing our uniqueness.Work/Life BalanceWe value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.Mentorship & Career GrowthWe’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.BASIC QUALIFICATIONS- Bachelor's degree in Computer Science, Electrical Engineering, or related technical field- 5+ years of experience in software development using languages such as Python, C++, or Java- Experience developing and maintaining large-scale software systems- Strong understanding of data structures, algorithms, and software design patterns- Experience with Linux/Unix operating systems and shell scripting- Familiarity with version control systems (e.g., Git) and CI/CD pipelines- Experience with database systems and SQL- Strong problem-solving skills and ability to debug complex systems ...

ASIC Design Engineer, Cloud-Scale Machine Learning Acceleration team

Utility Computing (UC)AWS Utility Computing (UC) provides product innovations — from foundational services such as Amazon’s Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations that continue to set AWS’s services and features apart in the industry. As a member of the UC organization, you’ll support the development and management of Compute, Database, Storage, Internet of Things (Iot), Platform, and Productivity Apps services in AWS, including support for customers who require specialized security solutions for customers who require specialized security solutions for their cloud services.Annapurna Labs (our organization within AWS UC) designs silicon and software that accelerates innovation. Customers choose us to create cloud solutions that solve challenges that were unimaginable a short time ago—even yesterday. Our custom chips, accelerators, and software stacks enable us to take on technical challenges that have never been seen before, and deliver results that help our customers change the world.About AWSAmazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.Diverse ExperiencesAWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying. Work/Life BalanceWe value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud. Inclusive Team CultureHere at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon conferences, inspire us to never stop embracing our uniqueness.Mentorship & Career GrowthWe’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional. Custom SoCs (System on Chip) live at the heart of AWS Machine Learning servers. As a member of the Cloud-Scale Machine Learning Acceleration team you’ll be responsible for the design and optimization of hardware in our data centers including AWS Inferentia, our custom designed machine learning inference datacenter server. Our success depends on our world-class server infrastructure; we’re handling massive scale and rapid integration of emergent technologies. We’re looking for an ASIC Design Eengineer to help us trail-blaze new technologies and architectures, while ensuring high design quality and making the right trade-offs.Key job responsibilities- integrate multiple subsystems into top level SOC, ensure correct clock/reset/functional/DFT signal routing- As a key member of the ASIC design team, you will implement and deliver high performance, area and power efficient RTL to achieve design targets and specifications.- Analyze design, microarchitecture or architecture to make trade-offs based on features, power, performance or area requirements.- Develop micro-architecture, implement SystemVerilog RTL, and deliver synthesis/timing clean design with constraints.- Perform lint and clock domain crossing quality checks on the design.- Work with with architects, other designers, verification teams, pre- and post-silicon validation teams, synthesis, timing and back-end teams to accomplish your tasks.You will thrive in this role if you:- Are familiar with scripting in Python- Are proficient with assertions- Have good debug skills to analyze RTL test failures- Have a "Learn and Be Curious" mindsetAbout the teamCustom SoCs (System on Chip) live at the heart of AWS Machine Learning servers. As a member of the Cloud-Scale Machine Learning Acceleration team you’ll be responsible for the design and optimization of hardware in our data centers including AWS Inferentia, our custom designed machine learning inference datacenter server. Our success depends on our world-class server infrastructure; we’re handling massive scale and rapid integration of emergent technologies. We’re looking for an ASIC Design Eengineer to help us trail-blaze new technologies and architectures, while ensuring high design quality and making the right trade-offs.Our team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge-sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. We care about your career growth and strive to assign projects that help our team members develop your engineering expertise so you feel empowered to take on more complex tasks in the future.BASIC QUALIFICATIONS- B.S. in Electrical Engineering or related technical field- 5+ years in RTL design for SOC- 5+ years in VLSI engineering- 5+ years with code quality tools including: Spyglass, LINT, or CDC ...

Sr. Software Development Engineer, Annapurna Labs

Annapurna Labs (our organization within AWS UC) designs silicon and software that accelerates innovation. Customers choose us to create cloud solutions that solve challenges that were unimaginable a short time ago—even yesterday. Our custom chips, accelerators, and software stacks enable us to take on technical challenges that have never been seen before, and deliver results that help our customers change the world.The AWS Cloud Storage offers a complete range of hardware and software for customers to store, access, govern, and analyze their data, reducing costs, increasing agility, and accelerating innovation.AWS Cloud Storage team is hiring firmware engineers with a background in NVMe memory devices to solve our customers toughest problems.As a firmware engineer on the AWS Cloud Storage team, you will be a thought leader at the forefront of consumer storage and networking solutions. You should feel equally comfortable in server and embedded environments, possess a deep understanding of computer architecture, Linux OS, and programming sophisticated embedded devices.Every day you will be working alongside brilliant engineers and leaders who obsess about performance, availability, scalability and durability of customer data, with the ambitious goal of improving AWS' industry-leading product.Key job responsibilities- Research, design, implement Firmware to support NVMe subsystem, DMA and Crypto through specialized HW units in Nitro Cards.- Debug complex, system-level, multi-component issues across multiple layers from kernel to application- Profile system performance activity and drive optimizations across our software stack- Deliver production-quality code and support its operation in the production environmentAbout the teamOur team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge-sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. We care about your career growth and strive to assign projects that help our team members develop your engineering expertise so you feel empowered to take on more complex tasks in the future.Diverse ExperiencesAWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.About AWSAmazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.Inclusive Team CultureHere at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences, inspire us to never stop embracing our uniqueness.Work/Life BalanceWe value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.Mentorship & Career GrowthWe’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.BASIC QUALIFICATIONS- Experience as a mentor, tech lead or leading an engineering team- 5+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience- 5+ years of experience with programming language: C or C++- 5+ years of experience in embedded Linux systems or NVMe Subsystem ...

Software Engineer- AI/ML, AWS Neuron

AWS Neuron is the complete software stack for the AWS Inferentia and Trainium cloud-scale machinelearning accelerators and the Trn1 and Inf1 servers that use them. This role is for a software engineer in the Machine Learning Applications (ML Apps) team for AWS Neuron. This role is responsible for development, enablement and performance tuning of a wide variety of ML model families, including massive scale large language models like GPT2, GPT3 and beyond, as well as stable diffusion, Vision Transformers and many more. The ML Apps team works side by side with chip architects, compiler engineers and runtime engineers to create , build and tune distributed training solutions with Trn1. Experience training these large models using Python is a must. FSDP, Deepspeed and other distributed training libraries are central to this and extending all of this for the Neuron based system is key.Key job responsibilitiesThis role will help lead the efforts building distributed training and inference support into Pytorch, Tensorflow, Jax using XLA and the Neuron compiler and runtime stacks. This role will help tune these models to ensure highest performance and maximize the efficiency of them running on the customer AWS Trainium and Inferentia silicon and the TRn1 , Inf1 servers. Strong software development and ML knowledge are both critical to this role.About the teamAbout UsInclusive Team CultureHere at AWS, we embrace our differences. We are committed to furthering our culture of inclusion. We have ten employee-led affinity groups, reaching 40,000 employees in over 190 chapters globally. We have innovative benefit offerings, and host annual and ongoing learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences. Amazon’s culture of inclusion is reinforced within our 16 Leadership Principles, which remind team members to seek diverse perspectives, learn and be curious, and earn trust.Work/Life BalanceOur team puts a high value on work-life balance. It isn’t about how many hours you spend at home or at work; it’s about the flow you establish that brings energy to both parts of your life. We believe striking the right balance between your personal and professional life is critical to life-long happiness and fulfillment. We offer flexibility in working hours and encourage you to find your own balance between your work and personal lives.Mentorship & Career GrowthOur team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge sharing and mentorship. We care about your career growth and strive to assign projects based on what will help each team member develop into a better-rounded professional and enable them to take on more complex tasks in the future.BASIC QUALIFICATIONS- 3+ years of non-internship professional software development experience- 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience- Experience programming with at least one software programming language ...

MLA IP Design Verification Engineer , Annapurna Labs

Job summary Amazon Web Services provides a highly reliable, scalable, low-cost infrastructure platform in the cloud that powers hundreds of thousands of businesses in 190 countries around the world.We are seeking an experienced Design Verification Engineers to build the next generation of our cloud server platforms. Our success depends on our world-class infrastructure; we’re handling massive scale and rapid integration of emergent technologies.As a member of the Cloud-Scale Machine Learning Acceleration team you’ll be responsible for the design and validation of machine learning hardware in our data centers.Some of your responsibilities will include verifying/validating that our hardware and software solutions achieve their desired functionality, developing and executing multi-faceted verification/validation plans, and measuring the teams progress towards our ambitious customer metrics.Key job responsibilitiesSome of your responsibilities will include verifying/validating that our hardware and software solutions achieve their desired functionality, developing and executing multi-faceted verification/validation plans, and measuring the teams progress towards our ambitious customer metrics.About the teamInclusive Team CultureHere at AWS, we embrace our differences. We are committed to furthering our culture of inclusion. We have ten employee-led affinity groups, reaching 40,000 employees in over 190 chapters globally. We have innovative benefit offerings, and host annual and ongoing learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences. Amazon’s culture of inclusion is reinforced within our 16 Leadership Principles, which remind team members to seek diverse perspectives, learn and be curious, and earn trust. Work/Life BalanceOur team puts a high value on work-life balance. It isn’t about how many hours you spend at home or at work; it’s about the flow you establish that brings energy to both parts of your life. We believe striking the right balance between your personal and professional life is critical to life-long happiness and fulfillment. We offer flexibility in working hours and encourage you to find your own balance between your work and personal lives. Mentorship & Career GrowthOur team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge sharing and mentorship. We care about your career growth and strive to assign projects based on what will help each team member develop into a better-rounded professional and enable them to take on more complex tasks in the future.BASIC QUALIFICATIONS- BS in engineering- 5+ years of design verification experience using System Verilog and UVM- 5+ years in testbench development including: stimulus, checkers, assertions and coverage- 5+ years of verifying multiple levels of logic including: IP blocks and full SOC system testing- Experience with C/C++, Python, or Perl- Significant experience in IP level design verification preferably in the CPU/compute space. ...

Software Dev Engineer - Machine Learning Apps, Accelerator, Annapurna ML

By applying to this position, your application will be considered for all locations we hire for in the United States.Are you excited about Machine Learning, chip acceleration, compilers, storage, systems or EC2? Are you passionate about delivering high quality services that affect hundreds of thousands of users? We are the dubbed the "secret sauce" behind AWS's success with development centers in the Canada and Israel, Annarpuna is at the forefront of innovation by combining cloud scale with the world’s most talented engineers.The Annapurna team hires for multiple disciplines Software and Hardware engineers including but not limited to compiler engineer, machine learning engineer, runtime engineer, performance engineer and ML chip accelerator, ASIC, physical designs, SDE in Test. Because of our teams’ breadth of talent, we’ve been able to improve AWS cloud infrastructure in networking and security with products such as AWS Nitro, Enhanced Network Adapter (ENA), and Elastic Fabric Adapter (EFA), in compute with AWS Graviton and F1 EC2 Instances, in machine learning with AWS Neuron, Inferentia and Trainium ML Accelerators, and in storage with scalable NVMe.If this sounds exciting to you - come build the future with us!Key job responsibilitiesInnovating and delivering creative SW Designs to develop new services, solve operational problems, drive improvements in developer velocity, or positively impact operational safetyWriting requirements capturing documents, design documents, integration test plans, and deployment plansCommunicating status and progress of deliverables to schedule, and sharing learnings/ innovations with your team and stakeholdersBASIC QUALIFICATIONS- Currently enrolled in, or completed a Bachelor’s degree program or higher in Computer Science, Computer Engineering, Electrical Engineering or related field- To qualify, applicants should have earned a Bachelor’s or Master’s degree between May 2023 to September 2025. Possible start dates for this role are between January 2025 to October 2025.- Programming experience in internship or coursework with programming language such as Python and/or C or C++.Candidates with strong interests and academic qualifications/research focus in two of the following:- Distributed systems, algorithms (MPI, NCCL, or similar)- Operating System - Linux system programming/services- Computer architecture- System Development- Complexity analysis ...

Sr. Substrate CAD layout eng, Annapurna Labs

Amazon Web Services provides a highly reliable, scalable, low-cost infrastructure platform in the cloud that powers hundreds of thousands of businesses in 190 countries around the world. We have data center locations in the U.S., Europe, Singapore, and Japan, and customers across all industries.We are seeking experienced Senior Substrate CAD layout Engineer for the next generation of our ML ASIC. This is a fast-paced, intellectually challenging position, and you’ll work with thought leaders in multiple technology areas. You’ll have relentlessly high standards for yourself and everyone you work with, and you’ll be constantly looking for ways to improve your products performance, quality and cost.We’re changing an industry, and we want individuals who are ready for this challenge and want to reach beyond what is possible today.As a Substrate/PCB layout engineer you will participate in the definition and implementation of substrate and PCB boards. This individual will contribute in all phases of the product’s design, including placement, routing, constraint table management, and drafting. You must bring a combination of Substrate/PCB design, assembly, test and fabrication knowledge, cross team functional interdependence, and demonstrable design process experience to the position. In this role you will: Work with the Electrical Engineering hardware team to provide PCB board design services Work with fabrication vendors to supply data and seek fabrication design requirements Work with assembly vendors to supply data and seek assembly design requirements Deliver best in class Substrate/PCB design to high-volume manufacturing Assist as necessary in schematic and PCB footprint development Work with the company PLM system to write and release ECO’sBASIC QUALIFICATIONS B.S. in Electrical Engineering or related field 7+ years of experience of PCB layout design 7+ years of current experience using Cadence Allegro PCB Design/CIS 7+ years of current experience using Cadence SIP and APD 7+ years of experience with high speed and impedance circuits 7+ years of experience with compact, dual side assembly design Experience with HDI design Experience with BGA pitch of equal or less than 0.5mm ...