Sr. Software Development Manager - ML Compiler, AWS Neuron, Annapurna Labs

The Ideal Candidate: Experience in developing programming languages and related compilers and tools. Plus any experience in ML workloads and Kernel languages.The Product: AWS Machine Learning accelerators are at the forefront of AWS innovation. The Inferentia chip delivers best-in-class ML inference performance at the lowest cost in the cloud. Trainium will deliver the best-in-class ML training performance with the most teraflops (TFLOPS) of compute power for ML in the cloud. This is all enabled by a new software stack, the AWS Neuron Software Development Kit (SDK), which includes an ML compiler, runtime and natively integrates into popular ML frameworks, such as PyTorch, TensorFlow and MxNet. The Neuron SDK optimizes performance of complex neural net models executed on AWS Inferentia and Trainium. AWS Neuron is used at scale with customers and partners like PyTorch, Epic Games, Snap, AirBnB, Autodesk, Amazon Alexa, Amazon Rekognition and more customers in various other segments.The Team: The Amazon Annapurna Labs team is responsible for building innovation in silicon and software for AWS customers. We are at the forefront of innovation by combining cloud scale with the world’s most talented engineers. Our team covers multiple disciplines including silicon engineering, hardware design and verification, software and operations. With such breadth of talent, there's opportunity to learn all of the time. We operate in spaces that are very large, yet our teams remain small and agile. There is no blueprint. We're inventing. We're experimenting. When you couple that with the ability to work on so many different products and services, it's a very unique learning culture.Learn more about Our History:https://www.amazon.science/how-silicon-innovation-became-the-secret-sauce-behind-awss-successYou: As a Senior Manager on the AWS Neuron team, you'll be leading a team of compiler engineers through developing, deploying, and scaling a compiler targeting AWS Inferentia and Trainium. You'll need to be technically capable, credible and curious in your own right as a trusted AWS Neuron Manager, innovating on behalf of our customers. You’ll leverage your vision and technical communication skills as a hands-on partner to AWS ML services teams, to be involved in pre-silicon design, bring new products/optimizations/features to market, and many other exciting projects to ensure the Neuron SDK exceeds our customers' needs of high performance, low cost, and ease of use.You will have deep knowledge of resource management, scheduling, code generation, optimization, and new instruction architectures including CPU, NPU, GPU and novel forms of compute.Explore the Product:https://awsdocs-neuron.readthedocs-hosted.com/en/latest/neuron-guide/neuron-cc/index.htmlhttps://github.com/aws/aws-neuron-sdkhttps://aws.amazon.com/machine-learning/neuron/https://aws.amazon.com/machine-learning/neuron/About the teamInclusive Team CultureHere at Annapurna Labs, we embrace our differences. We are committed to furthering our culture of inclusion. We have ten employee-led affinity groups, reaching 40,000 employees in over 190 chapters globally. We have innovative benefit offerings, and host annual and ongoing learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon conferences. Amazon’s culture of inclusion is reinforced within our 14 Leadership Principles, which remind team members to seek diverse perspectives, learn and be curious, and earn trust.Work/Life BalanceOur team puts a high value on work-life balance. It isn’t about how many hours you spend at home or at work; it’s about the flow you establish that brings energy to both parts of your life. We believe striking the right balance between your personal and professional life is critical to life-long happiness and fulfillment. We offer flexibility in working hours and encourage you to find your own balance between your work and personal lives.Mentorship & Career GrowthOur team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge sharing and mentorship. Our senior members enjoy one-on-one mentoring. We care about your career growth and strive to assign projects based on what will help each team member develop into a better-rounded engineer and enable them to take on more complex tasks in the future. BASIC QUALIFICATIONS- - 6+ years of engineering team management experience- - 8+ years of working directly within engineering teams experience- - 6+ years of designing or architecting (design patterns, reliability and scaling) of new and existing systems experience- - Experience partnering with product or program management teams- - Excellent software design fundamentals, knowledge of software engineering principles, and a deep understanding of compilers (resource management, instruction scheduling, code generation, and compute graph optimization ...

MLA IP Design Verification Engineer , Annapurna Labs

Job summary Amazon Web Services provides a highly reliable, scalable, low-cost infrastructure platform in the cloud that powers hundreds of thousands of businesses in 190 countries around the world.We are seeking an experienced Design Verification Engineers to build the next generation of our cloud server platforms. Our success depends on our world-class infrastructure; we’re handling massive scale and rapid integration of emergent technologies.As a member of the Cloud-Scale Machine Learning Acceleration team you’ll be responsible for the design and validation of machine learning hardware in our data centers.Some of your responsibilities will include verifying/validating that our hardware and software solutions achieve their desired functionality, developing and executing multi-faceted verification/validation plans, and measuring the teams progress towards our ambitious customer metrics.Key job responsibilitiesSome of your responsibilities will include verifying/validating that our hardware and software solutions achieve their desired functionality, developing and executing multi-faceted verification/validation plans, and measuring the teams progress towards our ambitious customer metrics.About the teamInclusive Team CultureHere at AWS, we embrace our differences. We are committed to furthering our culture of inclusion. We have ten employee-led affinity groups, reaching 40,000 employees in over 190 chapters globally. We have innovative benefit offerings, and host annual and ongoing learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences. Amazon’s culture of inclusion is reinforced within our 16 Leadership Principles, which remind team members to seek diverse perspectives, learn and be curious, and earn trust. Work/Life BalanceOur team puts a high value on work-life balance. It isn’t about how many hours you spend at home or at work; it’s about the flow you establish that brings energy to both parts of your life. We believe striking the right balance between your personal and professional life is critical to life-long happiness and fulfillment. We offer flexibility in working hours and encourage you to find your own balance between your work and personal lives. Mentorship & Career GrowthOur team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge sharing and mentorship. We care about your career growth and strive to assign projects based on what will help each team member develop into a better-rounded professional and enable them to take on more complex tasks in the future.BASIC QUALIFICATIONS- BS in engineering- 5+ years of design verification experience using System Verilog and UVM- 5+ years in testbench development including: stimulus, checkers, assertions and coverage- 5+ years of verifying multiple levels of logic including: IP blocks and full SOC system testing- Experience with C/C++, Python, or Perl- Significant experience in IP level design verification preferably in the CPU/compute space. ...

Runtime/Driver Software Development Engineer, Neuron Runtime

At AWS AI our vision is to make deep learning pervasive for everyday developers and to democratize access to cutting edge infrastructure. In order to deliver on that vision, we’ve created innovative software and hardware solutions that make it possible.AWS Neuron SDK is the complete software stack for the AWS Inferentia and Trainium machine learning accelerators designed by Annapurna Labs inside AWS. The Neuron SDK consists of a compiler, runtime, frameworks, and tooling customers need. It’s also preinstalled in AWS Deep Learning AMIs and Deep Learning Containers for customers to quickly get started with running high performance and cost-effective inference and training.This position is for a Software Engineer for the AWS Neuron SDK team with a deep background in Linux and device drivers. Deep knowledge and experience with hardware-based accelerator platforms (such as encryption, TCP offload, storage accelerators and router/switch platforms) and the necessary complexity for the software layers that interact and manage them. Experience implementing high performance software drivers, needed to ensure high throughput from software to hardware, is also a necessity. This engineer will work with the team to define the key interfaces and will develop the runtime stack and driver that manages the accelerator.This is a fast-paced, intellectually challenging position, where you’ll work with thought-leaders in multiple technology areas. You’ll have high standards for yourself and everyone you work with, and you’ll be constantly looking for ways to improve our products' performance, quality and cost.We’re changing an industry, and we want individuals who are ready for this challenge and want to reach beyond what is possible today.About the team**Why AWS**Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.**Utility Computing (UC)**Utility Computing (UC) AWS Utility Computing (UC) provides product innovations — from foundational services such as Amazon’s Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations that continue to set AWS’s services and features apart in the industry. As a member of the UC organization, you’ll support the development and management of Compute, Database, Storage, Internet of Things (Iot), Platform, and Productivity Apps services in AWS, including support for customers who require specialized security solutions for their cloud services.**Diverse Experiences**Amazon values diverse experiences. Even if you do not meet all of the preferred qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.**Work/Life Balance* *We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud. **Inclusive Team Culture* *Here at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences, inspire us to never stop embracing our uniqueness.**Mentorship and Career Growth**We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional. BASIC QUALIFICATIONS- 3+ years of non-internship professional software development experience- 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience- Experience programming with at least one software programming language ...

Sr. Software Development Engineer, ML Infrastructure Team

Want to help drive the success of Machine Learning technologies at AWS? Do you have the skills and motivation to build automation that supports the success of peer teams? We want to talk to you! We seek a Sr. Software Development Engineer for the Machine Learning (ML) Infrastructure team to build the tools that are used to guarantee top performance of AWS ML and High Performance Computing (HPC) technologies developed by our organization. Bring your exceptional knowledge of CI/CD automation, ML and HPC benchmarks and applications to bear on the cutting-edge software we develop. Join us as we expand the AWS offerings for AI, including Trainium, Graviton and the Elastic Fabric Adapter (EFA). Key job responsibilitiesBe the lead engineer on a team that builds and maintains the infrastructure that monitors and reports on functionality and performance of massive testing workloads run at scale. Use internal Amazon CI/CD tools, Linux, and public AWS products to automate the delivery of our software to customers, saving developer time. Write Python code that effortlessly spools up large clusters and runs benchmarks and applications for ML and HPC workloads. Use AWS Managed Grafana and Athena to digest the massive amount of performance data generated by these workloads and create dashboards for developers and stakeholders. Invent automatic mechanisms to alert developers to functional and performance regressions so they never reach reach customers. Manage the complexity of infrastructure that covers many instance types, software stacks, Linux operating systems, cutting-edge releases and make it easy to evolve. A day in the lifeYou use Typescript and the CDK to ensure all infrastructure setup is code (IoC), reviewed and committed to automated pipelines. You find innovative ways to schedule work using SLURM and Active Directory, supporting multiple teams of developers while keeping cluster costs down. You write excellent documents that communicate clearly to peers, stakeholders, and leadership what the team is doing and the plan for future work. You draw on your experience as a software developer to mentor other engineers. About the teamWe are part of Annapurna Labs, a subsidiary in AWS that builds software and hardware that make ML on EC2 work. Our organization is a dedicated group of innovators that have invented new networks, new silicon, new software suites, and combined those to entice customers to move immense ML and HPC workloads to the cloud. The ML Infrastructure team is laser focused on making AWS the best and most cost-effective place for customers to do AI at scale. Diverse ExperiencesAWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.About AWSAmazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.Inclusive Team CultureHere at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences, inspire us to never stop embracing our uniqueness.Work/Life BalanceWe value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.Mentorship & Career GrowthWe’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.BASIC QUALIFICATIONS- 5+ years of non-internship professional software development experience- 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience- 5+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience- Experience as a mentor, tech lead or leading an engineering team- 5+ years experience coding in Python, Typescript, CDK- Experience developing highly automated CI/CD pipelines (Jenkins preferred)- Proficiency working with Linux, ideally including Containers- Experience with Clustered ML or HPC Applications or Benchmarks, especially using SLURM or in AWS ...

System Validation Engineer, System Validation Engineer

AWS Utility Computing (UC) provides product innovations — from foundational services such as Amazon’s Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations that continue to set AWS’s services and features apart in the industry. As a member of the UC organization, you’ll support the development and management of Compute, Database, Storage, Internet of Things (Iot), Platform, and Productivity Apps services in AWS, including support for customers who require specialized security solutions for customers who require specialized security solutions for their cloud services.Annapurna Labs (our organization within AWS UC) designs silicon and software that accelerates innovation. Customers choose us to create cloud solutions that solve challenges that were unimaginable a short time ago—even yesterday. Our custom chips, accelerators, and software stacks enable us to take on technical challenges that have never been seen before, and deliver results that help our customers change the world.The validation team is at the forefront of the hardware lifecycle and supports the project at every step, from emulation, through initial bringup and validation, and even integration of customer feedback into tools and workloads. We are looking for a Software Development Engineer II to join the validation team and ensure our newest hardware will delight customers!Key job responsibilitiesKey job responsibilities- Develop chip-level and system-level validation plans focused on customer needs- Develop workloads and tools to enable validation, tested using emulation environments- Execute validation plans, debug failures, and communicate with key project stakeholders- Enable other teams to use workloads and tools for other parts of the projects such as characterization.- Teach and share debug strategies with the larger teamAbout the teamOur team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge-sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. We care about your career growth and strive to assign projects that help our team members develop your engineering expertise so you feel empowered to take on more complex tasks in the future.Diverse ExperiencesAWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.About AWSAmazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.Inclusive Team CultureHere at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences, inspire us to never stop embracing our uniqueness.Work/Life BalanceWe value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.Mentorship & Career GrowthWe’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.BASIC QUALIFICATIONS- 3+ years of non-internship professional software development experience- 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience- Experience programming with at least one software programming language ...

HBM PHY Expert, Annapurna Labs

AWS Utility Computing (UC) provides product innovations — from foundational services such as Amazon’s Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations that continue to set AWS’s services and features apart in the industry. As a member of the UC organization, you’ll support the development and management of Compute, Database, Storage, Internet of Things (Iot), Platform, and Productivity Apps services in AWS, including support for customers who require specialized security solutions for customers who require specialized security solutions for their cloud services.Annapurna Labs (our organization within AWS UC) designs silicon and software that accelerates innovation. Customers choose us to create cloud solutions that solve challenges that were unimaginable a short time ago—even yesterday. Our custom chips, accelerators, and software stacks enable us to take on technical challenges that have never been seen before, and deliver results that help our customers change the world.We are seeking an HBM/DDRx Phy expert with role in the definition, design and validation of AWS next generation ML Chips, Cards and server integration. As a senior member of our platform development team, you will have the outstanding and meaningful opportunity to participate in the design and execution of all HBM, Memories and Serdes topics, with the goal of creating and customized platforms that fit within AWS datacenter’s world leading technology. The HBM lead Engineer will need to independently work with vendors, understand the settings, write/modify tests, debug and collect data in the fleet.Key job responsibilitiesAs a senior member of the team, you will join a group of hardworking engineers to design and implement innovative next generation machine learning chips and servers. In this position, you will make a real impact in a dynamic, technology focused team. Your work will impact the growing field of machine learning.As a senior member of the team, you will collaborate with architects, design teams, software engineers to deliver the next generation ML chip. In this position, you will have the opportunity to be responsible for IP integration, 2.5D design, bring up, Characterization and validation.A day in the lifeA day in the life of an ASIC Engineer on the AWS Organization team focuses on operational excellence, constructively identifying problems and proposing solutions. You help your team evolve by actively participating in the code review process, design discussions, team planning, and ticket/metric/COE reviews. ASIC Engineers will also mentor and help to develop others and interview for the team.About the teamAbout the teamOur team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge-sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. We care about your career growth and strive to assign projects that help our team members develop your engineering expertise so you feel empowered to take on more complex tasks in the future.Diverse ExperiencesAWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.About AWSAmazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.Inclusive Team CultureHere at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences, inspire us to never stop embracing our uniqueness.Work/Life BalanceWe value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.Mentorship & Career GrowthWe’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.BASIC QUALIFICATIONS-BS or MS in EE, ECE or CS-7+ years of experience in Silicon development with -3+ years in SOC/IO/Subsystems-Good understanding of DDR/HBM at the PHY and controller level-Good knowledge of DDR/HBM training, timing parameters and/or controller features-Drive the IP Integration and design of silicon and 2.5D packaging-Support the physical design team, review clocking and timing constraints-Drive cross-functional triage effort on complex functional and performance issues-Take the leadership role in post-silicon bring-up of IO and HBM/DRRx stack -Define boot-up initiatization, reset flow, training sequence -Perform system-level debug and root-cause analysis through bring-up, characterization, validation and production phase-Experience Working with 3rd party IP and memory vendors ...

Senior Software Engineer - Hardware Abstraction Layer, Team Lead, AWS Machine Learning Accelerators

Custom silicon chips live at the heart of AWS machine learning servers (Trainium and Inferentia), and our team builds the backend software that manages these servers. We’re looking for someone to lead our system-on-chip hardware abstraction layer (SoC HAL) software team. You'll help us deliver at scale, as we build the next generations of SoC software. This is a hands-on, in-the-trenches software engineering leadership position. As the team lead for SoC HAL SW, you will: - Lead (both technically and as a people-manager) a small, strong team of ~5 developers- Work with hardware designers to build HALs for newly developed SoC IPs- Work with system software teams to solve SoC and system-level architectural issues, drive debug, architect the HAL itself, and innovate on cross-functional solutions- Continuously test and deploy your software stack to multiple internal customers- Refactor and maintain existing codebases throughout the device lifecycle- Innovate on the tooling you provide to customers, making it easier for them to use and debug our SoCsAWS's Annapurna Labs designs and deploys some of the largest custom silicon in the world, with many subsystems that all must be managed, tested, and monitored. The SoC HAL is a critical piece of the AWS infrastructure management software stack that ensures the chip is functional, performant, and secure. You will thrive in this role if you: - Enjoy building, managing, and leading small teams, or are looking to grow in this area- Enjoy coaching and mentoring teammates both technically and on interpersonal topics- Are proficient in C++ and familiar with Python- Enjoy working with hardware-based systems, and diving into chip and system architecture- Love solving complex system-level issues- Know how to build effective abstractions over low-level SoC details- Have strong opinions about software architecture, and are able to apply them effectively- Are familiar with modular driver architectures (such as the Linux or Windows driver stacks)- Enjoy learning new technologies, building software at scale, moving fast, and working closely with colleagues as part of a small, startup-like team within a large organization Although we build and deploy ML chips, no ML background is needed for this role. Your team (and your software) won’t be doing ML. Our HAL lives at the lowest level of the backend AWS infrastructure responsible for managing our ML servers. You and your team will develop HALs for components used by machine learning, like PCIe and HBM, but won’t need to deeply understand ML yourselves. This role can be based in either Cupertino, CA or Austin, TX. The team is split between the two sites, with no preference for one over the other. This is a fast-paced role where you'll work with thought-leaders in multiple technology areas. You'll have high standards for yourself and everyone you work with, and you'll be constantly looking for ways to improve your software, as well as our products' overall performance, quality, and cost. We're changing an industry. We're searching for individuals who are ready for this challenge, who want to reach beyond what is possible today. Come join us and build the future of machine learning!A day in the lifeA few videos help explain what the Annapurna Labs ML team is working on:- https://youtu.be/4nfkonjjICo?si=nKhM1Wv4108mnIOa- https://youtu.be/n38WDflRbjQ?si=SZEV5i_5du1jKYP-About the teamAWS Utility Computing (UC) provides product innovations that continue to set AWS’s services and features apart in the industry. As a member of the UC organization, you’ll support the development and management of Compute, Database, Storage, Platform, and Productivity Apps services in AWS, including support for customers who require specialized security solutions for their cloud services. Additionally, this role may involve exposure to and experience with Amazon's growing suite of generative AI services and other cutting-edge cloud computing offerings across the AWS portfolio.Annapurna Labs (our organization within AWS UC) designs silicon and software that accelerates innovation. Customers choose us to create cloud solutions that solve challenges that were unimaginable a short time ago—even yesterday. Our custom chips, accelerators, and software stacks enable us to take on technical challenges that have never been seen before, and deliver results that help our customers change the world.Our team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge-sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. We care about your career growth and strive to assign projects that help our team members develop your engineering expertise so you feel empowered to take on more complex tasks in the future.Diverse ExperiencesAWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying. About AWSAmazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.Inclusive Team CultureHere at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences, inspire us to never stop embracing our uniqueness.Work/Life BalanceWe value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud. Mentorship & Career GrowthWe’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional. BASIC QUALIFICATIONS- 6+ years of programming with at least one modern language such as C++, C#, Java, Python, Golang, PowerShell, Ruby experience- 6+ years of non-internship professional software development experience- 4+ years of designing or architecting (design patterns, reliability and scaling) of new and existing systems experience- Experience leading the design, build and deployment of complex and performant (reliable and scalable) software solutions in production- C++ development experience- Experience developing low-level software for hardware (SoC, ASIC, GPU, CPU, etc.) ...

ASIC Design Engineer, Cloud-Scale Machine Learning Acceleration team

Amazon Web Services provides a highly reliable, scalable, low-cost infrastructure platform in the cloud that powers hundreds of thousands of businesses in 190 countries around the world. We have data center locations in the U.S., Europe, Singapore, and Japan, and customers across all industries.Custom SoCs (System on Chip) live at the heart of AWS Machine Learning servers. As a member of the Cloud-Scale Machine Learning Acceleration team you’ll be responsible for the design and optimization of hardware in our data centers including AWS Inferentia, our custom designed machine learning inference datacenter server. Our success depends on our world-class server infrastructure; we’re handling massive scale and rapid integration of emergent technologies. We’re looking for an ASIC Design Eengineer to help us trail-blaze new technologies and architectures, while ensuring high design quality and making the right trade-offs.Key job responsibilitiesAs an ASIC Design Engineer, you will:• Develop and implement high-performance, area and power-efficient RTL designs to meet project specifications and targets• Conduct in-depth analysis of designs, microarchitectures, and architectures to optimize trade-offs between features, power consumption, performance, and area requirements• Create microarchitectures, implement SystemVerilog RTL, and deliver synthesis and timing-clean designs with appropriate constraints• Execute lint and clock domain crossing quality checks to ensure design integrity• Collaborate closely with cross-functional teams, including architects, fellow designers, verification specialists, pre- and post-silicon validation teams, and synthesis, timing, and back-end expertsThe ideal candidate will have a strong background in ASIC design, proficiency in SystemVerilog, and excellent analytical and problem-solving skills. Experience with high-performance and power-efficient designs is highly desirable.You will thrive in this role if you:- Have a "Learn and Be Curious" mindset- Have familiarity with key components such as interconnects, DMAs, Memory sub-systems, accelerator engines, debug and system level architectures- Have a strong drive to innovate, explore new solutions, and contribute to the company's intellectual property through patentsAbout the teamInclusive Team CultureHere at AWS, we embrace our differences. We are committed to furthering our culture of inclusion. We have ten employee-led affinity groups, reaching 40,000 employees in over 190 chapters globally. We have innovative benefit offerings, and host annual and ongoing learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon conferences. Amazon’s culture of inclusion is reinforced within our 16 Leadership Principles, which remind team members to seek diverse perspectives, learn and be curious, and earn trust. Work/Life BalanceOur team puts a high value on work-life balance. It isn’t about how many hours you spend at home or at work; it’s about the flow you establish that brings energy to both parts of your life. We believe striking the right balance between your personal and professional life is critical to life-long happiness and fulfillment. We offer flexibility in working hours and encourage you to find your own balance between your work and personal lives. Mentorship & Career GrowthOur team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge sharing and mentorship. We care about your career growth and strive to assign projects based on what will help each team member develop into a better-rounded professional and enable them to take on more complex tasks in the future.We are open to hiring candidates to work out of one of the following locations:Cupertino, CA, USA | Austin, TX, USABASIC QUALIFICATIONS- Bachelor's degree or equivalent- 3+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience ...

Software Development Manager - Compiler, AWS Neuron, Annapurna Labs

The Product: AWS Machine Learning accelerators are at the forefront of AWS innovation. The Inferentia chip delivers best-in-class ML inference performance at the lowest cost in cloud. Trainium will deliver the best-in-class ML training performance with the most teraflops (TFLOPS) of compute power for ML in the cloud. This is all enabled by edge software stack, the AWS Neuron Software Development Kit (SDK), which includes an ML compiler, runtime and natively integrates into popular ML frameworks, such as PyTorch, TensorFlow and MxNet. The Neuron SDK optimizes performance of complex neural net models executed on AWS Inferentia. AWS Neuron and Inferentia are used at scale with customers and partners like PyTorch, Epic Games, Snap, AirBnB, Autodesk, Amazon Alexa, Amazon Rekognition and more customers in various other segments.The Team: The Amazon Annapurna Labs team is a responsible for building innovation in silicon and software for AWS customers. We are at the forefront of innovation by combining cloud scale with the world’s most talented engineers. Our team covers multiple disciplines including silicon engineering, hardware design and verification, software and operations. With such breadth of talent, there's opportunity to learn all of the time. We operate in spaces that are very large, yet our teams remain small and agile. There is no blueprint. We're inventing. We're experimenting. When you couple that with the ability to work on so many different products and services, it's a very unique learning culture.Learn more about Our History:https://www.amazon.science/how-silicon-innovation-became-the-secret-sauce-behind-awss-successYou: We are seeking a talented SW Engineering Manager with strong leadership/ mentoring skills to join our Deep Learning Compiler Team. As a Manager III you be leading a team of experienced compiler engineers developing compiler optimization algorithms and deploying, at scale, a new compiler targeting AWS custom hardware. You'll need to be technically capable, credible, and curious in your own right as a trusted AWS Neuron Manager, innovating on behalf of our customers. You’ll leverage your technical communications skill as a hands-on partner to AWS ML services teams, involved in pre-silicon design, bringing new products/features to market. As deep learning models become more versatile, using compiler technologies to achieve both high performance and high productivity becomes essential. Join the team to build the software that will boost the entire deep learning community.You will have deep knowledge of resource management, scheduling, code generation, optimization, and new instruction architectures including CPU, NPU, GPU and novel forms of compute.Explore the Product:https://awsdocs-neuron.readthedocs-hosted.com/en/latest/neuron-guide/neuron-cc/index.htmlhttps://github.com/aws/aws-neuron-sdkhttps://aws.amazon.com/machine-learning/neuron/https://aws.amazon.com/machine-learning/neuron/In order to be considered for this role, candidates must be currently located or willing to relocate to Cupertino (preferred), Seattle, or Austin.About the teamInclusive Team CultureHere at Annapurna Labs, we embrace our differences. We are committed to furthering our culture of inclusion. We have ten employee-led affinity groups, reaching 40,000 employees in over 190 chapters globally. We have innovative benefit offerings, and host annual and ongoing learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon conferences. Amazon’s culture of inclusion is reinforced within our 14 Leadership Principles, which remind team members to seek diverse perspectives, learn and be curious, and earn trust.Work/Life BalanceOur team puts a high value on work-life balance. It isn’t about how many hours you spend at home or at work; it’s about the flow you establish that brings energy to both parts of your life. We believe striking the right balance between your personal and professional life is critical to life-long happiness and fulfillment. We offer flexibility in working hours and encourage you to find your own balance between your work and personal lives.Mentorship & Career GrowthOur team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge sharing and mentorship. Our senior members enjoy one-on-one mentoring. We care about your career growth and strive to assign projects based on what will help each team member develop into a better-rounded engineer and enable them to take on more complex tasks in the future. BASIC QUALIFICATIONS- 5+ years of engineering team management experience- 9+ years of working directly within engineering teams experience- 4+ years of designing or architecting (design patterns, reliability and scaling) of new and existing systems experience- Experience partnering with product or program management teams- Deep understanding of compilers (resource management, instruction scheduling, code generation, and compute graph optimization)- Strong software design fundamentals and excellent system-level coding skills with an emphasis on graph theory and performance techniques ...

Software Engineer II, Annapurna Labs ML Acceleration System Software

In Annapurna Labs we are at the forefront of hardware/software co-design not just in Amazon Web Services (AWS) but across the industry. The Machine Learning Server Software Team is looking for candidates interested in writing data-driven software for our Machine Learning servers.We help to maintain and manufacture the most advanced servers in the world. We touch technologies from the accelerator inner-workings to the I2C infrastructure pervasive in the server and everything in between. Our team does not work on machine learning algorithms, but rather on the physical systems (hardware) which execute and accelerate those machine learning algorithms. Data paths, I2C, accelerator inner-workings are our bread and butter. Come join our team.Key job responsibilities- Member of a team responsible for the software associated with server components and integration in to EC2.- Working with the MLA Hardware, Test and Manufacturing teams to create a coordinated software package to enable both qualification as well as rapid deployment of software.- Developing software (C/C++, Python, Lua) which can be maintained, improved upon, documented, tested, and reused. A day in the lifeThe MLA Systems Software team was formed to focus on server software primarily for debug, testing, qualification, and manufacturing. At a high-level our goal is to find ways to help the organization scale though the use of software and automation.About the teamOur team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge-sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. We care about your career growth and strive to assign projects that help our team members develop your engineering expertise so you feel empowered to take on more complex tasks in the future.Diverse ExperiencesAWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.About AWSAmazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.Inclusive Team CultureHere at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon conferences, inspire us to never stop embracing our uniqueness.Work/Life BalanceWe value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.Mentorship & Career GrowthWe’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.BASIC QUALIFICATIONS- 3+ years of non-internship professional software development experience- 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience- Experience programming with at least one software programming language ...

ASIC Test Engineer, Annapurna Silicon Operations

AWS Annapurna Labs develops the silicon used in our most advanced machine learning accelerator servers, utilizing the latest process nodes and massively scaled server clusters to provide the best hardware platform to customers for both inference and training workloads.We are seeking a semiconductor test engineer with a background in classic ATE platforms to create a clean running extremely low DPPM product-line forming the foundation to our servers.As our final product is a server, not just the silicon, you will find yourself stretching beyond structural testing and incorporating a lot more mission profile testing, utilizing a variety of external controlling ICs, porting server management and customer run-time software, including the use of virtualized machines and specific microcontrollers found in the end application.This particular role targets the High Bandwidth Memory (HBM) DRAM interface and memory stack testing, including training of the SOC phy to DRAM base die, stretching beyond JEDEC specs to gain maximum bandwidth with customized training, vendor-specific sequences, lane repair and optimization featuring here.Your output here will feature in ATE testing, mission profile testing on SLT and full mission profile in the server.You will understand the interfaces various modes of calibration and training, and how to glean quality of connection from eye diagrams, timing margin etc.You will also work with the vendors to ingest their production testing patterns and convert/augment to run on the HBMs in final package test using a variety of test interfaces.You will also have full access to all devices in fleet at any given time and have the opportunity to engage in lifetime test and repair strategies as these devices age and cycle.As such a curious mindset and an urge to learn system architecture will greatly benefit you in this role.We are a nurturing and supporting group that will ensure you have access to all tools, code and mentorship needed to become effective in this exciting new way of working!Key job responsibilities-Hardware development for probe, final test, system level test-Software development for each insert-General tool development support A day in the lifeAs you design and code solutions to help our team drive efficiencies in software architecture, you’ll create metrics, implement automation and other improvements, and resolve the root cause of software defects. You’ll also:- Build high-impact solutions to deliver to our large customer base.- Participate in design discussions, code review, and communicate with internal and external stakeholders.- Work cross-functionally to help drive business decisions with your technical input.- Work in a startup-like development environment, where you’re always working on the most important stuff.About the teamAWS Utility Computing (UC) provides product innovations — from foundational services such as Amazon’s Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations that continue to set AWS’s services and features apart in the industry. As a member of the UC organization, you’ll support the development and management of Compute, Database, Storage, Internet of Things (Iot), Platform, and Productivity Apps services in AWS, including support for customers who require specialized security solutions for customers who require specialized security solutions for their cloud services.Annapurna Labs (our organization within AWS UC) designs silicon and software that accelerates innovation. Customers choose us to create cloud solutions that solve challenges that were unimaginable a short time ago—even yesterday. Our custom chips, accelerators, and software stacks enable us to take on technical challenges that have never been seen before, and deliver results that help our customers change the world.About AWSAmazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.Diverse ExperiencesAWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.Work/Life BalanceWe value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.Inclusive Team CultureHere at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences, inspire us to never stop embracing our uniqueness.Mentorship & Career GrowthWe’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.Our team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge-sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. We care about your career growth and strive to assign projects that help our team members develop your engineering expertise so you feel empowered to take on more complex tasks in the future.BASIC QUALIFICATIONS- Bachelor's In Electrical engineering or Computer Science- 2+ years of experience in C/C++ and/or Microsoft VBA/VBT and terminal shell environments- 2+ years on Advantest 93K, Teradyne IGXL based testers or other system level testers ...

Sr. Tech Product Manager - Neuron Runtime and ML Infrastructure

AWS Neuron is looking for an experienced Technical Product Manager to define and drive product strategy for Neuron Runtime and ML Infrastructure integration. You will be part of the AWS Neuron Product Management team, driving innovation in machine learning acceleration software. AWS Neuron is the software stack for Trainium and Inferentia, the AWS Machine Learning chips, delivering best-in-class ML performance in the cloud. You will lead runtime and infrastructure requirements working backward from customer needs, drive performance and scalability features across Neuron Runtime and container ecosystems, enabling ML training and inference at scale, optimal orchestration and efficient resource management, efficient execution of machine learning models and integration with AWS services. This role will empower customers to successfully deploy and scale ML workloads on AWS Neuron through deep understanding of runtime systems, infrastructure design, and cloud service integration.The ideal candidate will have deep understanding of runtime systems, distributed computing, container orchestration, and ML infrastructure, with expertise in performance optimization, collective communication, Kubernetes ecosystems, Linux systems and enterprise distributions, and ML infrastructure deployment.Additionally, the ideal candidate should have:* Proven track record of driving product strategy and owning roadmap definition in complex technical environments* Experience delivering features with deep understanding of technical trade-offs and product implications* Strong ability to contribute to and influence engineering discussions around technology decisions and strategy* Demonstrated success in representing customer needs and driving alignment during executive-level prioritization* Track record of delivering results in fast-paced, ambiguous environments, particularly in early-stage programs* Experience with modern software development and collaborative open-source projectsAbout AWS Neuron:AWS Neuron is the software of Trainium and Inferentia, the AWS Machine Learning chips. Inferentia delivers best-in-class ML inference performance at the lowest cost in the cloud to our AWS customers. Trainium is designed to deliver the best-in-class ML training performance at the lowest training cost in the cloud, and it’s all being enabled by AWS Neuron. Neuron is a Software that include ML compiler and native integration into popular ML frameworks. Our products are being used at scale with external customers like Anthropic and Databricks as well as internal customers like Alexa, Amazon Bedrocks, Amazon Robotics, Amazon Ads, Amazon Rekognition and many more.Key job responsibilitiesIn this role, you will:* Drive and execute product strategy and roadmap working backwards from customer requirements in collaboration with engineering technical leadership* Assess technical implications of product architecture and optimization decisions* Drive technical alignment across Neuron components, Neuron workflows and dependencies* Work directly with software engineering teams to define and execute on new features* Produce clear and concise documents such as PRFAQ and PRD documents* Write user stories, and validate features meet developer needs* Drive feature discussions with customers, engineering, and other stakeholders* Anticipate bottlenecks, manage risk and escalations, balancing technical constraints* Find opportunities to innovate on behalf of our customers, design features related to these opportunities, and always push to improve our product developer experience* Build ecosystem partnerships and stay connected with industry trends* Represent the product in relevant industry eventsAbout the teamOur team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge-sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. We care about your career growth and strive to assign projects that help our team members develop your engineering expertise so you feel empowered to take on more complex tasks in the future.Diverse ExperiencesAWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.Inclusive Team CultureHere at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon conferences, inspire us to never stop embracing our uniqueness. Work/Life BalanceWe value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.Mentorship & Career GrowthWe’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.About Amazon Annapurna Labs:Amazon Annapurna Labs team (our organization within AWS UC) is responsible for building innovation in silicon and software for our AWS customers. We are at the forefront of innovation by combining cloud scale with the world’s most talented engineers. Our team covers multiple disciplines including silicon engineering, hardware design, software and operations. Because of our teams breadth of talent, we have been able to improve AWS cloud infrastructure in high-performance machine learning with AWS Neuron, Inferentia and Trainium ML chips, in networking and security with products such as AWS Nitro, Enhanced Network Adapter (ENA), and Elastic Fabric Adapter (EFA), and in computing with AWS Graviton and F1 EC2 instances.About AWS Utility Computing (UC):AWS Utility Computing (UC) provides product innovations that continue to set AWS’s services and features apart in the industry. As a member of the UC organization, you’ll support the development and management of Compute, Database, Storage, Platform, and Productivity Apps services in AWS, including support for customers who require specialized security solutions for their cloud services. Additionally, this role may involve exposure to and experience with Amazon's growing suite of generative AI services and other cloud computing offerings across the AWS portfolio.About AWSAmazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.BASIC QUALIFICATIONS- Bachelor's degree in computer science, engineering, analytics, mathematics, statistics, IT or equivalent- 10+ years of industry experience with at least 5+ years in Technical product management and 3+ years of software development- Solid knowledge in container orchestration and Kubernetes- Solid knowledge in computer architecture fundamentals and operating systems concepts- Excellent written and verbal communication abilities ...

2025 Software Dev Engineer Intern - Machine Learning Apps, Accelerator, Annapurna ML

Amazon Web Services (AWS) internships are full-time (40 hours/week) for 12 consecutive weeks during summer. By applying to this position, your application will be considered for all locations we hire for in the United States.Are you excited about Machine Learning, chip acceleration, compilers, storage, systems or EC2? Are you passionate about delivering high quality services that affect hundreds of thousands of users? We are the dubbed the "secret sauce" behind AWS's success with development centers in the U.S. and Israel, and Amazon is at the forefront of innovation by combining cloud scale with the world’s most talented engineers.We hire for multiple disciplines Software and Hardware engineers including but not limited to: compiler engineer, machine learning engineer, runtime engineer, performance engineer and ML chip accelerator, ASIC, physical designs. Because of our teams’ breadth of talent, we’ve been able to improve AWS cloud infrastructure in networking and security with products such as AWS Nitro, Enhanced Network Adapter (ENA), and Elastic Fabric Adapter (EFA), in compute with AWS Graviton and F1 EC2 Instances, in machine learning with AWS Neuron, Inferentia and Trainium ML Accelerators, and in storage with scalable NVMe.If this sounds exciting to you - come build the future with us!Key job responsibilities• Innovating and delivering creative SW Designs to develop new services, solve operational problems, drive improvements in developer velocity, or positively impact operational safety• Writing requirements capturing documents, design documents, integration test plans, and deployment plans• Communicating status and progress of deliverables to schedule, and sharing learnings/ innovations with your team and stakeholdersBASIC QUALIFICATIONS- Currently enrolled in a Bachelor’s degree program or higher Computer Science, Computer Engineering, Electrical Engineering, in these fields are considered with a graduation conferral date between December 2025 and September 2026- Programming experience in internship or coursework with programming language such as Python and/or C or C++.- Candidates with strong interests and academic qualifications/research focus in two of the following: o Distributed systems o Machine Learning - Experience with XLA, TVM, MLIR, LLVM, deep learning models and algorithms o Container o Operating System - Linux system programming/services ...

Architecture Verification Engineer, Annapurna Labs Machine Learning Acceleration

Custom machine learning chips are at the heart of our Trainium machine learning instances, and we invite you to build them with us!As an architecture verification engineer, you will be responsible for ensuring the functionality and performance of the SoC during the design process.You will work on a wide range of disciplines and components, such as verifying end-to-end functionality of the SoC, verifying the functionality and performance of different subsystems in the SoC and verifying interconnect and interface behaviors, among others. Key job responsibilitiesAs an Architecture Verification Engineer you will:* Gain a deep understanding of SoC architecture and micro-architecture* Work with architecture, design, DV and SW teams to verify and integrate device components and functionality* Design verification environments for both pre-silicon and post-silicon* Build testing infrastructure, data analysis tools and debug toolsBASIC QUALIFICATIONS * 2+ YoE in software development / DV * Proficiency in programming in C/C++, scripting in Bash/Python * Proficiency in data structures and algorithms * Knowledge of computer architecture * Knowledge of HW/SW interfaces ...

Sr. Software Development Engineer, Annapurna Labs

Annapurna Labs (our organization within AWS UC) designs silicon and software that accelerates innovation. Customers choose us to create cloud solutions that solve challenges that were unimaginable a short time ago—even yesterday. Our custom chips, accelerators, and software stacks enable us to take on technical challenges that have never been seen before, and deliver results that help our customers change the world.The AWS Cloud Storage offers a complete range of hardware and software for customers to store, access, govern, and analyze their data, reducing costs, increasing agility, and accelerating innovation.AWS Cloud Storage team is hiring firmware engineers with a background in NVMe memory devices to solve our customers toughest problems.As a firmware engineer on the AWS Cloud Storage team, you will be a thought leader at the forefront of consumer storage and networking solutions. You should feel equally comfortable in server and embedded environments, possess a deep understanding of computer architecture, Linux OS, and programming sophisticated embedded devices.Every day you will be working alongside brilliant engineers and leaders who obsess about performance, availability, scalability and durability of customer data, with the ambitious goal of improving AWS' industry-leading product.Key job responsibilities- Research, design, implement Firmware to support NVMe subsystem, DMA and Crypto through specialized HW units in Nitro Cards.- Debug complex, system-level, multi-component issues across multiple layers from kernel to application- Profile system performance activity and drive optimizations across our software stack- Deliver production-quality code and support its operation in the production environmentAbout the teamOur team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge-sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. We care about your career growth and strive to assign projects that help our team members develop your engineering expertise so you feel empowered to take on more complex tasks in the future.Diverse ExperiencesAWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.About AWSAmazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.Inclusive Team CultureHere at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences, inspire us to never stop embracing our uniqueness.Work/Life BalanceWe value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.Mentorship & Career GrowthWe’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.BASIC QUALIFICATIONS- Experience as a mentor, tech lead or leading an engineering team- 5+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience- 5+ years of experience with programming language: C or C++- 5+ years of experience in embedded Linux systems or NVMe Subsystem ...

Physical Design Methodology Engineer, Annapurna Labs

Amazon Web Services provides a highly reliable, scalable, low-cost infrastructure platform in the cloud that powers hundreds of thousands of businesses in 190 countries around the world. AWS has the broadest and deepest set of machine learning and AI services for our customers’ businesses. We are seeking experienced Physical Design Engineer to build the next generation of our cloud server platforms. Our success depends on our world-class infrastructure; we’re handling massive scale and rapid integration of emergent technologies.As a member of the Cloud-Scale Machine Learning Acceleration team you’ll be responsible for the design and optimization of Hardware in our data centers including technologies such as AWS Inferentia which is a machine learning inference product designed to deliver high performance at low cost.You’ll provide leadership in the application of new technologies to large scale deployments in a continuous effort to deliver a world-class customer experience. This is a fast-paced, intellectually challenging position, and you’ll work with thought-leaders in multiple technology areas. You’ll have relentlessly high standards for yourself and everyone you work with, and you’ll be constantly looking for ways to improve our products' performance, quality and cost. We’re changing an industry, and we want individuals who are ready for this challenge and want to reach beyond what is possible today.Key job responsibilities- You will create and support innovative physical design methodology and CAD flows.- Develop cloud infrastructure to support physical design work.- Drive improvement in RTL2GDS flows/methodology for PPA and TAT improvement.- Create Dashboard/central reports for project tracking and visualizing QoR/stats- Interface directly with RTL, Physical Design, Package Design, DFT and other teams to improve methodologies and efficiencies and drive efforts to resolution.- Work with EDA tool vendors to evaluate new tools, solve bugs, improve usability, etc.BASIC QUALIFICATIONS- Bachelors or Master’s degree in EE, CE, or CS - Minimum of 3+ years in developing design methodology or CAD flows in synthesis, PNR, or sign-off areas for advanced technology nodes. - Experience in writing production scripts for implementation and sign-of. tools in TCL, Perl, and/or Python - Solid understanding of ASIC physical design, physical design flows, and methodologies including synthesis, place and route, STA, formal verification. - Proven track record of delivering metric driven PPA flow development and support. ...

Component Engineer, Annapurnna MLA Hardware

Amazon Web Services provides a highly reliable, scalable, low-cost infrastructure platform in the cloud that powers hundreds of thousands of businesses in 190 countries around the world. We have data center locations in the U.S., Europe, Singapore, and Japan, and customers across all industries. We are seeking experienced Hardware Design Engineers to build the next generation of our cloud server infrastructure. Our success depends on our world-class server infrastructure; we’re handling massive scale and rapid integration of emergent technologies.We are looking for a Component Engineer with strong skills and background in hardware design and firmware support. In this role you will work with supply chain team, help with component selection, optimize cost and be responsible for system validation and integration of hardware in the AWS fleet through its entire life cycle.You will work cross functionally with the Platform Design team, and additional teams across AWS to improve quality and reliability of products operating in the fleet.We are looking for candidates who thrive in a fast-paced start-up like environment and work independently to deliver multiple projects in parallel. To be successful you need to be highly motivated and detailed oriented while meeting the highest standards and time to market, cost and quality goals. About the teamAbout the teamOur team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge-sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. We care about your career growth and strive to assign projects that help our team members develop your engineering expertise so you feel empowered to take on more complex tasks in the future.Diverse ExperiencesAWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.About AWSAmazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.Inclusive Team CultureHere at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon conferences, inspire us to never stop embracing our uniqueness.Work/Life BalanceWe value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.Mentorship & Career GrowthWe’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.BASIC QUALIFICATIONS* BS in Electrical Engineering, Computer Engineering, or a related field* 5+ years of experience with component engineering* 3+ years of experience with hardware/software integration for embedded systems or hardware development* Experience with supply chain, vendors and working with external ODM* Experience working in Linux environments* Experience writing scripts using Perl/Python/lua/Bash or another scripting language ...

System Development Engineer, Annapurna Labs, Machine Learning Accelerator Systems - Fleet Triage

Annapurna Labs was a startup company acquired by AWS in 2015, and is now fully integrated. If AWS is an infrastructure company, then think Annapurna Labs as the infrastructure provider of AWS. Our org covers multiple disciplines including silicon engineering, hardware design and verification, software, and operations. AWS Nitro, ENA, EFA, Graviton and F1 EC2 Instances, AWS Neuron, Inferentia and Trainium ML Accelerators, and in storage with scalable NVMe, are some of the products we have delivered, over the last few years.In Annapurna Labs we are at the forefront of hardware/software co-design not just in Amazon Web Services (AWS) but across the industry. The Machine Learning System Operations and Automation Team is looking for candidates interested in writing automation software for our "fleet" of Machine Learning servers deployed around the world. Do you like solving mysteries? Great! Figuring out what that light switch with no obvious function actually does? Me too! Are you wearing a smartwatch to monitor your sleep and activity over time to optimize your routines? You'll fit right in. Does the word exabyte excite you? Let's get to work. Our team writes truly massive scale autonomous software to monitor, optimize, and remediate hardware in the most advanced servers in the world. Come join us!Key job responsibilities- Member of a team responsible for system remediation, operational excellence, and customer experience on bleeding edge ML products- Utilize data to root cause hardware failures and identify live trends on the most complex systems in AWS- Implement and improve system level testing across the product lifecycle- Develop software which can be maintained, improved upon, documented, tested, and reused- Dive deep on issues at the intersection of hardware and softwareA day in the lifeThe MLA Systems Fleet Triage team is responsible for identifying and responding to the most challenging hardware and software failures from ML optimized servers at scale. We work in tandem with hardware design, firmware, and validation teams to improve test coverage and detection in production environments.As you design and code solutions to help our team drive efficiencies in software architecture, you’ll create metrics, implement automation and other improvements, and resolve the root cause of software defects. You’ll also:- Build high-impact solutions to deliver to our large customer base.- Participate in design discussions, code review, and communicate with internal and external stakeholders.- Work cross-functionally to help drive business decisions with your technical input.- Work in a startup-like development environment, where you’re always working on the most important stuff.About the teamOur team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge-sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. We care about your career growth and strive to assign projects that help our team members develop your engineering expertise so you feel empowered to take on more complex tasks in the future.Diverse ExperiencesAWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.About AWSAmazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.Inclusive Team CultureHere at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences, inspire us to never stop embracing our uniqueness.Work/Life BalanceWe value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.Mentorship & Career GrowthWe’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.BASIC QUALIFICATIONS- 2+ years of non-internship professional software development experience- 1+ years of designing or architecting (design patterns, reliability and scaling) of new and existing systems experience- 3+ years of administrative experience in networking, storage systems, operating systems and hands-on systems engineering experience- Knowledge of systems engineering fundamentals (networking, storage, operating systems)- Experience programming with at least one modern language such as C++, C#, Java, Python, Golang, PowerShell, Ruby ...

We show restricted results, but there are more jobs available in our database, use Search to see them