AWS Infrastructure Services owns the design, planning, delivery, and operation of all AWS global infrastructure. In other words, we’re the people who keep the cloud running. We support all AWS data centers and all of the servers, storage, networking, power, and cooling equipment that ensure our customers have continual access to the innovation they rely on. We work on the most challenging problems, with thousands of variables impacting the supply chain — and we’re looking for talented people who want to help.

You’ll join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers, and other vital roles. You’ll collaborate with people across AWS to help us deliver the highest standards for safety and security while providing seemingly infinite capacity at the lowest possible cost for our customers. And you’ll experience an inclusive culture that welcomes bold ideas and empowers you to own them to completion.



Do you enjoy designing scalable complex computing systems & solving difficult problems while driving influential change to a hyperscale environment? Are you curious about the systems used to run the largest cloud computing infrastructures in the world? Do you thrive in a fast paced and ever-changing environment?

Our team designs, builds and operates Amazon's fleet of complex computing systems supporting deep machine learning and mission critical enterprise workloads. We solve systemic hardware issues and we build hardware and software systems to detect and mitigate future recurrences so that our our customers can experience the highest quality of service possible!

You will be responsible for owning the design and operations of a brand new segment of servers for the AWS fleet. As end to end owners of the complex server fleet, our team works closely with partners to root cause failures and drive changes back into our current & future designs. Nothing is complete without closed loop corrective actions which drive changes back into our development processes and behavior specifications.

As a member of the AWS Hardware Engineering Services organization, you will apply your technical experience and work with other subject matter experts in core component development, compute server development, networking development, custom hypervisor/virtualization development and other teams. You will be responsible for hardware and systems that improve how we detect, root cause, and remediate issues. You will lead cross functional investigations and define changes needed to deliver results and will have direct exposure to internal and external AWS customers. Ideal candidates will have a background in server development, system design, root cause, scoping complex issues, qualification, problem solving and developing corrective actions.

Key job responsibilities
As a member of the Hardware Engineering Services team in this specific function, you will own and lead the design, development and root cause of a new segment of specialized servers.

You will work closely with our customers to understand their technical needs and business goals, leveraging your experience with server design and the knowledge of various teams to architect the solutions that we will deploy at scale.

To deliver your products you will work with an interdisciplinary team of component, firmware, test, qualification, and integration engineers, and lead our design and manufacturing partners to bring these servers to the data center. After launch you will oversee the fleet of servers you develop, monitoring their quality and how they are meeting the customer requirements.


A day in the life
Your day to day responsibilities will include interfacing with our internal and external customers to understand project requirements and facilitate system development ontop of your server design. You will be responsible for learning operational challenges to our existing fleet with the goal of improving the current customer experience as well as developing improved systems for future designs. You will work directly with vendors and ODM/JDM design teams to develop and manufacture your product at scale.

About the team
The team is comprise of both Hardware Design Engineers, System Design Engineers, Software Development Engineers and Technical Program Managers, all with the common goal of delivering the best specialized server fleet possible to our customers.

*Why AWS*
Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.

*Diverse Experiences*
Amazon values diverse experiences. Even if you do not meet all of the preferred qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.

*Work/Life Balance*
We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.

*Inclusive Team Culture*
Here at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences, inspire us to never stop embracing our uniqueness.

*Mentorship and Career Growth*
We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.

BASIC QUALIFICATIONS

- 5+ years of relevant work experience with complex server, storage or network server designs; working with interdisciplinary teams to execute product design from concept to production.
- 3+ years of advanced support experience managing customer escalations to quickly identify root cause and resolve issues in a large scale production environment (data center or similar).
- 3+ years of development experience in hardware/ firmware. Proven ability to debug PCIe, memory or storage subsystems.
- Demonstrated ability to engage technology developers in the industry to track and apply emerging technologies to innovative server designs Working with interdisciplinary teams to execute product design from concept to production spanning internal hardware, firmware and software teams as well as external design partners.
- Developing functional specifications, design verification plans, functional test procedures and operational repair plans.
- Demonstrated ability to physically test hardware through remote, or direct hands-on in lab or customer environments.
- Developing rigorous testing practices and design process flows.
- Strong customer focus and demonstrated ability to improve products on behalf of the customer.
- Ability to resolve complex issues in creative, efficient, and effective ways.
- Ability to obtain and analyze large data sets for trends.

PREFERRED QUALIFICATIONS

- Masters/PhD Degree with 10+ years of relevant experience to root cause hardware and firmware integration issues on complex server, storage or network server designs.
- Deep technical background with Memory, Storage, and PCIe debugging.
- Technical background in server design or architecture, specifically having lead the development of multiple complete servers.
- Experience implementing monitoring and alerting systems to quickly identify and categorize failures in production environments (through scripting, light coding).
- Leading cross functional teams through investigation and corrective actions on integrated system problems potentially spanning signal integrity, power, mechanical, thermal, material, firmware.
- BIOS/BMC/FW debugging

Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status. For individuals with disabilities who would like to request an accommodation, please visit https://www.amazon.jobs/en/disability/us.

Los Angeles County applicants: Job duties for this position include: work safely and cooperatively with other employees, supervisors, and staff; adhere to standards of excellence despite stressful conditions; communicate effectively and respectfully with employees, supervisors, and staff to ensure exceptional customer service; and follow all federal, state, and local laws and Company policies. Criminal history may have a direct, adverse, and negative relationship with some of the material job duties of this position. These include the duties and responsibilities listed above, as well as the abilities to adhere to company policies, exercise sound judgment, effectively manage stress and work safely and respectfully with others, exhibit trustworthiness and professionalism, and safeguard business operations and the Company’s reputation. Pursuant to the Los Angeles County Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.

Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $128,600/year in our lowest geographic market up to $213,600/year in our highest geographic market. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience. Amazon is a total compensation company. Dependent on the position offered, equity, sign-on payments, and other forms of compensation may be provided as part of a total compensation package, in addition to a full range of medical, financial, and/or other benefits. For more information, please visit https://www.aboutamazon.com/workplace/employee-benefits. This position will remain posted until filled. Applicants should apply via our internal or external career site.