The role requires broad engineering competence and in-depth technical knowledge of software development, DevOps, infrastructure tools, and distributed systems. The candidate should have demonstrated experience in planning, organizing, and executing software development projects in a dynamic environment. They should be able to independently design, develop, test, and deploy software, as well as clarify requirements and assist with estimates.
The successful candidate will have the unique opportunity to work closely with Security Engineers, Senior Software Engineers, Principal and Distinguished Engineers throughout AWS to define the technical roadmap and follow through with world-class execution.
The EC2 Dataplane LSE team owns and operates services and tools that are designed to detect and recover host and rack-level availability failures, as well as recover EC2 instances back to availability. They architect, develop, and operate highly available and resilient services that are critical to ensuring the highest availability of the broader EC2 service. The team is also closely involved in providing health monitoring capabilities for EC2's newest and upcoming ML infrastructure.
If the challenge of building the next generation of compute platform excites you, come join us to shape the future of compute services!
Key job responsibilities
* Design, develop, operate and own large-scale services, architecting them to scale
* Provide technical leadership and mentor junior engineers in the team
* Write high quality code to develop new systems, conduct deep design and code reviews
* Solve problems at their root, stepping back to understand the broader context, and implementing fixes to ensure that an issue will never happen again
* Share in an on-call rotation with your team
A day in the life
You will be working in a highly collaborative team environment that prioritizes developing and operating high-quality services. The team follows an Agile, Scrum-based process, which provides flexibility for each team member to demonstrate scope and impact. You will mentor the junior engineers and interns. The team generally values operational excellence, so defining the right metrics and providing useful insights into those metrics will be essential.
You'll participate in an on-call rotation with the team to resolve and mitigate production issues, and bring back the learnings from the on-call to continuously improve the operational posture of the team's services. When building new features, you will own the Application Security process and collaborate with the rest of the team to ensure operational readiness.
About the team
The EC2 Dataplane LSE team owns and operates services and tools that are designed to detect and recovery host and rack-level availability failures and recover customer instances back to availability. We architect, develop and own tier1 services that are highly available and resilient to failures, and these services themselves are critical to ensure the highest availability of the broader EC2 service. We play a key role in providing health monitoring capabilities for EC2’s newest and upcoming ML infrastructure. We are also the first responder team to engage in detection and recovery of large scale events that impact EC2 compute resources.
BASIC QUALIFICATIONS
- 3+ years of non-internship professional software development experience
- 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- 3+ years of programming using a modern programming language such as Java, C++, or C#, including object-oriented design experience
- Knowledge of professional software engineering & best practices for full software development life cycle, including coding standards, software architectures, code reviews, source control management, continuous deployments, testing, and operational excellence
PREFERRED QUALIFICATIONS
- 3+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
- Bachelor's degree in computer science or equivalent
Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status. For individuals with disabilities who would like to request an accommodation, please visit https://www.amazon.jobs/en/disability/us.
Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $129,300/year in our lowest geographic market up to $223,600/year in our highest geographic market. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience. Amazon is a total compensation company. Dependent on the position offered, equity, sign-on payments, and other forms of compensation may be provided as part of a total compensation package, in addition to a full range of medical, financial, and/or other benefits. For more information, please visit https://www.aboutamazon.com/workplace/employee-benefits. This position will remain posted until filled. Applicants should apply via our internal or external career site.