Sr Data Engineer, Infra-Finance Business intelligence & Transformations
AWS Infrastructure Services owns the design, planning, delivery, and operation of all AWS global infrastructure. In other words, we’re the people who keep the cloud running. We support all AWS data centers and all of the servers, storage, networking, power, and cooling equipment that ensure our customers have continual access to the innovation they rely on. We work on the most challenging problems, with thousands of variables impacting the supply chain — and we’re looking for talented people who want to help.You’ll join a diverse team of software, hardware, engineers, supply chain specialists, security experts, product and operations managers, and other vital roles. You’ll collaborate with people across AWS to help us deliver the highest standards for safety and security while providing seemingly infinite capacity at the lowest possible cost for our customers. And you’ll experience an inclusive culture that welcomes bold ideas and empowers you to own them to completion.Do you have a desire to make a major contribution to the future in the rapid growth environment of Cloud Computing? Amazon Web Services is looking for a Sr Data Engineer to help build a scalable and robust platforms that supports the AWS Infrastructure Supply Chain Finance (SCF) organization. You will be part of the Finance Automation and Analytics team working with Product managers, Business intelligence engineers, Business Analysts and other Data Engineers to architect the systems in context with the business outcomes.You will have a passion to dive deep, a high level of customer focus and a track record in process improvement. This role requires an individual with excellent analytical abilities, strong knowledge of data engineering solutions and the ability to work with Finance, quantitative and business teams. You will lead multiple automation and controllership initiatives across the Finance org. You will be primarily using but not limited to AWS solution stacks like Redshift, S3, Glue, Lambda, SNS, SQS, Cloudwatch, EC2, Lambda, Data pipeline and reporting tools such as Tableau and Alteryx/KNIME to implement solutions. You will be responsible for the full software development life cycle to build scalable application and deploy in AWS Cloud. Key job responsibilities- Design and develop the pipelines required for optimal extraction, transformation, and loading of data from a wide variety of data sources using SQL, Python and AWS big data technologies.- Oversee and continually improve production operations, including optimizing data delivery, re-designing infrastructure for greater scalability, code deployments, bug fixes and overall release management and coordination.- Establish and maintain best practices for the design, development and support of data integration solutions, including documentation.- Collaborate with Finance, Tax, Supply Chain, Procurement, and Engineering to capture requirements and deliver analytics solutions. - Able to read, write, and debug data processing and orchestration code written in SQL/Python following best coding standards (e.g. version controlled, code reviewed, etc.)- Apply automation so that with every iteration on a problem, you build your solution to have maximum scale and self-service ability by stakeholders. - Understand a broad range of Amazon’s data resources and know how, when, and which to use and which not to use.- Participate in strategic and tactical planning discussions to interface with business customers, gathering requirements and delivering complete reporting solutions.BASIC QUALIFICATIONS- 7+ years of data engineering experience- Experience with data modeling, warehousing and building ETL pipelines- Experience with SQL- Experience in at least one modern scripting or programming language, such as Python, Java, Scala, or NodeJS- Experience mentoring team members on best practices- Knowledge of distributed systems as it pertains to data storage and computing- Experience building data products incrementally and integrating and managing data sets from multiple sources ...