System Development Engineer (Level 4), Intelligent Cloud Hosting
Who are we? We host Amazon’s websites. That includes all of Amazon’s global marketplaces and partner portals, and all of Amazon’s consumer experiences like Kindle, Alexa, Amazon Video, Mobile Application; Physical Stores, Luna.... Not only we help deliver the html that delights our customers, we make sure that all underlying transactions are processed error-free. This includes payments, tax, contracts, returns, inventory, and notifications; and brand new experiences that have never been done before.
In short, we perform cloud infrastructure management at scale.
We are a diverse international team made up of engineers, developers, administrators and managers. We are obsessed with operational quality, security, reducing cost, increasing availability and keeping the largest eCommerce infrastructure purring.
Our team has offices in Dublin, Seattle, New York, Sydney, Bangalore, and Hyderabad.
We are looking for Systems Development Engineers who are inventive, understands operational excellence, and can work to scale.
A big part of our job is creating and updating strategic SRE tool-kit built on AWS technology. We use these tools across Amazon to deploy, monitor and operate the hundreds of thousands of services that power our highly distributed architecture. Our tools include orchestration, predictive analytics, monitoring, problem diagnosis and automated repair. These let us deploy configurations that intelligently manage how traffic flows through our highly distributed architecture to our delighted customers.
Key job responsibilities
As a System Development Engineer, you will:
- Use technology to solve straightforward problems, seeking input and guidance from team members
- Create or have responsibility to improve or intent small tools or applications
- Participate in team design, scoping, process, and prioritization discussions
- Work effectively with customers and/or internal partners to understand business impacts and identify any opportunities/problems arising from technical decisions
- Invent, refine, and develop your solutions to ensure they are meeting customer needs and team goals
- Develop an understanding of the maintenance characteristics, runtime properties, and dependencies of your team’s infrastructure, including hardware platform, operating system, and build dependencies.
- Provide assistance to the team with troubleshooting, researching the root cause, and thoroughly resolving defects in the event of a problem