Software Engineer, Multi Region Platform
Job Description
Who we areAbout Stripe
Stripe is a financial infrastructure platform for businesses. Millions of companies—from the world’s largest enterprises to the most ambitious startups—use Stripe to accept payments, grow their revenue, and accelerate new business opportunities. Our mission is to increase the GDP of the internet, and we have a staggering amount of work ahead. That means you have an unprecedented opportunity to put the global economy within everyone’s reach while doing the most important work of your career.
About the team
In this role, you will be joining the High Availability and Disaster Recovery team. At Stripe, availability is a core feature of our products. This team designs and builds new solutions to allow latency-critical, stateful applications to survive any type of disaster. We build distributed systems on top of unreliable architecture to provide highly available and resilient customer solutions. This team is creating greenfield solutions which will serve as the basis for Stripe’s architecture 5, 10, or 20 years into the future.
We're looking for an experienced, world class, distributed systems engineer to join us and support large, cross-organizational initiatives to improve the availability and uptime of all our products. In this role you’ll work across Stripe engineering to design, develop, and deploy practices, processes, and innovative infrastructure software that will be leveraged by the whole organization to ensure that our cloud services are Available by Design.
What you’ll do
You will help develop our global architecture by combining less-available components and data centers into a highly available and resilient whole. You will work on latency-critical solutions where every millisecond matters and data redundancy is a hard requirement. You will actively work with leaders and engineers across the company to understand and identify opportunities to improve Stripe’s reliability posture. You will learn quickly and work on a broad range of problems - one day may be investigating Mongo write concerns, the next may be minimizing cross-region TLS handshakes, followed by developing new systems to automate disaster detection and failovers. Your work will enable Stripe to increase the GDP of the internet by providing uptime and data protection which have historically been impossible.
Responsibilities
- First and foremost, you will write code
- Scope, design, implement, and deploy robust distributed services, making appropriate tradeoffs between reliability, throughput, latency, resiliency, engineering velocity and cost
- Investigate issues in live, distributed production systems
- Innovate, design and implement new products and prototypes to improve service resiliency, engineering velocity and management at scale
- Work with teams across Stripe to help them understand multi-region architectures, high durability implementation patterns, and best practices
- Mentor and grow the next generation of technical leaders at Stripe
Who you areÂ
We’re looking for someone who meets the minimum requirements to be considered for the role. If you meet these requirements, you are encouraged to apply. The preferred qualifications are a bonus, not a requirement.
Minimum requirements
This is where you’ll include the minimum requirements for the job. These are the absolute minimum experiences and skills needed to be considered for the position. Any candidate, whether outbound, inbound, or referred, who does not meet these, will not be considered so be fastidious when listing these.
- 8+ years of experience in software development
- Bachelor’s degree in Computer Science or related field or equivalent work experience
- A strong engineering background in building distributed systems at scale, with high reliability
- Experience developing, maintaining and debugging distributed systems
- Experience with operational excellence and a deep understanding of metrics, alarms and dashboards
- Holds yourself and others to a high bar when working with production
Preferred qualifications
- Experience with managing multi-region deployments, ensuring data locality, and implementing robust disaster recovery strategies.
- Experience with document databases like MongoDB
- Experience influencing, planning, scoping, and leading large projects across many teams
- Thrive in a collaborative environment involving diverse stakeholders and subject matter experts
- Experience in a development team in AWS/Azure/OCI/GCP
Date Posted
05/20/2024
Views
20