Senior Software Engineer, Site Reliability Engineering - Platform Infrastructure (Boston or Denver)
Company
Klaviyo
Location
Boston, MA
Type
Full Time
Job Description
At Klaviyo, we value the unique backgrounds, experiences and perspectives each Klaviyo (we call ourselves Klaviyos) brings to our workplace each and every day. We believe everyone deserves a fair shot at success and appreciate the experiences each person brings beyond the traditional job requirements. If you’re a close but not exact match with the description, we hope you’ll still consider applying. Want to learn more about life at Klaviyo? Visit careers.klaviyo.com to see how we empower creators to own their own destiny.
Check out this quick video by the hiring manager talking more about this role and what it's like to work at Klaviyo!
Engineers come to Klaviyo with experience in a variety of languages and from a number of disciplines. All engineers are expected to become extremely proficient in the technologies we use (not exhaustive):
- Python, Django, Celery
- MySQL, Cassandra, RabbitMQ, Redis, Pulsar
- React, HTML, JavaScript, Backbone.js
- Amazon Web Services (EC2, RDS, Aurora, etc.), Kubernetes on EKS
The SRE team builds foundational backend services as well as tooling and automation to allow product teams to release and scale their software reliably and predictably. SREs are team players who embed themselves within product teams as needed to advance the architecture and performance of software systems and train their peers in topics such as debugging distributed systems, building self-healing applications and eking out every drop of performance possible.
Internally, we call this role Senior Site Reliability Engineer on the Platform Infrastructure team. As a Senior Site Reliability Engineer you will own multiple foundational Klaviyo services and make a big impact on the productivity of our product engineering teams.
Mission and Vision of the Platform Infrastructure SRE Team
Vision: Offer a programmatically accessible catalog of durable, reliable, and easy to use and maintain infrastructure components enabling quality product delivery and maintenance.
Mission: Provide self-service tooling that enables use of our infrastructure components in a consolidated, consistent fashion via API-bound schemas and automation.
What You'll be Working With
- New Kubernetes infrastructure with ArgoCD - in the testing/iterating phase of the project, lots of teams to onboard, lots to learn and build out
- Abstractions to create ease of use for engineering teams
- Lots of collaboration opportunities both within and outside of SRE
- EC2-based infrastructure tooling - just starting to design and implement this, want to make it on par with what we do for Kubernetes
How You'll Make a Difference
- Ship foundational services to enable Klaviyo engineering to move faster with confidence
- Design and develop systems and processes that enable highly available & scalable systems
- Design, build and deliver software to dramatically improve the availability, scalability, latency, and efficiency of Klaviyo’s services
- Achieve break-throughs in systems throughput by identifying and eliminating bottlenecks
- Leverage technology such as Python, AWS, Django, Kubernetes, Bash, Terraform, MySQL, RabbitMQ, Redis, Cassandra, Postgresql to advance Klaviyo’s platform
- Champion best practices by actively collaborating with other teams in a culture that values whiteboarding and technical design review
- Contribute to the company as a subject matter expert in multiple areas, constantly pushing yourself to be a better engineer and to level up all of your peers within your team and within Klaviyo.
- Mentor and pair with other Klaviyo engineers to build better software by focusing on performance, self-healing system, configuration as code; defensive programming, application security, etc.
- Participate in periodic on call duties with a focus on solving issues when they are discovered, preventing recurrences and minimizing alert fatigueÂ
- Prototype and advocate for architectural improvements to achieve breakthrough results in Klaviyo systems’ operational scalability and reliability
- Work hand-in-hand with product-facing engineers to ship impactful code
- Perform quantitative investigation to understand and scale Klaviyo systems and manage the cross-functional effort to resolve scalability issues
- Produce and advocate for preventative, upstream solutions with internal stakeholders and external vendors and dependencies
- Confidently make informed, data-driven choices in a fast paced environment with competing priorities
Who You AreÂ
- Knowledge of Linux operating systems and computer networking
- Experience writing code in a programming language such as Python, Ruby, Go, etc.
- Experience administering cloud-based infrastructure (e.g. AWS)
- Ability to troubleshoot production issues related to computer infrastructure, configuration, monitoring, deployments, and continuous integration and delivery
- Ability and willingness to learn
- Ability to communicate clearly and mentor and coach others on a team
- Ability to participate in an on-call rotation
Get to Know Klaviyo
Klaviyo is a world-leading marketing automation platform dedicated to accelerating revenue and customer connection for online businesses. Klaviyo makes it easy to store, access, analyze and use transactional and behavioral data to power highly-targeted customer and prospect communications. The company's hybrid customer-data and marketing-platform model allows companies to grow by fostering direct relationships with customers, without giving up their valuable data to popular big-tech ad platforms. Over 265,000 innovative companies like Unilever, Custom Ink, Living Proof and Huckberry sell more with Klaviyo. Learn more at www.klaviyo.com.
If you are a California, Colorado, Rhode Island, Washington, New York City, or Jersey City resident and this role is a remote role, you can receive additional information about the compensation and benefits for this role, which we will provide upon request. Requests can be submitted here. Additional information regarding benefits can be found at klaviyorewards.com.
Klaviyo is committed to diversity and to a policy of equal employment opportunity and non-discrimination. We do not discriminate on the basis of race, color, religion, national origin, age, sex, marital status, ancestry, physical or mental disability, veteran status, gender identity, sexual orientation or any other characteristic protected by applicable law.
Date Posted
03/13/2023
Views
7
Similar Jobs
Senior Network Engineer - InterSystems
Views in the last 30 days - 0
InterSystems is seeking a Senior Network Engineer to support the deployment and maintenance of network infrastructure for their HealthShare and IRIS p...
View DetailsInformation Technology Intern (Summer 2025) - LineVision
Views in the last 30 days - 0
LineVision a rapidly growing climate tech company based in Boston MA is seeking an Information Technology Intern to deploy a new Modern Device Managem...
View DetailsPlatform Owner - Network Reliability - Takeda
Views in the last 30 days - 0
Takeda is seeking a Platform Owner for Network Reliability Engineering to join their Global Network Platform team The role involves developing framewo...
View DetailsIT Solution - Product Engineer - Takeda
Views in the last 30 days - 0
Takeda Development Center Americas Inc is seeking an IT Solution Product Engineer with a Bachelors degree in Engineering or a related field and 3 year...
View DetailsData Platform Engineer - GMSGQ - Takeda
Views in the last 30 days - 0
Takeda Pharmaceuticals USA is seeking a Data Platform Engineer GMSGQ for a fulltime position in Cambridge MA The role involves developing and maintain...
View DetailsSenior Software Engineer (Full Stack, Platform) - WHOOP
Views in the last 30 days - 0
WHOOP is seeking a Senior Software Engineer to join their Platform team in Boston MA The role involves driving largescale architecture projects collab...
View Details