Senior Site Reliability Engineer

ServiceTitan · Remote

Company

ServiceTitan

Location

Remote

Type

Full Time

Job Description

Ready to be a Titan?
At ServiceTitan, the SRE team engages the entire lifecycle of software development from ideation to operating predictably at scale. As an SRE at ServiceTitan, you will identify and build software to improve uptime, improve performance, and improve the overall customer experience. You will collaborate with architects and software engineers to deliver a highly available and highly automated infrastructure.
We are seeking a highly skilled and experienced Site Reliability Engineer (SRE) with a strong background in managing and optimizing Microsoft SQL Server, Kubernetes, and cloud platforms such as Azure or AWS. As an SRE, you will be responsible for ensuring the reliability, scalability, and performance of our infrastructure, with a specific focus on SQL Server, container orchestration using Kubernetes, and cloud-based services. You will collaborate with cross-functional teams to implement best practices, automate processes, and proactively monitor and resolve issues to ensure smooth operations in a hybrid cloud environment.
What you'll do:
  • Manage and optimize Microsoft SQL Server databases and related systems to ensure high availability, performance, and scalability.
  • Design, implement, and maintain database architecture, including table structures, indexes, and schemas, while adhering to industry best practices.
  • Implement SQL Server HA/DR technologies, such as failover clustering, database mirroring, and Always On Availability Groups.
  • Monitor SQL Server performance and proactively identify and resolve performance bottlenecks, query tuning, and optimization opportunities.
  • Deploy, configure, and manage containerized applications using Kubernetes, ensuring high availability, scalability, and fault tolerance.
  • Collaborate with development teams to design and implement efficient and scalable database solutions for new and existing applications.
  • Implement and maintain infrastructure-as-code (IaC) using tools like Terraform or CloudFormation to provision and manage cloud resources.
  • Monitor and optimize cloud-based services (Azure, AWS) to ensure performance, cost-efficiency, and high availability.
  • Automate database management tasks, infrastructure provisioning, and deployment processes using scripting languages and automation tools.
  • Implement and maintain CI/CD pipelines for seamless application deployments and releases.
  • Monitor and respond to alerts and incidents related to SQL Server performance, Kubernetes clusters, and cloud infrastructure.
  • Conduct regular database capacity planning and optimization to ensure efficient resource utilization.
  • Collaborate with cross-functional teams, including software engineers, system administrators, and network engineers, to troubleshoot and resolve complex issues.
  • Develop and maintain documentation, including standard operating procedures, runbooks, and technical diagrams.
  • Stay updated with the latest trends, tools, and best practices in SQL Server, Kubernetes, and cloud technologies, and share knowledge with the team.

What you'll bring:
  • 3+ years of experience in programming in either .NET, Python, PowerShell, Bash
  • Experience in managing cloud infrastructure in AWS, Azure, or GCP is a big plus
  • Experience maintaining services in Kubernetes environments is a plus
  • BA/BS in Computer Science, Computer Engineering or in a related technical discipline or equivalent industry experience.
  • Be able to craft beautiful infrastructure as code solutions.
  • Demonstrated sensitivity to operational concerns.
  • Demonstrated ability to debug code and troubleshoot outages.
  • Full-stack troubleshooting skills across all software layers is a big plus.
  • Superb communication skills, both written and verbal.
  • Passion about solving complex infrastructure challenges.
  • Excited about delivering a reliable high-quality product.
  • Highly motivated, smart, independent person who thrives in a fast-paced innovative environment.
  • Intensely eager to meet the needs of our customers and deliver best-of-breed SaaS solutions.
  • Experience using telemetry to understand throughput, limitations, and constraints in a service.
  • Understanding of architectural patterns to improve uptime.
  • Able to Monitor and improve site stability.
  • Passion for system, application and business metrics.

Be Human With Us:
Being human isn't about checking every box on a list. It's about the experiences we have, people we meet, and the perspectives we share. So, if you have the skills but are hesitant to apply because of your background, apply anyway. We need amazing people like you to help us challenge the conventional and think differently about the problems that we're solving. We're in this together. Come be human, with us.
What We Offer:
When you join our team, you're not just accepting a job. You're making a career move. Here's how we'll support you in doing some of the most impactful work of your career:
  • Flextime, recognition, and support for autonomous work: Flexible time off with ample learning and development opportunities to continue growing your career. We offer a comprehensive onboarding program, leadership training for Titans at all levels, and other programs and events. Great work is rewarded through Bonusly, peer-nominated awards, and more.
  • Holistic health and wellness benefits: Company-paid medical, dental, and vision (with 100% employer paid options and 90% coverage for dependents), FSA and HSA, 401k match, and telehealth options including memberships to Headspace, Galileo, One Medical, Ginger and more.
  • Support for Titans at all stages of life: Parental leave and support, up to $20k in adoption reimbursement, on demand maternity support through Maven Maternity, free breast milk shipping through Maven Milk, pet insurance, legal advisory services, financial planning tools, and more.

At ServiceTitan, we celebrate individuality and uniqueness. We believe that the convergence of fresh perspectives and experiences from all walks of life is what makes our product and culture so great. We strongly encourage people from underrepresented groups to apply. We do not discriminate against employees based on race, color, religion, sex, national origin, gender identity or expression, age, disability, pregnancy (including childbirth, breastfeeding, or related medical condition), genetic information, protected military or veteran status, sexual orientation, or any other characteristic protected by applicable federal, state or local laws.
ServiceTitan is committed to fair and equitable compensation for all of our employees. We thoughtfully consider a wide range of factors when determining individual compensation. The expected salary range for this role is between $137,000 - $196,000. Actual compensation for an individual may vary depending on skills, performance over time, qualifications, experience, and location. In addition to the base salary, the total compensation package also includes an annual bonus, equity and a holistic suite of benefits.
Apply Now

Date Posted

08/02/2023

Views

4

Back to Job Listings Add To Job List Company Profile View Company Reviews
Positive
Subjectivity Score: 0.8

Similar Jobs

Senior Design Manager (Infrastructure) - Canonical

Views in the last 30 days - 0

Canonical a leading opensource provider seeks a Senior Design Manager to drive innovation in cloud and AI technologies The role offers remote work glo...

View Details

Senior Product Designer - Org & Security - Typeform

Views in the last 30 days - 0

This job description outlines a role in developing an intelligent contact management system with AI capabilities The position involves designing user ...

View Details

Senior Business Analyst - Xpansiv

Views in the last 30 days - 0

Xpansiv promotes its role as an energy market innovator with a global platform for environmental commodities The job posting seeks a Business Analyst ...

View Details

Senior Specialist Senior Accountant Shared Financial Services - Make-A-Wish America

Views in the last 30 days - 0

The text describes Make a Wish Foundations mission to grant childrens wishes and their community efforts It outlines job positions with remotehybrid o...

View Details

Software Engineer Networking Software and Services - xAI

Views in the last 30 days - 0

The text describes xAIs mission to develop AI systems for understanding the universe and advancing human knowledge It outlines a role involving networ...

View Details

Associate Technical Support Engineer - Recharge

Views in the last 30 days - 0

Recharge is a subscription platform for innovative brands offering customer retention solutions They seek Technical Support roles with 247 coverage em...

View Details