Senior Site Reliability Engineer

· Remote

Location

Remote

Type

Full Time

Job Description

Senior Site Reliability Engineer

Reposted 10 Hours Ago
Easy Apply
2 Locations
Remote or Hybrid
Senior level
Legal Tech • Real Estate • Security • Software • Cybersecurity • PropTech
CertifID helps to stop wire fraud and keep money out of the hands of criminals.
The Role
The Senior Site Reliability Engineer will enhance reliability in production SaaS systems implement AI agents improve observability and mentor junior engineers.
Summary Generated by Built In
Cybercrime is rising reaching record highs in 2024. According to the FBI's IC3 report total losses exceeded $16 billion. With investment fraud and BEC scams at the forefront the message is clear: the real estate sector remains a lucrative target for cybercriminals. At CertifID we take this threat seriously and provide a secure platform that verifies the identities of parties involved in transactions authenticates wire transfer instructions and detects potential fraud attempts. Our technology is designed to mitigate risks and ensure that every transaction is conducted with confidence and peace of mind.

We know we couldn’t take on this challenge without our incredible team. We have been recognized as one of the Best Startups to Work for in Austin made the Inc. 5000 list and won Best Culture by Purpose Jobs three years in a row. We are guided by our core values and our vision of a world without wire fraud. We offer a dynamic work environment where you can contribute to meaningful impact and be part of a team dedicated to enhancing security and fighting fraud.

We are seeking a Senior Site Reliability Engineer (Senior SRE) to drive reliability improvements across our production SaaS environment. You’ll play a critical role in building scalable infrastructure patterns advancing observability improving incident response and partnering with engineering teams to embed reliability into system design and delivery.
 
This role is ideal for an experienced Sr. SRE who enjoys solving complex operational problems building automation and mentoring others.

What You’ll Do

  • Reliability & Platform Operations: Own and improve the reliability availability and performance of production systems while defining and operationalizing SLIs/SLOs and error budgets.
  • AI Agent Enablement:  Design and implement autonomous and semi-autonomous AI agents for monitoring distributed systems and applications. Build agents capable of consuming multi-source observability data (metrics logs traces etc.).
  • Incident Response: Participate in and help lead an on-call rotation serving as an escalation point for major incidents and facilitating blameless postmortems.
  • Automation & Infrastructure: Build automated workflows to eliminate manual work and design/maintain Infrastructure-as-Code with Terraform.
  • Observability: Improve metrics logs traces and alerting using tools like Datadog or Prometheus to reduce noise and increase signal.
  • Collaboration & Mentorship: Partner with application teams to implement reliability best practices and mentor junior engineers to foster a culture of knowledge sharing.

Who You Are

  • Strategic Architect: You look beyond the "what" to understand the "why" providing insights that influence our GTM and technical direction.
  • Startup Veteran: You are comfortable moving fast and staying proactive in an environment where the playbook is still being written.
  • Relatable & Adaptable: You can navigate different personalities across the organization from high-energy sales teams to analytical engineering partners.
  • Lifelong Learner: You have a thirst for learning keeping up with emerging technologies and industry trends.

What We're Looking For

  • Experience: 5+ years in SRE DevOps Platform Engineering or Infrastructure Engineering.
  • Cloud Expertise: Proven experience supporting production SaaS systems in Azure (preferred) AWS or GCP.
  • Technical Stack: Strong Linux networking and distributed systems troubleshooting skills.
  • Containers: Strong experience with containers and orchestration (Kubernetes/EKS/AKS).
  • IaC & Tooling: Expertise with Infrastructure-as-Code (Terraform strongly preferred).
  • Programming: Strong scripting/programming skills in Python Go Bash or C#/.NET.
  • Observability: Hands-on experience with Datadog Prometheus/Grafana or OpenTelemetry.

What We Offer

  • Flexible vacation
  • 12 company-paid holidays
  • 10 paid sick days
  • No work on your birthday
  • Health dental and vision Insurance (including a $0 option)
  • 401(k) with matching and no waiting period
  • Equity
  • Life insurance
  • Generous parental paid leave
  • Wellness reimbursement of $300/year
  • Remote worker reimbursement of $300/year
  • Professional development reimbursement
  • Competitive pay
  • An award-winning culture

Not sure if you check all the boxes? Apply anyway! 

We know that great talent comes in many forms and we value potential just as much as experience. If you're excited about this role and believe you can grow into it we’d love to hear from you. We’re looking for people who are eager to learn adapt and solve challenges—so if that sounds like you don’t let a checklist hold you back!

Change doesn't happen overnight and the same goes for us here at CertifID. We evolve collectively and individually as we grow by leaning into the core values that define us. As we grow we embody GRIT—collectively and individually—to raise the bar and influence outcomes in everything we do. Guard the Customer - Raise the Bar - Influence Outcomes - Teamwork Wins

Top Skills

.Net
Aks
AWS
Azure
Bash
C#
Datadog
Eks
GCP
Go
Grafana
Kubernetes
Linux
Opentelemetry
Prometheus
Python
Terraform

What the Team is Saying

Will
Natalia
Lei
Boris
Joelle
Nick
Luis
Tyler
Claudia
Am I A Good Fit?
beta
Expert contributor network
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Austin TX
130 Employees
Year Founded: 2018

What We Do

CertifID helps to create a world without wire fraud. Started after our co-founder was hit by fraud – we’re the only company dedicated to fighting fraud for the real estate industry with an identity verification SaaS platform insurance and proven recovery services. CertifID helps safeguard billions of dollars every month from fraud and provides peace of mind with direct insurance coverage on every wire it protects.

Why Work With Us

CertifID is a mission-driven company where every team member is a frontline defender against fraud. We operate with GRIT a set of core values anchored by our commitment to Guard the Customer above all else.

Gallery

CertifID Teams

Team
Stopping Fraud Through Tech
About our Teams

CertifID Offices

Hybrid Workspace

Employees engage in a combination of remote and on-site work.

Typical time on-site: Flexible
Austin TX
Grand Rapids MI
Learn more

Similar Jobs

CertifID

Staff Software Engineer

Legal Tech • Real Estate • Security • Software • Cybersecurity • PropTech
Easy Apply
Remote or Hybrid
2 Locations
130 Employees

CertifID

Customer Success Manager

Legal Tech • Real Estate • Security • Software • Cybersecurity • PropTech
Easy Apply
Remote or Hybrid
2 Locations
130 Employees

CertifID

Director/Senior Manager Marketing Operations

Legal Tech • Real Estate • Security • Software • Cybersecurity • PropTech
Easy Apply
Remote or Hybrid
2 Locations
130 Employees

CertifID

Senior Manager Sales Operations

Legal Tech • Real Estate • Security • Software • Cybersecurity • PropTech
Easy Apply
Remote or Hybrid
2 Locations
130 Employees
Apply Now

Date Posted

03/28/2026

Views

0

Back to Job Listings Add To Job List Company Profile View Company Reviews
Neutral
Subjectivity Score: 0
142,000+ Jobs Tracked
12,400+ Companies
1,930 Categories