Staff Site Reliability Engineer

· Remote

Location

Remote

Type

Full Time

Job Description

FieldguideJobs
Staff Site Reliability Engineer

Staff Site Reliability Engineer

Reposted 4 Hours Ago
2 Locations
In-Office or Remote
210K-275K Annually
Expert/Leader
Software
The Role
As a Staff Site Reliability Engineer you'll lead reliability strategies design scalable systems improve observability and mentor engineers to enhance system performance and resilience.
Summary Generated by Built In

About Us

Fieldguide is establishing a new state of trust for global commerce and capital markets through automating and streamlining the work of assurance and audit practitioners specifically within cybersecurity privacy and financial audit. Put simply we build software for the people who enable trust between businesses.

We’re based in San Francisco CA but built as a remote-first company that enables you to do your best work from anywhere. We're backed by top investors including Growth Equity at Goldman Sachs Alternatives Bessemer Venture Partners 8VC Floodgate Y Combinator DNX Ventures Global Founders Capital Justin Kan Elad Gil and more.

We value diversity in backgrounds and in experiences. We need people from all backgrounds and walks of life to help build the future of audit and advisory. Fieldguide’s team is inclusive driven humble and supportive. We are deliberate and self-reflective about the kind of team and culture that we are building seeking teammates that are not only strong in their own aptitudes but care deeply about supporting each other's growth.

As an early stage start-up employee you’ll have the opportunity to build out the future of business trust. We make audit practitioners’ lives easier by eliminating up to 50% of their work and giving them better work-life balance. If you share our values and enthusiasm for building a great culture and product you will find a home at Fieldguide.

About the Role

As a Staff Site Reliability Engineer (SRE) at Fieldguide you will play a critical leadership role in defining and driving the reliability scalability and observability strategy across our platform. You will operate as a technical leader and force multiplier influencing system design reliability standards and engineering practices across multiple teams.

This role goes beyond operating our internal systems. You will shape how reliability is engineered into our products from the ground up. You’ll lead cross-functional initiatives establish best practices and mentor engineers while ensuring our systems remain resilient performant and scalable as the company grows.

Our engineering hub is located in San Francisco. This role is open to remote candidates anywhere in the US; Bay Area-based employees will work in a hybrid setting.

What You’ll Do

  • Lead the design and evolution of highly scalable fault-tolerant distributed systems across our cloud infrastructure.

  • Define and drive adoption of SLOs SLIs and error budgets across engineering teams.

  • Architect and continuously improve observability platforms (metrics logging tracing).

  • Own reliability strategy and roadmap proactively identifying risks and driving long-term improvements.

  • Lead cross-team initiatives to improve system performance scalability and resilience.

  • Establish and enforce best practices for incident response on-call and operational excellence.

  • Drive root cause analysis and systemic improvements through blameless postmortems.

  • Champion automation and reduction of operational toil.

  • Guide capacity planning load testing and performance optimization efforts.

  • Design and validate disaster recovery failover strategies and resilience testing.

  • Mentor and coach engineers to elevate reliability engineering maturity.

  • Partner with Staff engineers across the organization to drive meaningful change

  • Partner with leadership to align business goals with reliability investments.

Who You Are

  • 10+ years of experience in software engineering with a focus on distributed systems and production infrastructure.

  • Extensive experience operating and scaling distributed systems in cloud environments with a strong preference for AWS.

  • Deep expertise in system reliability scalability and performance engineering at scale.

  • Demonstrated experience implementing SLO-driven engineering practices and reliability frameworks.

  • Strong background building and owning observability ecosystems (e.g. Datadog Prometheus Grafana).

  • Proficiency with Infrastructure as Code tooling particularly Terraform or equivalent.

  • Proven experience leading incident management post-mortems and production operations.

  • Strong software engineering fundamentals with the ability to contribute to and review complex codebases.

  • Track record of technical leadership and cross-functional influence across engineering and product teams.

  • Ability to balance tactical short-term needs with strategic long-term architectural improvements.

  • Excellent written and verbal communication skills with the ability to translate complex technical concepts for diverse audiences.

Bonus Points

  • Experience designing or operating multi-region and globally distributed systems.

  • Deep expertise in distributed tracing and performance analysis across complex service architectures.

  • Hands-on experience with database scalability and performance tuning at scale.

  • Familiarity with compliance-driven engineering environments (e.g. SOC 2 FedRAMP or similar frameworks).

  • Experience applying chaos engineering practices to validate and improve system resilience.

  • Experience building or scaling an SRE function within a high-growth organization.

More about Fieldguide

Fieldguide is a values-based company. Our values are:

  • Fearless - Inspire & break down seemingly impossible walls.

  • Fast - Launch fast with excellence iterate to perfection.

  • Lovable - Deliver happiness & 11 star experiences.

  • Owners - Execute & run the business with ownership.

  • Win-win - Create mutual value & earn trust for life.

  • Inclusive - Scale the best ideas with inclusive teams.

Some of our benefits include

  • Competitive compensation packages with meaningful ownership

  • Flexible PTO

  • 401k

  • Wellness benefits including a bundle of free therapy sessions

  • Technology & Work from Home reimbursement

  • Flexible work schedules

Skills Required

  • 10+ years of experience in software engineering
  • Extensive experience operating and scaling distributed systems in cloud environments
  • Deep expertise in system reliability scalability and performance engineering
  • Experience implementing SLO-driven engineering practices
  • Strong background building observability ecosystems
  • Proficiency with Infrastructure as Code tooling particularly Terraform
  • Experience leading incident management and post-mortems
  • Strong software engineering fundamentals
  • Track record of technical leadership and cross-functional influence
  • Excellent written and verbal communication skills

Fieldguide Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Fieldguide and has not been reviewed or approved by Fieldguide.

  • Fair & Transparent CompensationPay is considered competitive for a venture-backed startup with public postings including explicit ranges that clarify expectations. Self-reported compensation bands across multiple roles align with a generally strong total compensation mix.
  • Healthcare StrengthHealth dental vision mental health support and related protections are offered indicating broad baseline coverage. Employer-verified listings include HSA/FSA options disability and life insurance.
  • Leave & Time Off BreadthFlexible or unlimited PTO is advertised with additional paid time categories noted on public benefits grids. Flexible schedules and a distributed setup can facilitate taking time off.

Fieldguide Insights

Am I A Good Fit?
beta
Expert contributor network
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco California
62 Employees
Year Founded: 2020

What We Do

Fieldguide offers market-leading Artificial Intelligence and Cloud for Advisory and Audit firms. Built by former Big Four practitioners and veteran technology leaders our platform digitizes the end-to-end engagement workflow on a single cloud-native platform. Fieldguide's AI Advisory & Audit Cloud is trusted by top CPA firms to unlock growth increase margins and delight clients. Fieldguide AI is award winning being recognized by CPA Practice Advisor (3x Technology Innovation Award) and Accounting Today (2x Top New Product). Fieldguide is based in San Francisco and backed by top investors like 8VC Floodgate Y Combinator Fourth Realm Justin Kan Eric Ries and many more

Similar Jobs

SimSpace

Site Reliability Engineer

Information Technology • Security
Remote
U.S.
161 Employees
165K-230K Annually

MongoDB

Site Reliability Engineer

Big Data • Cloud • Software • Database
Easy Apply
Remote or Hybrid
10 Locations
5550 Employees
127K-249K Annually

Finalsite

Site Reliability Engineer

Edtech • Information Technology • Software
In-Office or Remote
The Center IN USA
563 Employees

Oscilar

Site Reliability Engineer

Artificial Intelligence • Fintech • Software • Financial Services
Remote
2 Locations
104 Employees

Similar Companies Hiring

Hardware • Other • Robotics • Sales • Software • Hospitality
New York NY
30 Employees
Fintech • Software
New York New York
6 Employees
Artificial Intelligence • Fintech • Software • Financial Services
New York New York
60 Employees
Apply Now

Date Posted

05/27/2026

Views

0

Back to Job Listings Add To Job List Company Profile View Company Reviews
Neutral
Subjectivity Score: 0

© 2026 Job Transparency. All rights reserved.