Job Description
About Us
Fieldguide is establishing a new state of trust for global commerce and capital markets through automating and streamlining the work of assurance and audit practitioners specifically within cybersecurity privacy and financial audit. Put simply we build software for the people who enable trust between businesses.
We’re based in San Francisco CA but built as a remote-first company that enables you to do your best work from anywhere. We're backed by top investors including Growth Equity at Goldman Sachs Alternatives Bessemer Venture Partners 8VC Floodgate Y Combinator DNX Ventures Global Founders Capital Justin Kan Elad Gil and more.
We value diversity in backgrounds and in experiences. We need people from all backgrounds and walks of life to help build the future of audit and advisory. Fieldguide’s team is inclusive driven humble and supportive. We are deliberate and self-reflective about the kind of team and culture that we are building seeking teammates that are not only strong in their own aptitudes but care deeply about supporting each other's growth.
As an early stage start-up employee you’ll have the opportunity to build out the future of business trust. We make audit practitioners’ lives easier by eliminating up to 50% of their work and giving them better work-life balance. If you share our values and enthusiasm for building a great culture and product you will find a home at Fieldguide.
About the Role
As a Senior Site Reliability Engineer (SRE) at Fieldguide you will be responsible for ensuring the reliability scalability and observability of our production systems. You will apply software engineering principles to infrastructure and operations designing systems that are resilient highly available and capable of scaling with rapid growth.
You’ll work closely with product and platform engineering teams to define and implement reliability standards improve system performance and build robust observability practices. This role is central to maintaining a high level of trust in our systems by proactively identifying risks reducing toil through automation and driving operational excellence.
Our engineering hub is located in San Francisco. This role is open to remote candidates anywhere in the US; Bay Area-based employees will work in a hybrid setting.
What You’ll Do
Design and operate highly scalable fault-tolerant systems that support production workloads across a distributed cloud environment.
Define and implement Service Level Objectives (SLOs) Service Level Indicators (SLIs) and error budgets to guide reliability decisions.
Build and improve observability systems (metrics logs tracing) to provide deep visibility into system behavior and performance.
Lead efforts to improve system reliability and performance including capacity planning load testing and performance tuning.
Automate operational processes to reduce manual toil and improve system consistency and resilience.
Partner with engineering teams to design systems with reliability and scalability built in from the start.
Participate in and improve incident response on-call practices and post-incident reviews focusing on root cause analysis and systemic improvements.
Drive continuous improvement of system resilience including disaster recovery and chaos testing.
Establish best practices for monitoring alerting and incident management to ensure rapid detection and resolution of issues.
Advocate for reliability-focused engineering culture including blameless postmortems and operational excellence.
Who You Are
5+ years of experience in site reliability engineering infrastructure or a related software engineering discipline.
Strong experience operating and scaling distributed systems in cloud environments with AWS preferred.
Hands-on experience building and managing observability platforms (e.g. Datadog Prometheus Grafana CloudWatch).
Experience defining SLOs/SLIs and leveraging them to inform and drive engineering priorities.
Proficiency with Infrastructure as Code tooling particularly Terraform or equivalent.
Deep understanding of system performance reliability patterns and distributed system failure modes.
Experience supporting production systems through on-call rotations and incident response.
Proficiency in at least one programming or scripting language used for automation and tooling.
Strong communication and collaboration skills with the ability to work effectively across engineering and product teams.
Bonus Points
Experience implementing distributed tracing systems such as OpenTelemetry or similar frameworks.
Experience with capacity planning and performance benchmarking at scale.
Familiarity with database performance tuning and observability across high-traffic systems.
Exposure to regulated or compliance-heavy engineering environments (e.g. SOC 2 FedRAMP or equivalent frameworks).
Experience applying chaos engineering practices to proactively test and strengthen system resilience.
More about Fieldguide
Fieldguide is a values-based company. Our values are:
Fearless - Inspire & break down seemingly impossible walls.
Fast - Launch fast with excellence iterate to perfection.
Lovable - Deliver happiness & 11 star experiences.
Owners - Execute & run the business with ownership.
Win-win - Create mutual value & earn trust for life.
Inclusive - Scale the best ideas with inclusive teams.
Some of our benefits include
Competitive compensation packages with meaningful ownership
Flexible PTO
401k
Wellness benefits including a bundle of free therapy sessions
Technology & Work from Home reimbursement
Flexible work schedules
Skills Required
- 5+ years of experience in site reliability engineering infrastructure or related software engineering discipline
- Strong experience operating and scaling distributed systems in cloud environments with AWS preferred
- Hands-on experience building and managing observability platforms (e.g. Datadog Prometheus Grafana CloudWatch)
- Experience defining SLOs/SLIs and leveraging them to inform and drive engineering priorities
- Proficiency with Infrastructure as Code tooling particularly Terraform or equivalent
- Deep understanding of system performance reliability patterns and distributed system failure modes
- Experience supporting production systems through on-call rotations and incident response
- Proficiency in at least one programming or scripting language used for automation and tooling
Fieldguide Compensation & Benefits Highlights
The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Fieldguide and has not been reviewed or approved by Fieldguide.
- Fair & Transparent Compensation—Pay is considered competitive for a venture-backed startup with public postings including explicit ranges that clarify expectations. Self-reported compensation bands across multiple roles align with a generally strong total compensation mix.
- Healthcare Strength—Health dental vision mental health support and related protections are offered indicating broad baseline coverage. Employer-verified listings include HSA/FSA options disability and life insurance.
- Leave & Time Off Breadth—Flexible or unlimited PTO is advertised with additional paid time categories noted on public benefits grids. Flexible schedules and a distributed setup can facilitate taking time off.
Fieldguide Insights
What We Do
Fieldguide offers market-leading Artificial Intelligence and Cloud for Advisory and Audit firms. Built by former Big Four practitioners and veteran technology leaders our platform digitizes the end-to-end engagement workflow on a single cloud-native platform. Fieldguide's AI Advisory & Audit Cloud is trusted by top CPA firms to unlock growth increase margins and delight clients. Fieldguide AI is award winning being recognized by CPA Practice Advisor (3x Technology Innovation Award) and Accounting Today (2x Top New Product). Fieldguide is based in San Francisco and backed by top investors like 8VC Floodgate Y Combinator Fourth Realm Justin Kan Eric Ries and many more
Similar Jobs
Similar Companies Hiring
Explore More
Date Posted
05/27/2026
Views
0
Similar Jobs
Senior Manager, Technical Program Management - Capital One Software (Remote) -
Views in the last 30 days - 0
View Details