Site Reliability Engineer
Company
Focal Systems
Location
San Francisco, CA
Type
Full Time
Job Description
Location: San Francisco - hybrid (1-2 days per week)
Salary: $165-175k + stock
Company Description
Focal Systems is the industry leader in retail AI solutions. We are a Silicon Valley based startup that has more than doubled in size every year since inception. We are a Deep Learning first company. Our mission is to automate and optimize brick and mortar retail using deep learning computer vision. Focal Systems has been deployed at scale with the top retailers in the world. We are looking for smart, creative and passionate people who want to help build a great and enduring company and deploy Deep Learning to the world!
Mission of the role:
To enable us to scale from 200k to 1 million cameras
Job Summary
As a Sr. DevOps/Site Reliability Engineer (SRE) at our company, you will play a pivotal role in ensuring the smooth operation and continuous improvement of our infrastructure, deployment processes, and overall system reliability.
Responsibilities
- Set up and manage blue/green and canary deployments to ensure smooth launches without downtime.
- Operate multiple large GCP Kubernetes clusters and fine tune for reliability vs cost
- Manage the various distributed services of the company, ensuring to always provide graceful updates, comprehensive test coverage, tracking of logs, and 99.9% uptime
- Work with Backend, Frontend and Deep Learning teams and write infrastructure automation code for their needs
- Identify scalability bottlenecks through load testing and plan infrastructure architecture
- Create tools to provide transparency/ease of access into the company's rich datasets stored across varying geographic locations and data formats
- Design, build, and manage a robust Continuous Integration and Continuous Deployment (CI/CD) pipeline.
- Lead uptime improvement processes including: postmortem review, on-call setup.
Requirements
- Solid experience in an infrastructure or Site Reliability Engineer (SRE) role
- Hands-on experience with containerization (Docker) and orchestration platforms (Kubernetes) required
- Experience in cloud cost management
- Great understanding of SQL, networking, distributed systems, operating systems (debian) and software engineering practices
- Experience with messaging systems
- Terraform or other Infrastructure as Code automation solution
- Operating Relational SQL databases and Redis at terabyte scale.
- Proven experience with setting up monitoring/alerting and reliability engineering
- Scriptings skills in Python
Nice to have experience:
- GitOps
- Setting up automation for complex load testing scenarios
- Tuning Deep Learning pipelines with Python, Pytorch and Multiprocessing
- Backend programming with Python
Why Focal Systems
Strong Values and Mission - We are a tightly-knit team with an ambitious mission and a strong set of core values, which define our approach to business and have successfully guided us since inception.
Exceptional Team - We are a team of hard-working, fun-loving professionals from some of the most eminent universities, research labs, and tech companies of our time. We pride ourselves on recruiting exceptional individuals to help us redefine the state-of-the-art.
Outstanding Partners - We work with 10+ of the largest retailers in the world and have a world-class roster of investors, advisors and partners to support & advise us in our endeavors.
Benefits
We care deeply about the health, happiness, and wellbeing of all of our employees. We offer:
- Competitive Salary & Attractive Stock
- Paid Time Off
- Quarterly Team Retreats
- Education grants
Date Posted
11/08/2024
Views
0
Similar Jobs
Software Engineer, Data Platform (Lead) - Benchling
Views in the last 30 days - 0
Benchling a leading biotechnology company is seeking a Senior Software Engineer to design and implement scalable multitenant services and APIs The rol...
View DetailsSenior Product Manager, Dev Solutions - Atlassian
Views in the last 30 days - 0
Atlassian offers a remote position for a Product Manager in the Dev Solutions team The role involves collaborating with crossfunctional teams to lead ...
View DetailsTreasury Management Officer - Technology and Disruptive Commerce - JPMorganChase
Views in the last 30 days - 0
The job posting is for a Treasury Management Officer in Commercial Banking The role involves generating new treasury management business maintaining c...
View DetailsRelationship Executive, Middle Market Banking - Executive Director - JPMorganChase
Views in the last 30 days - 0
The job description is for a Relationship Executive role in the Middle Market Banking team The role involves building and retaining profitable relatio...
View DetailsInternal Audit & SOX Senior - Chime
Views in the last 30 days - 0
Chime is seeking a Senior Internal Audit and SOX professional to implement a worldclass SOX program and contribute to the broader internal audit funct...
View DetailsSMB Account Executive - Benchling
Views in the last 30 days - 0
Benchling a biotechnology company is seeking a motivated SMB Account Executive to drive new business and expand their customer base The role involves ...
View Details