Principal SRE (Site Reliability Engineer) - REMOTE / WFH
Job Description
IdentityNow is SailPoint's Identity as a Service (IDaaS) product, and the Principal Site Reliability Engineer will be a key player on our Reliability Engineering team servicing the IdentityNow product suite. We are looking for engineers with broad experience in building and running distributed systems at global scale. If you enjoy analyzing complicated problems, innovating creative solutions, and collaborating across teams to build reliable, scalable, and impactful solutions, come join our Reliability Engineering team. We are a team of people that write software to solve scalability, observability, security, reliability, and operability problems.
What You'll Make Happen
- Make it easy for everyone to create, consume, manage, and scale reliable cloud production services to achieve more
- Work on SailPoint IdentityNow services to design, develop, and improve end-to-end reliability and maintainability for all services
- Serve as a thought leader on reliability initiatives to shape the future of our Reliability Engineering team
- Reliability consulting with our service teams on various aspects including architectural decisions, systemic observability, application performance, and on-call health training.
- Lead cross-team projects with developers to deliver high quality solutions from ideas to production code
- Formulate and drive cross-functional requirements working with Engineering, Product, Services, and other departments
- Author clean and thorough design documents and code that exemplify quality, simplicity, and maintainability
- Design high complexity systems that prioritize the customer experience and perspective
- Consistently create, train, and drive adoption of tools that help deliver insights and automation to simplify the complex and reduce toil
- Lead and be a mentor of excellence for design reviews, code, test cases, automation, observability, root cause analysis, and self-healing
- Takes responsibility for the effectiveness of their whole team's impact, in addition to their own work
- Works on their team's reputation and how it interacts and develops teammates outside the team
- Works with managers and directors to come up with strategies for better team and individual development
Requirements
- 10+ years experience as an SRE supporting a 24x7 highly available production environment for a SaaS or cloud service provider
- 5+ years experience leading design and implementation of solutions to improve availability and resiliency of software services
- Experience with cloud infrastructure environments, preferably AWS, and Infrastructure as code.
- Experience with containerization technology and/or Kubernetes
- Experience with Release automation, system administration, configuration management
- Experience with programming languages (Java, Python, Go, etc). Strong understanding of Linux, software development, systems, networking, and Cloud concepts
- Strong interpersonal and teaming skills - ability to set and enforce process and influence engineers who are not direct reports.
- Ability to operate in an agile, entrepreneurial start-up environment.
Education
- Bachelor's degree in Computer Science or other technical discipline, or equivalent experience
SailPoint is an equal opportunity employer and we welcome everyone to our team. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or veteran status.
Date Posted
09/05/2022
Views
6
Similar Jobs
Software Engineer Networking Software and Services - xAI
Views in the last 30 days - 0
The text describes xAIs mission to develop AI systems for understanding the universe and advancing human knowledge It outlines a role involving networ...
View DetailsPrincipal Cloud Architect: Pre-Sales - Myriad360
Views in the last 30 days - 0
This job description outlines a senior cloud architect role requiring Azure and GCP expertise focusing on secure cloud solutions The company emphasize...
View DetailsAssociate Technical Support Engineer - Recharge
Views in the last 30 days - 0
Recharge is a subscription platform for innovative brands offering customer retention solutions They seek Technical Support roles with 247 coverage em...
View DetailsFull Stack Product Engineer - Jiga
Views in the last 30 days - 0
Jiga is a remotefriendly company focused on empowering engineers with trust autonomy and flexibility They emphasize simplicity ownership and impactful...
View DetailsSenior Design Manager (Infrastructure) - Canonical
Views in the last 30 days - 0
Canonical a leading opensource provider seeks a Senior Design Manager to drive innovation in cloud and AI technologies The role offers remote work glo...
View DetailsSenior Product Designer - Org & Security - Typeform
Views in the last 30 days - 0
This job description outlines a role in developing an intelligent contact management system with AI capabilities The position involves designing user ...
View Details