Site Reliability Engineer
Job Description
Dark Wolf Solutions is looking for a Site Reliability Engineer (SRE) Subject Matter Expert (SME) to build and maintain infrastructure as code on large scale multi-site deployments. The SRE SME shall utilize their technical leadership experience to evaluate and assess new ways to scale platform capabilities. The SRE shall be able to automate workflows to help push the limit of the infrastructure and enable continuous delivery of capabilities onto a hybrid infrastructure. The engineer shall be able to troubleshoot issues until root causes are understood on high traffic production systems, participate in design and code review processes, interact with product owners to coordinate infrastructure changes and be responsible for identifying bottlenecks and improving performance of the platform.
Responsibilities:
- Collaborate cross-functionally with software developers, engineers, and operations teams.
- Monitor sites and software to make sure theyโre performing properly.ย
- Anticipates potential problems before they occur and provides dynamic solutions.ย
- Conduct post-incident reviews.ย
- Build sustainability by coding automation within a site infrastructure.
- Experience in Technical Customer Service, Customer Management, and experience in escalations may be required.
- Run our infrastructure with Chef, Ansible, Terraform, GitLab CI/CD, and Kubernetes.
- Design, build and maintain core infrastructure that enables GitLab scaling to support hundreds of thousands of concurrent users.
- Respond to incidents that impact platform availability and provide support for service engineers with customer incidents.
- Build monitoring that alerts on symptoms rather than on outages.
- Debug production issues across services and levels of the stack.
- Create and maintain documentation for actions and implementations to drive sustainability and then automation.
- Drive the strategic plan for PaaS infrastructure growth
Required Qualifications:
- 4+ years of experience developing production software leveraging modern languages (including: Java, Python, Go, NodeJS, etc.)
- 1+ years of experience developing containerized services deployed in production on orchestration platforms such as Kubernetes, Mesos, Swarm, etc.
- 3+ years of experience with agile and lean software development philosophies.
- 1+ years of experience working with relational and/or non-relational databases e.g. PostgreSQL, MySQL, MongoDB, Elasticsearch etc.
- 2+ years of demonstrated experience with modern version control systems such as Git, Subversion, Mercurial, etc.
- HS Diploma
- US Citizenship and clearable to a DoD Secret security clearance or higher.
- CompTIA Security+ CE or other DoD 8570 IAT II certification
- Bachelor Degree in Computer Science, Mathematics, or equivalent technical degree; or equivalent industry experience
The salary range for this position is $110,000 - $159,000, commensurate on experience.ย
We are proud to be an EEO/AA employer Minorities/Women/Veterans/Disabled and other protected categories.
In compliance with federal law, all persons hired will be required to verify identity and eligibility to work in the United States and to complete the required employment eligibility verification form upon hire.
ย
Date Posted
10/20/2022
Views
6
Similar Jobs
Software Engineer Networking Software and Services - xAI
Views in the last 30 days - 0
The text describes xAIs mission to develop AI systems for understanding the universe and advancing human knowledge It outlines a role involving networ...
View DetailsAssociate Technical Support Engineer - Recharge
Views in the last 30 days - 0
Recharge is a subscription platform for innovative brands offering customer retention solutions They seek Technical Support roles with 247 coverage em...
View DetailsFull Stack Product Engineer - Jiga
Views in the last 30 days - 0
Jiga is a remotefriendly company focused on empowering engineers with trust autonomy and flexibility They emphasize simplicity ownership and impactful...
View DetailsSenior Design Manager (Infrastructure) - Canonical
Views in the last 30 days - 0
Canonical a leading opensource provider seeks a Senior Design Manager to drive innovation in cloud and AI technologies The role offers remote work glo...
View DetailsSenior Product Designer - Org & Security - Typeform
Views in the last 30 days - 0
This job description outlines a role in developing an intelligent contact management system with AI capabilities The position involves designing user ...
View DetailsExecutive Director Patient Advocacy - Kyverna Therapeutics
Views in the last 30 days - 0
Kyverna Therapeutics is seeking an Executive Director for Patient Advocacy to lead initiatives in autoimmune disease treatment The role involves build...
View Details