Job Description
Rackner is a cloud-native consultancy which enables digital transformation for large organizations through the newest in distributed technologies. We're laser focused on end-to-end application development, DevSecOps, AI/ML, and systems architecture, wrapped up in a methodology focused on cloud-first and cost-effective innovation.
Title: Site Reliability Engineer
Location: Remote
Job Summary:
Rackner is looking for a DevOps Engineer who has deep experience with Kubernetes to join an existing team of DevOps and Cloud Engineers who are building out reusable Infrastructure as Code and CI/CD pipelines, to be deployed in a large enterprise environment. The engineer should have experience in advanced Kubernetes patterns (blue/green updates, stateful workloads, writing custom controllers) in order to contribute to this existing DevOps and cloud architecture.
Essential Functions:
A senior reliability engineer is a senior software developer than ensures that DevSecOps principals are followed through the entire software delivery lifecycle. They cross multiple product teams to get a grasp of a product and/or programs overall state of health. If a product team is in a poor state of health, the medic is responsible for reviving them and bringing them back to a healthy software state. Considered an expert on the CI/CD process and the overall vision of the program.
- Applies fundamental concepts, processes, practices,
- and procedures on technical tasks
- Performs work that requires practical experience and training
- Work is performed under supervision
- Possesses and applies expertise on multiple complex
Minimum Education and/or Experience:
ยท B.S. in Computer Science, Information Systems, or equivalent relevant work experience
Requisite Skills/Abilities:
- Siite reliability engineers (SREs) combine engineering experience and an innate drive to improve existing systems and processes, with the creativity to develop novel solutions to evolving challenges. For organizations, SREs are typically responsible for the availability and reliability of critical platform services and applications, ensuring they meet the requirements of internal and external users. The best SREs are motivated to collaborate with business leaders in building and running sustainable production systems, which can evolve and adapt to changes in a global business environment.
- Experience in Technical Customer Service, Customer Management, and experience in escalations may be required.
- Ability to obtain a Secret clearance
- Creating and maintaining documentation for implementations
- Be available to respond to incidents that impact
- Platform One availability and provide support for service engineers with customer incidents.
- Run our infrastructure with Chef, Ansible, Terraform, GitLab CI/CD, and Kubernetes.
- Build monitoring that alerts on symptoms rather than on outages.
- Document every action so your findings turn into repeatable actions and then into automation.
- Use the GitLab product to run GitLab.com as a first resort and improve the product as much as possible
- Improve operational processes (such as deployments and upgrades) to make them as boring as possible.
- Design, build and maintain core infrastructure that enables GitLab scaling to support hundreds of thousands of concurrent users.
- Debug production issues across services and levels of the stack.
Core Competencies:
- Customer Service & Communication
- Building Relationships
- Business Knowledge / Organizational Acumen
- Self-Motivation/Self Starter
- Leading Self and Others
Benefits:
- 401k with 100% matching up to 6%
- Highly competitive PTO
- Great health insurance with large network of providers
- Medical, Dental and Vision
- Life insurance, and short + long term disability
- Fitness/Gym Benefit
- Industry-Leading Weekly Pay Schedule
- Home office & equipment plan
- Employee Swag, Snacks & Events
Date Posted
11/09/2022
Views
3
Similar Jobs
Senior Solutions Engineer - Commerce Intelligence Platform -
Views in the last 30 days - 0
View Details