Site Reliability Engineer at Rackner

Rackner is a cloud-native consultancy which enables digital transformation for large organizations through the newest in distributed technologies. We're laser focused on end-to-end application development, DevSecOps, AI/ML, and systems architecture, wrapped up in a methodology focused on cloud-first and cost-effective innovation.

Title: Site Reliability Engineer

Location: Remote

Job Summary:

Rackner is looking for a DevOps Engineer who has deep experience with Kubernetes to join an existing team of DevOps and Cloud Engineers who are building out reusable Infrastructure as Code and CI/CD pipelines, to be deployed in a large enterprise environment. The engineer should have experience in advanced Kubernetes patterns (blue/green updates, stateful workloads, writing custom controllers) in order to contribute to this existing DevOps and cloud architecture.

Essential Functions:

A senior reliability engineer is a senior software developer than ensures that DevSecOps principals are followed through the entire software delivery lifecycle. They cross multiple product teams to get a grasp of a product and/or programs overall state of health. If a product team is in a poor state of health, the medic is responsible for reviving them and bringing them back to a healthy software state. Considered an expert on the CI/CD process and the overall vision of the program.

Applies fundamental concepts, processes, practices,
and procedures on technical tasks
Performs work that requires practical experience and training
Work is performed under supervision
Possesses and applies expertise on multiple complex

Minimum Education and/or Experience:

· B.S. in Computer Science, Information Systems, or equivalent relevant work experience

Requisite Skills/Abilities:

Siite reliability engineers (SREs) combine engineering experience and an innate drive to improve existing systems and processes, with the creativity to develop novel solutions to evolving challenges. For organizations, SREs are typically responsible for the availability and reliability of critical platform services and applications, ensuring they meet the requirements of internal and external users. The best SREs are motivated to collaborate with business leaders in building and running sustainable production systems, which can evolve and adapt to changes in a global business environment.
Experience in Technical Customer Service, Customer Management, and experience in escalations may be required.
Ability to obtain a Secret clearance
Creating and maintaining documentation for implementations
Be available to respond to incidents that impact
Platform One availability and provide support for service engineers with customer incidents.
Run our infrastructure with Chef, Ansible, Terraform, GitLab CI/CD, and Kubernetes.
Build monitoring that alerts on symptoms rather than on outages.
Document every action so your findings turn into repeatable actions and then into automation.
Use the GitLab product to run GitLab.com as a first resort and improve the product as much as possible
Improve operational processes (such as deployments and upgrades) to make them as boring as possible.
Design, build and maintain core infrastructure that enables GitLab scaling to support hundreds of thousands of concurrent users.
Debug production issues across services and levels of the stack.

Core Competencies:

Customer Service & Communication
Building Relationships
Business Knowledge / Organizational Acumen
Self-Motivation/Self Starter
Leading Self and Others

Benefits:

401k with 100% matching up to 6%
Highly competitive PTO
Great health insurance with large network of providers
Medical, Dental and Vision
Life insurance, and short + long term disability
Fitness/Gym Benefit
Industry-Leading Weekly Pay Schedule
Home office & equipment plan
Employee Swag, Snacks & Events

Site Reliability Engineer

Company

Location

Type

Job Description

Explore More

Date Posted

Views

Similar Jobs

Senior Software Engineer -

Senior Solutions Engineer - Commerce Intelligence Platform -

Senior Software Engineer -

information technology (IT) analyst - Mindrift

finance director - The Pod Group

Account Manager -