Senior Solution Site Reliability Engineer (Remote)

The Hartford · Hartford, CT

Company

The Hartford

Location

Hartford, CT

Type

Full Time

Job Description

You are a driven and motivated problem solver ready to pursue meaningful work. You strive to make an impact every day & not only at work, but in your personal life and community too. If that sounds like you, then you've landed in the right place.

The Hartford

Senior Solution Site Reliability Engineer

The Hartford's Digital Enablement team is seeking a highly motivated, detail-oriented and results-driven Senior Solution Site Reliability Engineer who is going to be responsible for maintaining the stability of some of our key products. This is a unique opportunity to join a team responsible for creating and maintaining services at The Hartford that enable our users digital adoption. Successful candidates will have experience in delivering quality technical solutions, an inquisitive mindset, as well as the desire to contribute to the larger strategic technical vision. We are always innovating in this space and the right individual would have to be willing to work outside their comfort zone and pioneer new strategies in this fast-paced environment.

In addition, the candidate must have the ability to manage multiple priorities, willingness to understand existing processes and systems, and possess strong interpersonal and communication skills. The candidate should be able to take decisions quickly in consultation with the team members and subject matter experts and be able to build relationships and understand the dynamics and critical nature of the business.

Responsibilities of the position include:

  • The right individual will be highly motivated and self-organized. High level of independence but is also a team player.
  • Manage vendor teams to drive best practices and improve operational efficiencies.
  • Work across the organization to provide business process or design recommendations consistent with long term business and IT strategy
  • Develop and enhance the solution or portfolio based on demand and budget. Optimize operational efficiency.
  • Develop business case for significant enhancements, migration projects.
  • Maintain the reliability of the solution. Lead on call activities to mitigate incidents as quick as possible. Maintains personal responsibility and commitment to respond to and address incidents quickly.
  • Develop effective tooling, alerts, and response to both identify and address reliability risks including automatic problem detection and mitigation.
  • Develop effective tooling and automation in the CI and CD pipelines.
  • Engage with the service consumers to define and design functional and non functional requirements for the solution.
  • Develop and provide training, best practices, sample code to enable consumers to take advantage of the solution as the best degree possible.
  • For open source components, engage actively with the community on bug fixing, new or improved functionality and version currency. Contribute to the sustained success of the component.
  • For COTS components: engage actively with the vendor on bug fixing, new or improved functionality and version currency.
  • Execute in an agile framework (Scrum or Kanban). Effectively communicate questions and impediments to the team as needed.
  • Participate in relevant vendor, community, industry conferences.

Qualifications:

  • Degree in Computer Science or related discipline with a minimum of 5 to 7 years of work experience in IT systems operations and application development. Preferably some experience in an SRE role.
  • Good Software engineering skills preferably with experience in Java, Pega, Identity and Access Management products and Front-End technologies like Angular etc.
  • Understanding of Linux system internals, are familiar with the TCP or IP stack, network routing and load balancing.
  • Command of Observability tools such as DynaTrace, SumoLogic, TrueSight, CloudWatch, automation tools such as Ansible and CI or CD pipeline tools such as Jenkins, UDeploy, SonarQube, AppScan, Nexis.
  • Approach troubleshooting systematically and have a deep sense of ownership for whatever you work on.
  • Ability to root cause sources of instability in a high traffic, distributed system.
  • Experience with configuration and troubleshooting of Linux, Java, Scala, Docker, Kubernetes systems.
  • Understanding of large-scale complex systems from a reliability perspective.
  • Experience with cloud technologies and any certificates like AWS Certified DevOps Engineer, AWS Certified Developer, Microsoft Certified Azure DevOps Engineer, Microsoft Certified Azure Developer, Certified Kubernetes Administrator, Certified Kubernetes Application Developer a plus.
  • Strong relationship building skills
  • Exceptional Communication skills - written and verbal
  • Excellent presentation skills and ability to formulate ideas for presentation to upper management.
Candidates must be authorized to work in the US without company sponsorship

Compensation

The listed annualized base pay range is primarily based on analysis of similar positions in the external market. Actual base pay could vary and may be above or below the listed range based on factors including but not limited to performance, proficiency and demonstration of competencies required for the role. The base pay is just one component of The Hartford's total compensation package for employees. Other rewards may include short-term or annual bonuses, long-term incentives, and on-the-spot recognition. The annualized base pay range for this role is:

$114,240 - $171,360

Equal Opportunity Employer/Females/Minorities/Veterans/Disability/Sexual Orientation/Gender Identity or Expression/Religion/Age

About Us | Culture & Employee Insights | Diversity, Equity and Inclusion | Benefits

Senior Solution Site Reliability Engineer - IE07LE

Skills:

Date Posted

01/14/2023

Views

6

Back to Job Listings Add To Job List Company Profile View Company Reviews
Positive
Subjectivity Score: 0.8