Sr. SRE Engineer, Resilience (Remote)

Enova · Chicago IL

Company

Enova

Location

Chicago IL

Type

Full Time

Job Description

The health and safety of Enova’s employees is our number one priority. Proof of vaccination will be required regardless of work location, unless prohibited by applicable state law. Employees may request an exemption to the vaccination policy due to medical reasons, sincerely-held religious beliefs, or as otherwise permitted by applicable state law.

Enova is currently accepting candidates for remote positions in the following eligible states: AL, AK, AR, AZ, CT, GA, IA, ID, IL, IN, KY, LA, MA, ME, MD, MI, MN, MO, MS, NC, ND, NE, NH, NV, NJ, NM, OH, OK, OR, PA, RI, SC, SD, TN, UT, VT, WI, WV, WY.

What you’ll be doing:

In this role, you will help improve the resiliency of our services through technology, incident analysis, and process refinement.

You will work on optimizing how we deal with unexpected complex failures, including facilitating our incident response process, running post-incident blameless retrospectives, analyzing for and learning from consistent high-level trends, and integrating technology to reduce the effort needed to maintain these functions.

You will be responsible for learning how our systems and applications relate holistically in order to appropriately react during outages and work alongside Subject Matter Experts to drive resolution. You will develop improvements to how we collect and analyze data around failures, adjusting to the ever-advancing environment as progress is made.

You will collaborate with IT, Software Engineering, and product teams to foster a culture of quality where resilience is woven into our technology stack. You will show what different failure modes look like by running experiments (Mock Incidents, Disaster Recovery) and share learnings across the organization.

Your core priorities will be to:

  • Own Enova’s Production Incident Process end-to-end. 
  • Develop processes and technology to sustainably test and improve the resiliency of our services on an ongoing basis, balancing tech and business needs.
  • Manage process refactoring initiatives to ensure risk mitigation is considered, improving customer experience.
  • Collect data, perform trend analysis, and identify patterns of risks and vulnerabilities.
  • Work with leading teams to address vulnerabilities, particularly principal engineers and production managers.
  • Socialize lessons learned among all teams to bolster the culture of operational ownership. 
  • Be part of our PI PIC (Incident Commander) rotation following training, leading incidents to completion, and driving post-incident analysis (including interviews, contributing factor analysis, incident response analysis, and remediation plans).

What you should have:

  • 3+ years of professional work experience in a technology role; Software Engineering, Systems, Ops, SRE, Product Management or others. 
  • Interest in complex distributed systems - how they work, how they can work better, how to know if they are working correctly.
  • Superior analytical, problem solving, and critical thinking skills.
  • Understanding of infrastructure as code (Terraform, Chef, etc.)
  • Experience with query language (Postgres, sql Kafka, etc.)
  • Ability to handle, analyze, and present data. 
  • Comfortable with ambiguity; able to translate ambiguous problems into strong solutions.
  • Demonstrates maturity, good judgment, negotiation, leadership and project management skills.
  • Excellent written and verbal communication skills, including the ability to communicate to different levels of an organization (i.e. on a technical vs. non-technical level).

Nice to have:

  • Experience with full stack development.
  • Experience with handling and leading resolution of major failures of critical systems. 
  • Experience driving large-scale changes.

About Resilience Engineering:

The Resilience Engineer is a subset of the Site Reliability Engineering team that strives to drive a culture of continuous resiliency improvement in our systems. We do this by focusing on our incident response process, incident analysis and learnings, and creatively solving systemic hurdles to resiliency. We work closely with other Tech, Operations, and Business teams to resolve complex failures and to continuously learn. 

Our goal at Enova is to recruit, hire, develop and maintain a diverse workforce. It is our policy to provide equal employment opportunity for all persons and not discriminate in employment decisions by placing the most qualified person in each job, without regard to any other classification protected by federal, state, or local law.


About Enova:
Enova is a leading financial technology company providing online financial services through its AI and machine learning powered lending platform. Enova serves the needs of non-prime consumers and small businesses, who are frequently underserved by traditional banks. Enova has provided more than 7 million customers with over $40 billion in loans and financing with market leading products that provide a path for them to improve their financial health. Want to learn more? Just ask any of our almost 1,500 employees.

Our goal at Enova, we believe that diversity and inclusion among our teammates is critical to our success as a global company, and we seek to recruit, develop and retain the most talented people from a diverse candidate pool. It is our policy to provide equal employment opportunity for all persons and not discriminate in employment decisions by placing the most qualified person in each job, without regard to any other classification protected by federal, state, or local law. California Applicants: Click here to review our California Privacy Policy for Job Applicants.

Apply Now

Date Posted

08/19/2022

Views

0

Back to Job Listings Add To Job List Company Profile View Company Reviews
Positive
Subjectivity Score: 0.8

Similar Jobs

Sr. Software Engineer - OEMS Team - Enfusion

Views in the last 30 days - 8

Enfusion is a pioneer in developing innovative cloud investment management software analytics and managed services They help fund managers streamline ...

View Details

Machine Learning Engineer - Oak Street Health

Views in the last 30 days - 8

Oak Street Health is a rapidly growing company that is looking for a machine learning engineer to support their production modeling efforts The compan...

View Details

DevOps/SRE Lead - TransUnion

Views in the last 30 days - 6

TransUnion is seeking a DevOps Lead with extensive cloud experience including AWS to support a crossfunctional engineering team in developing and impl...

View Details

Software Engineer 431407 - Experfy

Views in the last 30 days - 8

The job description is for a Software Engineer position that requires designing developing testing and deploying software systems and applications The...

View Details

Account Manager (Advertising Sales Team) - Chicago - CafeMedia

Views in the last 30 days - 6

This is an excellent opportunity to get broad experience in all aspects of digital media The position is based in Chicago IL and requires excellent co...

View Details

AVP, Internal Audit - CNA

Views in the last 30 days - 11

The job description is for an Assistant Vice President Internal Audit position at CNA The role involves leading a team to provide risk management gove...

View Details