Site Reliability Engineer
Company
Egen
Location
West Suburbs
Type
Full Time
Job Description
Egen is a data and cloud modernization firm helping industry-leading companies achieve digital breakthroughs and deliver for the future, today. We are catalysts for change who create digital breakthroughs at warp speed. Our team of Salesforce, cloud, and data platform experts are trusted by top clients in pursuit of the extraordinary. An Inc. 5000 Fastest Growing Company 7 times, and recently recognized on the Crain’s Chicago Business Fast 50 list, Egen has also been recognized as a Great Place to Work 4 times.
We are seeking a Site Reliability Engineer to ensure system reliability and infrastructure support. You will be responsible for delivering scalability, performance optimization, incident management, and analysis.
Responsibilities:
- Ensure system reliability and uptime of applications depending on the SLA’s
- Monitor system performance metrics and determine the approaches to optimize the system
- Lead incident management efforts with available methodology and document RCA(Root Cause Analysis), lessons learned, and any SOP’s for solving the issue in future
- Work closely with DevOps and Application teams to align priorities, share knowledge and drive continuous improvement initiatives
- Prioritize response efforts based on issue severity, potential impact on users, and business priorities
- Evaluate and approve changes to production systems, balancing the need for innovation with the requirement of stability and reliability
- Optimize resource usage and manage costs by identifying inefficiencies, rightsizing infrastructure resources, and implementing cost-saving measures
What we're looking for:
- 3+ years of SRE experience
- Bachelor’s Degree is preferred but will consider relevant experience as an equivalent
- Scripting: Python, Bash/Shell, Ruby, Java, .Net, SQL
- DataDog, NewRelic, Splunk, Grafana
- Docker, Kubenernets, Linux
- VictorOps, PagerDuty
- Git, Bitbucket
- Troubleshooting complex, intertwined distributed services
- Attention to detail
- Testing, Monitoring, Logging, Alerting
- Documentation
- Incident Management
Date Posted
02/27/2024
Views
0
Similar Jobs
60K Signing | Established ORS Team | 687K Salary + wRVU | Epic | Princeton WV - Jackson Physician Search
Views in the last 30 days - 0
View DetailsMarshall Health Network Geriatrics Opening - Pinnacle Health Group
Views in the last 30 days - 0
View DetailsStaff Engineer, Software Test - Thermo Fisher Scientific
Views in the last 30 days - 0
Education Bachelors or equivalent experience in computer science Engineering or a related field Build develop and maintain robust scalable and reusabl...
View DetailsSoftware Engineer, Product - AutoAssist
Views in the last 30 days - 0
Direct impact on technical strategy and company direction You will report directly to the Founder and CEO and play a pivotal role in shaping both our
View DetailsSoftware Engineer, AI - AutoAssist
Views in the last 30 days - 0
Autonomydriven mindset you thrive when you own the details and project manage your own work Direct impact on AI product strategy and technical direct...
View DetailsSecurity Shift Supervisor - DSI Security Services
Views in the last 30 days - 0
A validdrivers licenseis required Conduct post inspections and ensure compliance with sitespecific procedures Basic to intermediate computer proficien...
View Details