Application Support Engineer, Service Reliability Engineering
Job Description
Team: Support
This position is posted by Jobgether on behalf of a partner company. We are currently looking for an Application Support Engineer, Service Reliability Engineering in Canada.
This role sits at the core of service reliability and production excellence, ensuring the stability, performance, and scalability of critical enterprise systems. You will act as a key technical operator within a global Service Reliability Engineering environment, proactively monitoring system health and responding to incidents to minimize downtime and service disruption. The position combines deep technical troubleshooting with automation and continuous improvement, enabling resilient and efficient infrastructure operations. You will collaborate closely with development, infrastructure, and operations teams to enhance system reliability and embed best practices across the software lifecycle. Working in a fast-paced, cloud-driven environment, you will contribute to designing robust solutions that support high availability and performance at scale. This is a highly impactful role for an engineer passionate about operational excellence, automation, and system resilience.
Accountabilities:
- Design, maintain, and support highly available infrastructure and application systems to ensure reliability and scalability.
- Monitor system performance and health using observability tools, metrics, and alerts to proactively detect issues.
- Lead incident response activities, ensuring rapid resolution of production issues and minimal service disruption.
- Perform root cause analysis for system failures and implement long-term corrective and preventive solutions.
- Develop automation scripts and tools to reduce manual intervention and improve operational efficiency.
- Collaborate with engineering and development teams to integrate reliability best practices into system design and deployment.
- Maintain detailed technical documentation to support troubleshooting, knowledge sharing, and operational continuity.
- Continuously improve monitoring, alerting, and incident management processes to strengthen system resilience.
- 5+ years of experience in application support, Site Reliability Engineering (SRE), or infrastructure engineering roles.
- Strong experience managing highly available, cloud-based, or distributed systems.
- Proficiency in at least one programming or scripting language for automation (e.g., Python, Bash, or similar).
- Solid understanding of monitoring tools, logging systems, and incident management platforms.
- Strong knowledge of system performance optimization, troubleshooting methodologies, and CI/CD pipelines.
- Experience working with databases, distributed architectures, and cloud platforms.
- Strong analytical skills with the ability to interpret system metrics and identify patterns or risks.
- Excellent communication skills for cross-functional collaboration in global environments.
- Ability to remain calm and structured during high-pressure incident situations.
- Nice to have: experience with platforms such as ServiceNow, Salesforce, Oracle, Mulesoft, or similar enterprise tools.
- Strong ownership mindset with a proactive approach to identifying and preventing system issues.
- Competitive compensation aligned with Canadian market standards (with additional eligibility for bonuses)
- Comprehensive health, dental, and vision coverage
- Retirement savings plan with employer matching (DCPP / equivalent)
- Employee stock purchase program (ESPP)
- Paid vacation, sick leave, and statutory holidays
- Flexible work environment supporting work-life balance
- Employee assistance and wellness programs
- Opportunities to work on large-scale, mission-critical global systems
- Continuous learning and professional development opportunities
- Inclusive, diverse, and people-first engineering culture
Requirements:
Benefits:
Explore More
Date Posted
04/10/2026
Views
0
Similar Jobs
Staff Backend Engineer, AST: Composition Analysis - Jobgether
Views in the last 30 days - 0
View DetailsLead Software Engineer - Mobile Development (Crypto Wallets) - Jobgether
Views in the last 30 days - 0
View DetailsEngineering Manager, Core Product Engine (Backend) - Jobgether
Views in the last 30 days - 0
View Details