Service Reliability Engineer (SRE) III
Job Description
About Route Mobile:
Route Mobile is a leading provider of cloud communication platform services, offering innovative solutions to empower businesses to connect and engage with their customers across various channels. With a commitment to reliability and scalability, Route Mobile enables enterprises to enhance customer experiences and drive business growth through seamless communication solutions.
Position Overview:
Route Mobile is seeking an experienced and dedicated Service Reliability Engineer (SRE) III to join our dynamic team. In this role, you will be responsible for ensuring the reliability, availability, and performance of Route Mobile's cloud communication platform. Leveraging your expertise in systems engineering, automation, and monitoring, you will play a critical role in designing, implementing, and maintaining robust infrastructure and services that meet the needs of our customers.
Key Responsibilities:
- Infrastructure Design and Automation: Design, implement, and maintain scalable and resilient infrastructure using infrastructure-as-code principles and automation tools (e.g., Terraform, Ansible, Chef).
- Service Reliability: Monitor the health and performance of Route Mobile's services and systems, proactively identifying and resolving issues to ensure high availability and reliability.
- Incident Management: Respond to incidents and outages promptly, leading the troubleshooting and resolution efforts to minimize impact on customers and restore service functionality.
- Continuous Improvement: Drive continuous improvement initiatives to enhance the reliability, scalability, and performance of Route Mobile's infrastructure and services, leveraging best practices and industry standards.
- Capacity Planning: Conduct capacity planning and performance analysis to anticipate and address scalability requirements, ensuring adequate resources to support current and future workloads.
- Security and Compliance: Implement security best practices and compliance controls to protect Route Mobile's infrastructure and data assets, ensuring compliance with industry regulations and standards
- Change Management: Manage changes to production systems effectively, following change management processes and procedures to minimize risk and ensure stability.
- Documentation and Knowledge Sharing: Create and maintain documentation, runbooks, and operational procedures to facilitate knowledge sharing and onboarding of new team members.
- On-call Rotation: Participate in the on-call rotation schedule to provide 24/7 support for production systems and respond to incidents outside of regular business hours.
Qualifications:
- Bachelor's or Master's degree in Computer Science, Information Technology, or a related field.
- 5+ years of experience in systems engineering, infrastructure operations, or site reliability engineering.
- Proficiency in scripting and automation using languages such as Python, Bash, or PowerShell.
- Experience with cloud platforms (e.g., AWS, Azure, Google Cloud) and containerization technologies (e.g., Docker, Kubernetes).
- Strong understanding of networking concepts, distributed systems, and microservices architecture.
- Experience with monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack) for system observability and troubleshooting.
- Knowledge of incident management and post-incident analysis processes (e.g., SLIs, SLOs, SLAs).
- Excellent problem-solving and troubleshooting skills, with the ability to diagnose and resolve complex technical issues under pressure.
- Strong communication and collaboration skills, with the ability to work effectively in a fast-paced, team-oriented environment.
- Experience with Agile software development methodologies and DevOps practices is a plus.
Join Route Mobile:
If you are a skilled and motivated Service Reliability Engineer with a passion for ensuring the reliability and performance of cloud services, Route Mobile offers an exciting opportunity to make a significant impact in a rapidly evolving industry. Join us in shaping the future of cloud communication. Apply now!
Date Posted
09/09/2024
Views
0
Similar Jobs
Software Architecture Engineering and Cloud Computing Engineer - The Aerospace Corporation
Views in the last 30 days - 0
The Aerospace Corporation is seeking a Senior Project Engineer with expertise in software architecture engineering and cloud computing The role involv...
View DetailsLead Technical Support Engineer - HERE Technologies
Views in the last 30 days - 0
This role Senior Technical Support Engineer at HERE Technologies involves supporting a diverse portfolio of products and services acting as a technica...
View DetailsPrincipal / Lead Software Engineer- RUST (Algorithmic and Mathematics) - m/w/d - HERE Technologies
Views in the last 30 days - 0
HERE Technologies is seeking a Principal Software Engineer to lead the development of extended services for their VRP solver Tour Planning The role in...
View DetailsSenior Software Engineer (Scala/Java) - HERE Technologies
Views in the last 30 days - 0
HERE Technologies is seeking an experienced backend engineer with strong Java or Scala skills to join the Map Processing Pipelines team The role invol...
View DetailsSoftware Engineering Manager - Cargill
Views in the last 30 days - 0
The Software Engineering Manager job involves setting goals for a team responsible for software project development and delivery ensuring quality stan...
View DetailsSales Development Representative - UK (Remote) - Dscout
Views in the last 30 days - 0
Dscout is a company that specializes in experience research solutions helping innovative companies like Salesforce Sonos Groupon and Best Buy to build...
View Details