Lead Site Reliability Engineer

EPAM Systems Rockaway, NJ

Company

EPAM Systems

Location

Rockaway, NJ

Type

Full Time

Job Description

Join our dynamic team as a Lead Site Reliability Engineer! If you have a substantial background in software and systems engineering and a focus on reliability and scalability in cloud environments, your expertise is needed in managing and communicating with IoT devices via our platform. You will have a critical role in duties such as device registration and connection, bi-directional messaging between devices and the cloud, device state tracking and data storage, issuing alerts and notifications for device state changes, and integrating other cloud services like Device Registry and Firmware Upgrade.
Unlock the potential of remote work in Kyrgyzstan, giving you the flexibility to work from home or access our office in Bishkek.

Want more jobs like this?

Get jobs in Rockaway, NJ delivered to your inbox every week.

By signing up, you agree to our Terms of Service & Privacy Policy.


#LI-DNI#LI-VA2

Responsibilities
  • Design, implement, and maintain highly scalable and available systems across Azure cloud architectures
  • Regularly test and implement disaster recovery (DR) plans
  • Configure and enhance monitoring and alerting processes using Prometheus, Grafana, and OpsGenie
  • Develop dashboards to visualize system performance and reliability metrics
  • Use Terraform for infrastructure provisioning and management
  • Support the development team in ongoing projects
  • Communicate with the customer's DevOps team to discuss requirements and collaborate on implementations
  • Enhance release management and CI/CD processes
  • Improve system security based on security team recommendations
  • Document system support processes and design, write and test runbooks for operational tasks and incident response
Requirements
  • Minimum 5 years of experience as a DevOps or SRE engineer
  • Proven experience with Azure cloud architectures
  • Proficiency in Kubernetes and Docker/Linux services
  • Familiarity with monitoring tools: Prometheus, Grafana, OpsGenie
  • Experience with .NET Core and ASP.NET Core applications
  • Strong knowledge of Cosmos DB (both Mongo API & SQL API) and MS SQL Server
  • Expertise in Terraform
  • Experience with CI/CD tools and Azure Networking concepts
  • Excellent communication skills, ability to manage tasks and projects independently
  • Experience with Azure IoT Hub and EventHub is an added advantage
We offer
  • We connect like-minded people::
    • Delivering innovative solutions to industry leaders, making a global impact
    • Enjoyable working environment, whether it is the vibrant office or the comfort of your own home
    • Opportunity to work abroad for up to two months per year
    • Relocation opportunities within our offices in 55+ countries
    • Corporate and social events
  • We invest in your growth::
    • Leadership development, career advising, soft skills and well-being programs
    • Certifications, including GCP, Azure and AWS
    • Unlimited access to LinkedIn Learning, Get Abstract, O'Reilly
    • Free English classes with certified teachers
  • We cover it all::
    • Monetary bonuses for engaging in the referral program
    • Medical & family care package
    • Six trust days per year (sick leave without a medical certificate)
    • Coverage of psychology sessions of your choice
    • Discounts for fitness clubs and sports programs
    • Benefits package (sports activities, a variety of stores and services)
EPAM Kyrgyzstan is a team of technologists and innovators united by a passion for technology. In 2022, we opened our first office in Bishkek that works with the world's leading companies across many different industries. EPAM builds a continuously learning organization and helps its employees reach their full potential and achieve their professional goals through learning. Our agile methodologies, client collaboration frameworks, engineering excellence programs, and hybrid teams offer many career paths and development opportunities.

Apply Now

Date Posted

02/07/2025

Views

0

Back to Job Listings ❤️Add To Job List Company Info View Company Reviews
Neutral
Subjectivity Score: 0

Similar Jobs

Data Leak Protection Analyst - Barclays

Views in the last 30 days - 0

OR for an individual contributor they develop technical expertise in work area acting as an advisor where appropriate

View Details

Certified Electrician - Compass Group

Views in the last 30 days - 0

Utilize hand tools power tools and testing equipment such as ohmmeters to troubleshoot electrical issues Previous experience using hand tools power to...

View Details

Bathroom Installer - Premier Home Pros

Views in the last 30 days - 0

Basic knowledge of hand tools and power tools Must have a valid smart phone or smart device in order to receive work orders and update our CRM

View Details

AWS DevOps Engineer - AllShifts

Views in the last 30 days - 0

The AWS Engineer will be responsible for applying expertise to develop and execute requirements procedures and guidelines for AWS Infrastructure ensur...

View Details

Radar Software Engineer - In-Depth Engineering Corporation

Views in the last 30 days - 0

Perform in an agile fast paced environment applying advanced technologies software architecture design verification validation scientific principles a...

View Details

MIG Welder / Fabricator - Victory Truck Body

Views in the last 30 days - 0

Welding certification considered a plus Welding certification considered a plus Must have experience using equipment common to the welding trade eg Mi...

View Details