Site Reliability Engineer
Job Description
We are looking for a Site Reliability Engineer to join our team and develop software systems and automated solutions for operational aspects in an organization.
Site Reliability Engineer responsibilities include monitoring computer systems and building alerts for various operational issues that computer systems can experience.
Ultimately, you will work with our IT team to ensure our organization can continue to deliver products and services in our computer system environment.
Responsibilities:Â
- You will automate the server provisioning process to reduce the labor of our networking engineering and datacenter operations teamsÂ
- Perform deep dives into both systemic and latent reliability issuesÂ
- Drive standardization efforts across multiple disciplines and services in conjunction with embedded SREs throughout the organizationÂ
- Build and maintain low latency, high performance, scalable systems in a polyglot architectureÂ
- Ensure the team can support massive, global user growth while achieving rigorous SLAsÂ
- Enforce best practices for metrics gathering, monitoring, and alarmingÂ
- Accelerate the team by making service/application deployment and builds fast, simple, and reliableÂ
- Provide basic data administration and optimization, and monitoringÂ
- Provide basic network administration and troubleshootingÂ
- Assist in the technology selection for core computing and software infrastructure along with data persistenceÂ
Qualifications:Â
- Proven work experience as a Site Reliability Engineer or similar role within a Microsoft Azure or AWS cloud/hybrid-cloud environmentÂ
- Experience with infrastructure management tools such as TerraformÂ
- Experience with web server configuration, monitoring, trending, network design, high availabilityÂ
- Proficiency in a scripting languageÂ
- Practical, solid knowledge of shell scripting and at least one higher-level language (Bash or PowerShell preferred)Â
- Comfortable configuring DNS, DHCP, and LAN/WAN technologiesÂ
- Collaborate and communicate asynchronouslyÂ
- Document all the things so you don’t need to learn the same thing twiceÂ
- Have an enthusiastic, go-for-it attitudeÂ
- Relevant training and/or certifications as a Site Reliability EngineerÂ
- US Citizenship required
- Ability to obtain a U.S. Government clearance
Explore More
Date Posted
02/27/2024
Views
1
Similar Jobs
2025 Sensor Modeling and Simulation Analysis Engineer - The Aerospace Corporation
Views in the last 30 days - 0
The Aerospace Corporation is a trusted partner to the nations space programs providing technical expertise and innovative solutions across satellite l...
View DetailsInformation Security Consultant - Application Security Engineer - MassMutual
Views in the last 30 days - 0
MassMutual is seeking an experienced Application Security Engineer to join their dedicated team The role involves driving security best practices cond...
View DetailsRegional Director Public Sector Sales DOW - Chainguard
Views in the last 30 days - 0
The job seeks a Regional Director with sales expertise and security clearance to lead public sector initiatives and build partnerships Responsibilitie...
View DetailsManager, Customer Success - Bold Penguin
Views in the last 30 days - 0
Bold Penguin a leading digital solution platform for small commercial insurance is seeking a Manager of Customer Success The role involves leading a t...
View DetailsManager, Project Manager - Capital One
Views in the last 30 days - 0
Capital One a Fortune 500 company and one of the nations top 10 banks is seeking a Manager Project Manager The role involves leading critical and stra...
View Details