Job Description
The Site Reliability Engineering (SRE) team at Yahoo is the power behind engineering goodness for our production systems. By writing, designing and implementing software to drive velocity, operability, reliability and performance, the SRE team ensures continuous quality on production systems as we embed deeply in the many layers and stages of software development. We are all about: 1) Enabling a culture of ownership and excellence, 2) Engineering processes that are Automated and Agile, 3) Developing tools that are Self-Serve and (Re)Usable.
SRE's mission is to increase the uptime of Yahoo's properties by providing embedded subject matter experts to restore services around the clock. The SREs work together in a fast paced environment on a daily basis to solve complex problems before they impact the experiences of our customers. If you believe in the above, come join us. At Yahoo we want engineers that are self starters and problem solvers with the ability to do so with new and legacy code. Using innovative ideas to solve complex issues, while integrating easily with the running ecosystem, is a key trait that will fit very well in the organization.
About the role:
Do you have a passion for solving technical problems from the application to the network layer? Spend time trying to figure out how something works? Want to make real web applications and back-end systems faster, more reliable, more efficient? This position requires an aggressive troubleshooting expert who can multitask on problems of varying difficulty, priority and time-sensitivity in order to keep Yahoo's website services up and running. This versatile position requires familiarity with all the support concepts of large scale internet companies, such as systems administration, networking, process troubleshooting, and automation.
Key Responsibilities:
- Identify the criticality of incoming issues, prioritize, and troubleshoot appropriately.
- Track issues through multiple ticketing systems and triage to resolution within the defined SLAs
- Aggressively troubleshoot and multitask incidents of varying difficulty and priority with a focus on prioritization of tasks, ensuring that higher priority items are addressed first.
- Develop creative and practical solutions to resolve non-routine problems in your own area of expertise at SRE through analysis of various property and application dependencies.
- Apply broader knowledge of property in remediating complex system issues thus reducing escalation to the development team.
- Provides guidance and technical advice to the Ops team and gets involved as required, to resolve medium and higher severity incidents.
- Trains, guides and delegates work to the Operations team by breaking down information in a systematic and communicable manner from leadership position.
- Work with development teams to harden, enhance, document, and generally improve the operability of our systems.
- Provide a rapid response to escalations, leading to a decrease in response time and the Mean-time-to-resolution (MTTR).
- Utilize monitoring tools to proactively identify issues and trends, while working closely with cross-functional partners to implement solutions.
- Collaborate with service, network, and other operations teams on significant issues.
- Lead by example, deliver results, and eliminate missed opportunities.
- Ideal candidates will possess a broad range of computer science skills. The candidate must be persistent, result oriented, driven, and possess a strong sense of ownership.
- This position requires working night shifts***
Qualifications & Requirements :
- BS in Computer Science, Information Systems, Engineering, or related technical field with 3 years of related working experience (OR) Master's degree in Computer Science, Information Systems, Engineering, or related technical field with 1 year of related working experience.
- Strong analytical/troubleshooting skills with the ability to document/report bugs, test cases, and problem reports.
- Proficient in designing, analyzing and troubleshooting large-scale distributed systems.
- Excellent hands on Linux or Unix or any similar variants; both administration and internals.
- Possess excellent knowledge in OSI stack, TCP/IP networking, DNS, DHCP, SMTP, HTTP, load-balancers and highly available network servers.
- Hands on experience working with config management tools like Ansible, or Chef
- Should be able to interpret system condition by looking at system stats/profiles (e.g. CPU, Memory, Swap, disk capacity).
- Experience programming in at least one of the following languages: C, Java,Shell scripting, Python, Perl or Go
- Have administration background in Openstack, or equivalent virtual machine environment.
- Ability to rapidly learn and assimilate knowledge of complex software and systems, and apply understanding of system architecture when planning operational tasks and strategy.
Yahoo is proud to be an equal opportunity workplace. All qualified applicants will receive consideration for employment without regard to, and will not be discriminated against based on age, race, gender, color, religion, national origin, sexual orientation, gender identity, veteran status, disability or any other protected category. Yahoo is dedicated to providing an accessible environment for all candidates during the application process and for employees during their employment. If you need accessibility assistance and/or a reasonable accommodation due to a disability, please submit a request via the Accommodation Request Form ( www.yahooinc.com/careers/contact-us.html ) or call 408-336-1409. Requests and calls received for non-disability related issues, such as following up on an application, will not receive a response.
At Yahoo, we know that diversity makes us stronger. We are committed to a collaborative, inclusive environment that encourages authenticity and fosters a sense of belonging. We strive for everyone to feel valued, connected, and empowered to reach their potential and contribute their best. Check out our diversity and inclusion ( www.yahooinc.com/diversity/ ) page to learn more.
US Only: Please be aware that Yahoo requires all employees entering a U.S. Yahoo office and/or attending a company event (including client events) are required to be vaccinated for COVID-19. This position will require the successful candidate to obtain and show proof of a vaccination to enter a U.S. Yahoo office and/or attending a company event (including client events). Yahoo is an equal opportunity employer, and will provide reasonable accommodation to those individuals who are unable to be vaccinated consistent with federal, state, and local law.
If hired for this position in Colorado, the compensation range for this position is between $84,000.00 - $140,000.00. The compensation may vary depending on your location, skills and experience. The compensation package may also include additional incentive compensation opportunities in the form of discretionary annual bonus or commissions, plus equity incentives. Yahoo provides industry-leading benefits including healthcare, retirement, company holidays, vacation, sick time, parental leave and an employee assistance program. This information is provided per the Colorado Equal Pay Act.
Currently work for Yahoo? Please apply on our internal career site.
Date Posted
10/12/2022
Views
3
Similar Jobs
Software Engineer Networking Software and Services - xAI
Views in the last 30 days - 0
The text describes xAIs mission to develop AI systems for understanding the universe and advancing human knowledge It outlines a role involving networ...
View DetailsAssociate Technical Support Engineer - Recharge
Views in the last 30 days - 0
Recharge is a subscription platform for innovative brands offering customer retention solutions They seek Technical Support roles with 247 coverage em...
View DetailsFull Stack Product Engineer - Jiga
Views in the last 30 days - 0
Jiga is a remotefriendly company focused on empowering engineers with trust autonomy and flexibility They emphasize simplicity ownership and impactful...
View DetailsSenior Design Manager (Infrastructure) - Canonical
Views in the last 30 days - 0
Canonical a leading opensource provider seeks a Senior Design Manager to drive innovation in cloud and AI technologies The role offers remote work glo...
View DetailsSenior Product Designer - Org & Security - Typeform
Views in the last 30 days - 0
This job description outlines a role in developing an intelligent contact management system with AI capabilities The position involves designing user ...
View DetailsExecutive Director Patient Advocacy - Kyverna Therapeutics
Views in the last 30 days - 0
Kyverna Therapeutics is seeking an Executive Director for Patient Advocacy to lead initiatives in autoimmune disease treatment The role involves build...
View Details