(USA) Director, Software Engineering - Site Reliability Engineering
Job Description
What you'll do...
Job Summary
Walmart Global Tech's Site Reliability Engineering team is built with hybrid systems and software engineers who are responsible and take ownership for reliability, scalability, automation, and mission-critical issues related to uptime, availability and fast rate of improvement of Walmart's e-commerce, stores, and omni-channel platform. Our goal is to build, scale and guard the systems that delights the customers with watchful eye on capacity and performance.
As part of Reliability Engineering & Operations, you'll help to define and execute a unified, reliable, operationally robust set of processes and tools for Walmart Technology & its customers across all channels and geographies.
What you'll do
You will be responsible as a Director in Reliability Engineering and Operations team to ensure that critical parts of Walmart's business are prepared for known events and to address any contingency. You'll have opportunity to manage the complex challenges of micro service and scale which are unique to Walmart's e-commerce, stores, and omni-channel platform, while using your expertise in coding, algorithms, complex triaging and analysis, and large-scale system design. You'll excel if you have enthusiasm to dig deep and a flare for sharp technical communication, prioritization for uptime/availability and organization. To do so, you will need strong skills in following areas:
- Design, write and build tools to improve the reliability, latency, availability, and scalability of Walmart Tech stack including: 1) Engender reliability and availability starting with metrics and measurements. 2) Enable scaling by providing tools, developing training and/or augmenting processes . 3) Build tools/automate to prevent re-occurrence of problem to mission critical products/services. 4) Augment existing instrumentation to build a cohesive picture of the characteristics of our systems with special attention to points of failure.
- Drive team to build and scale fault-tolerant system and services in our hybrid cloud infrastructure.
- Partner with leadership across organization to establish strategic plans and objectives to improve the mean time to detect and mean time to restore.
- Collaborate with Service owners to define the SLOs and build SLIs to ensure systems are meeting the SLAs
What you'll bring
- A recognized Bachelor's / Master's degree in Engineering with 15+ years of experience in Site Reliability Engineering which includes the Service Management (Incident Problem & Change Management), Performance and Capacity Engineering and 5+ years of experience in managing, leading and developing Site Reliability Engineering focused teams with indirect reports around 15.
- Experience in Retail or Site facing transactional web services, and or, internet-based services environment would be added advantage.
- Experience in running Site Reliability or DevOps team with KPI targets on MTTD, MTTR, and availability would be an added advantage.
- You are a highly resilient and responsive team player with high degree of cross functional teaming and are someone who thrives in a fast-paced, dynamic, startup-like environment to drive business results and passionate about Customer delight with every transaction
- You have a great sense of urgency, is a great communicator (written and verbal), active listener and has experience working in a highly matrixed environment with a global footprint and should possess high degree of Emotional Quotient to actively listen and respond/mobilize effective plans directed at our Associates.
#LI-PL1
At Walmart, we offer competitive pay as well as performance-based incentive awards and other great benefits for a happier mind, body, and wallet. Health benefits include medical, vision and dental coverage. Financial benefits include 401(k), stock purchase and company-paid life insurance. Paid time off benefits include PTO (including sick leave), parental leave, family care leave, bereavement, jury duty, and voting. Other benefits include short-term and long-term disability, company discounts, Military Leave Pay, adoption and surrogacy expense reimbursement, and more.
You will also receive PTO and/or PPTO that can be used for vacation, sick leave, holidays, or other purposes. The amount you receive depends on your job classification and length of employment. It will meet or exceed the requirements of paid sick leave laws, where applicable. For information about PTO, see https://one.walmart.com/notices .
Live Better U is a Walmart-paid education benefit program for full-time and part-time associates in Walmart and Sam's Club facilities. Programs range from high school completion to bachelor's degrees, including English Language Learning and short-form certificates. Tuition, books, and fees are completely paid for by Walmart.
Eligibility requirements apply to some benefits and may depend on your job classification and length of employment. Benefits are subject to change and may be subject to a specific plan or program terms. For information about benefits and eligibility, see One.Walmart at https://bit.ly/3iOOb1J .
The annual salary range for this position is $192,000.00-$288,000.00
Additional compensation includes annual or quarterly performance incentives.
Additional compensation for certain positions may also include:
- Regional Pay Zone (RPZ) (based on location)
- Stock equity incentives
Minimum Qualifications...
Outlined below are the required minimum qualifications for this position. If none are listed, there are no minimum qualifications.
Option 1: Bachelor's degree in computer science, computer engineering, computer information systems, software engineering, or related area and6 years' experience in software engineering or related area.
Option 2: 8 years' experience in software engineering or related area.
3 years' supervisory experience.
Preferred Qualifications...
Outlined below are the optional preferred qualifications for this position. If none are listed, there are no preferred qualifications.
Master's degree in computer science, computer engineering, computer information systems, software engineering, or related area and 4 years' experience in software engineering or related area
Primary Location...
640 W California Avenue, Sunnyvale, CA 94086-4828, United States of America
Explore More
Date Posted
01/28/2023
Views
10
Similar Jobs
Technologist, System Design Engineering - Western Digital
Views in the last 30 days - 0
Western Digital is seeking a Technologist with expertise in SSD design hardware design Product Management Memory Systems and system architecture to le...
View DetailsStaff Engineer, System Design Verification Engineering - Western Digital
Views in the last 30 days - 0
Western Digital is seeking a validation engineer to define and track test plans characterize and optimize SSDs and lead bug review meetings The ideal ...
View DetailsSenior Front-End Software Engineer - Percipient.ai
Views in the last 30 days - 0
Percipientai founded in 2017 is a cuttingedge technology company specializing in Computer Vision Artificial Intelligence and Deep Learning They develo...
View DetailsPrincipal Software Engineer (Prisma Access) - Palo Alto Networks
Views in the last 30 days - 0
Palo Alto Networks is a cybersecurity company committed to protecting the digital way of life They are seeking a Principal Software Engineer to build ...
View DetailsPrincipal Engineer Software (Full Stack Developer) - Palo Alto Networks
Views in the last 30 days - 0
Palo Alto Networks is seeking a Senior FullStack Engineer to develop and maintain highperformance web applications collaborating with crossfunctional ...
View DetailsExecutive Assistant - ServiceNow
Views in the last 30 days - 0
ServiceNow a global market leader in AIenhanced technology is seeking a highly organized and experienced executive assistant to support a VP The role ...
View Details