Job Description
Want more jobs like this?
Get jobs that are Remote delivered to your inbox every week.

ESSENTIAL DUTIES AND RESPONSIBILITIES
- Collaborate with development and other SRE teams to enhance the reliability and efficiency of microservices applications.
- Engage with product development (PD) teams by participating in design reviews and production readiness checks.
- Collaborate with engineering teams, providing product feedback and where necessary contribute code to the product
- Work closely with cross-functional teams to ensure seamless integration of new features and services. https://aws.amazon.com/blogs/apn/the-6-pillars-of-the-aws-well-architected-framework/Â
- Analyze data from observability and monitoring tools to improve operational metrics of microservices as well as the entire platform.
- Leverage end-to-end technical expertise gained by engagement with multiple PD teams and analyzing observability data to propose improvements in code and design to improve SLO and prevent incidents.
- Create system documentation and training materials to empower and educate our fellow team members
- Take a purist SRE approach to shared multi-tenant infrastructure for a resilient SaaS microservice-based containerized systems in addition to customer-centric application environments
- Oversee and automate the team’s growing presence in AWS
- Creatively build and develop tooling to aid in driving 24x7x365 follow-the-sun operations of critical production systemsBuild and maintain observability tooling, metrics, and dashboarding for a global platform product infrastructureImprove our incident management lifecycle to identify, mitigate, and learn from reliability risks and issues
- Collaborate with engineering teams, providing product feedback and where necessary contribute code to the product
Education and Work Experience
- Bachelor’s Degree in Computer Science or related field
- Software engineering and task automation skills with Bash, Python, and/or Go are a mustExperience supporting web applications running on Java / Apache / Tomcat in a live production environmentFamiliarity with the Agile software development lifecycle
- Deep background with Linux systems and engineeringHighly experienced with engineering and automating on Amazon Web Services (AWS)
- Prior experience with IaC tools like Terraform/Terragrunt/TerraspacePrior experience with devops/gitops tools (Git, Bitbucket, Flux CD, Teamcity) for gate promotions
- Production-At-Scale support background in a heavily microservice-based worldHands-on engineering and ops expertise in containerization (Docker, Helm, Kubernetes/EKS, CNI and Ingress networking)
- Strong understanding of Single-Sign On, SAML, OAuth (Bonus if hands-on experience with Okta)Seasoned expertise around x.509 certificate technology and basic concepts of encryption
- Experience working with Relational Databases such as Aurora Postgres and/or Oracle RDSAdvanced exposure to application development, web UI (design and development), JSON, application architecture
- Experience strongly utilizing observability tools (logging/APM) like Datadog, CloudWatch, and PagerDuty.
- amiliarity with event store/stream-processing technologies like Kafka or AWS SQSUnderstanding of Open Application Model systems such as KubeVela or Crossplane
Personal Qualities and Soft Skills
- You greatly prefer writing code than clicking a GUI.
- You enjoy teaching, being a mentor to others, and working across boundariesOutstanding troubleshooting skills; ability to think critically and display an aptitude for problem solving
- Strong analytical mind with a penchant for process development and enhancement
- A highly positive can-do attitude with desire for being a team player
- Great communication skills and ability to explain complex technical concepts to a varied audience
- Demonstrate strong follow-through, a strong work ethic and consistently keep and meet commitments
Other Requirements
- Ability to read, write, and speak English
- Ability to speak in public settings, interface with customers, partners and vendors confidently
- Travel – Up to 25% of the job will require travel, approximately a week a month
Date Posted
11/08/2024
Views
0
Similar Jobs
Senior Design Manager (Infrastructure) - Canonical
Views in the last 30 days - 0
Canonical a leading opensource provider seeks a Senior Design Manager to drive innovation in cloud and AI technologies The role offers remote work glo...
View DetailsSenior Product Designer - Org & Security - Typeform
Views in the last 30 days - 0
This job description outlines a role in developing an intelligent contact management system with AI capabilities The position involves designing user ...
View DetailsSenior Business Analyst - Xpansiv
Views in the last 30 days - 0
Xpansiv promotes its role as an energy market innovator with a global platform for environmental commodities The job posting seeks a Business Analyst ...
View DetailsSenior Specialist Senior Accountant Shared Financial Services - Make-A-Wish America
Views in the last 30 days - 0
The text describes Make a Wish Foundations mission to grant childrens wishes and their community efforts It outlines job positions with remotehybrid o...
View DetailsSoftware Engineer Networking Software and Services - xAI
Views in the last 30 days - 0
The text describes xAIs mission to develop AI systems for understanding the universe and advancing human knowledge It outlines a role involving networ...
View DetailsAssociate Technical Support Engineer - Recharge
Views in the last 30 days - 0
Recharge is a subscription platform for innovative brands offering customer retention solutions They seek Technical Support roles with 247 coverage em...
View Details