Principal Site Reliability Engineer - Remote
Job Description
Donnelley Financial Solutions (DFIN) is a leader in risk and compliance solutions, providing insightful technology, industry expertise and data insights to clients across the globe. Weโre here to help you make smarter decisions with insightful technology, industry expertise and data insights at every stage of your business and investment lifecycles. As markets fluctuate, regulations evolve and technology advances, weโre there. And through it all, we deliver confidence with the right solutions in moments that matter.ย
The Principal Site Reliability Engineer is responsible for ensuring our SaaS products are fast, stable and optimized for our customers. SREโs at DFIN take on availability, performance, managing change, monitoring, response and are guardians of non-functional requirements.
ย
You either have an infrastructure background with a programmatic, automated mindset or are someone that comes with a software engineering background with infrastructure experience. The SRE goal is to build automated systems that reduce or eliminate manual work to keep our products up and running and performing optimally. We are looking for someone who thrives on collaboration within the team and across other groups and can operate independently to deliver solutions.
- Champion and implement a culture of SRE to maintain a high-quality platform infrastructure
- Champion and implement application and infrastructure monitoring and alerting to prevent client impacting issues by ensuring system availability, performance and scalability to maintain SLOs and SLAs
- Optimize application performance at scale
- Automate everything including system operational runbooks
- Define and support continuous integration and deployment pipelines (CI/CD) aligned to branching and quality assurance strategies
- Dive deep into technology and stay on the forefront of the latest tools, technologies, and strategies; help evaluate, prototype, and integrate them into work processes
- Perform with broad independence and deliver on project milestones and tasks on schedule while communicating progress regularly
- Build strong relationships with SRE team members and software engineering teams to hold each other accountable for quality expectations
- Learn continuously and apply lessons learned
- Evangelize best practices, eliminate bottlenecks, and improve process
- BS in Computer Science or equivalent work experience.
- 10+ years demonstrating hands-on technical leadership and business impact in combining software skills with systems to solve complex automation and reliability challenges
- 5+ years working with various cloud providers, containerization technologies, automated deployment frameworks, orchestration frameworks, monitoring, logging, alerting, system internals, networking, databases, distributed systems, and service-oriented architecture
- 5+ years of experience supporting public client facing revenue generating systems
- 5+ years instrumenting Application Performance Monitoring (APM) using a tool such as New Relic (preferred), DataDog, AppDynamics, etc.
- 3+ years of experience writing software in any modern software language such as C#.NET, Java, Javascript, Node.js, React.
- 3+ years of experience creating automation with tools such as Azure DevOps, Ansible, Terraform, PowerShell, Python/Bash to build and deploy in a continuous integration (CI) environment and to manage infrastructure as code
- You have proven track record to implement load, stress, performance and reliability testing standards at scale to improve service, platform and infrastructure resiliency
- You are experienced in leading efforts in securing systems in 24x7 production environments
It is the policy of Donnelley Financial Solutions to select, place and manage all its employees without discrimination based on race, color, national origin, gender, age, religion, actual or perceived disability, veteran's status, actual or perceived sexual orientation, genetic information or any other protected status.ย
If you are a qualified individual with a disability or a disabled veteran, you have the right to request a reasonable accommodation if you are unable or limited in your ability to use or access jobs.dfinsolutions.com as a result of your disability. You can request a reasonable accommodation by sending an email to [email protected].ย #BI-Remote
Date Posted
05/31/2023
Views
9
Similar Jobs
Software Engineer Networking Software and Services - xAI
Views in the last 30 days - 0
The text describes xAIs mission to develop AI systems for understanding the universe and advancing human knowledge It outlines a role involving networ...
View DetailsPrincipal Cloud Architect: Pre-Sales - Myriad360
Views in the last 30 days - 0
This job description outlines a senior cloud architect role requiring Azure and GCP expertise focusing on secure cloud solutions The company emphasize...
View DetailsAssociate Technical Support Engineer - Recharge
Views in the last 30 days - 0
Recharge is a subscription platform for innovative brands offering customer retention solutions They seek Technical Support roles with 247 coverage em...
View DetailsFull Stack Product Engineer - Jiga
Views in the last 30 days - 0
Jiga is a remotefriendly company focused on empowering engineers with trust autonomy and flexibility They emphasize simplicity ownership and impactful...
View DetailsSenior Design Manager (Infrastructure) - Canonical
Views in the last 30 days - 0
Canonical a leading opensource provider seeks a Senior Design Manager to drive innovation in cloud and AI technologies The role offers remote work glo...
View DetailsSenior Product Designer - Org & Security - Typeform
Views in the last 30 days - 0
This job description outlines a role in developing an intelligent contact management system with AI capabilities The position involves designing user ...
View Details