Sr. Manager, System Reliability Engineering - TrainingPeaks

Peaksware · Greater Boulder Area

Company

Peaksware

Location

Greater Boulder Area

Type

Full Time

Job Description

Are you ready to work on a product impacting millions of people? At TrainingPeaks our user base of athletes and coaches is growing rapidly. To meet their demands TrainingPeaks needs innovators, collaborators, and excellent engineers like you. Together we’re building the world’s best training platform. Join TrainingPeaks today.

You may know us as TrainingPeaks, MakeMusic, TrainHeroic and Alfred Music. All these brands are under the Peaksware umbrella. TrainingPeaks develops software for coaches and athletes to track, analyze and plan endurance training. TrainHeroic develops software solutions for the strength and conditioning needs of coaches and athletes. MakeMusic develops software to transform how music is composed, taught, learned and performed. Alfred Music creates and publishes educational music to help teachers, students, professionals and hobbyists experience the joy of making music.

We would love to have you join our ever-growing team! All applicants will receive equal consideration for employment regardless of gender, race, national origin, age, sexual orientation, gender identity, physical disability, religion, or length of time spent unemployed.

General Summary

As Sr. Manager, System Reliability Engineering, you are a proven leader in transforming business operations in a cloud-based environment. You and the team you lead will be crucial in ensuring the reliability, availability, and performance of our software applications and infrastructure. Your technical experience in System Reliability Engineering will be instrumental in setting and maintaining high standards for system availability, performance, and incident response. In this role, you will think beyond the scope of your team and look for opportunities to improve the organization at large, collaborating across multiple teams and leading the execution of those cross-team initiatives.

You are a continuous learner with a hunger for knowledge. You approach challenges as opportunities to improve. You value team members’ input from all levels, and you actively seek ways to support your colleagues.

You will sit directly with the System Reliability Engineering team, collaborate closely with Product and Engineering teams, and report to the Vice President, Operations. 

Core Functions

  • Team Leadership: Lead, mentor, and manage a team of Site Reliability Engineers, fostering a culture of collaboration, innovation, and excellence.
  • Collaboration: Foster strong relationships with software development, operations, and product teams to ensure a unified approach to system reliability.
  • Strategy and Planning: Develop and implement SRE strategies, goals, and objectives to achieve high system reliability and availability. Collaborate with cross-functional teams to align SRE objectives with business goals.
  • Incident Management & On-Call Rotation: Oversee incident response and resolution processes, ensuring a swift and effective response to system issues. Continuously improve incident management practices. Develop and manage an on-call rotation schedule for SREs and senior Developers, providing 24/7 support for critical systems.
  • Monitoring and Alerting: Establish and maintain robust monitoring and alerting systems to proactively identify and address system performance issues. Drive automation of monitoring and alerting processes.
  • Security and Compliance: Ensure security and compliance of the cloud-based applications by implementing and maintaining robust security measures, access controls, and monitoring systems to protect sensitive data and infrastructure from threats and vulnerabilities.
  • Capacity Planning: Work closely with the SRE and development teams to plan and manage system capacity, ensuring systems can handle current and future workloads.
  • Reliability Engineering: Drive the implementation of best practices for reliability engineering, including chaos engineering, fault tolerance, and system resilience.
  • Performance Optimization: Collaborate with engineering teams to optimize application performance and improve system efficiency.

The work characteristics described here are representative of those an employee encounters while performing the essential functions of this job. Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions.

Requirements

  • Experience managing a collaborative team of System Reliability Engineers.
  • Proven software engineering experience.
  • Prior experience working in a DevOps or SRE environment and continuous integration/deployment pipelines.
  • Proven experience working in a Cloud environment, ideally deep experience with AWS cloud services.
  • Experience with configuration management platforms such as AWS CloudFormation, and CI/CD pipelines and tools such as AWS CodeBuild/CodeDeploy.
  • Experience with scripting or programming languages (Bash, Python, JavaScript, etc.).
  • Ability to think critically and make decisions at the system level.
  • Experience with relational databases.
  • Experience with monitoring tools (Splunk, ELK, Application monitoring, etc.).
  • Solid understanding of networking and security best practices.
  • Experience with Docker and Container Orchestration technologies.
  • Understanding and implementation of microservices architecture and general software architecture design principles, Clean Architecture concepts, DDD, etc.
  • Impeccable written and verbal communication skills. 
  • Ability to thrive in a collaborative environment.
  • Prior experience taking initiative. A driven professional who makes regular, measurable progress.
  • A willingness to routinely share your experience and knowledge with others.
  • A strong desire to deliver your very best work daily, and an expectation of others to do the same.

Degrees are not required and we value all forms of continued education including traditional four-year degrees, post-graduate degrees, associate degrees, bootcamps, online training, professional certifications, self-teaching, and more.

Don’t meet every single requirement? Don’t worry. We still want to hear from you and encourage you to apply.

Benefits

Compensation

Peaksware/TrainingPeaks is committed to fair and equitable compensation practices. The salary range for this role is $120,489 - $200,815. Final compensation for this role will be determined by various factors such as a candidate’s relevant work experience, skills, and certifications.

This role is eligible for variable compensation including bonus.

Benefits and Perks

Health

  • 100% company-paid Medical for employees with buy-up options
  • Dental
  • Vision
  • Health Savings Account
  • Flexible Spending Account
  • Dependent Care Flexible Spending Account
  • Paid Parental Leave
  • Teladoc
  • Employee Assistance Program (EAP)
  • Additional coverage options such as accident and critical illness insurance and hospital indemnity

Disability and Life

  • Company-paid Short Term Disability
  • Company-paid Long Term Disability
  • Company-paid Basic Life Insurance and AD&D
  • Employee-paid Supplemental Life Insurance for Employee, Spouse, and/or Child

Additional

  • 401(K)
  • 401(K) Matching
  • Pet Insurance
  • 9 paid holidays annually and unlimited Flexible Time Off (FTO)
  • Free TrainingPeaks, TrainHeroic, MakeMusic accounts, and Alfred Music product
  • Access to the Performance and Recovery Center (PARC), our on-site fitness facility
  • Employee only access to on-site locker rooms and showers
  • Employee only access to secure, indoor bike storage
  • Access to our onsite Music Studio
  • An assortment of “grab’n go” fruit and snacks as well as on tap cold brew, kombucha, and beer.
  • Beautiful onsite cafe that includes indoor and outdoor seating and lounge areas.
  • Access to e-bikes available exclusively to Peaksware employees
  • Significant investment in resources for employee growth and development
  • Corporate discounts on select gym memberships and top brand gear
  • Flexible work schedule in a culture of trust

Please contact [email protected] if you require a reasonable accommodation to review our website or to apply online.

Work Environment

This job operates in a professional office environment that is well-lighted, heated, and/or air-conditioned with adequate ventilation and a noise level that is usually moderate. This role routinely uses standard office equipment such as computers, phones, photocopiers and filing cabinets.

All employees must comply with all safety policies, practices and procedures. Report all unsafe activities to your manager and/or Human Resources.

Physical Demands

While performing the duties of this job, the employee is regularly required to sit and move about the facility; use hands to handle, or feel; talk by expressing ideas by means of the spoken word; and hear by perceiving the nature of sounds. The employee is occasionally required to stand, walk, and reach with hands and arms. The employee must occasionally lift and/or move up to 10 pounds. Specific vision abilities required by this job include close vision, distance vision, color vision, peripheral vision, depth perception, and ability to adjust focus.

To view the Peaksware Privacy Policy, click here. By submitting an application, you acknowledge and agree to the Peaksware Privacy Policy.

Apply Now

Date Posted

11/06/2023

Views

1

Back to Job Listings Add To Job List Company Profile View Company Reviews
Positive
Subjectivity Score: 0.9

Similar Jobs

Laser Engineer - Atom Computing

Views in the last 30 days - 0

Atom Computing is hiring a Laser Engineer to manage and scale up laser systems for quantum computers The ideal candidate should have a PhD in Physics ...

View Details

Growth Marketing Specialist - B2B - MakeMusic - Peaksware

Views in the last 30 days - 0

The Growth Marketing Specialist role at Peaksware which includes brands like TrainingPeaks MakeMusic TrainHeroic and Alfred Music is a key position in...

View Details

Business Development Representative - MakeMusic - Peaksware

Views in the last 30 days - 0

Peaksware a company that includes brands like TrainingPeaks MakeMusic TrainHeroic and Alfred Music is seeking a Business Development Representative Th...

View Details

Recruiter - Peaksware - Peaksware

Views in the last 30 days - 0

Peaksware which includes brands like TrainingPeaks MakeMusic TrainHeroic and Alfred Music is seeking a Recruiter for a hybrid role The ideal candidate...

View Details

Growth Marketing Specialist - B2C - MakeMusic - Peaksware

Views in the last 30 days - 0

The Growth Marketing Specialist position at Peaksware which includes brands like TrainingPeaks MakeMusic TrainHeroic and Alfred Music is a key role in...

View Details

Customer & Product Support Specialist - Circadence Corporation

Views in the last 30 days - 0

Circadence an awardwinning USowned cybersecurity training and assessment platforms company is seeking a detailoriented and resourceful Customer Suppor...

View Details