Research Lead / Principal Scientist & Manager Post-Training · Alignment · Reinforcement Learning Autodesk AI Lab at Jobgether - Canada

Team: Product

This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Research Lead / Principal Scientist & Manager – Post-Training · Alignment · Reinforcement Learning in Canada.

This role sits at the forefront of frontier AI research, focusing on transforming foundation models into reliable, aligned, and domain-ready systems. You will lead a growing team of AI scientists while remaining deeply hands-on in research, shaping post-training strategies that include reinforcement learning, preference optimization, and agentic reasoning systems. Operating within a highly advanced AI research environment, you will influence both long-term research direction and real-world product impact across industries such as architecture, engineering, manufacturing, and media. The position blends scientific leadership with execution, requiring strong judgment in model behavior, evaluation design, and alignment trade-offs. You will collaborate closely with infrastructure, product, and research teams to ensure scalable and reproducible training workflows. This is a high-impact leadership role where your work directly contributes to advancing trustworthy AI systems used in real-world professional workflows.

Accountabilities

You will define and lead the post-training and alignment research strategy, overseeing how foundation models are refined into robust, safe, and high-performing systems. You will guide both technical direction and team execution while staying actively engaged in experimentation and algorithm development.

Lead post-training strategy across RLHF, preference optimization, and reinforcement learning for complex reasoning systems
Develop novel algorithms to improve model alignment, controllability, reliability, and domain-specific performance
Design and execute experiments to evaluate model behavior, robustness, reasoning quality, and safety
Establish evaluation frameworks for long-horizon reasoning, agentic behavior, and real-world workflow completion
Define model readiness criteria and provide go/no-go recommendations for deployment
Manage, mentor, and grow a team of AI researchers while fostering a high-rigor scientific culture
Collaborate with infrastructure and product teams to build scalable and reproducible training systems
Contribute to publications, patents, and external research visibility in top-tier ML venues
Translate technical findings into clear guidance for leadership and cross-functional stakeholders

Requirements

This role requires deep expertise in reinforcement learning and foundation model post-training, combined with proven research leadership experience. You should bring strong intuition for model behavior, alignment challenges, and large-scale AI system trade-offs.

Extensive hands-on experience with reinforcement learning and post-training methods (RLHF, RLAIF, PPO, DPO, or similar)
Proven experience leading or mentoring AI research teams in industry or academic settings
Strong understanding of alignment challenges, model evaluation, and reasoning systems
Experience designing rigorous evaluation frameworks for AI model performance and readiness
Ability to communicate complex technical concepts and trade-offs to diverse audiences
Background in ML, AI, or RL research, typically supported by a PhD or equivalent industry research experience
Preferred experience in frontier AI labs, agentic AI, or alignment research
Familiarity with large-scale training infrastructure and production AI systems is an asset
Strong publication record in top ML venues is highly valued

Benefits

Competitive compensation package including base salary, bonus, and equity components
Comprehensive health, dental, and vision coverage
Inclusive and collaborative research environment focused on real-world impact
Flexible work arrangements, including remote options across North America and Europe
Strong emphasis on research freedom, publication, and external visibility
Professional development opportunities and access to cutting-edge AI infrastructure
Paid time off, wellness programs, and employee support initiatives
Opportunity to influence frontier AI systems used in high-impact industrial domains

Research Lead / Principal Scientist & Manager Post-Training · Alignment · Reinforcement Learning Autodesk AI Lab

Company

Location

Type

Job Description

Accountabilities

Requirements

Benefits

Explore More

Date Posted

Views

Similar Jobs

Team Lead Data Engineer - Jobgether

Staff Data Scientist - Risk - Jobgether

Senior Technical Lead - Jobgether

Senior Program Manager - Enterprise Change Management - Jobgether

VP of Engineering - Jobgether

VP Enterprise Sales (m/f/d) - Jobgether