Research Lead / Principal Scientist & Manager Post-Training · Alignment · Reinforcement Learning Autodesk AI Lab

Jobgether · Canada

Company

Jobgether

Location

Canada

Type

Full Time

Job Description

Team: Product

This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Research Lead / Principal Scientist & Manager – Post-Training · Alignment · Reinforcement Learning in Canada.

This role sits at the forefront of frontier AI research, focusing on transforming foundation models into reliable, aligned, and domain-ready systems. You will lead a growing team of AI scientists while remaining deeply hands-on in research, shaping post-training strategies that include reinforcement learning, preference optimization, and agentic reasoning systems. Operating within a highly advanced AI research environment, you will influence both long-term research direction and real-world product impact across industries such as architecture, engineering, manufacturing, and media. The position blends scientific leadership with execution, requiring strong judgment in model behavior, evaluation design, and alignment trade-offs. You will collaborate closely with infrastructure, product, and research teams to ensure scalable and reproducible training workflows. This is a high-impact leadership role where your work directly contributes to advancing trustworthy AI systems used in real-world professional workflows.

Accountabilities

You will define and lead the post-training and alignment research strategy, overseeing how foundation models are refined into robust, safe, and high-performing systems. You will guide both technical direction and team execution while staying actively engaged in experimentation and algorithm development.

  • Lead post-training strategy across RLHF, preference optimization, and reinforcement learning for complex reasoning systems
  • Develop novel algorithms to improve model alignment, controllability, reliability, and domain-specific performance
  • Design and execute experiments to evaluate model behavior, robustness, reasoning quality, and safety
  • Establish evaluation frameworks for long-horizon reasoning, agentic behavior, and real-world workflow completion
  • Define model readiness criteria and provide go/no-go recommendations for deployment
  • Manage, mentor, and grow a team of AI researchers while fostering a high-rigor scientific culture
  • Collaborate with infrastructure and product teams to build scalable and reproducible training systems
  • Contribute to publications, patents, and external research visibility in top-tier ML venues
  • Translate technical findings into clear guidance for leadership and cross-functional stakeholders
  • Requirements

    This role requires deep expertise in reinforcement learning and foundation model post-training, combined with proven research leadership experience. You should bring strong intuition for model behavior, alignment challenges, and large-scale AI system trade-offs.

    • Extensive hands-on experience with reinforcement learning and post-training methods (RLHF, RLAIF, PPO, DPO, or similar)
    • Proven experience leading or mentoring AI research teams in industry or academic settings
    • Strong understanding of alignment challenges, model evaluation, and reasoning systems
    • Experience designing rigorous evaluation frameworks for AI model performance and readiness
    • Ability to communicate complex technical concepts and trade-offs to diverse audiences
    • Background in ML, AI, or RL research, typically supported by a PhD or equivalent industry research experience
    • Preferred experience in frontier AI labs, agentic AI, or alignment research
    • Familiarity with large-scale training infrastructure and production AI systems is an asset
    • Strong publication record in top ML venues is highly valued
    • Benefits

      • Competitive compensation package including base salary, bonus, and equity components
      • Comprehensive health, dental, and vision coverage
      • Inclusive and collaborative research environment focused on real-world impact
      • Flexible work arrangements, including remote options across North America and Europe
      • Strong emphasis on research freedom, publication, and external visibility
      • Professional development opportunities and access to cutting-edge AI infrastructure
      • Paid time off, wellness programs, and employee support initiatives
      • Opportunity to influence frontier AI systems used in high-impact industrial domains
Apply Now

Date Posted

05/13/2026

Views

0

Back to Job Listings Add To Job List Company Profile View Company Reviews
Neutral
Subjectivity Score: 0
142,000+ Jobs Tracked
12,400+ Companies
1,930 Categories