Data Scientist

· Remote

Location

Remote

Type

Full Time

Job Description

Data Scientist

Reposted 5 Hours Ago
Easy Apply
2 Locations
Hybrid
155K-202K Annually
Mid level
Artificial Intelligence • Big Data • Healthtech • Biotech • Pharmaceutical
Our mission is to bring treatments to patients faster and more efficiently. We're an AI-native drug development company.
The Role
As a Data Scientist you'll build and maintain portfolio systems design risk frameworks run backtesting experiments and collaborate across teams to enhance portfolio analytics using AI-driven predictions.
Summary Generated by Built In
About Formation Bio

Formation Bio is a tech and AI driven pharma company differentiated by radically more efficient drug development. 

Advancements in AI and drug discovery are creating more candidate drugs than the industry can progress because of the high cost and time of clinical trials. Recognizing that this development bottleneck may ultimately limit the number of new medicines that can reach patients Formation Bio founded in 2016 as TrialSpark Inc. has built technology platforms processes and capabilities to accelerate all aspects of drug development and clinical trials. Formation Bio partners acquires or in-licenses drugs from pharma companies research organizations and biotechs to develop programs past clinical proof of concept and beyond ultimately helping to bring new medicines to patients. The company is backed by investors across pharma and tech including a16z Sequoia Sanofi Thrive Capital John Doerr Spark Capital SV Angel Growth and others. 

You can read more at the following links:

  • Our Vision for AI in Pharma
  • Our Current Drug Portfolio
  • Our Technology & Platform

At Formation Bio our values are the driving force behind our mission to revolutionize the pharma industry. Every team and individual at the company shares these same values and every team and individual plays a key part in our mission to bring new treatments to patients faster and more efficiently.

About the Position 

As a Data Scientist on the platform prediction team you'll translate our probability of success predictions into measurable portfolio-level outcomes. You'll architect core systems — order management execution simulation portfolio construction risk monitoring and performance attribution — that let us rigorously evaluate signals from our AI-driven predictions in public and private equities and our internal portfolio.

This role sits at the intersection of quantitative finance healthcare data and AI-driven drug development. If you're excited about applying portfolio construction and risk management fundamentals to one of the most consequential prediction problems in healthcare this is the role.No other company — hedge fund or pharma — has a technical data science position translating drug development experience into durable AI-native portfolio strategies. The skills you develop here — portfolio construction over assets with radically asymmetric risk profiles clinical trial analytics AI/ML in production and risk management across multi-year horizons — can directly impact the delivery of new and effective therapeutics to patients by best aligning impactful medicines with economic incentives.

Responsibilities

  • Work with the team to implement and maintain core portfolio engine: order management system execution simulation layer portfolio construction service and performance tracking
  • Design risk frameworks that quantify exposure across a portfolio of drug development bets with radically different risk profiles timelines and failure modes
  • Run rigorous backtesting experiments with strict temporal constraints to evaluate Formation strategies against baseline approaches and measure marginal signal from new evidence sources
  • Coordinate across the organization to integrate internal Formation data sources (clinical trial data genomic evidence real-world data) and proprietary tooling into portfolio analytics pipelines
  • Work with product and engineering teams to build dashboards and reporting that communicate portfolio performance risk metrics and strategy comparisons to both technical and executive stakeholders
  • Collaborate with the broader data science team to ensure portfolio-level evaluation feeds back into model improvement and evidence prioritization

About You 

Required Qualifications

  • MS or PhD in a quantitative field (statistics finance physics computational science engineering or related)
  • 1-3 years in a quantitative research data science or analytics role — finance healthcare academic research or consulting all count; substantive internships qualify
  • Strong Python programming skills with experience in data-intensive workflows (pandas numpy scipy)
  • Solid grasp of core portfolio construction and risk concepts: position sizing rebalancing Sharpe ratio drawdown volatility benchmark comparison
  • Demonstrated ability to work with messy real-world datasets — comfortable with data wrangling deduplication and quality assessment
  • Clear communicator who can present quantitative results to both technical peers and business stakeholders

Preferred Qualifications

  • Experience with backtesting frameworks or portfolio simulation (vectorbt Backtrader or custom implementations)
  • Exposure to healthcare pharma or biotech data (clinical trials claims data -omics real-world evidence)
  • Familiarity with alternative data in a research or investment context
  • Experience with probability-of-success modeling drug development decision analysis or health economics
  • Comfort with LLMs or AI/ML pipelines in a production or research setting
  • Familiarity with dashboard/visualization tools (Streamlit Plotly Dash) and pipeline orchestration (Dagster Airflow)

Healthcare OR finance domain knowledge is valued; both are not required.

Total Compensation Range: $154500 - $202000

Compensation Individual compensation is determined by several factors including role scope geographic location and skills & experience. Your offer will reflect where you fall within the range based on these considerations. In addition to base salary we offer equity comprehensive benefits and generous perks. If the posted range doesn't match your expectations we still encourage you to apply!

Where We Hire Formation Bio is prioritizing hiring in key hubs primarily the New York City and Boston metro areas with a hybrid model requiring 1-3 days per week in office. Applicants from the Research Triangle (NC) and San Francisco Bay Area may also be considered. Please apply only if you reside in these locations or are willing to relocate.

Equal Opportunity Formation Bio is committed to building a diverse and inclusive team. We are an equal opportunity employer and welcome candidates from all backgrounds. All qualified applicants will receive consideration for employment without regard to race color creed religion national origin ancestry sex (including pregnancy childbirth breastfeeding and related medical conditions) gender identity or expression sexual orientation age disability genetic information marital status military or veteran status or any other characteristic protected by federal state or local law.

Top Skills

Airflow
Backtrader
Dagster
Dash
Numpy
Pandas
Plotly
Python
Scipy
Streamlit
Vectorbt

What the Team is Saying

Erin Siegel
Rand Miller
Joseph Frappaolo
Gurpreet Singh
Maya Dongier-Perez
Ben Miles
Am I A Good Fit?
beta
Expert contributor network
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: New York NY
140 Employees
Year Founded: 2014

What We Do

Formation Bio is a tech-driven pharma company differentiated by radically more efficient drug development. Formation Bio has built a technology platform that optimizes all aspects of drug development enabling more efficient trial design faster trial completion and higher quality trial data capture. Formation Bio acquires clinical-stage drugs from pharma and biotech and develops them faster and more efficiently unlocking greater value per program and accelerating access to new treatments for patients. Join our culture of innovation where your work directly contributes to transforming patient care in areas such as rheumatology dermatology CNS and cardiometabolic diseases. Our dynamic environment blends advanced technology with strategic drug development speeding up the delivery of new treatments. Here every role plays a part in our mission to bring new treatments to patients faster and more efficiently.

Why Work With Us

Our mission is our roadmap and north star. We are an impact-driven culture that hires for intelligence and low egos. We believe that the best employees are both smart and ambitious but also demonstrate humility and curiosity.

Gallery

Formation Bio Offices

Hybrid Workspace

Employees engage in a combination of remote and on-site work.

Typical time on-site: Not Specified
HQNew York NY
Our office is located in Manhattan near the Empire State Building. The area is lively and has great food and transportation options!

Similar Jobs

Formation Bio

Principal Data Scientist

Artificial Intelligence • Big Data • Healthtech • Biotech • Pharmaceutical
Easy Apply
Hybrid
2 Locations
140 Employees
205K-267K Annually

Formation Bio

Senior Data Scientist

Artificial Intelligence • Big Data • Healthtech • Biotech • Pharmaceutical
Easy Apply
Hybrid
2 Locations
140 Employees
163K-214K Annually

Formation Bio

Principal Data Scientist

Artificial Intelligence • Big Data • Healthtech • Biotech • Pharmaceutical
Easy Apply
Hybrid
2 Locations
140 Employees
222K-275K Annually

Formation Bio

Senior Data Scientist

Artificial Intelligence • Big Data • Healthtech • Biotech • Pharmaceutical
Easy Apply
Hybrid
2 Locations
140 Employees
170K-215K Annually
Apply Now

Date Posted

04/10/2026

Views

0

Back to Job Listings Add To Job List Company Profile View Company Reviews
Neutral
Subjectivity Score: 0

Similar Jobs

142,000+ Jobs Tracked
12,400+ Companies
1,930 Categories