Fateme Nateghi · AI Researcher in Healthcare

About

Much of my current work explores how LLMs and retrieval-augmented generation can support clinical workflows, enhancing specialty consultations, answering patient messages, and post-training models so the responses are accurate, reliable, and clinically useful.

I'm also a Data Science Lead in the ARISE network, advancing open benchmarks and real-world evaluation of clinical AI systems. Before Stanford, I completed my PhD in Biomedical Sciences at KU Leuven, where I focused on semi-supervised learning for time-to-event prediction in partially labeled clinical data.

I'm always happy to talk about clinical AI, research collaborations, or startup ideas. Drop me a line →

Recent Updates

Apr 2026

Contributor to the Stanford HAI AI Index Report 2026, second consecutive year (also 2025).

Feb 2026

Joined STVP as an Emerson Consequential Scholar (Feb–Jun 2026), exploring how research translates into ventures.

2026

Program Committee, ECML PKDD 2026 (2nd consecutive year, also served in 2024).

Dec 2025

Invited talk at Itec (KU Leuven & imec), Toward Trustworthy Clinical AI.

Dec 2025

NOHARM benchmark released, part of the MAST suite. Paper · how it works · leaderboard.

Nov 2025

Aug 2025

Launched an interactive demo for predicting OUD treatment retention with LLM-augmented EHR features ( JAMIA 2025).

Presented SAGE at AMIA 2025 (Atlanta), AI routing of eConsults using LLMs.

Mar 2025

Invited talk at the CTN Quarterly Steering Committee on MOUD retention.

2024

Awarded Stanford Bio-X Interdisciplinary Seed Grant as Lead Postdoc.

Selected Research

Benchmark2026

First, Do NOHARM: Towards Clinically Safe Large Language Models

David Wu*, Fateme Nateghi*, et al. · arXiv 2026 · *Co-first authors

A large-scale benchmark for evaluating the clinical safety of LLMs using real physician-to-physician consultation cases, the foundation of MAST (Medical AI Superintelligence Test).

100 cases + 1,000 perturbed variants · 10 specialties 28 LLMs evaluated 12,747 expert annotations by 29 physicians

Paper → How NOHARM works MAST benchmark suite Live leaderboard

Ongoing

SAGE: Specialist AI for Guiding Experts

Fateme Nateghi, Jonathan H. Chen, et al.

A retrieval-augmented system that helps primary care providers route specialty consultations and surfaces guideline-aligned recommendations during real clinical workflows.

SAGE website →AMIA 2025 talk

JAMIA2025

Predicting Buprenorphine-Naloxone Treatment Retention with LLM-Derived Clinical Features

Fateme Nateghi Haredasht, I. Lopez, S. Tate, et al. · J Am Med Inform Assoc 32(12), 2025 · Co-first authors

Multi-site EHR study (Stanford + NeuroBlu) LLM-extracted features from clinical notes Live demo for clinicians

Combining structured EHR data with LLM-extracted features (chronic pain, liver disease, depression) to predict 6-month retention in buprenorphine-naloxone therapy. Open-sourced an interactive web tool for real-time risk stratification at the bedside.

Live demo → JAMIA 2025 Addiction 2024 GitHub

PhD work

Semi-supervised survival analysis for partially labeled clinical data

KU Leuven · Data-driven Healthcare group

Methods for time-to-event prediction in ICU patients when most labels are censored, improving outcome prediction without requiring fully observed follow-up.

PhD Thesis (PDF) →

View all publications on Google Scholar

Selected Talks

Dec 2025

Toward Trustworthy Clinical AI: Building Safe & Context-Aware Systems for Healthcare

Itec (KU Leuven & imec)

Slides →

Invited

Nov 2025

Right Patient, Right Specialist, Right Time: AI Routing of eConsults Using Language Models

AMIA 2025 Annual Symposium · Atlanta, GA

Slides →

Talk

Mar 2025

Medication for Opioid Use Disorder: Predictability of Retention vs. Attrition

CTN Quarterly Steering Committee Meeting · Virtual

Slides →

Invited

Nov 2024

Enhancing Antibiotic Stewardship: A Machine Learning Approach to Predicting Antibiotic Resistance in Inpatient Care

AMIA 2024 Annual Symposium · San Francisco, CA

Slides →

Talk

Nov 2024

Predicting Treatment Retention in Buprenorphine-Naloxone Therapy Using Multi-Site EHR Data

AMIA 2024 Annual Symposium · San Francisco, CA

Slides →

Talk

Always happy to speak about clinical AI, LLM evaluation, or trustworthy healthcare AI. Invite me to speak →

Leadership & Fellowships

Open Network

Data Science Lead at ARISE

Advancing Healthcare AI through Open Science

Open research network advancing safety, reliability, and real-world evaluation of clinical AI through open benchmarks like NOHARM/MAST.

Fellowship · 2026

Emerson Consequential Scholar

Stanford Technology Ventures Program (STVP)
Feb 2026 – Jun 2026 · Palo Alto, CA

Apprenticeship-style fellowship at Stanford's entrepreneurship engine, exploring how rigorous research translates into ventures that drive consequential, real-world impact.

Grant Lead · 2024

Stanford Bio-X Seed Grant

Lead Postdoctoral Researcher

Lead postdoc on Enhancing Specialty Care with Digital Medical Consultations: A Retrieval-Augmented Language Model Approach. PI: J. H. Chen; Co-PIs: Bernstein, Tibshirani, Goldstein.

Annual Report · 2025 · 2026

Contributor, Stanford HAI AI Index Report

2 consecutive years · Stanford Institute for Human-Centered AI

Contributing to the world's most-cited annual report on AI progress, produced by Stanford HAI.

2026 Report → 2025 Report

Service · 2024 · 2026

Program Committee, ECML PKDD

European Conference on Machine Learning & Principles of KDD

Program Committee member for two editions of ECML PKDD, the premier European venue for ML and data-mining research. Reviewing submissions in clinical ML, survival analysis, and applied AI.

Session Co-organizer · 2025 · 2026 · 2027

Co-organizer, AI & ML in Clinical Medicine at PSB

Pacific Symposium on Biocomputing · 3 consecutive years (2025, 2026, 2027) · Hawai‘i

Co-organizer of the AI & ML in Clinical Medicine session at PSB for three consecutive years. This year's theme (Jan 3–7, 2027): agentic AI, clinical validation, and human-AI collaboration in real-world healthcare, covering agentic AI workflows, prospective clinical evaluation, LLMs in healthcare, multimodal AI, and AI safety & fairness.

Submit a paper

Deadline: Aug 3, 2026

I build clinically reliable AI for healthcare.

About

Recent Updates

Co-organizing the AI & ML in Clinical Medicine session at PSB 2027

Selected Research

First, Do NOHARM: Towards Clinically Safe Large Language Models

SAGE: Specialist AI for Guiding Experts

Predicting Buprenorphine-Naloxone Treatment Retention with LLM-Derived Clinical Features

Semi-supervised survival analysis for partially labeled clinical data

Selected Talks

Toward Trustworthy Clinical AI: Building Safe & Context-Aware Systems for Healthcare

Right Patient, Right Specialist, Right Time: AI Routing of eConsults Using Language Models

Medication for Opioid Use Disorder: Predictability of Retention vs. Attrition

Enhancing Antibiotic Stewardship: A Machine Learning Approach to Predicting Antibiotic Resistance in Inpatient Care

Predicting Treatment Retention in Buprenorphine-Naloxone Therapy Using Multi-Site EHR Data

Leadership & Fellowships

Data Science Lead at ARISE

Emerson Consequential Scholar

Stanford Bio-X Seed Grant

Contributor, Stanford HAI AI Index Report

Program Committee, ECML PKDD

Co-organizer, AI & ML in Clinical Medicine at PSB

Outside the Lab