Selected Publications

You can also find my full list of papers on Google Scholar.


Recent Highlight

First, Do NOHARM: Towards Clinically Safe Large Language Models
David Wu*, Fateme Nateghi*, et al.
arXiv preprint, 2025
*Co-first authors

We introduce NOHARM, a large-scale benchmark for evaluating the clinical safety of large language models using real physician-to-physician consultation cases. NOHARM is the foundation of MAST (Medical AI Superintelligence Test), a new suite of realistic clinical benchmarks for medical AI.

[Live Benchmark] | [Paper]

SAGE – Specialist AI for Guiding Experts
Fateme Nateghi, Jonathan H. Chen, et al.
Ongoing Project
[Live Demo] / [GitHub]

SAGE is a specialist-trained AI system that enhances electronic consultations (eConsults) by:
• Retrieving similar EHR patient cases
• Recommending clinical actions based on real-world data
• Pre-filling templates using LLMs
• Supporting retrieval-augmented generation (RAG)

The goal is to scale access to specialty care and promote guideline-aligned decision-making.

Selected Publications under SAGE

Automated Evaluation of Large Language Model Response Concordance with Human Specialist Responses on Physician-to-Physician eConsult Cases
DJH Wu*, Fateme Nateghi*, D Wu, V Ravi, LG McCoy, Y Weng, K Chopra, JH Chen
medRxiv preprint, 2025
*Equal contribution
[Paper]

Asking the Right Questions: Benchmarking Large Language Models in the Development of Clinical Consultation Templates
LG McCoy*, Fateme Nateghi*, K Chopra, D Wu, DJH Wu, A Conteh, JH Chen
arXiv preprint, 2025
*Equal contribution
[arXiv]


Retrieval-Augmented Guardrails for AI-Drafted Patient-Portal Messages: Error Taxonomy Construction and Large-Scale Evaluation
W. Chen, Fateme Nateghi, K.C. Black, F. Grolleau, E. Alsentzer, J.H. Chen, et al.
arXiv preprint, 2025
[arXiv]

A Multi-Site Machine Learning Model for Predicting Treatment Retention in Opioid Use Disorder
Fateme Nateghi, Sajjad Fouladvand, Steven Tate, Min Min Chan, Joannas Jie Lin Yeow, Kira Griffiths, Ivan Lopez, Jeremiah W. Bertz, Adam Miner, Tina Hernandez-Boussard, Chwen-Yuen Angie Chen, Huiqiong Deng, Keith Humphreys, Anna Lembke, Alexander Vance, Jonathan H. Chen
Addiction, 2024
[Paper] / [GitHub] / [WebApp]

Predicting Outcomes of Acute Kidney Injury Using Machine Learning
Fateme Nateghi, Liesbeth Viaene, Hans Pottel, Wouter De Corte, Celine Vens
Scientific Reports, 2023
[Paper] / [GitHub]

Predicting Survival Outcomes in the Presence of Unlabeled Data
Fateme Nateghi, Celine Vens
Machine Learning, 2022
[Paper] / [GitHub]

Clinical Entity-Augmented Retrieval for Clinical Information Extraction (CLEAR)
I. Lopez, A. Swaminathan, K. Vedula, S. Narayanan, Fateme Nateghi
npj Digital Medicine, 2025
[Paper] / [GitHub]

Embedding-Driven Diversity Sampling to Improve Few-Shot Synthetic Data Generation
I. Lopez, Fateme Nateghi, K. Caoili, J.H. Chen, A. Chaudhari
arXiv preprint, 2025
[arXiv]

MedHELM: Holistic Evaluation of Large Language Models for Medical Tasks
Suhana Bedi, ..., Fateme Nateghi, Jonathan H. Chen, Nigam H. Shah, et al.
arXiv, 2025
[Paper] / [Med-HELM Docs] / [Live Benchmark]

Deconver: A Deconvolutional Network for Medical Image Segmentation
P. Ashtari, S. Noei, Fateme Nateghi, J.H. Chen, G. Jurman, A. Pizurica
arXiv preprint, 2025
[arXiv] / [GitHub]

Quantization-Free Lossy Image Compression Using Integer Matrix Factorization
P. Ashtari, P. Behmandpoor, Fateme Nateghi, J.H. Chen, P. Patrinos
arXiv preprint, 2024
[arXiv] / [GitHub]

Supervised Fuzzy Partitioning
P. Ashtari, Fateme Nateghi, H. Beigy
Pattern Recognition, 2020
[Paper]