Advertisement · 728 × 90

Posts by Emily Alsentzer

Post image

Not too big. Not too small.
#ML4H2025 is just right for deep conversations and building meaningful connections.
Come get your healthy dose of ML research and vitamin D in sunny San Diego!
Register today: ahli.cc/ml4h-register/
It might be your favorite conference of the year!

5 months ago 3 2 0 1
Preview
Reflections from Research Roundtables at the Conference on Health, Inference, and Learning (CHIL) 2025 The 6th Annual Conference on Health, Inference, and Learning (CHIL 2025), hosted by the Association for Health Learning and Inference (AHLI), was held in person on June 25-27, 2025, at the University…

New pre-print from @emilyalsentzer.bsky.social et. al., CHIL 2025 convened experts to discuss critical themes in healthcare AI, including user-centric explainability, bias mitigation, domain adaptation, foundation models, and learning from small medical datasets.
#MedSky #MLSky #MedAI

6 months ago 2 2 0 0
Page 1 of the Perspective "Redefining Bias Audits for Generative AI in Health Care" 

Read full article at ai.nejm.org.

Page 1 of the Perspective "Redefining Bias Audits for Generative AI in Health Care" Read full article at ai.nejm.org.

Perspective by Irene Y. Chen, PhD, and Emily Alsentzer, PhD: Redefining Bias Audits for Generative AI in Health Care nejm.ai/4mhOFte

@irenetrampoline.bsky.social @emilyalsentzer.bsky.social #AI #MedSky #MLSky

7 months ago 0 1 0 0
Post image

🚨 Calling all health AI founders & builders!
Join us at Health AI Builders: A CHIL Unconference — June 25th @ UC Berkeley.

💡 Real talk on AI, regulation, GTM, & fundraising
👥 Small-group convos, big impact
🎯 Apply to attend: lu.ma/2arsxv64
#CHIL2025 #HealthAI #ML4H

10 months ago 1 1 0 0
Preview
Red teaming ChatGPT in medicine to yield real-world insights on model behavior - npj Digital Medicine npj Digital Medicine - Red teaming ChatGPT in medicine to yield real-world insights on model behavior

It’s finally out! We brought a multidisciplinary team of physicians, computer scientists, and engineers to red team LLMs for healthcare uses. And we have shared the dataset! www.nature.com/articles/s41...

1 year ago 44 11 2 4

Required reading If you ever use models to brainstorm.

Really nice study design: Domain expert researchers search for plagiarized papers of LLM- systems that generate research plans. Many successfully find papers, verified by authors of those papers.

But very hard to detect, see screenshot.

1 year ago 51 12 1 1
Video

Clinical #AI has come a long way, but are we losing some of the core principles of NLP? Dr. @emilyalsentzer.bsky.social discusses how early rule-based approaches still offer valuable lessons for today’s AI-powered clinical decision support. Full episode: nejm.ai/4gOGeSo

#MedSky #MLSky

1 year ago 2 1 0 0
Advertisement
AI Grand Rounds
Episode 27
From Clinical Notes to GPT-4: Dr. Emily Alsentzer on Natural Language Processing in Medicine

AI Grand Rounds Episode 27 From Clinical Notes to GPT-4: Dr. Emily Alsentzer on Natural Language Processing in Medicine

Dr. @emilyalsentzer.bsky.social, a Stanford faculty member and expert in clinical #AI, discusses the evolution of natural language processing, the challenges of AI in clinical settings, and what the future holds for open-source medical AI. Full episode: nejm.ai/4gOGeSo

#MedSky #MLSky

1 year ago 17 6 1 0
Editorial

To evaluate LLMs for clinical applications, we must move beyond fixed, narrowly scoped datasets and develop a suite of benchmarks that reflect the complexity and diversity of real-world clinical tasks.

“It’s Time to Bench the Medical Exam Benchmark” by Inioluwa Deborah Raji, B.A.S., Roxana Daneshjou, M.D., Ph.D., and Emily Alsentzer, Ph.D.

Editorial To evaluate LLMs for clinical applications, we must move beyond fixed, narrowly scoped datasets and develop a suite of benchmarks that reflect the complexity and diversity of real-world clinical tasks. “It’s Time to Bench the Medical Exam Benchmark” by Inioluwa Deborah Raji, B.A.S., Roxana Daneshjou, M.D., Ph.D., and Emily Alsentzer, Ph.D.

To move forward as a field, clinicians need to design benchmarks that better align with the tasks LLMs are expected to perform upon deployment. Read the editorial by Inioluwa Deborah Raji, BAS, @roxanadaneshjou.bsky.social, and @emilyalsentzer.bsky.social: nejm.ai/3PK8v1D

#MedSky #AI #MLSky

1 year ago 5 2 0 0
Preview
It’s Time to Bench the Medical Exam Benchmark Medical licensing examinations, such as the United States Medical Licensing Examination, have become the default benchmarks for evaluating large language models (LLMs) in health care. Performance o...

Well said, and very nice!

It’s Time to Bench the Medical Exam Benchmark @roxanadaneshjou.bsky.social @emilyalsentzer.bsky.social

ai.nejm.org/doi/full/10....

1 year ago 6 3 0 0

Medical licensing exams are convenient, but don’t reflect real-world clinical tasks. With LLMs already in EHRs, we need evals to match real-world needs. Let’s partner w/ hospitals piloting these tools to create diverse task-specific evaluations.
w/@rajiinio.bsky.social @roxanadaneshjou.bsky.social

1 year ago 9 2 0 0
Post image

Amazing - 30th anniversary of @pacsym.bio. Celebrating the amazing co-chairs and staff that have made this conference possible! #psb25

1 year ago 13 3 1 0

Submit an abstract to join us in Puerto Rico!

✨focus on real-world deployment of AI into the clinic with practical, in-depth discussions
✨intimate setting for detailed conversations & networking to start new collaborations
✨beachfront hotel 🌴

1 year ago 7 3 0 0
Search Jobs | Microsoft Careers

phd students in ML for health: my team based in cambridge UK is hiring interns for 2025!
jobs.careers.microsoft.com/global/en/sh...

1 year ago 18 4 0 1

Couldn't find a machine learning for health starter pack so I made one. 

DM/Reply if you want to be added!

go.bsky.app/PJKJ8vK

1 year ago 109 29 48 0