Advertisement · 728 × 90

Posts by Timothy Keyes

😭🤣🤪

7 months ago 0 0 0 0

I’m not sure what face I’m making in that screenshot (lol) but I’m glad you enjoyed! (Funny faces notwithstanding)

7 months ago 1 0 1 0
A chat interface showing a conversation starting with a user saying "Are there any talks about evals" and a response "Yes, Simon Couch just gave one, and he hopes you enjoyed it." There's a teddy bear mascot wearing a teal shirt and cap in the bottom right corner of the chat window.

A chat interface showing a conversation starting with a user saying "Are there any talks about evals" and a response "Yes, Simon Couch just gave one, and he hopes you enjoyed it." There's a teddy bear mascot wearing a teal shirt and cap in the bottom right corner of the chat window.

A picture from the side of a conference session room, with myself standing in front.

A picture from the side of a conference session room, with myself standing in front.

Just wrapped up a talk at #positconf2025 about LLM evaluation with R! Such a joy to hang out with #rstats folks in person and hear about what others are working on.

Resources and slides: github.com/simonpcouch/...

7 months ago 25 3 1 0

my most recent hot take: after a year working inside my hospital’s IT system, i’m increasingly convinced that clinical informatics should have been a residency (like pathology or radiology), not a fellowship.

11 months ago 5 0 0 0

idk what it means that so many people I know to be thoughtful, compassionate, and brilliant physicians have told me that - while they love taking care of patients - the growing frustrations and exhaustion they experience in clinical life make the whole thing feel unsustainable in the long-term

1 year ago 3 0 0 0

as i prepare to go back to med school, i am haunted daily by the number of my med school classmates (who are now residents, fellows, or attendings) who have reached out to me to learn about my side-career in health tech because they want to explore alternatives to a full-time clinical career

1 year ago 1 0 1 0

how are you such a legend - this is amazing!

1 year ago 5 0 0 0

all this talk about AI and i’m still sitting here thinking about the warm, familiar embrace of OLS

1 year ago 6 0 0 0

today i got an email in which a statistician cited a paper from over 200 years ago and all i could think is how that’s the kind of power computer scientists wish they had

1 year ago 24 2 4 0

I feel like every paper I read about LLMs in health care takes for granted that the encouraging, but not necessarily transformative, results we’ve seen so far are going to scale upward as LLMs improve. But are we really safe to assume that LLMs will keep getting better? I’m not so sure.

1 year ago 2 0 1 0
Advertisement

anyone who made me learn the brachial plexus will now be forced to understand gradients, sorry i don’t make the rules

1 year ago 0 0 0 0
Meme of a still from a Dr. Phil interview. An unamused girl is pictured with a banner reading “aneska says violence works for her”

Meme of a still from a Dr. Phil interview. An unamused girl is pictured with a banner reading “aneska says violence works for her”

forced my physician collaborators to listen to me talk about partial derivatives today

1 year ago 3 0 1 0

I can already tell that this paper is going triple-platinum in the Keyes household. Thank you for sharing!

1 year ago 0 0 1 0
Preview
Red teaming ChatGPT in medicine to yield real-world insights on model behavior - npj Digital Medicine npj Digital Medicine - Red teaming ChatGPT in medicine to yield real-world insights on model behavior

It’s finally out! We brought a multidisciplinary team of physicians, computer scientists, and engineers to red team LLMs for healthcare uses. And we have shared the dataset! www.nature.com/articles/s41...

1 year ago 44 11 2 4

disclaimer: I haven’t finished medical school yet (i will someday!) so this is mostly stolen valor, but even having a small amount of clinical training has imo helped me so much to understand small details that my technical colleagues don’t really notice or care about

1 year ago 2 0 0 0

i’m working on a few really cool applications of LLMs in health right now (in the real world, in a deployment environment) and my most valuable asset by far in this work isn’t being the most technical person on the team (i’m not); it’s knowing just enough about medicine to ask the right questions

1 year ago 4 0 1 0
AI Grand Rounds
Episode 27
From Clinical Notes to GPT-4: Dr. Emily Alsentzer on Natural Language Processing in Medicine

AI Grand Rounds Episode 27 From Clinical Notes to GPT-4: Dr. Emily Alsentzer on Natural Language Processing in Medicine

Dr. @emilyalsentzer.bsky.social, a Stanford faculty member and expert in clinical #AI, discusses the evolution of natural language processing, the challenges of AI in clinical settings, and what the future holds for open-source medical AI. Full episode: nejm.ai/4gOGeSo

#MedSky #MLSky

1 year ago 17 6 1 0
Advertisement

"What I find hard to reconcile is, on the one hand, we want to not fall behind on AI writ large. And on the other hand, the very people we need to ensure that agility are being let go.”—Nigam Shah @stanfordhai.bsky.social

1 year ago 146 24 9 0

i have to respect how non-technical folks use spreadsheets - they give you these beautiful murals with all kinds of colors and spacing and intricate patterns. i must say it really breaks my heart to immediately flatten it all with read_csv

1 year ago 2 0 0 0

(i appreciate the use of benchmarks and think they’re really valuable, but benchmarks must be drawn from some population of possible observations, right? so there will still be sampling error in any metrics computed using a benchmark, even if all models use the same benchmark)

1 year ago 1 0 0 0

i’ve been trying to understand why reporting some estimate of confidence intervals around performance metrics (or null hypothesis significance testing) is not more common in the machine learning/AI literature. i think this is changing, but there’s still a lot of weird (or missing) statistics ime

1 year ago 2 0 1 0

i think the vast ambiguity in the term “data scientist” causes a lot of headaches. am i an engineer? am i a computer scientist? am i a statistician? who knows!!!

1 year ago 1 0 1 0

started my day writing a few (simple) statistical proofs for a data science project (to justify simplifying a calculation from something complicated to something simpler and equivalent) and wow it really is nice to dust off the old PhD and put it to use every once in a while

1 year ago 2 0 0 0

This was such a wonderful read - thanks for sharing it @emilyriederer.bsky.social! I’ve been using poetry for package development recently, but now I’m eager to try uv!🤞 (And seaborn.objects might be a nice alternative to plotnine too!)

1 year ago 1 0 0 0

This is really cool! Congrats and thanks for sharing!

1 year ago 2 0 0 0

i rounded on stanford’s palliative medicine service today, and it was such a powerful reminder that end-of-life care clinicians are some of the most kind and empathetic people in health care. truly in awe of their ability to bear witness to (and guide people through) such difficult moments

1 year ago 1 0 0 0

Thanks for sharing! I’m excited to bring this to our group for journal club.

1 year ago 2 0 1 0
Advertisement

when someone tells you that they’ve solved an important problem using ai, that’s very exciting! but you should not take their word for it. if they don’t provide very transparent tools for monitoring/evaluating their ai product, it’s pretty safe to assume even *they* don’t know if it actually works

1 year ago 7 0 0 0

i am not an ai doomer by any stretch, but building ai systems (big or small) that solve useful problems in health care is really hard! validation experiments are really hard (and sometimes really expensive) to do! very few ai products work exactly as advertised, and many don’t work at all!

1 year ago 5 0 1 0