Swarat Chaudhuri (@swarat) Bsky

AlphaEvolve: A Gemini-powered coding agent for designing advanced algorithms New AI agent evolves algorithms for math and practical applications in computing by combining the creativity of large language models with automated evaluators

Announcing AlphaEvolve, our new LLM coding agent that has
- made new scientific discoveries
- discovered algorithms that are now deployed at Google (in Gemini, Transformers, TPU hardware design & data centers)

Blog: deepmind.google/discover/blo...
White paper:
storage.googleapis.com/deepmind-med...

11 months ago 115 40 5 14

US revokes nearly 1,500 student visas: Who are the targets? Hundreds of students have had their visas cancelled and find themselves in limbo.

One of my PhD students got their visa revoked. I know of other cases amongst my AI colleagues. This is not what investing in US leadership in AI looks like.

www.aljazeera.com/news/2025/4/...

1 year ago 60 23 2 1

Guggenheim Foundation Names 3 at UT in 100th Class of Fellows Swarat Chaudhuri, a computer scientist, and Feliciano Giustino, a physicist, are among this year’s fellows from The University of Texas at Austin.

Congrats to UT computer scientist Swarat Chaudhuri & UT physicist Feliciano Giustino who were named as Guggenheim Fellows for 2025!

#GuggFellows2025 @guggfellows.bsky.social @utaustin.bsky.social @swarat.bsky.social
cns.utexas.edu/news/accolad...

1 year ago 8 2 0 0

I am honored to be part of the #guggfellows2025 class. My Guggenheim project is on AI systems that can discover new math in an open-ended way. Many thanks to my students, colleagues, and mentors, who inspire me every day and without whom this work wouldn't be possible. www.gf.org/stories/anno...

1 year ago 10 1 1 0

Harvard has set an example for other higher-ed institutions - rejecting an unlawful and ham-handed attempt to stifle academic freedom, while taking steps to make sure students can benefit from an environment of intellectual inquiry, rigorous debate and mutual respect. Let’s hope others follow suit.

1 year ago 89548 18232 1572 746

The #LeanLang Standard Library, under active development at the Lean FRO, envisions providing a reliable and extensible basis for #softwaredevelopment, #softwareverification and #mathematics through verified components, a high-quality API, performance optimization, and best-in-class documentation.

1 year ago 8 4 0 0

Andrew Barto and Richard Sutton are the recipients of the 2024 ACM A.M. Turing Award for developing the conceptual and algorithmic foundations of reinforcement learning. Andrew Barto and Richard Sutton as the recipients of the 2024 ACM A.M. Turing Award for developing the conceptual and algorithmic foundations of reinforcement learning. In a series of papers beginning...

RL is so back!

(well, for some of us, it never really left)

awards.acm.org/about/2024-t...

1 year ago 72 12 1 1

STAND UP FOR SCIENCE March 7, 2025. Washington DC and nationwide. Because science is for everyone.

Calling all scientists and students based in London!

standupforscience2025.org and local groups are organizing rallies around the US to protest against the new administration’s massive and indiscriminate funding cuts to all manner of scientific research…👉🏼🧵
#sciencematters #standupforscience #london

1 year ago 7 3 3 0

Congrats to @amitayush.bsky.social for leading this effort. And thanks to my student George Tsoukalas and collaborator extraordinaire @gregdnlp.bsky.social, who made critical contributions to the work. (3/3)

1 year ago 1 0 0 0

It also has built-in machinery for large-scale, neurally guided proof search. We show that Proofwala's multilingual capabilities can enable transfer across proof assistants. Specifically, our multilingual model can outperform Coq- and Lean-only models at standard proof synthesis metrics. (2/3)

1 year ago 1 0 1 0

Excited about Proofwala, @amitayush.bsky.social's new framework for ML-aided theorem-proving.

* Paper: arxiv.org/abs/2502.04671
* Code: github.com/trishullab/p...

Proofwala allows the collection of proof-step data from multiple proof assistants (Coq and Lean) and multilingual training. (1/3)

1 year ago 21 5 1 1

Upon learning that yesterday would be my last day as a program officer at the National Science Foundation, I shared this parting message with my colleagues. The next few months will be frenetic and stressful for them. Here are some things that you can do to help them with the mission ahead. (1)

1 year ago 2420 825 69 70

DARPA released a Request for Information (RFI) that seeks community feedback on the draft DARPA Guide to Formal Methods to Deliver Resilient Systems for Proposals (“the FMDRS Guide”). You can find the RFI here on Sam.gov.

Details in the image...

1 year ago 5 3 0 0

Proving the Coding Interview: A Benchmark for Formally Verified Code Generation We introduce the Formally Verified Automated Programming Progress Standards, or FVAPPS, a benchmark of 4715 samples for writing programs and proving their correctness, the largest formal verification ...

Proving the Coding Interview: A Benchmark for Formally Verified Code Generation

“We introduce the Formally Verified Automated Programming Progress Standards, or FVAPPS, a benchmark of 4715 samples […] including 1083 curated and quality controlled samples”

arxiv.org/abs/2502.05714

1 year ago 4 1 1 0

Can LLMs be used to discover interpretable models of human and animal behavior?🤔

Turns out: yes!

Thrilled to share our latest preprint where we used FunSearch to automatically discover symbolic cognitive models of behavior.
1/12

1 year ago 135 45 3 11

How a Canadian scientist and a venomous lizard helped pave the way for Ozempic - National | Globalnews.ca In 1984, Dr. Daniel Drucker, an endocrinologist from the University of Toronto, discovered a hormone that helped pave the way for popular diabetes drugs such as Ozempic.

This is the most relevant article to NIH and research cuts I’ve seen.

Imagine if this was today , how many people would be saying “Why are we studying Gila Monsters and their impact on diabetes ? That’s wasted money !”

globalnews.ca/news/9793403...

1 year ago 48742 12455 1129 436

Super excited: my new @darpa program on AI for pure mathematics!

Exponentiating Mathematics (expMath) aims to accelerate the rate of progress in pure math through the development of an AI collaborator and new professional-level math benchmarks.

sam.gov/opp/4def3c13...

1 year ago 16 5 0 1

The Deep Link Equating Math Proofs and Computer Programs | Quanta Magazine Mathematical logic and the code of computer programs are, in an exact way, mirror images of each other.

Mathematical proof assistants like Coq and Lean were made possible by a correspondence that established the equivalence between proofs and computation. Read the explainer from our archive:

1 year ago 45 19 0 3

Screening performance and characteristics of breast cancer detected in the Mammography Screening with Artificial Intelligence trial (MASAI): a randomised, controlled, parallel-group, non-inferiority, ... The findings suggest that AI contributes to the early detection of clinically relevant breast cancer and reduces screen-reading workload without increasing false positives.

New: The largest medical A.I. randomized controlled trial yet performed, enrolling >100,000 women undergoing mammography screening
The use of AI led to 29% higher detection of cancer, no increase of false positives, and reduced workload compared with radiologists w/o AI thelancet.com/journals/lan...

1 year ago 1351 347 38 89

This is one more, and such a profound, way of distinguishing between science and technology: "Technology shouts for itself; science [does not]." (And these days, some technologies truly do themselves shout…)

1 year ago 9 1 1 0

2025 will be #mathsky interesting year!

1 year ago 19 5 2 0

@ayushkhaitan.bluesky.social, Amitayush Thakur, and I are organizing an #AI4Math panel at the Joint Mathematics Meeting this month. Please spread the word among your math friends! We will post a summary of the discussion after the event.

1 year ago 6 1 0 0

I really enjoy NASA Administrator (!!!) Michael Griffin on the "Real Reasons" versus the "Acceptable Reasons" to go to the moon: spaceref.com/status-repor...

1 year ago 33 8 1 2

You make a good point. Alphaproof will evolve just as the informal approaches have, though.

1 year ago 1 0 0 0

Yeah, I think so, especially if search is permitted at test-time.

1 year ago 0 0 0 0

From what I have seen, LLMs are quite good at that. There are plenty of examples of definitions being used in various contexts in the training data.

1 year ago 0 0 1 0

Can AI do maths yet? Thoughts from a mathematician. So the big news this week is that o3, OpenAI’s new language model, got 25% on FrontierMath. Let’s start by explaining what this means.

An excellent post by Kevin Buzzard on informal reasoning methods like o3. The key point, one I wholeheartedly agree with, is that informal methods continue to struggle with proof even when they give the correct answers, and this is a critical liability. xenaproject.wordpress.com/2024/12/22/c...

1 year ago 17 4 2 1

Hmm, I wasn’t imagining they would be connected to the account security people at X. But maybe worth a shot. Thank you!

1 year ago 1 0 0 0

We are excited about the potential of this approach in
✅ hard, research-level math tasks
✅ deep assurance of software and hardware systems.

This was a team effort with Kaiyu Yang, Gabriel Poesia, Jingxuan He, Wenda Li, Kristin Lauter, and Dawn Song. Please reach out to us with feedback! (2/2)

1 year ago 3 0 0 0

Posts by Swarat Chaudhuri