NYU Center for Data Science (@nyudatascience) Bsky

UCSF postdoctoral researcher Shailee Jain (@shaileejain.bsky.social) recently visited CDS for the Minds, Brains and Machines Seminar.

She discussed using AI interpretability tools and linguistic theory to better understand the human brain.

Jain is at the Chang lab in UCSF's Dept of Neurosurgery.

16 hours ago 1 0 0 1

The event was hosted by CDS Professor Kyunghyun Cho (@kyunghyuncho.bsky.social ).

D’Ercole spoke at the Global AI Frontier Lab Seminar Series in an individual capacity, rather than as an official representative of the UN.

5 days ago 0 0 0 0

At NYU's Global AI Frontier Lab Seminar Series, James D’Ercole of the UN's Department of Field Support discussed the challenges of deploying AI in peace operations, highlighting a real-time translation system in South Sudan designed for low-resource, high-risk field environments.

5 days ago 0 0 1 0

Undergrads will work on projects involving humanitarian ML, AI for science, visual neuroscience, and learning agents with mentors Carlos Fernandez-Granda, @neurograce.bsky.social, @mengyer.bsky.social, and Mateo Dulce Rubio.

5 days ago 0 0 0 0

Announcing the 2026–2027 Data Science Research Program for Undergraduate Students projects!

5 days ago 0 0 1 0

CreteLing 2026 CreteLing 2026 Summer Schoon in Linguistics

CDS Assoc. Prof. @tallinzen.bsky.social is teaching “Large Language Models and Linguistics” at CreteLing 2026, a summer school running July 18–31, in Crete, Greece.

The course examines how LLMs contribute to the scientific investigation of human language.

linguistics.philology.uoc.gr/cssl26/

6 days ago 1 1 0 0

CDS PhD students Aramis Tanelus, Varun Yerram, Grégoire Lambrecht, Carmel Pe’er, Jiacheng (Patrick) Shen, Nishka Pant, Niket Patel, Ellen Su, Boyi Yang, Sara Dragutinovic, and Shashwat Singh (@shashwat1002.bsky.social) met to celebrate their first year at CDS.

1 week ago 0 0 0 0

VeriSoftBench: Repository-Scale Formal Verification Benchmarks for Lean Large language models have achieved striking results in interactive theorem proving, particularly in Lean. However, most benchmarks for LLM-based proof automation are drawn from mathematics in the Mat...

Courant Faculty Fellow @jqchen.bsky.social, CDS Assoc Prof @gregdnlp.bsky.social, and Yutong Xin & @idillig.bsky.social from UT-Austin introduced VeriSoftBench, a benchmark of 500 Lean 4 proof obligations for software verification.

GitHub: github.com/utopia-group...

arXiv: arxiv.org/abs/2602.18307

1 week ago 0 0 0 0

CDS Assoc. Prof. of Music Technology and Data Science Brian McFee discusses his research focus at the intersection of audio signals and large language models.

He works on music information retrieval, investigating the representations of audio learned by LLMs and other systems.

1 week ago 1 0 0 0

CDS Associated Prof. Vasant Dhar (Stern School of Business, author, podcast host) moderated the CDS Startup Panel.

Panelists Haftan Eckholdt (5x founder), Mike King (iPullRank), and CDS alumnus Suvir Wadhwa (Flite) discussed the path to startup success.

1 week ago 0 0 0 0

Building the Science of Scaling: Improving the Efficiency of Deep Learning Optimizers A profound regime change in the field of optimization may be around the corner. For a decade, the Adam optimizer has overwhelmingly…

Using advanced AI optimizers like Muon doesn’t have to rely on guesswork.

Courant PhD students Shikai Qiu and Zixi (Charlie) Chen, CDS PhD Student Hoang Phan, CDS Asst. Prof. Qi Lei, and CDS Prof. @andrewgwils.bsky.social bridge theory and practice.

nyudatascience.medium.com/building-the...

1 week ago 1 1 0 0

Huge congratulations to Julia Stoyanovich, Institute Associate Professor and Director of the Center for Responsible AI, on being named on The 2026 Above & Beyond: Women list by City & State NY.

#NYUTandonMade
www.cityandstateny.com/power-lists/...

1 week ago 3 2 0 0

CDS Associate Professor of Mathematics and Data Science Carlos Fernandez-Granda on his research into foundational models.

He is excited to explore the “potential and limitations of these models for applications in healthcare, scientific imaging.”

1 week ago 0 0 0 0

CDS and the Wasserman Center recently hosted an Interview & Technical Prep Workshop for CDS undergrads.

The session involved technical problem-solving tips, interview strategies, and insights into what recruiters prioritize during the hiring process.

2 weeks ago 1 0 0 0

CDS students recently gathered at the CDS Open Space for "Interview Prep Jeopardy" organized by the CDS Graduate Student Community-Building Group (GCBG) and Women in Data Science (WiDS).

Teams answered questions inspired by real data science interviews to win prizes of CDS merch.

2 weeks ago 0 0 0 0

April 2026 CDS Research Feature

The April CDS Research Feature has dropped 📧 Yann LeCun brings us Rectified LpJEPA, Jiajian Ma develops a new deep learning model to lower multiple sclerosis diagnosis error rates, and more on this month’s feature! t.e2ma.net/webview/hcix...

2 weeks ago 1 1 0 0

CDS PhD student Anthony GX-Chen explains his paper, “KL-Regularized Reinforcement Learning Is Designed to Mode Collapse.”

See blog linked below for more.

2 weeks ago 0 0 0 0

Why Are AI Answers So Predictable? Uncovering the Math Behind Diversity Loss When humans are asked to pick a random number between one and one hundred, they offer a wide spread of choices, but when a large language…

Why do AI models repeat the same answers?

CDS PhD student Anthony GX-Chen, CDS Associate Professor Rajesh Ranganath, and co-authors show that diversity loss and mode collapse are built-in features of reinforcement learning.

nyudatascience.medium.com/why-are-ai-a...

2 weeks ago 0 0 0 1

Universal priors: solving empirical Bayes via Bayesian inference and pretraining We theoretically justify the recent empirical finding of [Teh et al., 2025] that a transformer pretrained on synthetically generated data achieves strong performance on empirical Bayes (EB) problems. ...

CDS Asst. Prof. Yanjun Han and colleagues at NYU and MIT explains why transformers trained on synthetic data excel at empirical Bayes (EB) problems.

By using universal priors, these models adapt to new data through posterior contraction.

arxiv.org/abs/2602.15136

2 weeks ago 1 0 0 0

Project proposals this year by CDS Faculty Fellow Mateo Dulce Rubio, CDS Assoc. Prof. Carlos Fernandez-Grande, Asst. Prof. Grace Lindsay (@neurograce.bsky.social), and Asst. Prof. Mengye Ren (@mengyer.bsky.social)!

2 weeks ago 1 1 0 0

Applications for the Data Science Research Program for Undergraduate Students are open!

- Work directly with CDS faculty and researchers on cutting-edge projects
- In-person
- 2026-2027 academic year
- 10 hours/week
- Funded

Deadline: May 1, 2026

cds.nyu.edu/research-pro...

2 weeks ago 2 1 1 0

Beyond Language Modeling: An Exploration of Multimodal Pretraining The visual world offers a critical axis for advancing foundation models beyond language. Despite growing interest in this direction, the design space for native multimodal models remains opaque. We pr...

CDS-affiliated Asst Prof @saining.bsky.social, CDS founding director @yann-lecun.bsky.social, and others released a paper on multimodal pretraining that focuses on moving past the limitations of standard language modeling to create more versatile systems.

arxiv.org/abs/2603.03276

2 weeks ago 1 0 0 0

CDS Asst. Prof. Grace Lindsay (@neurograce.bsky.social) shares research directions she is excited about:

- Building foundation models for satellite imagery in climate applications

- Using recent compute increases to build embodied models that navigate complex environments by mimicking the brain

3 weeks ago 0 0 0 0

CDS-affiliated Professor of Biochemistry and Molecular Pharmacology Itai Yanai (@itaiyanai.bsky.social), & CDS-affiliated Associate Professor of Computer Science Benjamin Peherstorfer.

(3/3)

3 weeks ago 0 0 0 0

Among the "highly cited researchers": CDS-affiliated Professor of Biology Rahul Satija, CDS-affiliated Professor of Politics Joshua Tucker (@jatucker.bsky.social),

(2/3)

3 weeks ago 0 0 1 0

Clarivate named NYU a top research institution for its concentration of "highly cited researchers," which is defined as being in the top 0.1%.

NYU is home to 0.4% of these researchers, putting it among the top 70 research powerhouses worldwide.

www.nyu.edu/about/news-p...

(1/3)

3 weeks ago 0 0 1 0

Testing the Limits of AI in Research Mathematics Artificial intelligence models are now capable of executing highly complex, research-level mathematics, signaling a fundamental shift in…

AI is becoming incredibly powerful at solving complex math, but it requires human supervision.

CDS Silver Prof Julia Kempe (@kempelab.bsky.social) tested top models on unpublished math questions, exploring their capabilities and limits in a new paper.

nyudatascience.medium.com/testing-the-...

3 weeks ago 2 0 0 0

CDS Clinical Professor of Data Science and Psychology Pascal Wallisch discusses cognitive diversity.

His work uses network data to predict individual differences.

“Two people are very, very different mentally. That is not noise.”

3 weeks ago 0 0 0 0

CDS MS student Adarsh Tiwari joined NYU Wasserman’s Global Career Trek in London this spring break, meeting with alumni and firms across finance and consulting in an international setting.

Visits included SCOR, Deloitte, Bank of America, Patron Capital, and Bloomberg.

3 weeks ago 0 0 0 0

Teaching LLMs to reason like Bayesians Google researchers demonstrate how Bayesian teaching through supervised fine-tuning enables LLMs to approximate optimal probabilistic reasoning and generalize to new domains.

Google Research's blog post: research.google/blog/teachin...

2/2

4 weeks ago 0 0 0 0

Posts by NYU Center for Data Science