Advertisement · 728 × 90

Posts by Abhinav Kumar

Preview
Theoretical Foundations of Conformal Prediction This book is about conformal prediction and related inferential techniques that build on permutation tests and exchangeability. These techniques are useful in a diverse array of tasks, including hypot...

The new conformal prediction book now seems to be final after a bunch of updates: arxiv.org/abs/2411.118...

1 month ago 11 3 1 0

In my #OTD threads I often try to draw attention to important scientists who historically haven't received the same recognition as their peers.

A science education involves a lot of lore: Stories and anecdotes meant to flesh out a narrative about the development of a field...

1 month ago 60 18 2 3
Post image Post image Post image

I have added a new tutorial on discrete diffusion models:
github.com/gpeyre/ot4ml

1 month ago 57 17 0 0
Preview
Joseph Y. Halpern Obituary February 13, 2026 - Bangs Funeral Home View Joseph Y. Halpern's obituary, send flowers, find service dates, and sign the guestbook.

Was saddened today to hear the passing of Joseph Halpern. I knew him from my undergraduate days at Cornell, for part of which he was the department chair.
www.bangsfuneralhome.com/obituaries/j...

2 months ago 10 4 0 1
Post image

If you are interested in all things ✨causal inference✨, please join our multidisciplinary Causal Inference Interest Group (CIIG). We host monthly seminars featuring speakers with various academic backgrounds and research interests.

Links below.

cc @clscohorts.bsky.social @pwgtennant.bsky.social

3 months ago 47 29 1 1
We propose a method for estimating long-term treatment effects with many short-term proxy outcomes: a central challenge when experimenting on digital platforms. We formalize this challenge as a latent variable problem where observed proxies are noisy measures of a low-dimensional set of unobserved surrogates that mediate treatment effects. Through theoretical analysis and simulations, we demonstrate that regularized regression methods substantially outperform naive proxy selection. We show in particular that the bias of Ridge regression decreases as more proxies are added, with closed-form expressions for the bias-variance tradeoff. We illustrate our method with an empirical application to the California GAIN experiment.

We propose a method for estimating long-term treatment effects with many short-term proxy outcomes: a central challenge when experimenting on digital platforms. We formalize this challenge as a latent variable problem where observed proxies are noisy measures of a low-dimensional set of unobserved surrogates that mediate treatment effects. Through theoretical analysis and simulations, we demonstrate that regularized regression methods substantially outperform naive proxy selection. We show in particular that the bias of Ridge regression decreases as more proxies are added, with closed-form expressions for the bias-variance tradeoff. We illustrate our method with an empirical application to the California GAIN experiment.

arXiv📈🤖
Long-Term Causal Inference with Many Noisy Proxies
By Lal, Imbens, Hull

3 months ago 11 5 0 0
Post image

I've just started reading this new book by Shekhar Khare, a renowned number theorist at UCLA, and I highly recommend it. It takes the reader on an intellectual adventure, with very illuminating analogies and vivid storytelling. It's a book about some brilliant math, but it also has a great heart.

3 months ago 47 7 2 0
Advertisement
Post image

We introduce epiplexity, a new measure of information that provides a foundation for how to select, generate, or transform data for learning systems. We have been working on this for almost 2 years, and I cannot contain my excitement! arxiv.org/abs/2601.03220 1/7

3 months ago 143 34 9 9
Preview
Inference on Local Variable Importance Measures for Heterogeneous Treatment Effects We provide an inferential framework to assess variable importance for heterogeneous treatment effects. This assessment is especially useful in high-risk domains such as medicine, where decision makers...

New paper: Inference on Local Variable Importance Measures for Heterogeneous Treatment Effects (with Peter B. Gilbert & @alexluedtke.bsky.social).

Preprint: arxiv.org/abs/2510.18843.

4 months ago 4 2 1 0
Gradient optimization methods: the benefits of instability
Gradient optimization methods: the benefits of instability YouTube video by Sydney Mathematical Research Institute - SMRI

Gradient optimization methods: the benefits of instability — Peter Bartlett, UC Berkeley

www.youtube.com/watch?v=wEgT...

#MathSky #SMRISeminar

4 months ago 16 6 0 0
Post image

Read last night. Very nice. arxiv.org/abs/2512.01868

4 months ago 21 4 2 1
Preview
The official home of the Python Programming Language

TLDR; The PSF has made the decision to put our community and our shared diversity, equity, and inclusion values ahead of seeking $1.5M in new revenue. Please read and share. pyfound.blogspot.com/2025/10/NSF-...
🧵

5 months ago 6406 2749 123 450
Preview
A Martingale Kernel Two-Sample Test The Maximum Mean Discrepancy (MMD) is a widely used multivariate distance metric for two-sample testing. The standard MMD test statistic has an intractable null distribution typically requiring costly...

A nice variant of the kernel two-sample test. arxiv.org/abs/2510.11853

Sketch of the idea: The MMD is the core of a commonly used nonparametric test for distribution testing. It works by embedding distributions into a RKHS and comparing their mean embeddings. [+]

6 months ago 5 1 1 0
Post image

As a grad student, the biologist Yitzhi “Patrick” Cai helped program 𝘌. 𝘤𝘰𝘭𝘪 bacteria to become a biosensor for arsenic contamination in drinking water. Today, he is leading a global effort to build the first-ever synthetic eukaryotic genome. www.quantamagazine.org/hes-gleaning...

6 months ago 36 5 1 1

We have two new mentees who are offering their time via office hours! Please show Sandeep Silwal and Kevin Tian some love and sign up to meet them!
let-all.com/officehours....

6 months ago 9 3 1 0

The paper this talk is based on is quite impressive arxiv.org/abs/2507.04441 one of those cases where you see direct real actionable insight using the categorical hammer.

6 months ago 3 1 1 0
Advertisement
Simulating Time With Square-Root Space (And With Details) - Ryan Williams
Simulating Time With Square-Root Space (And With Details) - Ryan Williams YouTube video by Institute for Advanced Study

Today at IAS, I gave a 2 hr 15 mins lecture on why TIME[t] is in SPACE[√(t log t)]. You can watch it here!
www.youtube.com/watch?v=ThLv...

6 months ago 39 7 1 0
Post image

Aligning an AI with human preferences might be hard. But there is more than one AI out there, and users can choose which to use. Can we get the benefits of a fully aligned AI without solving the alignment problem? In a new paper we study a setting in which the answer is yes.

7 months ago 27 4 1 0
Preview
Egan conjecture holds Given a Euclidean simplex of dimension n⩾2 let its radii of inscribed and circumscribed spheres be r and R, and the distance between the centers of th…

Sergei Drozdov has published his nice proof using hyperbolic simplexes of the necessary and sufficient condition on the radii of spheres that sit inside and outside a Euclidean simplex in any dimension.

www.sciencedirect.com/science/arti...

7 months ago 17 3 2 0

The workshops focused on (in chronological order):

- Variational Inference (youtube.com/playlist?lis...)
- Optimal Transport (youtube.com/playlist?lis...)
- Parallel Computing (youtube.com/playlist?lis...), and
- Computational Physics (youtube.com/playlist?lis...).

7 months ago 3 2 1 0
Post image

The mathematician Lingrui Ge recently helped find a new way to understand the solutions of almost-periodic functions, important equations that appear in quantum physics. The work has helped cement an intriguing connection between number theory and physics. www.quantamagazine.org/ten-martini-...

7 months ago 45 11 0 0
Post image

Pretty cool: the "Fundamental Examples" of independence structures in Non-Commutative Probability.

7 months ago 5 3 1 0

Carlos Cinelli, Avi Feller, Guido Imbens, Edward Kennedy, Sara Magliacane, Jose Zubizarreta
Challenges in Statistics: A Dozen Challenges in Causality and Causal Inference
https://arxiv.org/abs/2508.17099

7 months ago 10 4 0 0
Advertisement

Adi Shamir's advice to young researchers:

1. Read, read, read. Back in the eighties, I read every cryptography paper out there. Once that became impossible, I read the abstract of every paper. Now I read at least every title.

1 year ago 16 4 2 1
Preview
An Ode to the Spherical Cow How Imperfect Models Drive Scientific Discovery

This week's post is about why spherical cows are physics' mascot ⚛️🧪

open.substack.com/pub/nirmalya...

8 months ago 32 11 3 0
Post image

Big fan of this perspective:

11 months ago 44 8 2 0
Lessons from Paula Harris / by Sophie Huiberts
Lessons from Paula Harris / by Sophie Huiberts YouTube video by Mixed Integer Programming

This is about one of my greatest inspirations. It would mean a lot to me if you gave it a watch

8 months ago 22 7 0 1

Regardless of whether you plan to use them in applications, everyone should learn about Gaussian processes, and Bayesian methods. They provide a foundation for reasoning about model construction and all sorts of deep learning behaviour that would otherwise appear mysterious.

8 months ago 55 6 3 0
E5: What Confounding Really Is
E5: What Confounding Really Is YouTube video by Causal Foundations

After a bit of a summer pause, I'm back to making episodes. In this episode, I explain the notion of confounding, and clarify why confounders should not be thought of as alternate explanations of an observed effect.

youtu.be/kAgS7cltBhM

8 months ago 4 1 0 0
Post image

Randomized trials (RCTs) help evaluate if deploying AI/ML systems actually improves outcomes (e.g., survival rates in a healthcare context).

But AI/ML systems can change: Do we need a new RCT every time we update the model? Not necessarily, as we show in our UAI paper! arxiv.org/abs/2502.09467

8 months ago 5 1 1 0