Gowthami Somepalli (@gowthami) Bsky

Domain Ontologies: Indispensable for Knowledge Graph Construction AI slop is all around and increasingly extraction of useful information will face difficulties as we start to feed more noise into the already noisy world of knowledge. We are in an era of unpreced…

What’s the right resolution for such ontologies? 1,000-10,000 seems like the sweet spot.

H/t @aneeshsathe.com
aneeshsathe.com/2025/01/15/d...

1 year ago 5 2 0 0

Deep Learning Classics and Trends - Google Groups

About to send my last DLCT email of the year today (in 2 hours).

Join the 7-year-old mailing list if you haven't heard of it. (And if you have heard of it but haven't joined, I trust that it's a well thought decision that suits you the best.)

groups.google.com/g/deep-learn...

1 year ago 13 2 0 0

The recording of my #NeurIPS2024 workshop talk on multimodal iterative refinement is now available to everyone who registered: neurips.cc/virtual/2024...

My talk starts at 1:10:45 into the recording.

I believe this will be made publicly available eventually, but I'm not sure when exactly!

1 year ago 36 4 1 0

[M2L 2024] Transformers - Lucas Beyer YouTube video by Mediterranean Machine Learning (M2L) summer school

One of the best tutorials for understanding Transformers!

📽️ Watch here: www.youtube.com/watch?v=bMXq...

Big thanks to @giffmana.ai for this excellent content! 🙌

1 year ago 54 8 0 0

Anne Gagneux, Ségolène Martin, @quentinbertrand.bsky.social Remi Emonet and I wrote a tutorial blog post on flow matching: dl.heeere.com/conditional-... with lots of illustrations and intuition!

We got this idea after their cool work on improving Plug and Play with FM: arxiv.org/abs/2410.02423

1 year ago 356 102 12 11

congratulations, @ian-goodfellow.bsky.social, for the test-of-time award at @neuripsconf.bsky.social!

this award reminds me of how GAN started with this one email ian sent to the Mila (then Lisa) lab mailing list in May 2014. super insightful and amazing execution!

1 year ago 187 27 3 3

Maybe I’m cynical 🙈, this feels more like a KPI meeting activity than something that’s actually useful. There are 1000s of open datasets on HF which are barely used which are curated with a task in mind.

1 year ago 5 0 0 0

Trying to build a "books you must read" list for my lab that everyone gets when they enter. Right now its:

- Sutton and Barto
- The Structure of Scientific Revolutions
- Strunk and White
- Maybe "Prediction, Learning, and Games", TBD

Kinda curious what's missing in an RL / science curriculum

1 year ago 141 11 36 1

On Subjective Uncertainty Quantification and Calibration in Natural Language Generation Applications of large language models often involve the generation of free-form responses, in which case uncertainty quantification becomes challenging. This is due to the need to identify task-specif...

This is a simple and good paper, which somehow nobody working on these things cites, or even seems to be aware of arxiv.org/abs/2406.05213 It is simple idea that seems useful; it formulates the subjective uncertainty for natural language generation in a decision-theoretic setup.

1 year ago 27 3 2 1

A real-time (or very fast) open-source txt2video model dropped: LTXV.

HF: huggingface.co/Lightricks/L...
Gradio: huggingface.co/spaces/Light...
Github: github.com/Lightricks/L...

Look at that prompt example though. Need to be a proper writer to get that quality.

1 year ago 89 10 6 1

1. Computing standard errors of the mean using the Central Limit Theorem 2. When questions are drawn in related groups, computing clustered standard errors 3. Reducing variance by resampling answers and by analyzing next-token probabilities 4. When two models are being compared, conducting statistical inference on the questionlevel paired differences, rather than the population-level summary statistics 5. Using power analysis to determine whether an eval (or a random subsample) is capable of testing a hypothesis of interest

Perhaps an unpopular opinion, but I don't think the problem with Large Language Model evaluations is the lack of error bars.

1 year ago 110 5 9 2

let me say it once more: "the gap between OAI/Anthropic/Meta/etc. and a large group of companies all over the world you've never cared to know of, in terms of LM pre-training? tiny"

1 year ago 77 8 12 1

👋

1 year ago 1 0 0 0

The return of the Autoregressive Image Model: AIMv2 now going multimodal.
Excellent work by @alaaelnouby.bsky.social & team with code and checkpoints already up:

arxiv.org/abs/2411.14402

1 year ago 46 8 1 0

Extending Video Masked Autoencoders to 128 frames Video understanding has witnessed significant progress with recent video foundation models demonstrating strong performance owing to self-supervised pre-training objectives; Masked Autoencoders (MAE) ...

Interesting paper on arxiv this morning: arxiv.org/abs/2411.13683
It's a video masked autoencoder in which you learn which tokens to mask to process fewer of them and scale to longer videos. It's a #NeurIPS2024 apparently.
I wonder if there could be such strategy in the pure generative setup.

1 year ago 47 4 0 0

Very true. Completely forgot about this. However I don’t believe this model is a true reflection of VLMs trained from scratch are capable of though… or maybe my hypothesis is wrong. 🤷‍♀️

1 year ago 0 0 0 0

💯 ! Haven’t seen a single VLM where everything is trained from scratch

1 year ago 0 0 3 0

I have the same thing on, and it’s giving me follow notifications but not comments (which is very stupid! 🥲)

1 year ago 1 0 0 0

I’m not getting notifications for comments here, anyone facing the same issue?

1 year ago 0 0 1 0

What softwares do I actually use on my Mac as a software enthusiast? • Mimansa Jaiswal With several years of using a Mac, it took me time to settle down on a set of apps and softwares that I can heartily recommend. I try almost 30 a month, but end up using around 20 in total for everyth...

Posts by Gowthami Somepalli