Advertisement Β· 728 Γ— 90

Posts by Alexandra Gomez-Villa πŸ³οΈβ€πŸŒˆ

@neuripsconf.bsky.social, a quick question: are we going to have a competition track this year?

6 days ago 0 0 0 0
Post image

Children exhibit visual understanding from limited experience, orders of magnitude less than our best models.

We introduce the Zero-shot World Model (ZWM). Trained on a single child's visual experience, BabyZWM rapidly generates competence across diverse benchmarks with no task-specific training. 🧡

1 week ago 55 24 1 4
Post image

How do we make attention actually capture context?

Exclusive Self Attention (XSA) is an interesting variant that improves attention with minimal cost in speed & memory.

Check out the video here: youtu.be/2eZKT4H9_iQ

1 week ago 14 4 0 0
Dolby Laboratories hiring AI PhD Intern, Sound Experiences Lab in Barcelona, Catalonia, Spain | LinkedIn Posted 10:05:54 AM. Join the leader in entertainment innovation and help us design the future. At Dolby, science meets…See this and similar jobs on LinkedIn.

Are you a PhD student specializing in audio or AI? Interested in an internship with us in Barcelona? Apply here!

We've had the pleasure of working with talented interns in the past, and the experience has been mutually very rewarding.

www.linkedin.com/jobs/view/40...

1 year ago 7 2 0 0
Post image

I have just read this book and I strongly identify with the authors. In the increasingly accelerated landscape of machine learning (where a paper from two years ago is considered "old"), I think these kinds of reflections are really important for researchers in academia.

1 year ago 4 0 0 0
Post image Post image Post image Post image

A long human life is about 90 years. You can visualize that as weeks. I wrote an R/shiny app and a Python CLI (click) that'll make this plot for you when you input your birthday, and give you a printable 8.5x11 PDF. https://github.com/stephenturner/lifeweeks #Rstats 🧡 1/5

1 year ago 18 2 1 0

SIGGRAPH'25 (form): 12 days.
RSS'25 (abs): 13 days.
SIGGRAPH'25 (paper-md5): 19 days.
RSS'25 (paper): 20 days.
ICML'25: 26 days.
RLC'25 (abs): 41 days.
RLC'25 (paper): 48 days.
IROS'25: 56 days.
ICCV'25: 61 days.

1 year ago 6 1 0 1
Post image

Researcher: "We let the data speak for itself."

Earlier that day:

1 year ago 7958 1008 97 69
Advertisement

What was the most important machine learning paper in 2024?

My Famous Deep Learning Papers list (that I use in teaching) does not include any new ideas from the last year.

papers.baulab.info

Which single new paper would you add?

1 year ago 55 11 10 0
Video

New* video! If you’ve ever wondered what topology is, this problem is one of the best examples I know of to give an authentic sense of what it’s all about: youtu.be/IQqtsm-bBRU

1 year ago 385 52 4 4
Preview
Predictions for AI in 2025: Collaborative Agents, AI Skepticism, and New Risks Leading Stanford faculty offer their expectations for artificial intelligence in the new year.

What’s on the horizon for AI in 2025? Leading Stanford faculty offer their expectations in the new year. hai.stanford.edu/news/predict...

1 year ago 14 3 1 0

A post by @cloneofsimo on Twitter made me write up some lore about residuals, ResNets, and Transformers. And I couldn't resist sliding in the usual cautionary tale about small/mid-scale != large-scale.

Blogpost: lb.eyer.be/s/residuals....

1 year ago 81 9 3 2
Preview
a man in a suit and tie is holding a newspaper in his hand ALT: a man in a suit and tie is holding a newspaper in his hand

Registration for #DLBCN 2024 is now open for the general public. Get your ticket today !

sites.google.com/view/dlbcn20...

1 year ago 2 2 0 0
Post image

Qui Gon Jinn sharing some insightful prompting wisdom πŸ‘ŒπŸΌ

1 year ago 11 3 1 0
What is a camera calibration?
What is a camera calibration? YouTube video by Main Street Autonomy

Nice high-level explainer of camera calibration by Main Street Autonomy.
youtu.be/IHzRSLvRW9c

1 year ago 35 2 0 0

Welcome @egavves.bsky.social!

Been wondering where did all of the computer vision people on X go? Check out my 2 starter packs πŸ‘‡

1 year ago 9 3 1 0
Advertisement
Preview
Potentialities of science comics for science communication: lessons from the classroom The aim of this pilot study was to understand how the use of science comics, centred on complex scientific knowledge, can promote students' engagement with science, in order to discuss its poten...

πŸ”¬ Weekend Science Spotlight πŸ§ͺ

Let’s share a recent study by someone else worth spotlighting!

My pick is from @jscicom.bsky.social: jcom.sissa.it/article/pubi...

It explores how science comics boost engagement & understanding to simplify complex topics.

What’s yours? πŸ€”

#SciComm #SciArt

1 year ago 51 17 1 5
Preview
Reclaiming AI as a Theoretical Tool for Cognitive Science - Computational Brain & Behavior The idea that human cognition is, or can be understood as, a form of computation is a useful conceptual tool for cognitive science. It was a foundational assumption during the birth of cognitive scien...

Full paper:

van Rooij, I., Guest, O., Adolfi, F. et al. Reclaiming AI as a Theoretical Tool for Cognitive Science. Comput Brain Behav (2024). doi.org/10.1007/s421...

1 year ago 68 18 5 3
Preview
GitHub - anthropics/courses: Anthropic's educational courses Anthropic's educational courses. Contribute to anthropics/courses development by creating an account on GitHub.

Anthropic has some comprehensive resources covering tool use and AI agents that few know about.

The repository features five courses, from prompt engineering to evaluation and tool use.

- Sahar Mor.

github.com/anthropics/c...

1 year ago 19 5 1 0

On the Karpathy-Schmidhuber discussion: here is a slide I sometimes present on the history and origins of attention, which is shared between ML, image-processing (work of Jean-Michel Morel), NLP and computer vision.

1 year ago 66 7 6 1
Post image

One of my astute grad students made the observation that the meme also works if you switch the grad student and PI. 🀣

1 year ago 6 1 0 0
Post image Post image

I am super hyped and happy with our recent paper on a ✨VampPrior 2.0✨: Hierarchical VAE with a diffusion-based VampPrior! πŸ¦‡ We got SOTA VAE results on CIFAR-10! Kudos to Anna Kuzina because this TMLR paper is the last chapter in her PhD thesis 🀩
πŸ“ƒ tinyurl.com/22rvzc4f
πŸ’» github.com/AKuzina/dvp_...

1 year ago 75 9 1 0
Post image
1 year ago 55 8 0 1
Advertisement
Preview
Trending Papers - November ✨ - a zh-ai-community Collection Most upvoted paper on the Daily Papers

The most upvoted papers from the Chinese community on the Daily Papers - NovemberπŸ”₯
huggingface.co/collections/...

1 year ago 1 1 0 0

I don't really see a clear path where we keep an open internet that is not mostly full of AIs talking to each other. We can't reliably detect AI content, it is cheap and easy to generate, and there are lots of incentives to do so, even besides scams.

You can see the problem on all the social sites.

1 year ago 271 40 28 19
Post image

Have you ever wondered how to train an autoregressive generative transformer on text and raw pixels, without a pretrained visual tokenizer (e.g. VQ-VAE)?

We have been pondering this during summer and developed a new model: JetFormer πŸŒŠπŸ€–

arxiv.org/abs/2411.19722

A thread πŸ‘‡

1/

1 year ago 153 37 4 7

I learned about this paper (arxiv.org/abs/2406.09413) when Alexei gave this wonderful talk at the U. They trained 60K diffusion models, each for a different person's visual identity. Sampling weights from this set creates a model for a novel identity.

1 year ago 20 6 1 0

Science rarely affords you the luxury of being exactly right about anything. Critics of your work will expect you to have all the answers at once, but in fact progress is more often about being vaguely right and working your way toward the truth one small step at a time.

1 year ago 39 4 2 0