Advertisement Β· 728 Γ— 90

Posts by Thiemo Alldieck

(5) The success of scaling text, images, video should be an argument *for* scaling, not *against* other modalities.

(6) Efficiency matters. Hoping models become as efficient as existing alternatives without exploring to improve those alternatives is blindfolding us.

6/6

2 months ago 1 0 0 0

(4) As pointed out already by Aleks Holynski on Twitter, text tokens, and pixels are equally handcrafted. If we accept those as valid, singling out 3D as "too handcrafted" is logically inconsistent.

5/6

2 months ago 1 0 1 0

(3) We humans build spatial memory through physical interaction. I don't see how models can develop true spatial understanding without building a spatial memory themselves. 3D representations seem way more helpful here than observing 2D pixel streams.

4/6

2 months ago 1 0 1 0

(2) 3D is more than its representation. While specific data structures will evolve or disappear, 3D is the fundamental concept our world is grounded in. It will always be worth studying, even if models learn it implicitly (which we currently just hope for).

3/6

2 months ago 1 0 1 0

(1) Computer vision was developed to solve "real" problems like measuring, quality control, medical imaging, or mapping. These aren't just "fake tasks" waiting for an embodied agent.

2/6

2 months ago 1 0 1 0

Great read! Here are my 2 cents: I agree with the push toward end-to-end learning, however, the conclusion that CV will simply "go away" feels too dramatic and overly simplified. Here is what I believe was overlooked: 🧡

(cross posting from Twitter)

1/6

2 months ago 1 0 1 0

Project page
*links to*
Huggingface paper page
*links to*
arXiv abstract
*links to*
PDF

🫠🫠🫠

5 months ago 0 0 0 0

We are looking for Student Researchers to work with us in ZurichπŸ‡¨πŸ‡­ next year!

If you work on depth and/or 3D reconstruction, please reach out!

Europe-based position:
www.google.com/about/career...

US-based position:
www.google.com/about/career...

5 months ago 3 0 0 0
Advertisement

Find me today at 4:30pm at the Google booth - let's chat! #CVPR2025

10 months ago 0 0 0 0

On my way to #CVPR2025 πŸ›«

Looking forward to connect!

10 months ago 4 0 0 0

If you expect a service (paper published), pay a price (review others). Isn't it that simple?

1 year ago 4 0 0 0
Post image

Excited to share that today our paper recommender platform www.scholar-inbox.com has reached 20k users! We hope to reach 100k by the end of the year.. Lots of new features are being worked on currently and rolled out soon.

1 year ago 190 26 12 8
Post image

My group is looking for motivated PhD students that want to work on the future of digital humans.
Within the ERC project 'LeMo: Learning Digital Humans in Motion' there are two open positions:

www.career.tu-darmstadt.de/HPv3.Jobs/TU...

www.career.tu-darmstadt.de/HPv3.Jobs/TU...

1 year ago 19 7 0 0

hey everyone - I am now also active here and excited about computer vision and machine learning stuff. πŸŽ‰

1 year ago 47 5 3 0
Preview
a cartoon character named charlie brown is putting a letter in a mailbox ALT: a cartoon character named charlie brown is putting a letter in a mailbox

πŸ˜•

1 year ago 1 0 0 0
Advertisement

Scroll Reverser is another one...

1 year ago 1 0 0 0

Come and work with us πŸ’ͺ

1 year ago 1 0 0 0

☝️

1 year ago 1 0 0 0