Vicente Ordonez (@vicenteor) Bsky

⏰The ICCV 2025 discussion phase will close soon! ⏰

As a reviewer, it's your job to:
-📖 Read author rebuttals
-🗣️ Engage in discussions
-✅ Submit your final rating & justification

Your responsible engagement is critical for a fair review process!

Deadline: May 27!

10 months ago 5 2 0 0

My solidarity to Harvard colleagues and my respect for maintaining dignity during troubled times. Their leadership makes their standing in the global stage well deserved.

10 months ago 20 2 0 0

An owl which is the mascot of Rice is formed on the foam of a cappuccino

Cappuccino served at the 50th year celebration of the School of Engineering and Computing at Rice.

1 year ago 19 0 0 0

Group picture with my PhD students #studio-ghibli

1 year ago 5 0 0 0

Entrepreneurs 4 NSF - Formstack

Adding here an example of what I was going for: entrepreneurs4nsf.formstack.com/forms/petiti...

1 year ago 4 1 0 1

Entrepreneurs 4 NSF - Formstack

Adding here an example of what I was going for: entrepreneurs4nsf.formstack.com/forms/petiti...

1 year ago 4 1 0 1

I wish more authors of papers I have missed citing would email me. At the same time I have only done this sparely and mostly with people I already know first hand or had some kind of interaction in the past and in no case I believe this was deliberate. It is just hard to keep up sometimes.

1 year ago 7 0 0 0

Thanks CVPR for bringing the conversation here

1 year ago 5 0 0 0

We just have to believe

1 year ago 3 0 1 0

I remember when Twitter was this small and familiar. I had/have some followers on Twitter that now are low key celebrities but probably followed me long way back before they became low key celebrities.

1 year ago 7 0 1 0

By popular demand, we are extending #CVPR2025 coverage to Bluesky. Stay tuned!

1 year ago 124 17 5 2

But if it does happen in the open, my hope is that it’s a concerted effort that includes academics as much as a larger coalition of people who agrees this issue is important.

1 year ago 2 0 1 0

There should be pushback publicly but it should not be only self serving. If people only express concerns when it directly affects them that’s a sad state of things.

1 year ago 0 0 0 0

Indeed I don’t feel that way. I think there should be pushback but I also think the general public should be as concerned. Complaining and pushing back will happen but maybe not in the open.

1 year ago 1 0 1 0

That said there's no reason to not continue using our institutional channels to continue championing science and education.

1 year ago 2 0 1 0

One thing we can do going forward is to work so that the general community gets convinced why funding science and having strong research institutions is a good thing. This time we might have to learn from our mistakes.

1 year ago 2 1 1 0

There are so many more things to be outraged about at the moment than complaining about federal funding for science. Especially when coming from academics, it is a bit too self-serving at this moment. Yes it is bad and the effects will be long lasting but so will be a dozen other things.

1 year ago 7 0 1 0

Great times for innovation ahead!

1 year ago 3 0 0 0

These are models that can perfectly be run on most cheap hardware unlike the full large R1. But I wouldn’t be surprised we will see R1 quality running on more accessible hardware.

1 year ago 4 0 1 0

Everyone concentrates on o1 and R1 but even the base 7B or 1.5B models seem better than the very first public version of ChatGPT (3.5-turbo) that took the world by surprise.

1 year ago 3 0 1 0

I think it’s exciting to see LLMs that are open source and on par with the top models accesible only through APIs. DeepSeek and before that Llama-3.

1 year ago 4 0 1 0

Can pretrained diffusion models be connected for cross-modal generation?

📢 Introducing AV-Link ♾️

Bridging unimodal diffusion models in one self-contained framework to enable:
📽️ ➡️ 🔊 Video-to-Audio generation.
🔊 ➡️ 📽️ Audio-to-Video generation.

🌐: snap-research.github.io/AVLink/

⤵️ Results

1 year ago 7 3 1 1

AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation We propose AV-Link, a unified framework for Video-to-Audio and Audio-to-Video generation that leverages the activations of frozen video and audio diffusion models for temporally-aligned cross-modal co...

Link to arxiv preprint: arxiv.org/abs/2412.15191

1 year ago 2 0 0 0

Check this recent work by my PhD student Moayed. He has been doing amazing work on Generative AI for images, video and audio. We introduce AV-Link ♾️, an unified approach for audio-video generation. Our generated audio is the best in terms of synchronization with video actions. Check thread below.

1 year ago 7 1 1 0

I still don’t feel quite more productive in the era of LLMs. There are very few things I can do better but far from what I hear from anecdotes. I wonder what would be the one low hanging fruit I should be delegating to LLMs.

1 year ago 2 1 2 0

Happy New Year Everyone!

1 year ago 8 0 0 0

I remember watching this video circa 2009-2010.

1 year ago 4 0 0 0

End of year celebration with the great Moshe Vardi @myvardi.bsky.social is one of the perks of Rice. Hopefully he will be active on Bluesky soon.

1 year ago 9 1 0 0

news.rice.edu/news/2024/ri...

1 year ago 1 0 0 0

Job alert! 🚨 (a bit special but anyway)

If you are (or know someone who is) a Phd about to graduate or just graduated, and have to skip some in between time, I currently have a PostDoc position that could run for 6 months.

Needed: Experience with publishing at Top A conferences.
Just ping me. 👍

1 year ago 23 7 0 0

Posts by Vicente Ordonez