Advertisement · 728 × 90

Posts by Hugo Malard

Post image

An Eye for an Ear: Zero-shot Audio Description Leveraging an Image Captioner using Audiovisual Distribution Alignment

1 year ago 6 1 0 0
Post image

If you want to learn more about audio-visual alignment and how to use it to give audio abilities to your VLM, stop by our @NeurIPSConf poster #3602 (East exhibit hall A-C) today at 11am!

1 year ago 2 0 0 1

Hi! Could you please add me?

1 year ago 2 0 1 0

Amazing! Would love to be added!

1 year ago 1 0 0 0