Advertisement · 728 × 90

Posts by Prem Seetharaman

Fun model to work on! More fun stuff to come!

1 year ago 2 0 0 0

good story from someone completely unrelated to me i swear

1 year ago 4 0 0 0

👀

1 year ago 3 0 0 0

neat - i think all these spaces are basically a linear layer / permutation away from each other. with one codebook (or a vae setup) you could maybe just solve it with the embedding matrices directly, no audio needed

1 year ago 1 0 0 0

Great work from @hugofloresgarcia.bsky.social’s internship at Adobe - turn your voice into basically anything!

1 year ago 4 1 0 0
Video

Introducing MultiFoley, a video-aware audio generation method with multimodal controls! 🎉

⌨️Make a typewriter sound like a piano 🎹
🐱Make a cat meow like a lion roars! 🦁
⏱️Perfectly time existing SFX 💥 to a video

Link to research in comments:
by Adobe Research

1 year ago 40 5 2 0

Check out our new work on video-guided audio gen with a focus on fine-grained creative control! Done by @czyang.bsky.social during an internship with our group at Adobe Research. Super fun model!

1 year ago 10 2 0 0
Video

A nifty application of depth estimation, creating a mockup of a digital design on real-world objects: sniklaus.com/mockup

1 year ago 9 2 0 0
Advertisement

Here's one that seems to catch a bit more "thread-like" content, sorts by recency instead of likes, and drops arxiv bots: bsky.app/profile/psee.... Seems to work ok for now, and catches some non-ML threads too

1 year ago 1 0 0 0

Made a feed that tries to index paper threads only: bsky.app/profile/psee.... To get into the feed, make a post with "arxiv.org" in the post somewhere + don't be a bot. My tiny contribution to the recent migration! Built w/ @skyfeed.app. Planning on some paper threads of my own soon...

1 year ago 7 2 0 1
Post image

For those of you who haven't yet, give scholar-inbox.com a try! It's a free personal paper recommender which helps you stay up-to-date by sending daily/weekly paper digests directly to your inbox. Your votes train your own classifier, and you can have a peek at its feature words. Here are mine!

1 year ago 17 6 2 2

Made a feed that tries to index paper threads only: bsky.app/profile/psee.... To get into the feed, make a post with "arxiv.org" in the post somewhere + don't be a bot. My tiny contribution to the recent migration! Built w/ @skyfeed.app. Planning on some paper threads of my own soon...

1 year ago 7 2 0 1

I initiated a starter pack for Audio ML. Let me know if you'd like to be added/removed.
go.bsky.app/LGmct4z

1 year ago 68 22 46 1