Fun model to work on! More fun stuff to come!
Posts by Prem Seetharaman
good story from someone completely unrelated to me i swear
👀
neat - i think all these spaces are basically a linear layer / permutation away from each other. with one codebook (or a vae setup) you could maybe just solve it with the embedding matrices directly, no audio needed
Great work from @hugofloresgarcia.bsky.social’s internship at Adobe - turn your voice into basically anything!
Introducing MultiFoley, a video-aware audio generation method with multimodal controls! 🎉
⌨️Make a typewriter sound like a piano 🎹
🐱Make a cat meow like a lion roars! 🦁
⏱️Perfectly time existing SFX 💥 to a video
Link to research in comments:
by Adobe Research
Check out our new work on video-guided audio gen with a focus on fine-grained creative control! Done by @czyang.bsky.social during an internship with our group at Adobe Research. Super fun model!
A nifty application of depth estimation, creating a mockup of a digital design on real-world objects: sniklaus.com/mockup
Here's one that seems to catch a bit more "thread-like" content, sorts by recency instead of likes, and drops arxiv bots: bsky.app/profile/psee.... Seems to work ok for now, and catches some non-ML threads too
Made a feed that tries to index paper threads only: bsky.app/profile/psee.... To get into the feed, make a post with "arxiv.org" in the post somewhere + don't be a bot. My tiny contribution to the recent migration! Built w/ @skyfeed.app. Planning on some paper threads of my own soon...
For those of you who haven't yet, give scholar-inbox.com a try! It's a free personal paper recommender which helps you stay up-to-date by sending daily/weekly paper digests directly to your inbox. Your votes train your own classifier, and you can have a peek at its feature words. Here are mine!
Made a feed that tries to index paper threads only: bsky.app/profile/psee.... To get into the feed, make a post with "arxiv.org" in the post somewhere + don't be a bot. My tiny contribution to the recent migration! Built w/ @skyfeed.app. Planning on some paper threads of my own soon...
I initiated a starter pack for Audio ML. Let me know if you'd like to be added/removed.
go.bsky.app/LGmct4z