Advertisement · 728 × 90

Posts by Santiago Castro

https://tinyurl.com/BristolCVLectureship

Pls RT
Permanent Assistant Professor (Lecturer) position in Computer Vision @bristoluni.bsky.social [DL 6 Jan 2025]
This is a research+teaching permanent post within MaVi group uob-mavi.github.io in Computer Science. Suitable for strong postdocs or exceptional PhD graduates.
t.co/k7sRRyfx9o
1/2

1 year ago 22 14 1 1
A screenshot showing the usage quota of my repositories storage, which is 5x more than full.

A screenshot showing the usage quota of my repositories storage, which is 5x more than full.

HuggingFace is limiting repositories' storage 😱

1 year ago 4 0 0 1

Can all graduate programs please accept a universal letter system like Interfolio so we don’t have to upload 100 letters individually?! The time waste is insane.

Students are telling me that only *two* of their applications accept Interfolio!

1 year ago 4 2 0 0

A librarian that previously worked at the British Library created a relatively small dataset of bsky posts, hundreds of times smaller than previous researchers, to help folks create toxicity filters and stuff.

So people bullied him & posted death threats.

He took it down.

Nice one, folks.

1 year ago 583 59 28 11

Personally, reviewing for NeurIPS a couple years back changed me as a reviewer. For one paper I rejected, I kept citing it throughout the year to people for a finding it had. This made me realise it was a good paper, it just had some easy targets for rejection.

1 year ago 67 8 2 1

Do you know what rating you’ll give after reading the intro? Are your confidence scores 4 or higher? Do you not respond in rebuttal phases? Are you worried how it will look if your rating is the only 8 among 3’s? This thread is for you.

1 year ago 77 20 4 3
Preview
CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models We present CAT4D, a method for creating 4D (dynamic 3D) scenes from monocular video. CAT4D leverages a multi-view video diffusion model trained on a diverse combination of datasets to enable novel vie...

We just dropped CAT4D, text to dynamic 3D models that you can render in real time. Not posting a video because Bluesky is garbage in this respect; go straight to the real time viewer on a desktop browser and look around. The cat kneading dough is my favorite.
cat-4d.github.io

1 year ago 114 11 3 1
Post image
1 year ago 3 1 0 0

In the HuggingFace/Bluesky incident, the problem goes deeper than whether the data is "public" or "private"

What matters to people is whether their data was collected, which data was collected, how it may be used, and who it may be used by

1 year ago 118 27 3 8
Advertisement

ACL syntax track reviewers >> almost any other conference.

These folks care about their sub-field and i learn something new every time!

1 year ago 12 2 1 1
Post image

On July 26th, Nancy Pelosi sells 5000 shares of $MSFT Microsoft,

On Nov 27, 2024, the FTC announces it is launching a wide-ranging US antitrust probe against $MSFT.

This was Nancy Pelosi's largest sell in two years of her portfolio, with $MSFT below her sell now.

1 year ago 194 26 20 3

We are looking for the current best multi-view full-body 3d pose estimation model/software with Remi Cadene

Any good advice?

Should include hands pose estimation in addition to body preferably

Better if able to use multiple cameras as inputs (multi-view)

for open-source low cost robot teleop

1 year ago 20 3 4 0
I'm standing next to the poster holding a dice tray with a blue dice inside

I'm standing next to the poster holding a dice tray with a blue dice inside

Today I presented my MSc. work "Exploring approaches to Improvisational Interactive Storytelling" in the student seminar.

I narrated a basic setting and used a dice to explain the gamemastering mechanisms to the committee ☺️

OK, now I have to write my thesis! 😅

1 year ago 5 2 0 0

Check out our new work on video-guided audio gen with a focus on fine-grained creative control! Done by @czyang.bsky.social during an internship with our group at Adobe Research. Super fun model!

1 year ago 10 2 0 0
Video

🎥 Introducing MultiFoley, a video-aware audio generation method with multimodal controls! 🔊
We can
⌨️Make a typewriter sound like a piano 🎹
🐱Make a cat meow like a lion roars! 🦁
⏱️Perfectly time existing SFX 💥 to a video.

arXiv: arxiv.org/abs/2411.17698
website: ificl.github.io/MultiFoley/

1 year ago 42 12 2 6

Rare personal tweet:
Subletting our furnished apartment in Brooklyn for the spring at a significant discount. It's quite nice and in a fun location. under price. Email me know if you are interested, I will send pictures.

1 year ago 25 7 0 0

The FATE group at @msftresearch.bsky.social NYC is accepting applications for 2025 interns. 🥳🎉

For full consideration, apply by 12/18.

jobs.careers.microsoft.com/global/en/jo...

Interested in AI evaluation? Apply for the STAC internship too!

jobs.careers.microsoft.com/global/en/jo...

1 year ago 73 35 4 1

If you want to help improve peer review, we are looking for a new Co-CTO for ACL Rolling Review!

Requirements:
- Post-PhD
- Experienced with Python (including command line use)
- Time commitment of 3 hours a week on average (but note that you are not expected to review while serving)

Contact me!

1 year ago 7 5 0 0
Post image

🌍✨Announcing the 4th edition of the NLP for Positive Impact workshop at #ACL2025 in Vienna!
Come join us and explore various social applications of NLP!
📢 Call for papers & more details coming soon!
🔗sites.google.com/view/nlp4positiveimpact/...

1 year ago 25 7 1 1
Advertisement
Preview
WhatsApp will soon transcribe your voice messages Finally, an easy way to skim through lengthy voice clips.

WhatsApp will soon transcribe your voice messages

1 year ago 87 12 10 10

If you're interested in embeddings and SQLite you should be paying attention to sqlite-vec

Lots of neat stuff in this release - and the blog post provides a very clear explanation of what it can do

1 year ago 92 9 2 1