Advertisement · 728 × 90

Posts by Matheus Gadelha

Post image

This work was led by our amazing intern Oindrila Saha (UMass Amherst -- now at Adobe!), with Vojtech Krs, Radomir Mech, Kevin Blackburn-Matzen, and Subhransu Maji (UMass Amherst).

#ICLR2026 #ImageGeneration #ComputerVision

4 weeks ago 1 0 0 0
Post image

To train and evaluate at this scale, we introduce SIGMA-Set27K: a synthetic dataset with 100k+ unique subjects across 27k images, providing identity, structure, and spatial annotations. Training data at this level of annotation density did not previously exist for this task.

4 weeks ago 1 0 1 0
Post image

The framework also supports single- and multi-subject insertion in one pass, subject reposing, free-form masks not seen during training, and mixing different granularity levels in a single generation.

4 weeks ago 0 0 1 0
Post image

Subject identity is preserved while style varies freely via text. The same objects can be re-lit, re-stylized, or reposed without losing their appearance.

4 weeks ago 0 0 1 0
Post image

SIGMA-Gen also accepts spatial guidance at varying levels of precision — coarse 2D bounding boxes, 3D boxes, or pixel-level segmentation masks and depth maps — with a single model. You provide as much structure as you have; the model fills the rest.

4 weeks ago 0 0 1 0
Preview
SIGMA-GEN: Structure and Identity Guided Multi-subject Assembly for Image Generation We present SIGMA-GEN, a unified framework for multi-identity preserving image generation. Unlike prior approaches, SIGMA-GEN is the first to enable single-pass multi-subject identity-preserved generat...

The key challenge: placing multiple specific subjects in one scene while preserving each identity. Prior methods handle subjects independently or lose fidelity at compositing time. SIGMA-Gen does this in a single forward pass.

Project page: oindrilasaha.github.io/SIGMA-Gen/

4 weeks ago 0 0 1 0
Post image

Text-to-image models are remarkable at generation, but they end up deciding on their own what goes where. If you need specific subjects in a specific arrangement, you are left writing prompts and sampling until something usable appears. SIGMA-Gen, our new ICLR 2026 paper, tries to change that.

4 weeks ago 4 0 1 0

I am very likely guilty of this myself, but I agree and really appreciate your take!

1 month ago 0 1 0 0

PSA

(this sounds like something pretty important that I should know but could have lived the rest of my life in shame without ever learning it)

1 month ago 1 0 0 0

🚀 Excited to share REPPO, a new on-policy RL agent!

TL;DR: Replace PPO with REPPO for fewer hyperparameter headaches and more robust training.

REPPO, led by @cvoelcker.bsky.social, will be presented at ICLR 2026. How does it work? 🧵👇

2 months ago 25 10 1 0
Advertisement

Great resource!

This is specially important for people starting their research careers.

Getting to know different research communities is really important! Technical insight is crucial, but science is ultimately a social endeavor -- understanding and communicating to your peers is key.

3 months ago 3 1 1 0
Preview
Technology and Social Isolation: From Cars to “AI” In this blog post, I tell a story of how some technologies from the past century or so have, overall, led to increased social isolation in the United States, as we replaced in-person social interactio...

As we in the US gather with our families this weekend, it's a good time to consider the ways that technology brings us together—but also keeps us apart. In this blog post, I tell a story about how technology may have tended to increase social isolation.
aaronhertzmann.com/2025/10/26/i...

4 months ago 7 4 0 0
Post image

Breaking: we release a fully synthetic generalist dataset for pretraining, SYNTH and two new SOTA reasoning models exclusively trained on it. Despite having seen only 200 billion tokens, Baguettotron is currently best-in-class in its size range. pleias.fr/blog/blogsyn...

5 months ago 187 34 3 17

Session this afternoon (in 30 minutes)!!

Poster 153 — see you there!

5 months ago 2 0 0 0
Post image

I wrote a notebook for a lecture/exercice on image generation with flow matching. The idea is to use FM to render images composed of simple shapes using their attributes (type, size, color, etc). Not super useful but fun and easy to train!
colab.research.google.com/drive/16GJyb...

Comments welcome!

9 months ago 41 8 2 0

Oh nvm I read “our ICCV paper…” haha

10 months ago 1 0 0 0

Are the results out? I see nothing in OpenReview :-(

10 months ago 1 0 2 0
CVPR 2025 Workshop List

For folks attending CVPR: is there a website where I can see the list of workshops, their location AND time? Day and time are empty when I access cvpr.thecvf.com/Conferences/...

10 months ago 1 0 0 0
Advertisement

I will be in Nashville until Saturday for CVPR'25 \o/

DM if you want to meet!

10 months ago 7 0 0 0
Wilhem receiving the award on stage

Wilhem receiving the award on stage

🏅Honored to have been awarded at #Eurographics25 for our paper on #LipschitzPruning to speed-up SDF rendering!

👉 The paper's page: wbrbr.org/publications...

Congrats to @wbrbr.bsky.social, M. Sanchez, @axelparis.bsky.social, T. Lambert, @tamyboubekeur.bsky.social, M. Paulin and T. Thonat!

11 months ago 25 4 0 0
Preview
GitHub - JelteF/PyLaTeX: A Python library for creating LaTeX files A Python library for creating LaTeX files. Contribute to JelteF/PyLaTeX development by creating an account on GitHub.

I usually write a python script that prints some .npy file as tex a .tex table. It is also useful as an easy way to share results throughout the project, so I consider this as part of the codebase. I heard that people that are more serious about such practice use smth like github.com/JelteF/PyLaTeX

11 months ago 2 0 0 0

Comecei a fazer por causa disso e por causa do vim, mas até no overleaf é vantagem

11 months ago 0 0 0 0

Eu n uso pra coisas que faço sozinho (anotações, apresentações, etc). Mas pra trabalho colaborativo é meio q obrigatório. Estudantes vão entrar em revolta se vc usar git hahaha

11 months ago 0 0 1 0

NeurIPS and SIGGRAPH Asia deadline are coming.

Make your life easier: read this thread.

11 months ago 7 2 1 0

Let's gooo!!! \o/

Probably my first time visiting Brazil for professional reasons :-)

11 months ago 4 0 0 0

What features did you find particularly useful?

I liked asking questions about the code base and the tab completion seems nice, but I've been getting unhelpful suggestions for all the "agentic" stuff.

1 year ago 1 0 1 0

By popular demand, we are extending #CVPR2025 coverage to Bluesky. Stay tuned!

1 year ago 124 17 5 2
Video

Exciting news! MegaSAM code is out🔥 & the updated Shape of Motion results with MegaSAM are really impressive! A year ago I didn't think we could make any progress on these videos: shape-of-motion.github.io/results.html
Huge congrats to everyone involved and the community 🎉

1 year ago 74 17 3 0
Advertisement

*it

1 year ago 0 0 0 0

I understand the sentiment, but it is important for people to know that is currently does not reflect reviewer guidelines at CVPR: cvpr.thecvf.com/Conferences/...

“(…) you should include specific feedback on ways the authors can improve their papers.”

1 year ago 5 0 2 0