(@maximo2100) Bsky - nopzon.com

github.com/cgtuebingen/...
3D-RE-GEN
3D Reconstruction of Indoor Scenes with a Generative Framework

3 months ago 0 0 0 0

name-that-part.github.io
Name That Part:
3D Part Segmentation and Naming

3 months ago 0 0 0 0

github.com/thu-ml/Turbo...
TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

3 months ago 0 0 0 0

huggingface.co/papers/2512....
EgoX: Egocentric Video Generation from a Single Exocentric Video

4 months ago 0 0 0 0

github.com/zai-org/SCAIL
Character Animation/motion transfer

4 months ago 0 0 0 0

github.com/ssj9596/One-...
One-to-All Animation: Alignment-Free
Character Animation and Image Pose Transfer

4 months ago 0 0 0 0

github.com/ali-vilab/Wa...
Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance.
Indicate a trajectory.

4 months ago 0 0 0 0

huggingface.co/papers/2512....
EMMA: Efficient Multimodal Understanding, Generation, and Editing with a Unified Architecture

4 months ago 0 0 0 0

videocof.github.io
Unified Video Editing with Temporal Reasoner

4 months ago 1 0 0 0

huggingface.co/papers/2511....
MotionV2V: Editing Motion in a Video

4 months ago 2 0 0 0

huggingface.co/papers/2511....
One-to-All Animation: Alignment-Free Character Animation and Image Pose Transfer. Arxiv: 2511.22940

4 months ago 1 0 0 0

huggingface.co/dx8152/Qwen-...
Qwen-Edit-2509-Light-Migration Lora

4 months ago 0 0 0 0

huggingface.co/aquif-ai/aqu...
Text to image based on Wan 2.2.

4 months ago 0 0 0 0

github.com/Tencent-Huny...
HunyuanVideo-1.5: A lightweight video generation model

4 months ago 1 0 0 0

kszpxxzmc.github.io/ViSAudio-pro...
ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation

4 months ago 0 0 0 0

huggingface.co/meituan-long...
LongCat-Image-Edit (Qwen based)

4 months ago 0 0 0 0

lightx-ai.github.io
Light-X is a video generation framework that jointly controls camera trajectory and illumination from monocular videos.

4 months ago 0 0 0 0

github.com/stuttlepress...
This workflow uses Wan VACE (Wan 2.2 Fun VACE or Wan 2.1 VACE, your choice!) to smooth out awkward motion transitions between video clips.

4 months ago 0 0 0 0

firstframego.github.io
First Frame Go

4 months ago 0 0 0 0

huggingface.co/papers/2512....
RELIC: Interactive Video World Model with Long-Horizon Memory

4 months ago 1 0 0 0

magicquill.art/v2/
MagicQuill V2: Precise and Interactive Image Editing with Layered Visual Cues.

4 months ago 0 0 0 0

arxiv.org/abs/2512.03041
MultiShotMaster: A Controllable Multi-Shot Video Generation Framework

4 months ago 0 0 0 0

tuna-ai.org
TUNA leverages unified visual representations to enable image/video understanding, image/video generation, and image editing within a single framework. By Meta.

4 months ago 1 0 0 0

kr1sjfu.github.io/iMontage-web/
iMontage: Image editing, In-context Generation, storyboards.

4 months ago 0 0 0 0

Currently, Qwen Image Edit is your open source alternative to google's nano banana.

4 months ago 0 0 0 0

www.microsoft.com/en-us/resear...
Fara-7B: An Agentic Model for Computer Use. Open-source by Microsoft. A computer Use Agent (CUA) model that leverages computer interfaces, such as a mouse and keyboard, to complete tasks.

4 months ago 0 0 0 0

arxiv.org/abs/2511.19320
SteadyDancer: Harmonized and Coherent Human Image Animation with First-Frame Preservation.

4 months ago 0 0 0 0

huggingface.co/Tongyi-MAI/Z...
Z-Image is a powerful and highly efficient image generation model with 6B parameters. Currently there are three variants: Turbo, Base, Edit.

4 months ago 0 0 0 0

huggingface.co/ByteDance/Sa...
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

4 months ago 0 0 0 0

github.com/Tencent-Huny...
Video generation model

4 months ago 0 0 0 0

Posts by