github.com/cgtuebingen/...
3D-RE-GEN
3D Reconstruction of Indoor Scenes with a Generative Framework
Posts by
name-that-part.github.io
Name That Part:
3D Part Segmentation and Naming
github.com/thu-ml/Turbo...
TurboDiffusion: 100–200× Acceleration for Video Diffusion Models
huggingface.co/papers/2512....
EgoX: Egocentric Video Generation from a Single Exocentric Video
github.com/zai-org/SCAIL
Character Animation/motion transfer
github.com/ssj9596/One-...
One-to-All Animation: Alignment-Free
Character Animation and Image Pose Transfer
github.com/ali-vilab/Wa...
Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance.
Indicate a trajectory.
huggingface.co/papers/2512....
EMMA: Efficient Multimodal Understanding, Generation, and Editing with a Unified Architecture
videocof.github.io
Unified Video Editing with Temporal Reasoner
huggingface.co/papers/2511....
MotionV2V: Editing Motion in a Video
huggingface.co/papers/2511....
One-to-All Animation: Alignment-Free Character Animation and Image Pose Transfer. Arxiv: 2511.22940
huggingface.co/dx8152/Qwen-...
Qwen-Edit-2509-Light-Migration Lora
huggingface.co/aquif-ai/aqu...
Text to image based on Wan 2.2.
github.com/Tencent-Huny...
HunyuanVideo-1.5: A lightweight video generation model
kszpxxzmc.github.io/ViSAudio-pro...
ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation
huggingface.co/meituan-long...
LongCat-Image-Edit (Qwen based)
lightx-ai.github.io
Light-X is a video generation framework that jointly controls camera trajectory and illumination from monocular videos.
github.com/stuttlepress...
This workflow uses Wan VACE (Wan 2.2 Fun VACE or Wan 2.1 VACE, your choice!) to smooth out awkward motion transitions between video clips.
firstframego.github.io
First Frame Go
huggingface.co/papers/2512....
RELIC: Interactive Video World Model with Long-Horizon Memory
magicquill.art/v2/
MagicQuill V2: Precise and Interactive Image Editing with Layered Visual Cues.
arxiv.org/abs/2512.03041
MultiShotMaster: A Controllable Multi-Shot Video Generation Framework
tuna-ai.org
TUNA leverages unified visual representations to enable image/video understanding, image/video generation, and image editing within a single framework. By Meta.
kr1sjfu.github.io/iMontage-web/
iMontage: Image editing, In-context Generation, storyboards.
Currently, Qwen Image Edit is your open source alternative to google's nano banana.
www.microsoft.com/en-us/resear...
Fara-7B: An Agentic Model for Computer Use. Open-source by Microsoft. A computer Use Agent (CUA) model that leverages computer interfaces, such as a mouse and keyboard, to complete tasks.
arxiv.org/abs/2511.19320
SteadyDancer: Harmonized and Coherent Human Image Animation with First-Frame Preservation.
huggingface.co/Tongyi-MAI/Z...
Z-Image is a powerful and highly efficient image generation model with 6B parameters. Currently there are three variants: Turbo, Base, Edit.
huggingface.co/ByteDance/Sa...
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
github.com/Tencent-Huny...
Video generation model