← all repositories

JIA-Lab-research/DreamOmni2

A multimodal deep learning model that edits and generates images based on natural language instructions.

2k stars Python Image · Video · Audio
DreamOmni2
Velocity · 7d
+8.0
★ / day
Trend
steady
star history

DreamOmni2 is a unified image generation and editing model that accepts multimodal instructions combining text and images. It enables users to modify existing images or create new ones through natural language guidance. The model was published at CVPR 2026 as a Highlight paper and is available with pretrained weights on HuggingFace along with interactive demo spaces.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.