lukasHoel/text2room
Text2Room generates textured 3D meshes from text prompts using 2D text-to-image diffusion models.

Velocity · 7d
+0.9
★ / day
Trend
→steady
star history
This project creates navigable textured 3D room meshes from natural language prompts by iteratively projecting and blending text-to-image outputs into a 3D scene using monocular depth estimation. It combines Stable Diffusion for inpainting with depth maps to establish geometric structure, then fuses multi-view renderings into cohesive textured meshes.