
Share
Researchers have unlocked a new method to generate detailed 3D objects using UV maps and image diffusion models, transforming complex shapes into manageable images for easier rendering.
In a significant advancement for 3D generative models, researchers from Simon Fraser University and City University of Hong Kong have introduced Object Images (Omages), a novel approach to generating realistic 3D objects with detailed UV maps. This method, which won the Best Paper Award at 3DV 2025, effectively converts complex 3D shapes into manageable 64x64 pixel images, making it possible to use image generation models like Diffusion Transformers for 3D shape creation.
The core idea behind Omages is to encapsulate the surface geometry, appearance, and patch structures of a 3D object within a 2D image format. This addresses the challenges of geometric and semantic irregularity in polygonal meshes, which are common in high-quality human-made 3D assets but difficult to capture with traditional 3D generative models.
Recent advancements in 3D generative models have been impressive, but they often treat 3D shapes as static "statues," lacking the rich geometric and semantic details found in human-made 3D assets. For example, a high-quality 3D model of a headphone with intricate parts (like the one shown below) is difficult to capture using current methods. Similarly, a pack of books standing closely together (another example provided) presents significant challenges for single-view reconstruction techniques.

Preprocessing:
Generation:
The researchers evaluated their method on the ABO dataset, comparing it to recent 3D generative models. The results showed that Omages achieve point cloud FID (Fréchet Inception Distance) scores comparable to state-of-the-art methods, while naturally supporting PBR (Physically Based Rendering) material generation.
Object Images (Omages) represent a significant step forward in 3D generative modeling. By converting complex 3D shapes into manageable 2D images, this approach addresses the challenges of geometric and semantic irregularity, enabling the use of powerful image generation models for 3D shape creation. The results on the ABO dataset demonstrate the effectiveness of Omages, making them a promising tool for practitioners in fields such as 3D modeling, animation, and interactive applications.
Tags
Original Sources
↗ https://omages.github.io/?utm_source=tldrai
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
8 August 2024
133 articles
Related Articles
Related Articles
More Stories