
Share
MeshFormer revolutionizes 3D mesh generation with its efficient, single-pass process using sparse views and explicit 3D bias, outpacing rivals by requiring fewer resources and less time for training.
At NeurIPS 2024, a team of researchers from UC San Diego, Hillbot Inc., Zhejiang University, and UCLA introduced MeshFormer, a novel approach to high-quality 3D mesh generation. MeshFormer stands out for its ability to produce detailed, textured meshes with fine-grained geometric details in a single feed-forward pass, all while being trained efficiently on just 8 H100 GPUs over two days. This is a significant improvement over concurrent methods that often require more than one hundred GPUs and complex multi-stage training processes.
MeshFormer takes a sparse set of multi-view RGB images and normal maps as input. These inputs can be estimated using existing 2D diffusion models, which significantly aids in guiding the geometry's learning process. The model uses a 3D feature volume representation, where features are stored in 3D sparse voxels.

MeshFormer's efficiency is demonstrated by its ability to be trained on just 8 H100 GPUs in two days, a stark contrast to other methods that require significantly more resources. The model's performance is validated through benchmarks on datasets such as GSO (Google Scanned Objects) and OmniObject3D, where it consistently produces high-quality meshes with fine-grained details.
MeshFormer represents a significant advancement in 3D mesh generation by efficiently leveraging 3D native structures and input guidance. Its ability to produce high-quality, textured meshes with fine-grained details in a single pass, while being trained on limited resources, makes it a promising tool for various applications in computer vision and graphics.
Tags
Original Sources
↗ https://meshformer3d.github.io/?utm_source=tldrai
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
21 August 2024
88 articles
Related Articles
Related Articles
More Stories