
Share
This innovative model from ICLR 2025 uses sparse unposed images to create real-time 3D reconstructions with Gaussian splats, offering a practical solution for various applications without complex training requirements.
At ICLR 2025 (Oral, top 1.8%), a team of researchers from ETH Zurich, NVIDIA, Microsoft, and UC Merced introduced NoPoSplat, a groundbreaking feed-forward model for 3D scene reconstruction using 3D Gaussians from unposed sparse multi-view images. This method, trained solely with photometric loss, achieves real-time 3D Gaussian reconstruction during inference, making it highly practical for real-world applications.
NoPoSplat's architecture is designed to handle sparse unposed images by directly reconstructing 3D Gaussians in a canonical space. Here’s a breakdown of the key components:
For simplicity, let’s consider a two-view setup:

NoPoSplat was evaluated on the RealEstate10k dataset, a challenging benchmark for novel view synthesis. The results demonstrate its superior performance compared to previous pose-free methods:
NoPoSplat’s two-stage coarse-to-fine pipeline for pose estimation is another significant contribution. Trained without ground truth depth or explicit matching loss, the method significantly outperforms state-of-the-art methods:
NoPoSplat represents a significant advance in pose-free generalizable 3D reconstruction. By simplifying the reconstruction process and addressing scale ambiguity, it achieves superior performance in novel view synthesis and pose estimation tasks. This work has broad implications for real-world applications, from augmented reality to robotics.
Tags
Original Sources
↗ https://noposplat.github.io/?utm_source=tldrai
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
4 November 2024
88 articles
Related Articles
Related Articles
More Stories