OGGSplat is a novel open-vocabulary generalizable 3D reconstruction framework
that extends the field-of-view from sparse input images by leveraging semantic priors.
OGGSplat can extrapolate the Gaussians beyond the input view by leveraging semantic priors from diffusion models.
OGGSplat is a novel generalizable 3D reconstruction framework that extends the field-of-view from sparse input images by leveraging semantic priors. It introduces an Open Gaussian Growing approach that combines RGB-semantic joint inpainting guided by bidirectional diffusion models to extrapolate unseen regions beyond the input views. The inpainted images are then lifted back into 3D space for progressive optimization of Gaussian parameters, enabling efficient and high-quality semantic-aware scene reconstruction.
OGGSplat effectively extrapolates to unseen regions while maintaining both high visual fidelity and semantic plausibility.
OGGSplat demonstrates strong semantic-aware scene reconstruction capabilities, even when handling out-of-distribution samples or just two-view images captured directly from a smartphone camera.
@article{wang2025oggsplat, title={OGGSplat: Open Gaussian Growing for Generalizable Reconstruction with Expanded Field-of-View}, author={Wang, Yanbo and Wang, Ziyi and Zheng, Wenzhao and Zhou, Jie and Lu, Jiwen}, journal={arXiv preprint arXiv:2506.05204}, year={2025} }