TY - GEN
T1 - PromptGS
T2 - 2025 IEEE International Workshop on Multimedia Signal Processing, MMSP 2025
AU - Wang, Xun
AU - Xue, Xutao
AU - Kang, Xubing
AU - Li, Siyuan
AU - Utsho, Shayer Shabab
AU - Li, Kun
AU - Ji, Mengqi
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - Reconstructing tiny floating objects in large-scale 3D scene remains a fundamental challenge for 3D Gaussian Splatting (3DGS). These objects often receive insufficient point density and gradient supervision during training due to limited visibility and low image-space saliency, making them difficult to recover even after prolonged optimization. We present PromptGS, a visual prompting framework that incorporates lightweight human input to guide the 3DGS optimization process. PromptGS fuses projected 2D error maps with user-specified spatial prompts to form a 3D attention field, which acts as an optimization prior to guide Gaussian densification, adaptive resampling, and multiview selection. This mechanism directs training efforts toward regions with high semantic relevance but low point density, improving reconstruction in areas that are frequently overlooked. Furthermore, we design a Gaussian scoring function that ranks candidates based on their improvement potential, ensuring efficient resource allocation. Moreover, PromptGS achieves multiview consistent rendering of small objects, indicating that their geometry and appearance are faithfully reconstructed in 3D space rather than approximated through view-dependent texture projection. Experiments on public benchmarks and challenging synthetic scenes demonstrate that PromptGS consistently outperforms existing methods in both visual fidelity and efficiency.
AB - Reconstructing tiny floating objects in large-scale 3D scene remains a fundamental challenge for 3D Gaussian Splatting (3DGS). These objects often receive insufficient point density and gradient supervision during training due to limited visibility and low image-space saliency, making them difficult to recover even after prolonged optimization. We present PromptGS, a visual prompting framework that incorporates lightweight human input to guide the 3DGS optimization process. PromptGS fuses projected 2D error maps with user-specified spatial prompts to form a 3D attention field, which acts as an optimization prior to guide Gaussian densification, adaptive resampling, and multiview selection. This mechanism directs training efforts toward regions with high semantic relevance but low point density, improving reconstruction in areas that are frequently overlooked. Furthermore, we design a Gaussian scoring function that ranks candidates based on their improvement potential, ensuring efficient resource allocation. Moreover, PromptGS achieves multiview consistent rendering of small objects, indicating that their geometry and appearance are faithfully reconstructed in 3D space rather than approximated through view-dependent texture projection. Experiments on public benchmarks and challenging synthetic scenes demonstrate that PromptGS consistently outperforms existing methods in both visual fidelity and efficiency.
KW - Gaussian Splatting
KW - multiview consistency
KW - tiny object reconstruction
KW - visual prompt 3D reconstruction
UR - https://www.scopus.com/pages/publications/105032960846
U2 - 10.1109/MMSP64401.2025.11324335
DO - 10.1109/MMSP64401.2025.11324335
M3 - 会议稿件
AN - SCOPUS:105032960846
T3 - 2025 IEEE International Workshop on Multimedia Signal Processing, MMSP 2025
SP - 228
EP - 233
BT - 2025 IEEE International Workshop on Multimedia Signal Processing, MMSP 2025
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 21 September 2025 through 23 September 2025
ER -