TY - GEN
T1 - VISPI
T2 - 2024 Eurographics Italian Chapter Conference on Smart Tools and Applications in Graphics, STAG 2024
AU - Shah, Uzair
AU - Jashari, Sara
AU - Tukur, Muhammad
AU - Pintore, Giovanni
AU - Gobbetti, Enrico
AU - Schneider, Jens
AU - Agus, Marco
N1 - Publisher Copyright:
© 2024 The Authors.
PY - 2024/11
Y1 - 2024/11
N2 - Taking a 360° image is the quickest and most cost-effective way to capture the entire environment around the viewer in a form that can be directly exploited for creating immersive content [PBAG23]. In this work, we introduce novel solutions for the virtual staging of indoor environments, supporting automatic emptying, object insertion, and relighting. Our solution, dubbed VISPI (Virtual Staging Pipeline for Single Indoor Panoramic Images), integrates data-driven processing components, that take advantage of the analysis of knowledge learned from massive data collections, within a real-time rendering and editing system, allowing for interactive restaging of indoor scenes. Key components of VISPI include: i) a holistic architecture based on a multi-task vision transformer for extracting geometry, semantic, and material information from a single panoramic image, ii) a lighting model based on spherical Gaussians, iii) a method for lighting estimation from the geometric, semantic, and material signals, and iv) a real-time editing and rendering component. The proposed framework provides an interactive and user-friendly solution for creating immersive visualizations of indoor spaces. We present a preliminary assessment of VISPI using a synthetic dataset - Structured3D - and demonstrate its application in creating restaged indoor scenes.
AB - Taking a 360° image is the quickest and most cost-effective way to capture the entire environment around the viewer in a form that can be directly exploited for creating immersive content [PBAG23]. In this work, we introduce novel solutions for the virtual staging of indoor environments, supporting automatic emptying, object insertion, and relighting. Our solution, dubbed VISPI (Virtual Staging Pipeline for Single Indoor Panoramic Images), integrates data-driven processing components, that take advantage of the analysis of knowledge learned from massive data collections, within a real-time rendering and editing system, allowing for interactive restaging of indoor scenes. Key components of VISPI include: i) a holistic architecture based on a multi-task vision transformer for extracting geometry, semantic, and material information from a single panoramic image, ii) a lighting model based on spherical Gaussians, iii) a method for lighting estimation from the geometric, semantic, and material signals, and iv) a real-time editing and rendering component. The proposed framework provides an interactive and user-friendly solution for creating immersive visualizations of indoor spaces. We present a preliminary assessment of VISPI using a synthetic dataset - Structured3D - and demonstrate its application in creating restaged indoor scenes.
UR - https://www.scopus.com/pages/publications/85216249717
U2 - 10.2312/stag.20241334
DO - 10.2312/stag.20241334
M3 - Conference contribution
AN - SCOPUS:85216249717
T3 - Eurographics Italian Chapter Proceedings - Smart Tools and Applications in Graphics, STAG
BT - Smart Tools and Applications in Graphics - Eurographics Italian Chapter Conference, STAG 2024
A2 - Fellner, Dieter
PB - Eurographics Association
Y2 - 14 November 2024 through 15 November 2024
ER -