ISS: Image as Stepping Stone for Text-Guided 3D Shape Generation

Liu, Zhengzhe; Dai, Peng; Li, Ruihui; Qi, Xiaojuan; Fu, Chi Wing

File Download

There are no files associated with this item.

Supplementary

Citations:
Appears in Collections:
- Electrical & Electronic Engineering: Conference papers

Conference Paper: ISS: Image as Stepping Stone for Text-Guided 3D Shape Generation

Title	ISS: Image as Stepping Stone for Text-Guided 3D Shape Generation
Authors	Liu, Zhengzhe Dai, Peng Li, Ruihui Qi, Xiaojuan Fu, Chi Wing
Issue Date	1-May-2023
Abstract	Text-guided 3D shape generation remains challenging due to the absence of large paired text-shape dataset, the substantial semantic gap between these two modalities, and the structural complexity of 3D shapes. This paper presents a new framework called Image as Stepping Stone (ISS) for the task by introducing 2D image as a stepping stone to connect the two modalities and to eliminate the need for paired text-shape data. Our key contribution is a two-stage feature-space-alignment approach that maps CLIP features to shapes by harnessing a pre-trained single-view reconstruction (SVR) model with multi-view supervisions: first map the CLIP image feature to the detail-rich shape space in the SVR model, then map the CLIP text feature to the shape space and optimize the mapping by encouraging CLIP consistency between the input text and the rendered images. Further, we formulate a textguided shape stylization module to dress up the output shapes with novel structures and textures. Beyond existing works on 3D shape generation from text, our new approach is general for creating shapes in a broad range of categories, without requiring paired text-shape data. Experimental results manifest that our approach outperforms the state-of-the-arts and our baselines in terms of fidelity and consistency with text. Further, our approach can stylize the generated shapes with both realistic and fantasy structures and textures.
Persistent Identifier	http://hdl.handle.net/10722/337308

DC Field	Value	Language
dc.contributor.author	Liu, Zhengzhe	-
dc.contributor.author	Dai, Peng	-
dc.contributor.author	Li, Ruihui	-
dc.contributor.author	Qi, Xiaojuan	-
dc.contributor.author	Fu, Chi Wing	-
dc.date.accessioned	2024-03-11T10:19:40Z	-
dc.date.available	2024-03-11T10:19:40Z	-
dc.date.issued	2023-05-01	-
dc.identifier.uri	http://hdl.handle.net/10722/337308	-
dc.description.abstract	<p> Text-guided 3D shape generation remains challenging due to the absence of large paired text-shape dataset, the substantial semantic gap between these two modalities, and the structural complexity of 3D shapes. This paper presents a new framework called Image as Stepping Stone (ISS) for the task by introducing 2D image as a stepping stone to connect the two modalities and to eliminate the need for paired text-shape data. Our key contribution is a two-stage feature-space-alignment approach that maps CLIP features to shapes by harnessing a pre-trained single-view reconstruction (SVR) model with multi-view supervisions: first map the CLIP image feature to the detail-rich shape space in the SVR model, then map the CLIP text feature to the shape space and optimize the mapping by encouraging CLIP consistency between the input text and the rendered images. Further, we formulate a textguided shape stylization module to dress up the output shapes with novel structures and textures. Beyond existing works on 3D shape generation from text, our new approach is general for creating shapes in a broad range of categories, without requiring paired text-shape data. Experimental results manifest that our approach outperforms the state-of-the-arts and our baselines in terms of fidelity and consistency with text. Further, our approach can stylize the generated shapes with both realistic and fantasy structures and textures. <br></p>	-
dc.language	eng	-
dc.relation.ispartof	The 11th International Conference on Learning Representations (ICLR 2023) (01/05/2023-05/05/2023, Kigali, Rwanda)	-
dc.title	ISS: Image as Stepping Stone for Text-Guided 3D Shape Generation	-
dc.type	Conference_Paper	-

File Download

Supplementary

Conference Paper: ISS: Image as Stepping Stone for Text-Guided 3D Shape Generation

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats