File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: Text-to-structure interpretation of user requests in BIM interaction

TitleText-to-structure interpretation of user requests in BIM interaction
Authors
Issue Date14-Mar-2025
PublisherElsevier
Citation
Automation in Construction, 2025, v. 174 How to Cite?
Abstract

Numerous efforts have been devoted to utilizing a natural language-based interface for BIM interaction. These interfaces require extracting user's intent (i.e., the operation type) and slots (i.e., the targeted elements and properties). However, there is a lack of a fine-grained approach for extracting intent and slot information simultaneously. This paper introduces a text-to-structure approach based on language models to interpret user requests for BIM interaction (T2S4BIM). It proposed a synthetic data generation method and a curated dataset as data support. Employing Transformer-based models, T2S4BIM converts unstructured user requests into a structured format with intent and slot information. Experiments demonstrated that T2S4BIM outperformed existing approaches, with encoder-decoder models like T5 and FLAN-T5 achieving performance comparable to larger, decoder-only models such as Llama3.1-8B and Qwen2.5-7B, while improving efficiency. The practical applicability of T2S4BIM was illustrated through a Revit plug-in that interprets user requests and executes corresponding actions (e.g., manipulating object properties).


Persistent Identifierhttp://hdl.handle.net/10722/355248
ISSN
2023 Impact Factor: 9.6
2023 SCImago Journal Rankings: 2.626

 

DC FieldValueLanguage
dc.contributor.authorWei, Yinyi-
dc.contributor.authorLi, Xiao-
dc.contributor.authorPetzold, Frank-
dc.date.accessioned2025-03-29T00:35:34Z-
dc.date.available2025-03-29T00:35:34Z-
dc.date.issued2025-03-14-
dc.identifier.citationAutomation in Construction, 2025, v. 174-
dc.identifier.issn0926-5805-
dc.identifier.urihttp://hdl.handle.net/10722/355248-
dc.description.abstract<p>Numerous efforts have been devoted to utilizing a natural language-based interface for BIM interaction. These interfaces require extracting user's intent (i.e., the operation type) and slots (i.e., the targeted elements and properties). However, there is a lack of a fine-grained approach for extracting intent and slot information simultaneously. This paper introduces a text-to-structure approach based on language models to interpret user requests for BIM interaction (T2S4BIM). It proposed a synthetic data generation method and a curated dataset as data support. Employing Transformer-based models, T2S4BIM converts unstructured user requests into a structured format with intent and slot information. Experiments demonstrated that T2S4BIM outperformed existing approaches, with encoder-decoder models like T5 and FLAN-T5 achieving performance comparable to larger, decoder-only models such as Llama3.1-8B and Qwen2.5-7B, while improving efficiency. The practical applicability of T2S4BIM was illustrated through a Revit plug-in that interprets user requests and executes corresponding actions (e.g., manipulating object properties).<br></p>-
dc.languageeng-
dc.publisherElsevier-
dc.relation.ispartofAutomation in Construction-
dc.titleText-to-structure interpretation of user requests in BIM interaction-
dc.typeArticle-
dc.identifier.doi10.1016/j.autcon.2025.106119-
dc.identifier.volume174-
dc.identifier.eissn1872-7891-
dc.identifier.issnl0926-5805-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats