File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: Retrieval augmented generation-driven information retrieval and question answering in construction management

TitleRetrieval augmented generation-driven information retrieval and question answering in construction management
Authors
KeywordsConstruction management
Large language model
Retrieval augmented generation
Issue Date1-May-2025
PublisherElsevier
Citation
Advanced Engineering Informatics, 2025, v. 65 How to Cite?
Abstract

Construction management is a communication-intensive field, requiring prompt responses to queries from various stakeholders to ensure project continuity. However, retrieving accurate information from project documents is hampered by the mismatch in granularity between queries and vast contents and by inherent ambiguities in information. Large language models (LLMs) and retrieval-augmented generation (RAG) offer new opportunities to address the challenges. However, their effectiveness is limited by the segmentation of documents and insufficient consideration of engineers’ preferences. Therefore, we propose a novel paradigm: RAG for Construction Management (RAG4CM). It includes three components: 1) a pipeline that parses project documents into hierarchical structures to establish a knowledge pool; 2) novel RAG search algorithms; and 3) a user preference learning mechanism. The first two components enhance granularity alignment and RAG results by integrating document-level hierarchical features with raw contents. The preference learning realizes continuously improved responses along with user-system interactions. We developed a prototype system and conducted extensive experiments, demonstrating that the knowledge pool efficiently accommodates texts, tables, and images. RAG4CM realized a 0.924 Top-3 and 0.898 answer accuracy, surpassing both open-source frameworks and commercial products. In addition, preference learning further increases answer accuracy by 1.3 % to 9.5 %. Consequently, RAG4CM enables multi-source information retrieval in a user-friendly manner, improving communication efficiency and facilitating construction management activities.


Persistent Identifierhttp://hdl.handle.net/10722/354753
ISSN
2023 Impact Factor: 8.0
2023 SCImago Journal Rankings: 1.731

 

DC FieldValueLanguage
dc.contributor.authorWu, Chengke-
dc.contributor.authorDing, Wenjun-
dc.contributor.authorJin, Qisen-
dc.contributor.authorJiang, Junjie-
dc.contributor.authorJiang, Rui-
dc.contributor.authorXiao, Qinge-
dc.contributor.authorLiao, Longhui-
dc.contributor.authorLi, Xiao-
dc.date.accessioned2025-03-07T00:35:13Z-
dc.date.available2025-03-07T00:35:13Z-
dc.date.issued2025-05-01-
dc.identifier.citationAdvanced Engineering Informatics, 2025, v. 65-
dc.identifier.issn1474-0346-
dc.identifier.urihttp://hdl.handle.net/10722/354753-
dc.description.abstract<p>Construction management is a communication-intensive field, requiring prompt responses to queries from various stakeholders to ensure project continuity. However, retrieving accurate information from project documents is hampered by the mismatch in granularity between queries and vast contents and by inherent ambiguities in information. Large language models (LLMs) and retrieval-augmented generation (RAG) offer new opportunities to address the challenges. However, their effectiveness is limited by the segmentation of documents and insufficient consideration of engineers’ preferences. Therefore, we propose a novel paradigm: RAG for Construction Management (RAG4CM). It includes three components: 1) a pipeline that parses project documents into hierarchical structures to establish a knowledge pool; 2) novel RAG search algorithms; and 3) a user preference learning mechanism. The first two components enhance granularity alignment and RAG results by integrating document-level hierarchical features with raw contents. The preference learning realizes continuously improved responses along with user-system interactions. We developed a prototype system and conducted extensive experiments, demonstrating that the knowledge pool efficiently accommodates texts, tables, and images. RAG4CM realized a 0.924 Top-3 and 0.898 answer accuracy, surpassing both open-source frameworks and commercial products. In addition, preference learning further increases answer accuracy by 1.3 % to 9.5 %. Consequently, RAG4CM enables multi-source information retrieval in a user-friendly manner, improving communication efficiency and facilitating construction management activities.<br></p>-
dc.languageeng-
dc.publisherElsevier-
dc.relation.ispartofAdvanced Engineering Informatics-
dc.rightsThis work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.-
dc.subjectConstruction management-
dc.subjectLarge language model-
dc.subjectRetrieval augmented generation-
dc.titleRetrieval augmented generation-driven information retrieval and question answering in construction management-
dc.typeArticle-
dc.identifier.doi10.1016/j.aei.2025.103158-
dc.identifier.scopuseid_2-s2.0-85216930420-
dc.identifier.volume65-
dc.identifier.eissn1873-5320-
dc.identifier.issnl1474-0346-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats