File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1016/j.aei.2025.103158
- Scopus: eid_2-s2.0-85216930420
- Find via
Supplementary
-
Citations:
- Scopus: 1
- Appears in Collections:
Article: Retrieval augmented generation-driven information retrieval and question answering in construction management
Title | Retrieval augmented generation-driven information retrieval and question answering in construction management |
---|---|
Authors | |
Keywords | Construction management Large language model Retrieval augmented generation |
Issue Date | 1-May-2025 |
Publisher | Elsevier |
Citation | Advanced Engineering Informatics, 2025, v. 65 How to Cite? |
Abstract | Construction management is a communication-intensive field, requiring prompt responses to queries from various stakeholders to ensure project continuity. However, retrieving accurate information from project documents is hampered by the mismatch in granularity between queries and vast contents and by inherent ambiguities in information. Large language models (LLMs) and retrieval-augmented generation (RAG) offer new opportunities to address the challenges. However, their effectiveness is limited by the segmentation of documents and insufficient consideration of engineers’ preferences. Therefore, we propose a novel paradigm: RAG for Construction Management (RAG4CM). It includes three components: 1) a pipeline that parses project documents into hierarchical structures to establish a knowledge pool; 2) novel RAG search algorithms; and 3) a user preference learning mechanism. The first two components enhance granularity alignment and RAG results by integrating document-level hierarchical features with raw contents. The preference learning realizes continuously improved responses along with user-system interactions. We developed a prototype system and conducted extensive experiments, demonstrating that the knowledge pool efficiently accommodates texts, tables, and images. RAG4CM realized a 0.924 Top-3 and 0.898 answer accuracy, surpassing both open-source frameworks and commercial products. In addition, preference learning further increases answer accuracy by 1.3 % to 9.5 %. Consequently, RAG4CM enables multi-source information retrieval in a user-friendly manner, improving communication efficiency and facilitating construction management activities. |
Persistent Identifier | http://hdl.handle.net/10722/354753 |
ISSN | 2023 Impact Factor: 8.0 2023 SCImago Journal Rankings: 1.731 |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Wu, Chengke | - |
dc.contributor.author | Ding, Wenjun | - |
dc.contributor.author | Jin, Qisen | - |
dc.contributor.author | Jiang, Junjie | - |
dc.contributor.author | Jiang, Rui | - |
dc.contributor.author | Xiao, Qinge | - |
dc.contributor.author | Liao, Longhui | - |
dc.contributor.author | Li, Xiao | - |
dc.date.accessioned | 2025-03-07T00:35:13Z | - |
dc.date.available | 2025-03-07T00:35:13Z | - |
dc.date.issued | 2025-05-01 | - |
dc.identifier.citation | Advanced Engineering Informatics, 2025, v. 65 | - |
dc.identifier.issn | 1474-0346 | - |
dc.identifier.uri | http://hdl.handle.net/10722/354753 | - |
dc.description.abstract | <p>Construction management is a communication-intensive field, requiring prompt responses to queries from various stakeholders to ensure project continuity. However, retrieving accurate information from project documents is hampered by the mismatch in granularity between queries and vast contents and by inherent ambiguities in information. Large language models (LLMs) and retrieval-augmented generation (RAG) offer new opportunities to address the challenges. However, their effectiveness is limited by the segmentation of documents and insufficient consideration of engineers’ preferences. Therefore, we propose a novel paradigm: RAG for Construction Management (RAG4CM). It includes three components: 1) a pipeline that parses project documents into hierarchical structures to establish a knowledge pool; 2) novel RAG search algorithms; and 3) a user preference learning mechanism. The first two components enhance granularity alignment and RAG results by integrating document-level hierarchical features with raw contents. The preference learning realizes continuously improved responses along with user-system interactions. We developed a prototype system and conducted extensive experiments, demonstrating that the knowledge pool efficiently accommodates texts, tables, and images. RAG4CM realized a 0.924 Top-3 and 0.898 answer accuracy, surpassing both open-source frameworks and commercial products. In addition, preference learning further increases answer accuracy by 1.3 % to 9.5 %. Consequently, RAG4CM enables multi-source information retrieval in a user-friendly manner, improving communication efficiency and facilitating construction management activities.<br></p> | - |
dc.language | eng | - |
dc.publisher | Elsevier | - |
dc.relation.ispartof | Advanced Engineering Informatics | - |
dc.rights | This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. | - |
dc.subject | Construction management | - |
dc.subject | Large language model | - |
dc.subject | Retrieval augmented generation | - |
dc.title | Retrieval augmented generation-driven information retrieval and question answering in construction management | - |
dc.type | Article | - |
dc.identifier.doi | 10.1016/j.aei.2025.103158 | - |
dc.identifier.scopus | eid_2-s2.0-85216930420 | - |
dc.identifier.volume | 65 | - |
dc.identifier.eissn | 1873-5320 | - |
dc.identifier.issnl | 1474-0346 | - |