File Download
Supplementary
-
Citations:
- Appears in Collections:
postgraduate thesis: Data mining and dialog act modeling in educational dialog
Title | Data mining and dialog act modeling in educational dialog |
---|---|
Authors | |
Advisors | Advisor(s):Kao, CM |
Issue Date | 2023 |
Publisher | The University of Hong Kong (Pokfulam, Hong Kong) |
Citation | Shan, D. [單大鵬]. (2023). Data mining and dialog act modeling in educational dialog. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR. |
Abstract | Educational dialogue provides an essential perspective on the learning process, encapsulating the intricate dynamics between teachers and students. Traditional methods of analyzing educational dialogue, however, are labor-intensive and challenging for large-scale analysis. This constraint limits our capacity to fully utilize the growing volumes of educational dialogue data, particularly from online learning platforms. This thesis presents a comprehensive solution to this issue through three pivotal studies, aiming to expedite and enhance the data analysis process to inform and improve educational practices.
The initial study focuses on an exploratory data analysis of tutor engagement within large-scale online tutoring, emphasizing superficial features like the duration of tutoring sessions. While this approach discloses intriguing trends, the lack of dialogue act coding restricts our capability to extract pedagogical insights, highlighting the necessity for more nuanced methods of analysis. In the subsequent study, we employ transformer-based pre-trained language models as automatic dialogue act annotators within two key educational dialogue settings: online tutoring and classroom interactions. Despite achieving high accuracy, these models face challenges in online settings due to the imbalanced distribution of dialogue acts. We conduct a comparison with the state-of-the-art language model, ChatGPT, revealing limitations due to its input constraints. To address these shortcomings, we introduce a data augmentation pipeline, utilizing GPT3.5’s generative capabilities to enhance the automatic annotator’s ability to manage less frequently observed dialogue acts. The third study employs the improved automatic annotator in two practical contexts. Initially, we interpret the classifications made by the automatic annotator into keyword importance, illustrating its potential as an instructional tool for educators to refine their discursive techniques, leading to more effective and enriched pedagogical practice. Secondly, we execute a data mining analysis of the dialogic pattern in online tutoring data, uncovering more profound, pedagogically relevant patterns. These insights validate existing educational theories and contribute to the development of pedagogical thought, thereby enhancing the teaching-learning dynamic.
This thesis underscores the potency of data mining and dialogue act modeling in deriving significant insights from vast educational dialogue data. By facilitating the analysis process, the research not only broadens our understanding of effective teaching and learning strategies but also provides innovative tools to advance educational practices in a digitally enhanced learning landscape. |
Degree | Doctor of Philosophy |
Subject | Communication in education - Data processing |
Dept/Program | Computer Science |
Persistent Identifier | http://hdl.handle.net/10722/350284 |
DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Kao, CM | - |
dc.contributor.author | Shan, Dapeng | - |
dc.contributor.author | 單大鵬 | - |
dc.date.accessioned | 2024-10-21T08:16:11Z | - |
dc.date.available | 2024-10-21T08:16:11Z | - |
dc.date.issued | 2023 | - |
dc.identifier.citation | Shan, D. [單大鵬]. (2023). Data mining and dialog act modeling in educational dialog. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR. | - |
dc.identifier.uri | http://hdl.handle.net/10722/350284 | - |
dc.description.abstract | Educational dialogue provides an essential perspective on the learning process, encapsulating the intricate dynamics between teachers and students. Traditional methods of analyzing educational dialogue, however, are labor-intensive and challenging for large-scale analysis. This constraint limits our capacity to fully utilize the growing volumes of educational dialogue data, particularly from online learning platforms. This thesis presents a comprehensive solution to this issue through three pivotal studies, aiming to expedite and enhance the data analysis process to inform and improve educational practices. The initial study focuses on an exploratory data analysis of tutor engagement within large-scale online tutoring, emphasizing superficial features like the duration of tutoring sessions. While this approach discloses intriguing trends, the lack of dialogue act coding restricts our capability to extract pedagogical insights, highlighting the necessity for more nuanced methods of analysis. In the subsequent study, we employ transformer-based pre-trained language models as automatic dialogue act annotators within two key educational dialogue settings: online tutoring and classroom interactions. Despite achieving high accuracy, these models face challenges in online settings due to the imbalanced distribution of dialogue acts. We conduct a comparison with the state-of-the-art language model, ChatGPT, revealing limitations due to its input constraints. To address these shortcomings, we introduce a data augmentation pipeline, utilizing GPT3.5’s generative capabilities to enhance the automatic annotator’s ability to manage less frequently observed dialogue acts. The third study employs the improved automatic annotator in two practical contexts. Initially, we interpret the classifications made by the automatic annotator into keyword importance, illustrating its potential as an instructional tool for educators to refine their discursive techniques, leading to more effective and enriched pedagogical practice. Secondly, we execute a data mining analysis of the dialogic pattern in online tutoring data, uncovering more profound, pedagogically relevant patterns. These insights validate existing educational theories and contribute to the development of pedagogical thought, thereby enhancing the teaching-learning dynamic. This thesis underscores the potency of data mining and dialogue act modeling in deriving significant insights from vast educational dialogue data. By facilitating the analysis process, the research not only broadens our understanding of effective teaching and learning strategies but also provides innovative tools to advance educational practices in a digitally enhanced learning landscape. | - |
dc.language | eng | - |
dc.publisher | The University of Hong Kong (Pokfulam, Hong Kong) | - |
dc.relation.ispartof | HKU Theses Online (HKUTO) | - |
dc.rights | The author retains all proprietary rights, (such as patent rights) and the right to use in future works. | - |
dc.rights | This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. | - |
dc.subject.lcsh | Communication in education - Data processing | - |
dc.title | Data mining and dialog act modeling in educational dialog | - |
dc.type | PG_Thesis | - |
dc.description.thesisname | Doctor of Philosophy | - |
dc.description.thesislevel | Doctoral | - |
dc.description.thesisdiscipline | Computer Science | - |
dc.description.nature | published_or_final_version | - |
dc.date.hkucongregation | 2023 | - |
dc.identifier.mmsid | 991044736496303414 | - |