Domain-specific deep learning with environmental and medical applications

Zhang, Qi; 章琦

File Download

FullText.pdf

Supplementary

Citations:
Appears in Collections:
- HKU Theses Online
- Electrical & Electronic Engineering: Theses

postgraduate thesis: Domain-specific deep learning with environmental and medical applications

Title	Domain-specific deep learning with environmental and medical applications
Authors	Zhang, Qi 章琦
Advisors	Advisor(s):Li, VOK Lam, JCK
Issue Date	2023
Publisher	The University of Hong Kong (Pokfulam, Hong Kong)
Citation	Zhang, Q. [章琦]. (2023). Domain-specific deep learning with environmental and medical applications. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR.
Abstract	Deep learning has achieved unprecedented success in various fields ranging from image recognition to natural language processing. However, despite their prowess, deep learning models can encounter issues related to data inefficiency, overfitting, lack of interpretability and sensitivity to dataset shifts. Furthermore, these models often function as "black boxes", providing little insight into the internal workings of the model or the decision-making process. To address these challenges, Domain-specific Deep Learning has risen as a growing trend. Domain-specific Deep Learning refers to the integration of domain-specific knowledge in deep learning models. Problem-specific insights can be utilized to either enhance the deep learning model or facilitate the training process. This approach aims to leverage the expressivity and learning capacity of neural networks while benefitting from the precision, robustness and interpretability provided by domain-specific knowledge. This study proposes several different approaches for incorporating domain-specific knowledge into deep learning models. We provide examples of how these methods can be applied in the fields of biomedical and environmental science, illustrating their wide-ranging benefits. Chapter 3 integrates domain knowledge with a deep learning model by acting on the feature space. Our proposed Deep-AIR model fills the existing research gap in air pollution modelling by exploiting domain-specific features (including Air Pollution, Weather, Urban Morphology, Transport and Time-sensitive features), and by employing a hybrid CNN-LSTM structure to capture the spatial-temporal features and 1×1 convolution layers to enhance the learning of temporal and spatial interaction. Chapter 4 injects domain knowledge into a deep learning model by exploiting regularisation and post-processing schemes with prior insights. We develop a deep learning-based 3D cell segmentation pipeline, 3DCellSeg, to address challenges in cell segmentation. A custom loss function (3DCellSeg Loss) is introduced to tackle the clumped cell problem, and an efficient touching area-based clustering algorithm (TASCAN) is developed to separate 3D cells from foreground masks. Chapter 5 explores a deeper combination of knowledge and network, by guiding the construction of a neural network with domain-specific knowledge. We develop a novel hierarchical graph attention network to identify highly correlational GMs (HGMs) of PTSD. Domain-specific knowledge, including somatic mutations, genes, PTSD pathways and their correlations have been incorporated into the graph structures. Our study is carefully guided by prominent PTSD literature and clinical experts within the field; highly salient HGMs generated from our model are further verified by existing PTSD-related authoritative medical journals. Our study illustrates the utility and significance of a hybrid approach, integrating both AI and expert-guided/domain-specific knowledge for the thorough identification of biomarkers. Our work is among the first to systematically explore different approaches to integrate domain-specific knowledge with deep learning models, emphasising not only the theoretical contributions but also the practical implications. Our experimental results demonstrate the improvement of deep learning models on both inference precision, and interpretability and robustness. This study underpins the imperative of converging the power of deep learning with the richness of domain-specific knowledge, thus facilitating the advancement of more sophisticated, effective and trustworthy AI systems.
Degree	Doctor of Philosophy
Subject	Data integration (Computer science) Deep learning (Machine learning)
Dept/Program	Electrical and Electronic Engineering
Persistent Identifier	http://hdl.handle.net/10722/335941

DC Field	Value	Language
dc.contributor.advisor	Li, VOK	-
dc.contributor.advisor	Lam, JCK	-
dc.contributor.author	Zhang, Qi	-
dc.contributor.author	章琦	-
dc.date.accessioned	2023-12-29T04:05:01Z	-
dc.date.available	2023-12-29T04:05:01Z	-
dc.date.issued	2023	-
dc.identifier.citation	Zhang, Q. [章琦]. (2023). Domain-specific deep learning with environmental and medical applications. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR.	-
dc.identifier.uri	http://hdl.handle.net/10722/335941	-
dc.description.abstract	Deep learning has achieved unprecedented success in various fields ranging from image recognition to natural language processing. However, despite their prowess, deep learning models can encounter issues related to data inefficiency, overfitting, lack of interpretability and sensitivity to dataset shifts. Furthermore, these models often function as "black boxes", providing little insight into the internal workings of the model or the decision-making process. To address these challenges, Domain-specific Deep Learning has risen as a growing trend. Domain-specific Deep Learning refers to the integration of domain-specific knowledge in deep learning models. Problem-specific insights can be utilized to either enhance the deep learning model or facilitate the training process. This approach aims to leverage the expressivity and learning capacity of neural networks while benefitting from the precision, robustness and interpretability provided by domain-specific knowledge. This study proposes several different approaches for incorporating domain-specific knowledge into deep learning models. We provide examples of how these methods can be applied in the fields of biomedical and environmental science, illustrating their wide-ranging benefits. Chapter 3 integrates domain knowledge with a deep learning model by acting on the feature space. Our proposed Deep-AIR model fills the existing research gap in air pollution modelling by exploiting domain-specific features (including Air Pollution, Weather, Urban Morphology, Transport and Time-sensitive features), and by employing a hybrid CNN-LSTM structure to capture the spatial-temporal features and 1×1 convolution layers to enhance the learning of temporal and spatial interaction. Chapter 4 injects domain knowledge into a deep learning model by exploiting regularisation and post-processing schemes with prior insights. We develop a deep learning-based 3D cell segmentation pipeline, 3DCellSeg, to address challenges in cell segmentation. A custom loss function (3DCellSeg Loss) is introduced to tackle the clumped cell problem, and an efficient touching area-based clustering algorithm (TASCAN) is developed to separate 3D cells from foreground masks. Chapter 5 explores a deeper combination of knowledge and network, by guiding the construction of a neural network with domain-specific knowledge. We develop a novel hierarchical graph attention network to identify highly correlational GMs (HGMs) of PTSD. Domain-specific knowledge, including somatic mutations, genes, PTSD pathways and their correlations have been incorporated into the graph structures. Our study is carefully guided by prominent PTSD literature and clinical experts within the field; highly salient HGMs generated from our model are further verified by existing PTSD-related authoritative medical journals. Our study illustrates the utility and significance of a hybrid approach, integrating both AI and expert-guided/domain-specific knowledge for the thorough identification of biomarkers. Our work is among the first to systematically explore different approaches to integrate domain-specific knowledge with deep learning models, emphasising not only the theoretical contributions but also the practical implications. Our experimental results demonstrate the improvement of deep learning models on both inference precision, and interpretability and robustness. This study underpins the imperative of converging the power of deep learning with the richness of domain-specific knowledge, thus facilitating the advancement of more sophisticated, effective and trustworthy AI systems.	-
dc.language	eng	-
dc.publisher	The University of Hong Kong (Pokfulam, Hong Kong)	-
dc.relation.ispartof	HKU Theses Online (HKUTO)	-
dc.rights	The author retains all proprietary rights, (such as patent rights) and the right to use in future works.	-
dc.rights	This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.	-
dc.subject.lcsh	Data integration (Computer science)	-
dc.subject.lcsh	Deep learning (Machine learning)	-
dc.title	Domain-specific deep learning with environmental and medical applications	-
dc.type	PG_Thesis	-
dc.description.thesisname	Doctor of Philosophy	-
dc.description.thesislevel	Doctoral	-
dc.description.thesisdiscipline	Electrical and Electronic Engineering	-
dc.description.nature	published_or_final_version	-
dc.date.hkucongregation	2024	-
dc.identifier.mmsid	991044751041103414	-

File Download

Supplementary

postgraduate thesis: Domain-specific deep learning with environmental and medical applications

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats