File Download
Supplementary

postgraduate thesis: Advanced neural networks and their application for medical image segmentation

TitleAdvanced neural networks and their application for medical image segmentation
Authors
Advisors
Advisor(s):Chan, SCWu, YC
Issue Date2025
PublisherThe University of Hong Kong (Pokfulam, Hong Kong)
Citation
Qi, W. [祁文博]. (2025). Advanced neural networks and their application for medical image segmentation. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR.
AbstractConvolutional neural networks have shown significant promise in medical image segmentation, providing crucial insights for early diagnosis, biopsy planning, and clinical therapies. However, accurate segmentation is challenging due to inherent image characteristics (\textit{i.e.}, low contrast, speckle noise in ultrasound), variations in target shape and size, and domain shift across datasets. This thesis aims to develop advanced neural networks for medical image segmentation. Specifically, we shall consider breast tumor segmentation from ultrasound images, ultrasound thyroid nodule segmentation, and domain generalization in prostate MRI segmentation. Firstly, we propose an innovative Multi-scale Dynamic Fusion Network (MDF-Net) to segment the ultrasound breast tumors. It is structured as an end-to-end two-stage architecture, comprising a trunk sub-network responsible for multi-scale feature selection and a refinement sub-network that is optimized to enhance feature exploration and fusion, thereby minimizing impairments. Building upon the UNet++, the trunk network features dense skip connections to facilitate connectivity between features across different scales. Additionally, we introduce multi-scale deep supervision to capture more discriminative features and attenuate inaccuracies stemming from speckle noise. The refinement sub-network leverages a structurally optimized MDF mechanism to enhance initial segmentation at coarser scales and delve into inter-subject variation insights at finer scales. Evaluation of two publicly available datasets demonstrates that our proposed MDF-Net outperforms state-of-the-art approaches in terms of Dice coefficient and other evaluation metrics. Secondly, existing multi-task learning methods for thyroid nodule segmentation suffer from 1) the distribution gap between different datasets and 2) inconsistency in loss calculation for different tasks. We propose a novel STR-Net to address these issues. Specifically, we propose a Multi-mix Data Augmentation that randomly crops the foreground and background of gland and nodule images and mixes them to generate new samples. Furthermore, we propose a new Thyroid-Region Prior Guided Refinement Network by adding Multi-scale Deep Supervision and Nodule Refinement Structure. Moreover, a Teacher-Student Semi-supervised Framework is constructed with our proposed network to maintain consistency in the multi-task feature alignment. Finally, an Edge Distance Regularization method is proposed for post-processing to make nodule segmentation boundaries smoother and flatter. Extensive experiments on two datasets have demonstrated the effectiveness of our method. Lastly, we propose a bidirectional Gated Recurrent Unit (GRU) based refinement network with simple and effective Patch Mixing and Risk Extrapolation (PMRE) schemes for multi-site prostate MRI segmentation. It employs a large convolution kernel-based multiscale feature encoder to extract multiscale features from consecutive 2D slices and a recurrent bidirectional ConvGRU-based contracting decoder to fuse the 3D segmentation features from a coarse-to-fine strategy. To enhance the generalization capability and robustness of the network across diverse target domains, a novel PMRE domain generalization approach is introduced by leveraging data manipulation, network design optimizations, and risk extrapolation. Specifically, an effective Domain Patch Mixing mechanism, which interpolates patches from different domains, is proposed for effective data augmentation. A simple and effective Segmentation Risk Extrapolation scheme is proposed to minimize the performance spread of the network over all the multisite samples. Experimental results from six commonly used source domains of prostate show that the proposed framework performs better than state-of-the-art algorithms.
DegreeDoctor of Philosophy
SubjectNeural networks (Computer science)
Diagnostic imaging - Data processing
Dept/ProgramElectrical and Electronic Engineering
Persistent Identifierhttp://hdl.handle.net/10722/356571

 

DC FieldValueLanguage
dc.contributor.advisorChan, SC-
dc.contributor.advisorWu, YC-
dc.contributor.authorQi, Wenbo-
dc.contributor.author祁文博-
dc.date.accessioned2025-06-05T09:31:11Z-
dc.date.available2025-06-05T09:31:11Z-
dc.date.issued2025-
dc.identifier.citationQi, W. [祁文博]. (2025). Advanced neural networks and their application for medical image segmentation. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR.-
dc.identifier.urihttp://hdl.handle.net/10722/356571-
dc.description.abstractConvolutional neural networks have shown significant promise in medical image segmentation, providing crucial insights for early diagnosis, biopsy planning, and clinical therapies. However, accurate segmentation is challenging due to inherent image characteristics (\textit{i.e.}, low contrast, speckle noise in ultrasound), variations in target shape and size, and domain shift across datasets. This thesis aims to develop advanced neural networks for medical image segmentation. Specifically, we shall consider breast tumor segmentation from ultrasound images, ultrasound thyroid nodule segmentation, and domain generalization in prostate MRI segmentation. Firstly, we propose an innovative Multi-scale Dynamic Fusion Network (MDF-Net) to segment the ultrasound breast tumors. It is structured as an end-to-end two-stage architecture, comprising a trunk sub-network responsible for multi-scale feature selection and a refinement sub-network that is optimized to enhance feature exploration and fusion, thereby minimizing impairments. Building upon the UNet++, the trunk network features dense skip connections to facilitate connectivity between features across different scales. Additionally, we introduce multi-scale deep supervision to capture more discriminative features and attenuate inaccuracies stemming from speckle noise. The refinement sub-network leverages a structurally optimized MDF mechanism to enhance initial segmentation at coarser scales and delve into inter-subject variation insights at finer scales. Evaluation of two publicly available datasets demonstrates that our proposed MDF-Net outperforms state-of-the-art approaches in terms of Dice coefficient and other evaluation metrics. Secondly, existing multi-task learning methods for thyroid nodule segmentation suffer from 1) the distribution gap between different datasets and 2) inconsistency in loss calculation for different tasks. We propose a novel STR-Net to address these issues. Specifically, we propose a Multi-mix Data Augmentation that randomly crops the foreground and background of gland and nodule images and mixes them to generate new samples. Furthermore, we propose a new Thyroid-Region Prior Guided Refinement Network by adding Multi-scale Deep Supervision and Nodule Refinement Structure. Moreover, a Teacher-Student Semi-supervised Framework is constructed with our proposed network to maintain consistency in the multi-task feature alignment. Finally, an Edge Distance Regularization method is proposed for post-processing to make nodule segmentation boundaries smoother and flatter. Extensive experiments on two datasets have demonstrated the effectiveness of our method. Lastly, we propose a bidirectional Gated Recurrent Unit (GRU) based refinement network with simple and effective Patch Mixing and Risk Extrapolation (PMRE) schemes for multi-site prostate MRI segmentation. It employs a large convolution kernel-based multiscale feature encoder to extract multiscale features from consecutive 2D slices and a recurrent bidirectional ConvGRU-based contracting decoder to fuse the 3D segmentation features from a coarse-to-fine strategy. To enhance the generalization capability and robustness of the network across diverse target domains, a novel PMRE domain generalization approach is introduced by leveraging data manipulation, network design optimizations, and risk extrapolation. Specifically, an effective Domain Patch Mixing mechanism, which interpolates patches from different domains, is proposed for effective data augmentation. A simple and effective Segmentation Risk Extrapolation scheme is proposed to minimize the performance spread of the network over all the multisite samples. Experimental results from six commonly used source domains of prostate show that the proposed framework performs better than state-of-the-art algorithms.-
dc.languageeng-
dc.publisherThe University of Hong Kong (Pokfulam, Hong Kong)-
dc.relation.ispartofHKU Theses Online (HKUTO)-
dc.rightsThe author retains all proprietary rights, (such as patent rights) and the right to use in future works.-
dc.rightsThis work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.-
dc.subject.lcshNeural networks (Computer science)-
dc.subject.lcshDiagnostic imaging - Data processing-
dc.titleAdvanced neural networks and their application for medical image segmentation-
dc.typePG_Thesis-
dc.description.thesisnameDoctor of Philosophy-
dc.description.thesislevelDoctoral-
dc.description.thesisdisciplineElectrical and Electronic Engineering-
dc.description.naturepublished_or_final_version-
dc.date.hkucongregation2025-
dc.identifier.mmsid991044970873903414-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats