File Download
Supplementary
-
Citations:
- Appears in Collections:
postgraduate thesis: Generalization in autonomous driving systems
| Title | Generalization in autonomous driving systems |
|---|---|
| Authors | |
| Advisors | Advisor(s):Wu, YC |
| Issue Date | 2025 |
| Publisher | The University of Hong Kong (Pokfulam, Hong Kong) |
| Citation | Kou, W. [寇卫斌]. (2025). Generalization in autonomous driving systems. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR. |
| Abstract |
Deep Learning (DL)-based autonomous driving (AD) develops rapidly. However, AD models generally face poor generalization. Specifically, on the one hand, AD models trained from data in a particular geographical region perform poorly when applied in other regions due to data domain-shift; on the other hand, AD models generally underperform in adverse weather conditions owing to invisibility or lack of structural information. To tackle the geographically related generalization problem, Hierarchical Federated Learning (HFL) offers a potential solution by collaborative privacy-preserving training over distributed datasets from different regions. Unfortunately, existing HFL suffers from three major challenges: (I) slow convergence because data from different cities are with disparate statistical properties; (II) inefficient use of communication resource owing to HFL's traditional static resource scheduling policy; (III) disregard of limited communication resource in practice. This thesis will introduce strategies to solve these challenges. In addition, to improve the AD model generalization in diverse weather conditions, a generalizable scheme without relying on clear reference images is proposed.
To solve the HFL challenges, this thesis firstly proposes FedGau to accelerate convergence. FedGau models the statistical distribution of both single RGB image and RGB datasets as Gaussian distributions, allowing for aggregation weights to be designed based on data statistical properties rather than purely based on volume. Alongside FedGau, this thesis introduces AdapRS, which dynamically adjusts the number of model exchanges based on real-time performance metrics of the global model. Comprehensive experiments show that FedGau accelerates the HFL model convergence and AdapRS significantly reduces unnecessary communication overhead. On the other hand, to address the communication resource constraint in HFL, this thesis proposes CRCHFL framework. CRCHFL prioritizes different stages of the HFL process to minimize generalization errors. Extensive experiments demonstrate CRCHFL outperforms existing FL methods under strict communication resource constraints.
After above HFL challenges are tackled, AD models can enhance generalization via training on large-scale and diverse data from various regions. As time progresses, the data involved in the federated AD system expands. Such ever-expanding data may eventually result in the model under-fitting, leading to poor generalization. To handle this under-fitting limitation, this thesis explores the potential of Large Vision Models (LVMs) to rectify the problem. Precisely, this thesis proposes a pFedLVM framework, which incorporates LVMs into federated AD systems, and exchanges latent features instead of entire LVM parameters between the central server and participating vehicles. This significantly reduces communication demands while allowing for personalized model updates tailored to local driving conditions.
Finally, this thesis improves AD model generalization in adverse weather conditions through a novel strategy termed AdvImmu. This approach, which integrates techniques such as Locally Sequential Mechanism (LSM), Globally Shuffled Mechanism (GSM), and Unfolded Regularizers (URs), enhances model robustness across various weather conditions without relying on clear weather references. Additionally, to address the challenge of lacking frame-wise annotations in adverse conditions, AdvImmu incorporates the foundation model SAM with a clustering algorithm SBICAC to generate pseudo-labels, thereby enabling effective training in complex scenarios.
|
| Degree | Doctor of Philosophy |
| Subject | Automated vehicles |
| Dept/Program | Electrical and Electronic Engineering |
| Persistent Identifier | http://hdl.handle.net/10722/356604 |
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.advisor | Wu, YC | - |
| dc.contributor.author | Kou, Weibin | - |
| dc.contributor.author | 寇卫斌 | - |
| dc.date.accessioned | 2025-06-05T09:31:24Z | - |
| dc.date.available | 2025-06-05T09:31:24Z | - |
| dc.date.issued | 2025 | - |
| dc.identifier.citation | Kou, W. [寇卫斌]. (2025). Generalization in autonomous driving systems. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR. | - |
| dc.identifier.uri | http://hdl.handle.net/10722/356604 | - |
| dc.description.abstract | Deep Learning (DL)-based autonomous driving (AD) develops rapidly. However, AD models generally face poor generalization. Specifically, on the one hand, AD models trained from data in a particular geographical region perform poorly when applied in other regions due to data domain-shift; on the other hand, AD models generally underperform in adverse weather conditions owing to invisibility or lack of structural information. To tackle the geographically related generalization problem, Hierarchical Federated Learning (HFL) offers a potential solution by collaborative privacy-preserving training over distributed datasets from different regions. Unfortunately, existing HFL suffers from three major challenges: (I) slow convergence because data from different cities are with disparate statistical properties; (II) inefficient use of communication resource owing to HFL's traditional static resource scheduling policy; (III) disregard of limited communication resource in practice. This thesis will introduce strategies to solve these challenges. In addition, to improve the AD model generalization in diverse weather conditions, a generalizable scheme without relying on clear reference images is proposed. To solve the HFL challenges, this thesis firstly proposes FedGau to accelerate convergence. FedGau models the statistical distribution of both single RGB image and RGB datasets as Gaussian distributions, allowing for aggregation weights to be designed based on data statistical properties rather than purely based on volume. Alongside FedGau, this thesis introduces AdapRS, which dynamically adjusts the number of model exchanges based on real-time performance metrics of the global model. Comprehensive experiments show that FedGau accelerates the HFL model convergence and AdapRS significantly reduces unnecessary communication overhead. On the other hand, to address the communication resource constraint in HFL, this thesis proposes CRCHFL framework. CRCHFL prioritizes different stages of the HFL process to minimize generalization errors. Extensive experiments demonstrate CRCHFL outperforms existing FL methods under strict communication resource constraints. After above HFL challenges are tackled, AD models can enhance generalization via training on large-scale and diverse data from various regions. As time progresses, the data involved in the federated AD system expands. Such ever-expanding data may eventually result in the model under-fitting, leading to poor generalization. To handle this under-fitting limitation, this thesis explores the potential of Large Vision Models (LVMs) to rectify the problem. Precisely, this thesis proposes a pFedLVM framework, which incorporates LVMs into federated AD systems, and exchanges latent features instead of entire LVM parameters between the central server and participating vehicles. This significantly reduces communication demands while allowing for personalized model updates tailored to local driving conditions. Finally, this thesis improves AD model generalization in adverse weather conditions through a novel strategy termed AdvImmu. This approach, which integrates techniques such as Locally Sequential Mechanism (LSM), Globally Shuffled Mechanism (GSM), and Unfolded Regularizers (URs), enhances model robustness across various weather conditions without relying on clear weather references. Additionally, to address the challenge of lacking frame-wise annotations in adverse conditions, AdvImmu incorporates the foundation model SAM with a clustering algorithm SBICAC to generate pseudo-labels, thereby enabling effective training in complex scenarios. | - |
| dc.language | eng | - |
| dc.publisher | The University of Hong Kong (Pokfulam, Hong Kong) | - |
| dc.relation.ispartof | HKU Theses Online (HKUTO) | - |
| dc.rights | The author retains all proprietary rights, (such as patent rights) and the right to use in future works. | - |
| dc.rights | This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. | - |
| dc.subject.lcsh | Automated vehicles | - |
| dc.title | Generalization in autonomous driving systems | - |
| dc.type | PG_Thesis | - |
| dc.description.thesisname | Doctor of Philosophy | - |
| dc.description.thesislevel | Doctoral | - |
| dc.description.thesisdiscipline | Electrical and Electronic Engineering | - |
| dc.description.nature | published_or_final_version | - |
| dc.date.hkucongregation | 2025 | - |
| dc.identifier.mmsid | 991044970873203414 | - |
