File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1148/ryai.220082
- Scopus: eid_2-s2.0-85161384989
- PMID: 37293342
- WOS: WOS:001124231100002
- Find via
Supplementary
- Citations:
- Appears in Collections:
Article: Influence of Data Distribution on Federated Learning Performance in Tumor Segmentation
Title | Influence of Data Distribution on Federated Learning Performance in Tumor Segmentation |
---|---|
Authors | |
Keywords | Abdomen/GI Brain/ Brain Stem Comparative Studies Convolutional Neural Network (CNN) CT Data Distribution Federated Deep Learning Liver MR Imaging Tumor Segmentation |
Issue Date | 26-Apr-2023 |
Publisher | Radiological Society of North America |
Citation | Radiology: Artificial Intelligence, 2023, v. 5, n. 3 How to Cite? |
Abstract | Purpose: To investigate the correlation between differences in data distributions and federated deep learning (Fed-DL) algorithm performance in tumor segmentation on CT and MR images. Materials and Methods: Two Fed-DL datasets were retrospectively collected (from November 2020 to December 2021): one dataset of liver tumor CT images (Federated Imaging in Liver Tumor Segmentation [or, FILTS]; three sites, 692 scans) and one publicly avail-able dataset of brain tumor MR images (Federated Tumor Segmentation [or, FeTS]; 23 sites, 1251 scans). Scans from both datasets were grouped according to site, tumor type, tumor size, dataset size, and tumor intensity. To quantify differences in data distributions, the following four distance metrics were calculated: earth mover’s distance (EMD), Bhattacharyya distance (BD), χ2 distance (CSD), and Kolmogorov-Smirnov distance (KSD). Both federated and centralized nnU-Net models were trained by using the same grouped datasets. Fed-DL model performance was evaluated by using the ratio of Dice coefficients, θ, between federated and centralized models trained and tested on the same 80:20 split datasets. Results: The Dice coefficient ratio (θ) between federated and centralized models was strongly negatively correlated with the distances between data distributions, with correlation coefficients of −0.920 for EMD, −0.893 for BD, and −0.899 for CSD. However, KSD was weakly correlated with θ, with a correlation coefficient of −0.479. Conclusion: Performance of Fed-DL models in tumor segmentation on CT and MRI datasets was strongly negatively correlated with the distances between data distributions. |
Persistent Identifier | http://hdl.handle.net/10722/331044 |
ISSN | 2023 Impact Factor: 8.1 2023 SCImago Journal Rankings: 2.602 |
ISI Accession Number ID |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Luo, G | - |
dc.contributor.author | Liu, T | - |
dc.contributor.author | Lu, J | - |
dc.contributor.author | Chen, X | - |
dc.contributor.author | Yu, L | - |
dc.contributor.author | Wu, J | - |
dc.contributor.author | Chen, DZ | - |
dc.contributor.author | Cai, W | - |
dc.date.accessioned | 2023-09-21T06:52:18Z | - |
dc.date.available | 2023-09-21T06:52:18Z | - |
dc.date.issued | 2023-04-26 | - |
dc.identifier.citation | Radiology: Artificial Intelligence, 2023, v. 5, n. 3 | - |
dc.identifier.issn | 2638-6100 | - |
dc.identifier.uri | http://hdl.handle.net/10722/331044 | - |
dc.description.abstract | Purpose: To investigate the correlation between differences in data distributions and federated deep learning (Fed-DL) algorithm performance in tumor segmentation on CT and MR images. Materials and Methods: Two Fed-DL datasets were retrospectively collected (from November 2020 to December 2021): one dataset of liver tumor CT images (Federated Imaging in Liver Tumor Segmentation [or, FILTS]; three sites, 692 scans) and one publicly avail-able dataset of brain tumor MR images (Federated Tumor Segmentation [or, FeTS]; 23 sites, 1251 scans). Scans from both datasets were grouped according to site, tumor type, tumor size, dataset size, and tumor intensity. To quantify differences in data distributions, the following four distance metrics were calculated: earth mover’s distance (EMD), Bhattacharyya distance (BD), χ2 distance (CSD), and Kolmogorov-Smirnov distance (KSD). Both federated and centralized nnU-Net models were trained by using the same grouped datasets. Fed-DL model performance was evaluated by using the ratio of Dice coefficients, θ, between federated and centralized models trained and tested on the same 80:20 split datasets. Results: The Dice coefficient ratio (θ) between federated and centralized models was strongly negatively correlated with the distances between data distributions, with correlation coefficients of −0.920 for EMD, −0.893 for BD, and −0.899 for CSD. However, KSD was weakly correlated with θ, with a correlation coefficient of −0.479. Conclusion: Performance of Fed-DL models in tumor segmentation on CT and MRI datasets was strongly negatively correlated with the distances between data distributions. | - |
dc.language | eng | - |
dc.publisher | Radiological Society of North America | - |
dc.relation.ispartof | Radiology: Artificial Intelligence | - |
dc.rights | This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. | - |
dc.subject | Abdomen/GI | - |
dc.subject | Brain/ Brain Stem | - |
dc.subject | Comparative Studies | - |
dc.subject | Convolutional Neural Network (CNN) | - |
dc.subject | CT | - |
dc.subject | Data Distribution | - |
dc.subject | Federated Deep Learning | - |
dc.subject | Liver | - |
dc.subject | MR Imaging | - |
dc.subject | Tumor Segmentation | - |
dc.title | Influence of Data Distribution on Federated Learning Performance in Tumor Segmentation | - |
dc.type | Article | - |
dc.identifier.doi | 10.1148/ryai.220082 | - |
dc.identifier.pmid | 37293342 | - |
dc.identifier.scopus | eid_2-s2.0-85161384989 | - |
dc.identifier.volume | 5 | - |
dc.identifier.issue | 3 | - |
dc.identifier.eissn | 2638-6100 | - |
dc.identifier.isi | WOS:001124231100002 | - |
dc.identifier.issnl | 2638-6100 | - |