File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: Influence of Data Distribution on Federated Learning Performance in Tumor Segmentation

TitleInfluence of Data Distribution on Federated Learning Performance in Tumor Segmentation
Authors
KeywordsAbdomen/GI
Brain/ Brain Stem
Comparative Studies
Convolutional Neural Network (CNN)
CT
Data Distribution
Federated Deep Learning
Liver
MR Imaging
Tumor Segmentation
Issue Date26-Apr-2023
PublisherRadiological Society of North America
Citation
Radiology: Artificial Intelligence, 2023, v. 5, n. 3 How to Cite?
AbstractPurpose: To investigate the correlation between differences in data distributions and federated deep learning (Fed-DL) algorithm performance in tumor segmentation on CT and MR images. Materials and Methods: Two Fed-DL datasets were retrospectively collected (from November 2020 to December 2021): one dataset of liver tumor CT images (Federated Imaging in Liver Tumor Segmentation [or, FILTS]; three sites, 692 scans) and one publicly avail-able dataset of brain tumor MR images (Federated Tumor Segmentation [or, FeTS]; 23 sites, 1251 scans). Scans from both datasets were grouped according to site, tumor type, tumor size, dataset size, and tumor intensity. To quantify differences in data distributions, the following four distance metrics were calculated: earth mover’s distance (EMD), Bhattacharyya distance (BD), χ2 distance (CSD), and Kolmogorov-Smirnov distance (KSD). Both federated and centralized nnU-Net models were trained by using the same grouped datasets. Fed-DL model performance was evaluated by using the ratio of Dice coefficients, θ, between federated and centralized models trained and tested on the same 80:20 split datasets. Results: The Dice coefficient ratio (θ) between federated and centralized models was strongly negatively correlated with the distances between data distributions, with correlation coefficients of −0.920 for EMD, −0.893 for BD, and −0.899 for CSD. However, KSD was weakly correlated with θ, with a correlation coefficient of −0.479. Conclusion: Performance of Fed-DL models in tumor segmentation on CT and MRI datasets was strongly negatively correlated with the distances between data distributions.
Persistent Identifierhttp://hdl.handle.net/10722/331044
ISSN

 

DC FieldValueLanguage
dc.contributor.authorLuo, G-
dc.contributor.authorLiu, T-
dc.contributor.authorLu, J-
dc.contributor.authorChen, X-
dc.contributor.authorYu, L-
dc.contributor.authorWu, J-
dc.contributor.authorChen, DZ-
dc.contributor.authorCai, W-
dc.date.accessioned2023-09-21T06:52:18Z-
dc.date.available2023-09-21T06:52:18Z-
dc.date.issued2023-04-26-
dc.identifier.citationRadiology: Artificial Intelligence, 2023, v. 5, n. 3-
dc.identifier.issn2638-6100-
dc.identifier.urihttp://hdl.handle.net/10722/331044-
dc.description.abstractPurpose: To investigate the correlation between differences in data distributions and federated deep learning (Fed-DL) algorithm performance in tumor segmentation on CT and MR images. Materials and Methods: Two Fed-DL datasets were retrospectively collected (from November 2020 to December 2021): one dataset of liver tumor CT images (Federated Imaging in Liver Tumor Segmentation [or, FILTS]; three sites, 692 scans) and one publicly avail-able dataset of brain tumor MR images (Federated Tumor Segmentation [or, FeTS]; 23 sites, 1251 scans). Scans from both datasets were grouped according to site, tumor type, tumor size, dataset size, and tumor intensity. To quantify differences in data distributions, the following four distance metrics were calculated: earth mover’s distance (EMD), Bhattacharyya distance (BD), χ2 distance (CSD), and Kolmogorov-Smirnov distance (KSD). Both federated and centralized nnU-Net models were trained by using the same grouped datasets. Fed-DL model performance was evaluated by using the ratio of Dice coefficients, θ, between federated and centralized models trained and tested on the same 80:20 split datasets. Results: The Dice coefficient ratio (θ) between federated and centralized models was strongly negatively correlated with the distances between data distributions, with correlation coefficients of −0.920 for EMD, −0.893 for BD, and −0.899 for CSD. However, KSD was weakly correlated with θ, with a correlation coefficient of −0.479. Conclusion: Performance of Fed-DL models in tumor segmentation on CT and MRI datasets was strongly negatively correlated with the distances between data distributions.-
dc.languageeng-
dc.publisherRadiological Society of North America-
dc.relation.ispartofRadiology: Artificial Intelligence-
dc.rightsThis work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.-
dc.subjectAbdomen/GI-
dc.subjectBrain/ Brain Stem-
dc.subjectComparative Studies-
dc.subjectConvolutional Neural Network (CNN)-
dc.subjectCT-
dc.subjectData Distribution-
dc.subjectFederated Deep Learning-
dc.subjectLiver-
dc.subjectMR Imaging-
dc.subjectTumor Segmentation-
dc.titleInfluence of Data Distribution on Federated Learning Performance in Tumor Segmentation-
dc.typeArticle-
dc.identifier.doi10.1148/ryai.220082-
dc.identifier.pmid37293342-
dc.identifier.scopuseid_2-s2.0-85161384989-
dc.identifier.volume5-
dc.identifier.issue3-
dc.identifier.eissn2638-6100-
dc.identifier.issnl2638-6100-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats