Influence of Data Distribution on Federated Learning Performance in Tumor Segmentation

Luo, G; Liu, T; Lu, J; Chen, X; Yu, L; Wu, J; Chen, DZ; Cai, W

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1148/ryai.220082
Scopus: eid_2-s2.0-85161384989
PMID: 37293342
Find via

Supplementary

Citations:
- Scopus: 0
- PubMed Central: 0
Appears in Collections:
- Statistics & Actuarial Science: Journal/Magazine Articles

Article: Influence of Data Distribution on Federated Learning Performance in Tumor Segmentation

Title	Influence of Data Distribution on Federated Learning Performance in Tumor Segmentation
Authors	Luo, G Liu, T Lu, J Chen, X Yu, L Wu, J Chen, DZ Cai, W
Keywords	Abdomen/GI Brain/ Brain Stem Comparative Studies Convolutional Neural Network (CNN) CT Data Distribution Federated Deep Learning Liver MR Imaging Tumor Segmentation
Issue Date	26-Apr-2023
Publisher	Radiological Society of North America
Citation	Radiology: Artificial Intelligence, 2023, v. 5, n. 3 How to Cite? DOI: http://dx.doi.org/10.1148/ryai.220082
Abstract	Purpose: To investigate the correlation between differences in data distributions and federated deep learning (Fed-DL) algorithm performance in tumor segmentation on CT and MR images. Materials and Methods: Two Fed-DL datasets were retrospectively collected (from November 2020 to December 2021): one dataset of liver tumor CT images (Federated Imaging in Liver Tumor Segmentation [or, FILTS]; three sites, 692 scans) and one publicly avail-able dataset of brain tumor MR images (Federated Tumor Segmentation [or, FeTS]; 23 sites, 1251 scans). Scans from both datasets were grouped according to site, tumor type, tumor size, dataset size, and tumor intensity. To quantify differences in data distributions, the following four distance metrics were calculated: earth mover’s distance (EMD), Bhattacharyya distance (BD), χ2 distance (CSD), and Kolmogorov-Smirnov distance (KSD). Both federated and centralized nnU-Net models were trained by using the same grouped datasets. Fed-DL model performance was evaluated by using the ratio of Dice coefficients, θ, between federated and centralized models trained and tested on the same 80:20 split datasets. Results: The Dice coefficient ratio (θ) between federated and centralized models was strongly negatively correlated with the distances between data distributions, with correlation coefficients of −0.920 for EMD, −0.893 for BD, and −0.899 for CSD. However, KSD was weakly correlated with θ, with a correlation coefficient of −0.479. Conclusion: Performance of Fed-DL models in tumor segmentation on CT and MRI datasets was strongly negatively correlated with the distances between data distributions.
Persistent Identifier	http://hdl.handle.net/10722/331044
ISSN	2638-6100

DC Field	Value	Language
dc.contributor.author	Luo, G	-
dc.contributor.author	Liu, T	-
dc.contributor.author	Lu, J	-
dc.contributor.author	Chen, X	-
dc.contributor.author	Yu, L	-
dc.contributor.author	Wu, J	-
dc.contributor.author	Chen, DZ	-
dc.contributor.author	Cai, W	-
dc.date.accessioned	2023-09-21T06:52:18Z	-
dc.date.available	2023-09-21T06:52:18Z	-
dc.date.issued	2023-04-26	-
dc.identifier.citation	Radiology: Artificial Intelligence, 2023, v. 5, n. 3	-
dc.identifier.issn	2638-6100	-
dc.identifier.uri	http://hdl.handle.net/10722/331044	-
dc.description.abstract	Purpose: To investigate the correlation between differences in data distributions and federated deep learning (Fed-DL) algorithm performance in tumor segmentation on CT and MR images. Materials and Methods: Two Fed-DL datasets were retrospectively collected (from November 2020 to December 2021): one dataset of liver tumor CT images (Federated Imaging in Liver Tumor Segmentation [or, FILTS]; three sites, 692 scans) and one publicly avail-able dataset of brain tumor MR images (Federated Tumor Segmentation [or, FeTS]; 23 sites, 1251 scans). Scans from both datasets were grouped according to site, tumor type, tumor size, dataset size, and tumor intensity. To quantify differences in data distributions, the following four distance metrics were calculated: earth mover’s distance (EMD), Bhattacharyya distance (BD), χ2 distance (CSD), and Kolmogorov-Smirnov distance (KSD). Both federated and centralized nnU-Net models were trained by using the same grouped datasets. Fed-DL model performance was evaluated by using the ratio of Dice coefficients, θ, between federated and centralized models trained and tested on the same 80:20 split datasets. Results: The Dice coefficient ratio (θ) between federated and centralized models was strongly negatively correlated with the distances between data distributions, with correlation coefficients of −0.920 for EMD, −0.893 for BD, and −0.899 for CSD. However, KSD was weakly correlated with θ, with a correlation coefficient of −0.479. Conclusion: Performance of Fed-DL models in tumor segmentation on CT and MRI datasets was strongly negatively correlated with the distances between data distributions.	-
dc.language	eng	-
dc.publisher	Radiological Society of North America	-
dc.relation.ispartof	Radiology: Artificial Intelligence	-
dc.rights	This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.	-
dc.subject	Abdomen/GI	-
dc.subject	Brain/ Brain Stem	-
dc.subject	Comparative Studies	-
dc.subject	Convolutional Neural Network (CNN)	-
dc.subject	CT	-
dc.subject	Data Distribution	-
dc.subject	Federated Deep Learning	-
dc.subject	Liver	-
dc.subject	MR Imaging	-
dc.subject	Tumor Segmentation	-
dc.title	Influence of Data Distribution on Federated Learning Performance in Tumor Segmentation	-
dc.type	Article	-
dc.identifier.doi	10.1148/ryai.220082	-
dc.identifier.pmid	37293342	-
dc.identifier.scopus	eid_2-s2.0-85161384989	-
dc.identifier.volume	5	-
dc.identifier.issue	3	-
dc.identifier.eissn	2638-6100	-
dc.identifier.issnl	2638-6100	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Article: Influence of Data Distribution on Federated Learning Performance in Tumor Segmentation

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats