Data privacy-aware machine learning approach in pancreatic cancer diagnosis

dc.authorid0000-0002-5877-0177
dc.authorwosidV-8861-2017
dc.contributor.authorAkmeşe, Ömer Faruk
dc.date.accessioned2024-09-20T07:38:08Z
dc.date.available2024-09-20T07:38:08Z
dc.date.issued2024en_US
dc.departmentHitit Üniversitesi, Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümü
dc.description.abstractProblem Pancreatic ductal adenocarcinoma (PDAC) is considered a highly lethal cancer due to its advanced stage diagnosis. The fve-year survival rate after diagnosis is less than 10%. However, if diagnosed early, the fve-year survival rate can reach up to 70%. Early diagnosis of PDAC can aid treatment and improve survival rates by taking necessary precautions. The challenge is to develop a reliable, data privacy-aware machine learning approach that can accurately diagnose pancreatic cancer with biomarkers. Aim The study aims to diagnose a patient’s pancreatic cancer while ensuring the confdentiality of patient records. In addition, the study aims to guide researchers and clinicians in developing innovative methods for diagnosing pancreatic cancer. Methods Machine learning, a branch of artifcial intelligence, can identify patterns by analyzing large datasets. The study pre-processed a dataset containing urine biomarkers with operations such as flling in missing values, cleaning outliers, and feature selection. The data was encrypted using the Fernet encryption algorithm to ensure confdentiality. Ten separate machine learning models were applied to predict individuals with PDAC. Performance metrics such as F1 score, recall, precision, and accuracy were used in the modeling process. Results Among the 590 clinical records analyzed, 199 (33.7%) belonged to patients with pancreatic cancer, 208 (35.3%) to patients with non-cancerous pancreatic disorders (such as benign hepatobiliary disease), and 183 (31%) to healthy individuals. The LGBM algorithm showed the highest efciency by achieving an accuracy of 98.8%. The accuracy of the other algorithms ranged from 98 to 86%. In order to understand which features are more critical and which data the model is based on, the analysis found that the features “plasma_CA19_9”, REG1A, TFF1, and LYVE1 have high importance levels. The LIME analysis also analyzed which features of the model are important in the decision-making process. Conclusions This research outlines a data privacy-aware machine learning tool for predicting PDAC. The results show that a promising approach can be presented for clinical application. Future research should expand the dataset and focus on validation by applying it to various populations.
dc.description.provenanceSubmitted by Zeynep Umut NARİN (umutarslan@hitit.edu.tr) on 2024-09-20T07:36:00Z No. of bitstreams: 1 omer-faruk-akmese2024.pdf: 1678503 bytes, checksum: 85f96f1c779c2ca274178d752c1de362 (MD5)en
dc.description.provenanceApproved for entry into archive by Zeynep Umut NARİN (umutarslan@hitit.edu.tr) on 2024-09-20T07:38:08Z (GMT) No. of bitstreams: 1 omer-faruk-akmese2024.pdf: 1678503 bytes, checksum: 85f96f1c779c2ca274178d752c1de362 (MD5)en
dc.description.provenanceMade available in DSpace on 2024-09-20T07:38:08Z (GMT). No. of bitstreams: 1 omer-faruk-akmese2024.pdf: 1678503 bytes, checksum: 85f96f1c779c2ca274178d752c1de362 (MD5) Previous issue date: 2024en
dc.identifier.citationAkmeşe, Ö. F. (2024). Data privacy-aware machine learning approach in pancreatic cancer diagnosis. BMC Medical Informatics and Decision Making, 24(1), 1-14.
dc.identifier.doi10.1186/s12911-024-02657-2
dc.identifier.issn1472-6947
dc.identifier.pmid39237927
dc.identifier.scopusqualityN/A
dc.identifier.urihttps://doi.org/10.1186/s12911-024-02657-2
dc.identifier.urihttps://hdl.handle.net/11491/9055
dc.identifier.wosWOS:001306379600001
dc.identifier.wosqualityQ2
dc.indekslendigikaynakWeb of Science
dc.indekslendigikaynakScopus
dc.indekslendigikaynakPubMed
dc.language.isoen
dc.publisherBMC
dc.relation.ispartofBMC MEDICAL INFORMATICS AND DECISION MAKING
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
dc.rightsinfo:eu-repo/semantics/openAccess
dc.subjectPancreas canceren_US
dc.subjectDiagnosisen_US
dc.subjectMachine learningen_US
dc.subjectClassifcationen_US
dc.subjectData privacy-awareen_US
dc.titleData privacy-aware machine learning approach in pancreatic cancer diagnosis
dc.typeArticle

Dosyalar

Orijinal paket
Listeleniyor 1 - 1 / 1
Yükleniyor...
Küçük Resim
İsim:
omer-faruk-akmese2024.pdf
Boyut:
1.6 MB
Biçim:
Adobe Portable Document Format
Açıklama:
Tam Metin / Full Text
Lisans paketi
Listeleniyor 1 - 1 / 1
[ X ]
İsim:
license.txt
Boyut:
1.44 KB
Biçim:
Item-specific license agreed upon to submission
Açıklama: