Please use this identifier to cite or link to this item: http://dx.doi.org/10.25673/117452
Full metadata record
DC FieldValueLanguage
dc.contributor.authorGriewing, Sebastian-
dc.contributor.authorLechner, Fabian-
dc.contributor.authorGremke, Niklas-
dc.contributor.authorLukáč, Štefan-
dc.contributor.authorJanni, Wolfgang-
dc.contributor.authorWallwiener, Markus-
dc.contributor.authorWagner, Uwe-
dc.contributor.authorHirsch, Martin-
dc.contributor.authorKuhn, Sebastian-
dc.date.accessioned2024-12-06T07:15:14Z-
dc.date.available2024-12-06T07:15:14Z-
dc.date.issued2024-
dc.identifier.urihttps://opendata.uni-halle.de//handle/1981185920/119411-
dc.identifier.urihttp://dx.doi.org/10.25673/117452-
dc.description.abstractPurpose: Large language models (LLM) show potential for decision support in breast cancer care. Their use in clinical care is currently prohibited by lack of control over sources used for decision-making, explainability of the decision-making process and health data security issues. Recent development of Small Language Models (SLM) is discussed to address these challenges. This preclinical proof-of-concept study tailors an open-source SLM to the German breast cancer guideline (BC-SLM) to evaluate initial clinical accuracy and technical functionality in a preclinical simulation. Methods: A multidisciplinary tumor board (MTB) is used as the gold-standard to assess the initial clinical accuracy in terms of concordance of the BC-SLM with MTB and comparing it to two publicly available LLM, ChatGPT3.5 and 4. The study includes 20 fictional patient profiles and recommendations for 5 treatment modalities, resulting in 100 binary treatment recommendations (recommended or not recommended). Statistical evaluation includes concordance with MTB in % including Cohen’s Kappa statistic (κ). Technical functionality is assessed qualitatively in terms of local hosting, adherence to the guideline and information retrieval. Results: The overall concordance amounts to 86% for BC-SLM (κ = 0.721, p < 0.001), 90% for ChatGPT4 (κ = 0.820, p < 0.001) and 83% for ChatGPT3.5 (κ = 0.661, p < 0.001). Specific concordance for each treatment modality ranges from 65 to 100% for BC-SLM, 85–100% for ChatGPT4, and 55–95% for ChatGPT3.5. The BC-SLM is locally functional, adheres to the standards of the German breast cancer guideline and provides referenced sections for its decision-making. Conclusion: The tailored BC-SLM shows initial clinical accuracy and technical functionality, with concordance to the MTB that is comparable to publicly-available LLMs like ChatGPT4 and 3.5. This serves as a proof-of-concept for adapting a SLM to an oncological disease and its guideline to address prevailing issues with LLM by ensuring decision transparency, explainability, source control, and data security, which represents a necessary step towards clinical validation and safe use of language models in clinical oncology.eng
dc.language.isoeng-
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/-
dc.subject.ddc610-
dc.titleProof-of-concept study of a small language model chatbot for breast cancer decision support - a transparent, source-controlled, explainable and data-secure approacheng
dc.typeArticle-
local.versionTypepublishedVersion-
local.bibliographicCitation.journaltitleJournal of cancer research and clinical oncology-
local.bibliographicCitation.volume150-
local.bibliographicCitation.publishernameSpringer-
local.bibliographicCitation.publisherplaceBerlin-
local.bibliographicCitation.doi10.1007/s00432-024-05964-3-
local.openaccesstrue-
dc.identifier.ppn190885894X-
cbs.publication.displayform2024-
local.bibliographicCitation.year2024-
cbs.sru.importDate2024-12-06T07:14:30Z-
local.bibliographicCitationEnthalten in Journal of cancer research and clinical oncology - Berlin : Springer, 1904-
local.accessrights.dnbfree-
Appears in Collections:Open Access Publikationen der MLU

Files in This Item:
File Description SizeFormat 
s00432-024-05964-3.pdf1.13 MBAdobe PDFThumbnail
View/Open