Please use this identifier to cite or link to this item:
http://dx.doi.org/10.25673/117931
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Schmidt-Barbo, Paul | - |
dc.contributor.author | Kalweit, Gabriel | - |
dc.contributor.author | Naouar, Mehdi | - |
dc.contributor.author | Paschold, Lisa | - |
dc.contributor.author | Willscher, Edith | - |
dc.contributor.author | Schultheiß, Christoph | - |
dc.contributor.author | Märkl, Bruno | - |
dc.contributor.author | Dirnhofer, Stefan | - |
dc.contributor.author | Tzankov, Alexandar | - |
dc.contributor.author | Binder, Mascha | - |
dc.contributor.author | Kalweit, Maria | - |
dc.date.accessioned | 2025-01-27T07:58:30Z | - |
dc.date.available | 2025-01-27T07:58:30Z | - |
dc.date.issued | 2024 | - |
dc.identifier.uri | https://opendata.uni-halle.de//handle/1981185920/119891 | - |
dc.identifier.uri | http://dx.doi.org/10.25673/117931 | - |
dc.description.abstract | The classification of B cell lymphomas—mainly based on light microscopy evaluation by a pathologist—requires many years of training. Since the B cell receptor (BCR) of the lymphoma clonotype and the microenvironmental immune architecture are important features discriminating different lymphoma subsets, we asked whether BCR repertoire next-generation sequencing (NGS) of lymphoma-infiltrated tissues in conjunction with machine learning algorithms could have diagnostic utility in the subclassification of these cancers. We trained a random forest and a linear classifier via logistic regression based on patterns of clonal distribution, VDJ gene usage and physico-chemical properties of the top-n most frequently represented clonotypes in the BCR repertoires of 620 paradigmatic lymphoma samples—nodular lymphocyte predominant B cell lymphoma (NLPBL), diffuse large B cell lymphoma (DLBCL) and chronic lymphocytic leukemia (CLL)—alongside with 291 control samples. With regard to DLBCL and CLL, the models demonstrated optimal performance when utilizing only the most prevalent clonotype for classification, while in NLPBL—that has a dominant background of non-malignant bystander cells—a broader array of clonotypes enhanced model accuracy. Surprisingly, the straightforward logistic regression model performed best in this seemingly complex classification problem, suggesting linear separability in our chosen dimensions. It achieved a weighted F1-score of 0.84 on a test cohort including 125 samples from all three lymphoma entities and 58 samples from healthy individuals. Together, we provide proof-of-concept that at least the 3 studied lymphoma entities can be differentiated from each other using BCR repertoire NGS on lymphoma-infiltrated tissues by a trained machine learning model. | eng |
dc.language.iso | eng | - |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | - |
dc.subject.ddc | 610 | - |
dc.title | Detection of disease-specific signatures in B cell repertoires of lymphomas using machine learning | eng |
dc.type | Article | - |
local.versionType | publishedVersion | - |
local.bibliographicCitation.journaltitle | PLoS Computational Biology | - |
local.bibliographicCitation.volume | 20 | - |
local.bibliographicCitation.issue | 7 | - |
local.bibliographicCitation.publishername | Public Library of Science | - |
local.bibliographicCitation.publisherplace | San Francisco, Calif. | - |
local.bibliographicCitation.doi | 10.1371/journal.pcbi.1011570 | - |
local.openaccess | true | - |
dc.identifier.ppn | 1899185097 | - |
cbs.publication.displayform | 2024 | - |
local.bibliographicCitation.year | 2024 | - |
cbs.sru.importDate | 2025-01-27T07:57:48Z | - |
local.bibliographicCitation | Enthalten in PLoS Computational Biology - San Francisco, Calif. : Public Library of Science, 2005 | - |
local.accessrights.dnb | free | - |
Appears in Collections: | Open Access Publikationen der MLU |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
journal.pcbi.1011570.pdf | 2.36 MB | Adobe PDF | ![]() View/Open |