Please use this identifier to cite or link to this item: http://dx.doi.org/10.25673/122639
Title: Fast barcode calling based on k-mer distances
Author(s): Uphoff, Riko Corwin
Schüler, Steffen
Große, IvoLook up in the Integrated Authority File of the German National Library
Müller-Hannemann, MatthiasLook up in the Integrated Authority File of the German National Library
Issue Date: 2026
Type: Article
Language: English
Abstract: DNA barcodes, which are short DNA strings, are regularly used as tags in pooled sequencing experiments to enable the identification of reads originating from the same sample. A crucial task in the subsequent analysis of pooled sequences is barcode calling, where one must identify the corresponding barcode for each read. This task is computationally challenging when the probability of synthesis and sequencing errors is high, like in photolithographic microarray synthesis. Identifying the most similar barcode for each read is a theoretically attractive solution for barcode calling. However, an all-to-all exact similarity calculation is practically infeasible for applications with millions of barcodes and billions of reads. Hence, several computational approaches for barcode calling have been proposed, but the challenge of developing an efficient and precise computational approach remains. Here, we propose a simple, yet highly effective new barcode calling approach that uses a filtering technique based on precomputed k-mer lists. We find that this approach has a slightly higher accuracy than the state-of-the-art approach, is more than 500 times faster than that, and allows barcode calling for one million barcodes and one billion reads per day on a server GPU. The same throughput can even be realized using a CPU-parallel implementation.
URI: https://opendata.uni-halle.de//handle/1981185920/124584
http://dx.doi.org/10.25673/122639
Open Access: Open access publication
License: (CC BY-NC 4.0) Creative Commons Attribution NonCommercial 4.0(CC BY-NC 4.0) Creative Commons Attribution NonCommercial 4.0
Journal Title: PNAS nexus
Publisher: Oxford University Press
Publisher Place: Oxford
Volume: 5
Issue: 2
Original Publication: 10.1093/pnasnexus/pgag001
Appears in Collections:Open Access Publikationen der MLU

Files in This Item:
File SizeFormat 
pgag001.pdf670.82 kBAdobe PDFView/Open