Terug naar overzicht

Breaking free from references: a consensus-based approach for community profiling with long amplicon nanopore data

Third-generation sequencing platforms, such as Oxford Nanopore Technology (ONT), have made it possible to characterize communities through the sequencing of long amplicons. While this theoretically allows for an increased taxonomic resolution compared to short-read sequencing platforms such as Illumina, the high error rate remains problematic for accurately identifying the community members present within a sample. Here, we present and validate CONCOMPRA, a tool that allows the detection of closely related strains within a community by drafting and mapping to consensus sequences. We show that CONCOMPRA outperforms several other tools for profiling bacterial communities using full-length 16S rRNA gene sequencing. Since CONCOMPRA does not rely on a sequence database for profiling communities, it is applicable to systems and amplicons for which little to no reference data exists. Our validation test shows that the amplification of long PCR products is likely to produce chimeric byproducts that inflate alpha diversity and skew community structure, stressing the importance of chimera detection. CONCOMPRA is available on GitHub (https://github.com/willem-stock/CONCOMPRA).

Details

Aantal pagina's 10
Volume 26
Tijdschrift nummer 1
Pagina's (van-tot) 1
Type A1: Web of Science-artikel
Categorie Onderzoek
Tijdschrift Briefings in Bioinformatics
Issns 1477-4054
Taal Engels
Bibtex

@misc{4cd79ef0-14d7-4ed5-a244-9bd6dead447c,
title = "Breaking free from references: a consensus-based approach for community profiling with long amplicon nanopore data",
abstract = "Third-generation sequencing platforms, such as Oxford Nanopore Technology (ONT), have made it possible to characterize communities through the sequencing of long amplicons. While this theoretically allows for an increased taxonomic resolution compared to short-read sequencing platforms such as Illumina, the high error rate remains problematic for accurately identifying the community members present within a sample. Here, we present and validate CONCOMPRA, a tool that allows the detection of closely related strains within a community by drafting and mapping to consensus sequences. We show that CONCOMPRA outperforms several other tools for profiling bacterial communities using full-length 16S rRNA gene sequencing. Since CONCOMPRA does not rely on a sequence database for profiling communities, it is applicable to systems and amplicons for which little to no reference data exists. Our validation test shows that the amplification of long PCR products is likely to produce chimeric byproducts that inflate alpha diversity and skew community structure, stressing the importance of chimera detection. CONCOMPRA is available on GitHub (https://github.com/willem-stock/CONCOMPRA).",
author = "Willem Stock and Coralie Rousseau and Glen Dierickx and Sofie D’hondt and Luz Amadei Martínez and Simon M. Dittami and Luna M. van der Loos and Olivier De Clerck",
year = "2024",
month = dec,
day = "16",
doi = "https://doi.org/10.1093/bib/bbae642",
language = "Nederlands",
publisher = "Instituut voor Natuur- en Bosonderzoek",
address = "België,
type = "Other"
}

Auteurs

Willem Stock
Coralie Rousseau
Glen Dierickx
Sofie D’hondt
Luz Amadei Martínez
Simon M. Dittami
Luna M. van der Loos
Olivier De Clerck