Science Cast

PIMENTO: A PrIMEr infereNce TOolkit to facilitate large-scale calling of amplicon sequence variants

librarianJuly 7, 2025 3:58am

Views (0)
Comments (0)

Export Citation

Voice is AI-generated

Connected to paperThis paper is a preprint and has not been certified by peer review

PIMENTO: A PrIMEr infereNce TOolkit to facilitate large-scale calling of amplicon sequence variants

bioRxivPDFJuly 6, 2025 12:00am

Authors

Atallah, C.; Richardson, L.; Beracochea, M.; Finn, R. D.

Abstract

The calling of amplicon sequence variants from DNA metabarcoding data is a common method of revealing the taxonomic makeup of environmental samples. A significant hurdle to the large-scale calling of amplicon sequence variants from publicly available nucleotide datasets is the presence of primer sequences in reads, the removal of which is a necessary pre-processing step for this form of analysis. Further, as the details of which primers were used is rarely associated with the sequence records, there is a need for a method that can automatically infer the presence and identity of primers in sequencing data. In this work, we introduce PIMENTO, a Python package which uses a dual-strategy approach for identifying primers that are present in sequencing reads to enable their removal, and therefore facilitate amplicon sequence variant calling at scale.

TwitterandLinkedIn

0 comments

Add comment

PIMENTO: A PrIMEr infereNce TOolkit to facilitate large-scale calling of amplicon sequence variants

PIMENTO: A PrIMEr infereNce TOolkit to facilitate large-scale calling of amplicon sequence variants

AI-powered Paper ChatBeta

AI-powered Paper ChatBeta

0 comments