A lightweight codon-based DNA Transformer for Regulatory Region Identification in the Genome

Avatar
Poster
Voice is AI-generated
Connected to paperThis paper is a preprint and has not been certified by peer review

A lightweight codon-based DNA Transformer for Regulatory Region Identification in the Genome

Authors

Karthik, A. S. P.; Das, A. B.

Abstract

We developed a lightweight codon-based DNA Transformer equipped with multi-head self-attention and an adaptive classifier head, which achieves exon intron classification with high accuracy and also has moderate accuracy in CDS classification and splice site recognition. We named this model as ExIT (Exon-Intron Transformer). We have implemented codon tokenization for this model. This has been validated on the human genome with external validation from the chimpanzee genome. Further benchmarking has implied that our model is better than the existing models in the above tasks.

Follow Us on

0 comments

Add comment