A lightweight codon-based DNA Transformer for Regulatory Region Identification in the Genome
Voice is AI-generated
Connected to paperThis paper is a preprint and has not been certified by peer review
A lightweight codon-based DNA Transformer for Regulatory Region Identification in the Genome
Karthik, A. S. P.; Das, A. B.
AbstractWe developed a lightweight codon-based DNA Transformer equipped with multi-head self-attention and an adaptive classifier head, which achieves exon intron classification with high accuracy and also has moderate accuracy in CDS classification and splice site recognition. We named this model as ExIT (Exon-Intron Transformer). We have implemented codon tokenization for this model. This has been validated on the human genome with external validation from the chimpanzee genome. Further benchmarking has implied that our model is better than the existing models in the above tasks.