Chamorro Parejo, AndreĢs David
Loading...
1 results
Publication Search Results
Now showing 1 - 1 of 1
Publication Albertlast: A bidirectional encoder representation of a transformer's approach for the estimation of Line-1 content(2023-05-12) Chamorro Parejo, AndreĢs David; Seguel CampodoĢnico, Juan Jaime; College of Engineering; Ramos, Kenneth; SchuĢtz Schmuck, Marko; Rivera Gallego, Wilson; Arzuaga Cruz, Emmanuel; Department of Computer Science and Engineering; RodrıĢguez RomaĢn, DanielTechnological breakthroughs in high-throughput sequencing platforms have triggered a revolution in genomics. This revolution has significantly augmented an already large number of genomic datasets, and their sizes. Every increase in the amount of data brings about challenges to the ability to process it. For certain bioinformatics tasks, it is no longer possible, or desirable, to rely exclusively on classical alignment and mapping methods. This is, for example, the case of methods for the identification of LINE-1 in the genome, which present challenges in accurately identifying the variations associated with the inserts in a sample. This dissertation developed a masking model using the Bidirectional Encoder Representations from Transformers (BERT) technique and used it to develop a transformer classification model. The final product is an innovative alignment-free system that detect and analyze polymorphic LINE-1 insertions and content estimation in a sample.