Protein Secondary Structure Prediction Using Transformers
Manzi Kevin Maxime
arXiv:2512.08613·cs.AI·Published 2025-12-09
Predicting protein secondary structures such as alpha helices, beta sheets, and coils from amino acid sequences is essential for understanding protein function. This work presents a transformer-based model that applies attention mechanisms to protein sequence data to predict structural motifs. A sliding-window data augmentation technique is used on the CB513 dataset to expand the training samples. The transformer shows strong ability to generalize across variable-length sequences while effectively capturing both local and long-range residue interactions.
TopicsProtein & Biomolecules
Tagsprotein-function
arXiv categoriescs.AI
arXiv abstract pagePDF