Protein Secondary Structure Prediction Using Transformers

Manzi Kevin Maxime

arXiv:2512.08613·cs.AI·Published 2025-12-09

Predicting protein secondary structures such as alpha helices, beta sheets, and coils from amino acid sequences is essential for understanding protein function. This work presents a transformer-based model that applies attention mechanisms to protein sequence data to predict structural motifs. A sliding-window data augmentation technique is used on the CB513 dataset to expand the training samples. The transformer shows strong ability to generalize across variable-length sequences while effectively capturing both local and long-range residue interactions.

TopicsProtein & Biomolecules

Tagsprotein-function

arXiv categoriescs.AI

arXiv abstract pagePDF