SEUIR Repository

Multi-S3p: protein secondary structure prediction with specialized multi-network and self-attention-based deep learning model

Show simple item record

dc.contributor.author Mohamed Mufassirin, M. M.
dc.contributor.author Hakim Newton, M. A.
dc.contributor.author Julia, Rahman
dc.contributor.author Abdul Sattar
dc.date.accessioned 2023-07-05T04:24:59Z
dc.date.available 2023-07-05T04:24:59Z
dc.date.issued 2023-06-05
dc.identifier.citation IEEE Access,Vol. 11, 2023, pp. 57083-57096. en_US
dc.identifier.issn 2169-3536 (Online)
dc.identifier.uri http://ir.lib.seu.ac.lk/handle/123456789/6720
dc.description.abstract Protein structure prediction (PSP) is a vital challenge in bioinformatics, structural biology and drug discovery. Protein secondary structure (SS) prediction is critical since three-dimensional (3D) structures are primarily made up of secondary structures. With the advancement of deep learning approaches, SS classification accuracy has been significantly improved. Many existing methods use an ensemble of complex neural networks to improve SS prediction. Because of the high dimensionality of the hyperparam eter space, deep neural networks with complex architectures are typically challenging to train effectively. Also, predicting secondary structures in the boundary regions between different types of SS is challenging. This study presents Multi-S3P, which employs bidirectional Long-Short-Term-Memory (BILSTM) and Convolutional Neural Networks (CNN) with a self-attention mechanism to improve the secondary structure prediction using an effective training strategy to capture the unique characteristics of each type of secondary structure and combine them more effectively. The ensemble of CNN and BILSTM can learn both contextual information and long-range interactions between the residues. In addition, using a self-attention mechanism allows the model to focus on the most important features for improving performance. We used the SPOT 1D dataset for the training and validation of our model using a set of four input features derived from amino acid sequences. Further, the model was tested on four popular independent test datasets and compared with various state-of-the-art predictors. The presented results show that Multi-S3P outperformed the other methods in terms of Q3, Q8 accuracy and other performance metrics, achieving the highest Q3 accuracy of 87.57% and a Q8 accuracy of 77.56% on the TEST2016 test set. More importantly, Multi-S3P demonstrates high performance in SS boundary regions. Our experiment also demonstrates that the combination of different input features and a multi-network-based training strategy significantly improved the performance. en_US
dc.language.iso en_US en_US
dc.publisher Institute of Electrical and Electronics Engineers Inc en_US
dc.subject Deep Learning en_US
dc.subject Convolutional Neural Network en_US
dc.subject Protein Structure Prediction en_US
dc.subject Protein Secondary Structure en_US
dc.subject Recurrent Neural Network en_US
dc.title Multi-S3p: protein secondary structure prediction with specialized multi-network and self-attention-based deep learning model en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

  • Research Articles [915]
    THESE ARE RESEARCH ARTICLES OF ACADEMIC STAFF, PUBLISHED IN JOURNALS AND PROCEEDINGS ELSWHERE

Show simple item record

Search SEUIR


Advanced Search

Browse

My Account