TY - JOUR
T1 - DeepB3Pred
T2 - blood–brain barrier peptide predictor using stacked BiGRU model with novel features
AU - Arif, Muhammad
AU - Musleh, Saleh
AU - Alam, Tanvir
N1 - Publisher Copyright:
© The Author(s) 2025.
PY - 2025/12
Y1 - 2025/12
N2 - Background: The blood–brain barrier (B3) acts as a membrane that is a major concern in treating central nervous system (CNS) disorders. The B3 penetrating peptides (B3PPs) play a significant role in delivering therapeutic drugs to a wide range of disorder diseases such as multiple sclerosis, Parkinson’s disease, and Alzheimer’s disease. Therefore, the correct identification of drug agents is important for the disease treatment. Generally, the computational methods are more cost effective and faster than conventional wet-lab methods in predicting B3PPs. Consequently, we have developed a novel deep learning-based predictor called DeepB3Pred that accurately predicts the B3PPs and non-B3PPs from sequence data. Results: In the proposed method, we used three types of novel features namely Pseduo residue energy content matric (PseRECM), graphical and statistical-based feature engineering (GSFE), and composition-transition and distribution (CTD)-based features. These features capture the energy-, graphical-, and compositional-based properties of from the primary peptide sequences. The data skewness is recognized as an inevitable issue that was tackled by employing a random under sampling technique. The extracted data were fed into various deep learning, i.e., stacked bidirectional gated recurrent unit (BiGRU), Deep Forest, and machine learning models, i.e., CatBoost, Support Vector Machine. BiGRU-based DeepB3Pred model attained better results than the other state-of-the-art B3PPs predictors. The prediction efficacy of the proposed model on fivefold cross-validation in terms of accuracy is 0.945, MCC of 0.877, and area under the curve (AUC) of 0.965. The generalization performance on the unseen data is reported as 0.869 for accuracy, 0.635 for MCC, and 0.933 for AUC. Conclusion: We believe our research will accelerate the peptide-based drug discovery for neurological diseases in particular.
AB - Background: The blood–brain barrier (B3) acts as a membrane that is a major concern in treating central nervous system (CNS) disorders. The B3 penetrating peptides (B3PPs) play a significant role in delivering therapeutic drugs to a wide range of disorder diseases such as multiple sclerosis, Parkinson’s disease, and Alzheimer’s disease. Therefore, the correct identification of drug agents is important for the disease treatment. Generally, the computational methods are more cost effective and faster than conventional wet-lab methods in predicting B3PPs. Consequently, we have developed a novel deep learning-based predictor called DeepB3Pred that accurately predicts the B3PPs and non-B3PPs from sequence data. Results: In the proposed method, we used three types of novel features namely Pseduo residue energy content matric (PseRECM), graphical and statistical-based feature engineering (GSFE), and composition-transition and distribution (CTD)-based features. These features capture the energy-, graphical-, and compositional-based properties of from the primary peptide sequences. The data skewness is recognized as an inevitable issue that was tackled by employing a random under sampling technique. The extracted data were fed into various deep learning, i.e., stacked bidirectional gated recurrent unit (BiGRU), Deep Forest, and machine learning models, i.e., CatBoost, Support Vector Machine. BiGRU-based DeepB3Pred model attained better results than the other state-of-the-art B3PPs predictors. The prediction efficacy of the proposed model on fivefold cross-validation in terms of accuracy is 0.945, MCC of 0.877, and area under the curve (AUC) of 0.965. The generalization performance on the unseen data is reported as 0.869 for accuracy, 0.635 for MCC, and 0.933 for AUC. Conclusion: We believe our research will accelerate the peptide-based drug discovery for neurological diseases in particular.
KW - Bioinformatics
KW - Blood–brain barrier peptides (BPs)
KW - Deep learning
KW - Feature extraction
KW - Machine learning
UR - https://www.scopus.com/pages/publications/105020288166
U2 - 10.1186/s12915-025-02419-0
DO - 10.1186/s12915-025-02419-0
M3 - Article
C2 - 41162940
AN - SCOPUS:105020288166
SN - 1741-7007
VL - 23
JO - BMC Biology
JF - BMC Biology
IS - 1
M1 - 325
ER -