Математическая биология и биоинформатика
RUS  ENG    ЖУРНАЛЫ   ПЕРСОНАЛИИ   ОРГАНИЗАЦИИ   КОНФЕРЕНЦИИ   СЕМИНАРЫ   ВИДЕОТЕКА   ПАКЕТ AMSBIB  
Общая информация
Последний выпуск
Архив
Импакт-фактор

Поиск публикаций
Поиск ссылок

RSS
Последний выпуск
Текущие выпуски
Архивные выпуски
Что такое RSS



Матем. биология и биоинформ.:
Год:
Том:
Выпуск:
Страница:
Найти






Персональный вход:
Логин:
Пароль:
Запомнить пароль
Войти
Забыли пароль?
Регистрация


Математическая биология и биоинформатика, 2023, том 18, выпуск 1, страницы 113–127
DOI: https://doi.org/10.17537/2023.18.113
(Mi mbb512)
 

Биоинформатика

Exploiting ensemble learning and negative sample space for predicting extracellular matrix receptor interactions

Abhigyan Natha, Sudama Rathorea, Pangambam Sendash Singhb

a Department of Biochemistry, Pt. Jawahar Lal Nehru Memorial Medical College, Raipur, India
b Department of Computer Science, Banaras Hindu University
Список литературы:
Аннотация: The extracellular matrix (ECM) is best described as a dynamic three-dimensional mesh of various macromolecules. These include proteoglycans (e.g., perlecan andagrin), non-proteoglycan polysaccharides (e.g., hyaluronan), and fibrous proteins (e.g., collagen, elastin, fibronectin, and laminin). ECM proteins are involved in various biological functions and their functionality is largely governed by interaction with other ECM proteins as well as trans-membrane receptors including integrins, proteoglycans such assyndecan, other glycoproteins, and members of the immunoglobulin superfamily. In the present work, a machine learning approach is developed using sequence and evolutionary features for predicting ECM protein-receptor interactions. Two different feature vector representations, namely fusion of feature vectors and average of feature vectors are used within corporation of the best representation employing feature selection. The current results show that the feature vector representation is an important aspect of ECM protein interaction prediction, and that the average of feature vectors performed better than the fusion of feature vectors. The best prediction model with boosted random forest resulted in 72.6% overall accuracy, 74.4% sensitivity and 70.7% specificity with the 200 best features obtained using the ReliefF feature selection algorithm. Further, a comparative analysis was performed for negative sample subset selection using three sampling methods, namely random sampling, $k$-Means sampling, and Uniform sampling. $k$-Means based representative sampling resulted in enhanced accuracy (75.5% accuracy with 80.8% sensitivity, 68.1% specificity and 0.801 AUC) for the prediction of ECM protein-receptor interactions in comparison to the other sampling methods. On comparison with other three state of the art PPI predictors, it is observed that the latter displayed low sensitivity but higher specificity. The current work presents the first machine learning based prediction model specifically developed for ECM protein-receptor interactions.
Ключевые слова: ECM receptor interaction, Boosting; Boosted Random Forest, ReliefF, Random Sampling, $k$-Means, Uniform Sampling.
Материал поступил в редакцию 19.01.2023, 24.02.2023, опубликован 24.04.2023
Тип публикации: Статья
Язык публикации: английский
Образец цитирования: Abhigyan Nath, Sudama Rathore, Pangambam Sendash Singh, “Exploiting ensemble learning and negative sample space for predicting extracellular matrix receptor interactions”, Матем. биология и биоинформ., 18:1 (2023), 113–127
Цитирование в формате AMSBIB
\RBibitem{NatRatSin23}
\by Abhigyan~Nath, Sudama~Rathore, Pangambam~Sendash~Singh
\paper Exploiting ensemble learning and negative sample space for predicting extracellular matrix receptor interactions
\jour Матем. биология и биоинформ.
\yr 2023
\vol 18
\issue 1
\pages 113--127
\mathnet{http://mi.mathnet.ru/mbb512}
\crossref{https://doi.org/10.17537/2023.18.113}
Образцы ссылок на эту страницу:
  • https://www.mathnet.ru/rus/mbb512
  • https://www.mathnet.ru/rus/mbb/v18/i1/p113
  • Citing articles in Google Scholar: Russian citations, English citations
    Related articles in Google Scholar: Russian articles, English articles
    Статистика просмотров:
    Страница аннотации:28
    PDF полного текста:13
    Список литературы:9
     
      Обратная связь:
     Пользовательское соглашение  Регистрация посетителей портала  Логотипы © Математический институт им. В. А. Стеклова РАН, 2024