APPLYING HYPERGRAPHS TO STUDIES IN QUANTITATIVE BIOLOGY
Received: 05th June 2024 Revised: 10th June 2024, 11th June 2024 Accepted: 10th June 2024
DOI:
https://doi.org/10.20319/lijhls.2024.9.2133Keywords:
Hypergraph, Hypergraph Model, Hypergraph Classifier, GraphAbstract
The objective of this research is to demonstrate hypergraph versatility and applicability for modeling diverse biological systems. The inherent structure of hypergraphs allows for encoding of higher-order feature interactions, providing a flexible framework for efficient models that can enhance our understanding of physical phenomena and one that can be generalized across various datasets. By adopting innovative methods including centrality measure and populations of models rather than singular instances, biases and overfitting tendencies are mitigated, again presenting promise for application across a broad spectrum of biological systems. Furthermore, emphasis is placed on the significance of probabilistic distribution analysis in elucidating threshold selection and feature relevance while maintaining high levels of accuracy. Our results demonstrate the advantages of hypergraph models on two different datasets; with the first on gene expression and the identification of outlier genes and the second on classifying starch grains. There is significant scope in the application of the hypergraph to a wider class of biological systems, with the potential to improve understanding of the biological processes.
References
Barton, S., Coster, A., Donovan, D. and Lefevre, J. (2024). A classification model based on a population of hypergraphs. arXiv, 2405.15063. URL https://arxiv.org/abs/2405.15063
Barton, S., Broad, Z., Ortiz-Barrientos, D., Donovan, D. and Lefevre, J.(2023). Hypergraphs and centrality measures identifying key features in gene expression data. Mathematical Biosciences, 366, 109089. DOI https://doi.org/10.1016/j.mbs.2023.109089
Breiman, L. (2001) Random forests. Machine Learning, 45, 5-32. DOI https://doi.org/10.1023/A:1010933404324
Berge, C. (1984) Hypergraphs: combinatorics of finite sets. Elsevier 45. URL https://books.google.com.au/books/about/Hypergraphs.html?id=jEyfse-EKf8C&redir_esc=y
Broad, Z., Lefevre, J., Wilkinson, M., Barton, S., Barbier, F., Jung, H., Donovan, D. and Ortiz-Barrientos, D. (2023). Gene expression divergence during adaptation to contrasting environments. arXiv. DOI 10.22541/au.169823548.87378722/v1
Coster, A.C.F. and Field, J.H. (2015) What starch grain is that? A geometric morphometric approach to determining plant species origin. Journal of Archaeological Science, 58, 9-25. DOI https://doi.org/10.1016/j.jas.2015.03.014
Dash, M. and Liu, H. (1997) Feature selection for classification. Intelligent data analysis, 1(1-4), 131-156. DOI https://doi.org/10.1016/S1088-467X(97)00008-5
Díaz-Uriarte, R. and Alvarez de Andrés, S. (2006) Gene selection and classification of microarray data using random forest. BMC Bioinformatics, 7, 1-13. DOI https://doi.org/10.1186/1471-2105-7-3
Di D., Shi F., Yan F., Xia L., Mo Z., Ding Z., Shan F., Song B., Li S., Wei Y., Shao Y., Han M., Gao Y., Sui H., Gao Y. and Shen D. (2021). Hypergraph learning for identification of COVID-19 with CT imaging, Medical Image Analysis, 68, 101910. DOI 10.1016/j.media.2020.101910
Li, D., Xu, Z., Li, S. and Sun, X. (2013). Link prediction in social networks based on hypergraph. Proceedings of the 22nd international conference on world wide web, 41-42. URL https://api.semanticscholar.org/CorpusID:15302229
Zhou, D., Huang, J. and Schölkopf, B. (2006) Learning with hypergraphs: Clustering, classification, and embedding. Advances in neural information processing systems, 19. URL https://proceedings.neurips.cc/paper_files/paper/2006/file/dff8e9c2ac33381546d96deea9922999-Paper.pdf
Downloads
Published
How to Cite
Issue
Section
License
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
Copyright of Published Articles
Author(s) retain the article copyright and publishing rights without any restrictions.
All published work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.