Article
A Multiobjective Evolutionary Conceptual Clustering Methodology for Gene Annotation Within Structural Databases: A Case of Study on the Gene Ontology Database
Author/s | Romero Zaliz, Rocío C.
Rubio Escudero, Cristina Perren Cobb, J. Herrera, Francisco Cordón, Óscar Zwir, Igor |
Department | Universidad de Sevilla. Departamento de Lenguajes y Sistemas Informáticos |
Publication Date | 2008 |
Deposit Date | 2022-11-28 |
Published in |
|
Abstract | Current tools and techniques devoted to examine the
content of large databases are often hampered by their inability
to support searches based on criteria that are meaningful to
their users. These shortcomings are ... Current tools and techniques devoted to examine the content of large databases are often hampered by their inability to support searches based on criteria that are meaningful to their users. These shortcomings are particularly evident in data banks storing representations of structural data such as biological networks. Conceptual clustering techniques have demonstrated to be appropriate for uncovering relationships between features that characterize objects in structural data. However, typical con ceptual clustering approaches normally recover the most obvious relations, but fail to discover the lessfrequent but more informative underlying data associations. The combination of evolutionary algorithms with multiobjective and multimodal optimization techniques constitutes a suitable tool for solving this problem. We propose a novel conceptual clustering methodology termed evolutionary multiobjective conceptual clustering (EMO-CC), re lying on the NSGA-II multiobjective (MO) genetic algorithm. We apply this methodology to identify conceptual models in struc tural databases generated from gene ontologies. These models can explain and predict phenotypes in the immunoinflammatory response problem, similar to those provided by gene expression or other genetic markers. The analysis of these results reveals that our approach uncovers cohesive clusters, even those comprising a small number of observations explained by several features, which allows describing objects and their interactions from different perspectives and at different levels of detail. |
Funding agencies | Ministerio de Ciencia Y Tecnología (MCYT). España |
Project ID. | TIC-2003-00877
BIO2004-0270E TIN2006-12879 |
Citation | Romero Zaliz, R.C., Rubio Escudero, C., Perren Cobb, J., Herrera, F., Cordón, Ó. y Zwir, I. (2008). A Multiobjective Evolutionary Conceptual Clustering Methodology for Gene Annotation Within Structural Databases: A Case of Study on the Gene Ontology Database. IEEE Transactions on Evolutionary Computation, 12 (6), 679-701. https://doi.org/10.1109/TEVC.2008.915995. |
Files | Size | Format | View | Description |
---|---|---|---|---|
A Multiobjective Evolutionary ... | 2.230Mb | [PDF] | View/ | |