Artículo
Graph coloring for extracting discriminative genes in cancer data
Autor/es | Mahfouz, Mohamed A.
Nepomuceno Chamorro, Juan Antonio |
Departamento | Universidad de Sevilla. Departamento de Lenguajes y Sistemas Informáticos |
Fecha de publicación | 2019 |
Fecha de depósito | 2022-05-27 |
Publicado en |
|
Resumen | Background and objective: The major difficulty of the analysis of the input gene
expression data in a microarray-based approach for an automated diagnosis of can cer is the large number of genes (high dimensionality) with ... Background and objective: The major difficulty of the analysis of the input gene expression data in a microarray-based approach for an automated diagnosis of can cer is the large number of genes (high dimensionality) with many irrelevant genes (noise) compared to the very small number of samples. This research study tackles the dimensionality reduction challenge in this area. Methods: This research study introduces a dimension-reduction technique termed graph coloring approach (GCA) for microarray data-based cancer classification based on analyzing the absolute correlation between gene–gene pairs and partitioning genes into several hubs using graph coloring. GCA starts by a gene-selection step in which top relevant genes are selected using a biserial correlation. Each time, a gene from an ordered list of top relevant genes is selected as the hub gene (representative) and redundant genes are added to its group; the process is repeated recursively for the remaining genes. A gene is considered redundant if its absolute correlation with the hub gene is greater than a controlling threshold. A suitable range for the threshold is estimated by computing a percentage graph for the absolute correlation between gene–gene pairs. Each value in the estimated range for the threshold can efficiently produce a new feature subset. Results: GCA achieved significant improvement over several existing techniques in terms of higher accuracy and a smaller number of features. Also, genes selected by this method are relevant genes according to the information stored in scientific repositories. Conclusions: The proposed dimension-reduction technique can help biologists accu rately predict cancer in several areas of the body. |
Cita | Mahfouz, M.A. y Nepomuceno Chamorro, J.A. (2019). Graph coloring for extracting discriminative genes in cancer data. Annals of Human Genetics, 83 (3), 141-159. |
Ficheros | Tamaño | Formato | Ver | Descripción |
---|---|---|---|---|
Graph coloring for extracting ... | 755.4Kb | [PDF] | Ver/ | |