Mostrar el registro sencillo del ítem
Trabajo Fin de Grado
Machine Learning: aplicación a datos RNA-Seq
dc.contributor.advisor | Cubiles de la Vega, María Dolores | es |
dc.contributor.advisor | Enguix González, Alicia | es |
dc.creator | Fernández Delgado, Marta | es |
dc.date.accessioned | 2016-09-29T06:15:11Z | |
dc.date.available | 2016-09-29T06:15:11Z | |
dc.date.issued | 2016-09 | |
dc.identifier.citation | Fernández Delgado, M. (201-). Machine Learning: aplicación a datos RNA-Seq. (Trabajo fin de grado inédito). Universidad de Sevilla, Sevilla. | |
dc.identifier.uri | http://hdl.handle.net/11441/46276 | |
dc.description.abstract | Over the past years, biomedicine has experienced a revolution. This is partly due to the very high volume of information existing today that has been produced by both clinical and pharmacological studies as well as Omics data generation. At the same time, the development of statistical and computer techniques that allow its analysis has led to a new area of knowledge: Bioinformatics. The main aim of this study is to compare different Machine Learning techniques applied to a set of genetic data that were obtained through the RNA-Seq method. This method sequences cDNA in a massive way through a high-performance platform in order to gather global information about the DNA of a sample. Due to the efficacy, reproducibility and performance of the high-throughput sequencing, the RNA-Seq method allows researchers to measure the level of gene expression, detect alternative splicing, mutations, etc. It is also included in this study the theoretic and practical description of several supervised learning methods such Classification Trees, Bagging, Random Forest and Boosting. The efficacy of these methods has been measured by different rates that can be subsequently compared. The said rates are: sensitivity, specificity, area under the ROC curve, global hit rate, positive predictive value (PPV), negative predictive value (NPV) and balance accuracy. This study is complemented by the implementation of R software. It presents different graphs (bar charts, box-andwhisker plot, etc.) as well as numerical data of the obtained predictions for each of the methods. | es |
dc.format | application/pdf | es |
dc.language.iso | spa | es |
dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 Internacional | * |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/ | * |
dc.title | Machine Learning: aplicación a datos RNA-Seq | es |
dc.type | info:eu-repo/semantics/bachelorThesis | es |
dc.type.version | info:eu-repo/semantics/publishedVersion | es |
dc.rights.accessRights | info:eu-repo/semantics/openAccess | es |
dc.contributor.affiliation | Universidad de Sevilla. Departamento de Estadística e Investigación Operativa | es |
dc.description.degree | Universidad de Sevilla. Grado en Matemáticas | es |
dc.contributor.group | Universidad de Sevilla. FQM153: Estadística e Investigación Operativa | es |
idus.format.extent | 55 p. | es |
dc.identifier.idus | https://idus.us.es/xmlui/handle/11441/46276 |
Ficheros | Tamaño | Formato | Ver | Descripción |
---|---|---|---|---|
Fernández Delgado Marta TFG.pdf | 1.050Mb | [PDF] | Ver/ | |