Mostrar el registro sencillo del ítem

Trabajo Fin de Grado

dc.contributor.advisorCubiles de la Vega, María Doloreses
dc.creatorSánchez Santana, Sara del Carmenes
dc.date.accessioned2016-05-05T11:57:29Z
dc.date.available2016-05-05T11:57:29Z
dc.date.issued2015-06
dc.identifier.citationSánchez Santana, S.d.C. (2015). Análisis de datos de RNA-Seq comparación de métodos para el estudio de expresión génica diferencial. (Trabajo Fin de Grado Inédito). Universidad de Sevilla, Sevilla.
dc.identifier.urihttp://hdl.handle.net/11441/40809
dc.description.abstractRNA-Seq is a next generation sequencing (NGS) procedure of the DNA for discovering, profiling and quantifying RNA transcripts, so this technology allows for measure gene expression. The analysis of RNA-Seq data, which comprise discrete counts of reads mapped to genes or transcripts, is made up of different parts: quality control checks on raw sequence data,mapping reads to a reference genome, counts of reads and the detection of differentially expressed genes across different biological conditions. The Bioconductor project provides tools for identifying differential expression for RNA-Seq data by means of the use of R statistical programming language. We focus our attention on four Bioconductor packages: EdgeR, NOISeq, DESeq2 and Limma. Now let’s see the main features of these packages. - EdgeR: This package is based on the negative binomial distribution. The default method for the normalization of data is the trimmed mean of M-values (TMM). For filter out transcripts with very low counts, EdgeR establishes a minimum of counts per millions (CPM). Finally, it uses the exact test for the binomial negative to determinate differentially expressed genes. - NOISeq: It is based on non-parametric approaches for the differential expression analysis of RNA-Seq data. In this case, the normalization default technique implemented is RPKM (reads per kilobase per million). The method to filter out transcripts with low counts is CPM (counts per millions). The differential expression analysis between two experimental conditions is based on the probability of each transcript or gene of being differently expressed: it is obtained by comparing the differential expression statistics, M-D values, of that transcript or gene against distribution of changes in expression values when comparing replicates within the same condition, noise distribution. - DESeq2: It is based on negative binomial generalized linear models. The technique used for the normalization is normalisation based on the estimation of the effective library size. DESEq2 filters out the transcripts which very low counts by means of the mean of normalized counts for each gene. Finally, it uses the Wald test for determining differentially expressed genes. - Limma: This Bioconductor package was designed originally for the analysis of Microarray data, but it has been adapted for the analysis of RNA-Seq data. It is based on the use of linear models. The normalization default technique is the trimmed mean of M-values (TMM). To filter 3 out transcripts with very low counts, Limma uses the minimum of counts per millions (CPM). This package uses the t-test for determining differentially expressed genes. We analyze RNA-Seq data of eight patients with asthma. There are four untreated patients and four patients with dexamethasone, a potent glucocorticoid. We want to identify the number of differentially expressed transcripts between these two experimental conditions: untreateddexamethasone. It can be done by using the four Bioconductor packages seen previously. The obtained results are: 853 differentially expressed transcripts with EdgeR, 415 transcripts with NOISeq, 1212 transcripts with DESeq2, and 719 with Limma. These four packages agree on 277 differentially expressed transcripts between untreated patients and treated (dexamethasone) patients.es
dc.formatapplication/pdfes
dc.language.isospaes
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internacional*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/*
dc.titleAnálisis de datos de RNA-Seq comparación de métodos para el estudio de expresión génica diferenciales
dc.typeinfo:eu-repo/semantics/bachelorThesises
dc.type.versioninfo:eu-repo/semantics/publishedVersiones
dc.rights.accessRightsinfo:eu-repo/semantics/openAccesses
dc.contributor.affiliationUniversidad de Sevilla. Departamento de Estadística e Investigación Operativaes
dc.description.degreeUniversidad de Sevilla. Grado en Estadísticaes
idus.format.extent64 p.es
dc.identifier.idushttps://idus.us.es/xmlui/handle/11441/40809

FicherosTamañoFormatoVerDescripción
Sánchez Santana Sara del Carmen ...1.379MbIcon   [PDF] Ver/Abrir  

Este registro aparece en las siguientes colecciones

Mostrar el registro sencillo del ítem

Attribution-NonCommercial-NoDerivatives 4.0 Internacional
Excepto si se señala otra cosa, la licencia del ítem se describe como: Attribution-NonCommercial-NoDerivatives 4.0 Internacional