Analysis of differential gene expression and alternative splicing is significantly influenced by choice of reference genome

RNA-seq analysis has enabled the evaluation of transcriptional changes in many species including nonmodel organisms. However, in most species only a single reference genome is available and RNA-seq reads from highly divergent varieties are typically aligned to this reference. Here, we quantify the i...

Descripción completa

Detalles Bibliográficos
Autores principales: Slabaugh, Erin, Desai, Jigar S., Sartor, Ryan C., Lawas, Lovely Mae F., Jagadish, Krishna S.V., Doherty, Colleen J.
Formato: Journal Article
Lenguaje:Inglés
Publicado: Cold Spring Harbor Laboratory 2019
Acceso en línea:https://hdl.handle.net/10568/164713
_version_ 1855531753825894400
author Slabaugh, Erin
Desai, Jigar S.
Sartor, Ryan C.
Lawas, Lovely Mae F.
Jagadish, Krishna S.V.
Doherty, Colleen J.
author_browse Desai, Jigar S.
Doherty, Colleen J.
Jagadish, Krishna S.V.
Lawas, Lovely Mae F.
Sartor, Ryan C.
Slabaugh, Erin
author_facet Slabaugh, Erin
Desai, Jigar S.
Sartor, Ryan C.
Lawas, Lovely Mae F.
Jagadish, Krishna S.V.
Doherty, Colleen J.
author_sort Slabaugh, Erin
collection Repository of Agricultural Research Outputs (CGSpace)
description RNA-seq analysis has enabled the evaluation of transcriptional changes in many species including nonmodel organisms. However, in most species only a single reference genome is available and RNA-seq reads from highly divergent varieties are typically aligned to this reference. Here, we quantify the impacts of the choice of mapping genome in rice where three high-quality reference genomes are available. We aligned RNA-seq data from a popular productive rice variety to three different reference genomes and found that the identification of differentially expressed genes differed depending on which reference genome was used for mapping. Furthermore, the ability to detect differentially used transcript isoforms was profoundly affected by the choice of reference genome: Only 30% of the differentially used splicing features were detected when reads were mapped to the more commonly used, but more distantly related reference genome. This demonstrated that gene expression and splicing analysis varies considerably depending on the mapping reference genome, and that analysis of individuals that are distantly related to an available reference genome may be improved by acquisition of new genomic reference material. We observed that these differences in transcriptome analysis are, in part, due to the presence of single nucleotide polymorphisms between the sequenced individual and each respective reference genome, as well as annotation differences between the reference genomes that exist even between syntenic orthologs. We conclude that even between two closely related genomes of similar quality, using the reference genome that is most closely related to the species being sampled significantly improves transcriptome analysis.
format Journal Article
id CGSpace164713
institution CGIAR Consortium
language Inglés
publishDate 2019
publishDateRange 2019
publishDateSort 2019
publisher Cold Spring Harbor Laboratory
publisherStr Cold Spring Harbor Laboratory
record_format dspace
spelling CGSpace1647132025-05-14T10:24:18Z Analysis of differential gene expression and alternative splicing is significantly influenced by choice of reference genome Slabaugh, Erin Desai, Jigar S. Sartor, Ryan C. Lawas, Lovely Mae F. Jagadish, Krishna S.V. Doherty, Colleen J. RNA-seq analysis has enabled the evaluation of transcriptional changes in many species including nonmodel organisms. However, in most species only a single reference genome is available and RNA-seq reads from highly divergent varieties are typically aligned to this reference. Here, we quantify the impacts of the choice of mapping genome in rice where three high-quality reference genomes are available. We aligned RNA-seq data from a popular productive rice variety to three different reference genomes and found that the identification of differentially expressed genes differed depending on which reference genome was used for mapping. Furthermore, the ability to detect differentially used transcript isoforms was profoundly affected by the choice of reference genome: Only 30% of the differentially used splicing features were detected when reads were mapped to the more commonly used, but more distantly related reference genome. This demonstrated that gene expression and splicing analysis varies considerably depending on the mapping reference genome, and that analysis of individuals that are distantly related to an available reference genome may be improved by acquisition of new genomic reference material. We observed that these differences in transcriptome analysis are, in part, due to the presence of single nucleotide polymorphisms between the sequenced individual and each respective reference genome, as well as annotation differences between the reference genomes that exist even between syntenic orthologs. We conclude that even between two closely related genomes of similar quality, using the reference genome that is most closely related to the species being sampled significantly improves transcriptome analysis. 2019-06 2024-12-19T12:54:13Z 2024-12-19T12:54:13Z Journal Article https://hdl.handle.net/10568/164713 en Cold Spring Harbor Laboratory Slabaugh, Erin; Desai, Jigar S.; Sartor, Ryan C.; Lawas, Lovely Mae F.; Jagadish, S.V. Krishna and Doherty, Colleen J. 2019. Analysis of differential gene expression and alternative splicing is significantly influenced by choice of reference genome. RNA, Volume 25 no. 6 p. 669-684
spellingShingle Slabaugh, Erin
Desai, Jigar S.
Sartor, Ryan C.
Lawas, Lovely Mae F.
Jagadish, Krishna S.V.
Doherty, Colleen J.
Analysis of differential gene expression and alternative splicing is significantly influenced by choice of reference genome
title Analysis of differential gene expression and alternative splicing is significantly influenced by choice of reference genome
title_full Analysis of differential gene expression and alternative splicing is significantly influenced by choice of reference genome
title_fullStr Analysis of differential gene expression and alternative splicing is significantly influenced by choice of reference genome
title_full_unstemmed Analysis of differential gene expression and alternative splicing is significantly influenced by choice of reference genome
title_short Analysis of differential gene expression and alternative splicing is significantly influenced by choice of reference genome
title_sort analysis of differential gene expression and alternative splicing is significantly influenced by choice of reference genome
url https://hdl.handle.net/10568/164713
work_keys_str_mv AT slabaugherin analysisofdifferentialgeneexpressionandalternativesplicingissignificantlyinfluencedbychoiceofreferencegenome
AT desaijigars analysisofdifferentialgeneexpressionandalternativesplicingissignificantlyinfluencedbychoiceofreferencegenome
AT sartorryanc analysisofdifferentialgeneexpressionandalternativesplicingissignificantlyinfluencedbychoiceofreferencegenome
AT lawaslovelymaef analysisofdifferentialgeneexpressionandalternativesplicingissignificantlyinfluencedbychoiceofreferencegenome
AT jagadishkrishnasv analysisofdifferentialgeneexpressionandalternativesplicingissignificantlyinfluencedbychoiceofreferencegenome
AT dohertycolleenj analysisofdifferentialgeneexpressionandalternativesplicingissignificantlyinfluencedbychoiceofreferencegenome