Virus detection by high-throughput sequencing of small RNAs: large-scale performance testing of sequence analysis strategies

Recent developments in high-throughput sequencing (HTS), also called next-generation sequencing (NGS), technologies and bioinformatics have drastically changed research on viral pathogens and spurred growing interest in the field of virus diagnostics. However, the reliability of HTS-based virus dete...

Descripción completa

Detalles Bibliográficos
Autores principales: Massart, S., Chiumenti, M., Jonghe, K., Glover, R., Haegeman, A., Koloniuk, I., Kominek, P., Kreuze, Jan F., Kutnjak, D., Lotos, L., Maclot, F., Maliogka, V., Maree, Hans J.
Formato: Journal Article
Lenguaje:Inglés
Publicado: Scientific Societies 2019
Materias:
Acceso en línea:https://hdl.handle.net/10568/100881
_version_ 1855522747758673920
author Massart, S.
Chiumenti, M.
Jonghe, K.
Glover, R.
Haegeman, A.
Koloniuk, I.
Kominek, P.
Kreuze, Jan F.
Kutnjak, D.
Lotos, L.
Maclot, F.
Maliogka, V.
Maree, Hans J.
author_browse Chiumenti, M.
Glover, R.
Haegeman, A.
Jonghe, K.
Koloniuk, I.
Kominek, P.
Kreuze, Jan F.
Kutnjak, D.
Lotos, L.
Maclot, F.
Maliogka, V.
Maree, Hans J.
Massart, S.
author_facet Massart, S.
Chiumenti, M.
Jonghe, K.
Glover, R.
Haegeman, A.
Koloniuk, I.
Kominek, P.
Kreuze, Jan F.
Kutnjak, D.
Lotos, L.
Maclot, F.
Maliogka, V.
Maree, Hans J.
author_sort Massart, S.
collection Repository of Agricultural Research Outputs (CGSpace)
description Recent developments in high-throughput sequencing (HTS), also called next-generation sequencing (NGS), technologies and bioinformatics have drastically changed research on viral pathogens and spurred growing interest in the field of virus diagnostics. However, the reliability of HTS-based virus detection protocols must be evaluated before adopting them for diagnostics. Many different bioinformatics algorithms aimed at detecting viruses in HTS data have been reported but little attention has been paid thus far to their sensitivity and reliability for diagnostic purposes. Therefore, we compared the ability of 21 plant virology laboratories, each employing a different bioinformatics pipeline, to detect 12 plant viruses through a double-blind large-scale performance test using 10 datasets of 21- to 24-nucleotide small RNA (sRNA) sequences from three different infected plants. The sensitivity of virus detection ranged between 35 and 100% among participants, with a marked negative effect when sequence depth decreased. The false-positive detection rate was very low and mainly related to the identification of host genome-integrated viral sequences or misinterpretation of the results. Reproducibility was high (91.6%). This work revealed the key influence of bioinformatics strategies for the sensitive detection of viruses in HTS sRNA datasets and, more specifically (i) the difficulty in detecting viral agents when they are novel or their sRNA abundance is low, (ii) the influence of key parameters at both assembly and annotation steps, (iii) the importance of completeness of reference sequence databases, and (iv) the significant level of scientific expertise needed when interpreting pipeline results. Overall, this work underlines key parameters and proposes recommendations for reliable sRNA-based detection of known and unknown viruses.
format Journal Article
id CGSpace100881
institution CGIAR Consortium
language Inglés
publishDate 2019
publishDateRange 2019
publishDateSort 2019
publisher Scientific Societies
publisherStr Scientific Societies
record_format dspace
spelling CGSpace1008812025-12-08T09:54:28Z Virus detection by high-throughput sequencing of small RNAs: large-scale performance testing of sequence analysis strategies Massart, S. Chiumenti, M. Jonghe, K. Glover, R. Haegeman, A. Koloniuk, I. Kominek, P. Kreuze, Jan F. Kutnjak, D. Lotos, L. Maclot, F. Maliogka, V. Maree, Hans J. diagnosis viroids potatoes viruses rna Recent developments in high-throughput sequencing (HTS), also called next-generation sequencing (NGS), technologies and bioinformatics have drastically changed research on viral pathogens and spurred growing interest in the field of virus diagnostics. However, the reliability of HTS-based virus detection protocols must be evaluated before adopting them for diagnostics. Many different bioinformatics algorithms aimed at detecting viruses in HTS data have been reported but little attention has been paid thus far to their sensitivity and reliability for diagnostic purposes. Therefore, we compared the ability of 21 plant virology laboratories, each employing a different bioinformatics pipeline, to detect 12 plant viruses through a double-blind large-scale performance test using 10 datasets of 21- to 24-nucleotide small RNA (sRNA) sequences from three different infected plants. The sensitivity of virus detection ranged between 35 and 100% among participants, with a marked negative effect when sequence depth decreased. The false-positive detection rate was very low and mainly related to the identification of host genome-integrated viral sequences or misinterpretation of the results. Reproducibility was high (91.6%). This work revealed the key influence of bioinformatics strategies for the sensitive detection of viruses in HTS sRNA datasets and, more specifically (i) the difficulty in detecting viral agents when they are novel or their sRNA abundance is low, (ii) the influence of key parameters at both assembly and annotation steps, (iii) the importance of completeness of reference sequence databases, and (iv) the significant level of scientific expertise needed when interpreting pipeline results. Overall, this work underlines key parameters and proposes recommendations for reliable sRNA-based detection of known and unknown viruses. 2019-03 2019-04-23T21:20:08Z 2019-04-23T21:20:08Z Journal Article https://hdl.handle.net/10568/100881 en Open Access Scientific Societies Massart, S.; Chiumenti, M.; Jonghe, K.; Glover, R.; Haegeman, A.; Koloniuk, I.; Kominek, P.; Kreuze, J.F.; Kutnjak, D.; Lotos, L.; Maclot, F.; Maliogka, V.; Maree, Hans J.; Koloniuk, I. 2019. Virus detection by high-throughput sequencing of small RNAs: large-scale performance testing of sequence analysis strategies. Phytopathology. ISSN 0031-949X. 109:3. pp. 488-497.
spellingShingle diagnosis
viroids
potatoes
viruses
rna
Massart, S.
Chiumenti, M.
Jonghe, K.
Glover, R.
Haegeman, A.
Koloniuk, I.
Kominek, P.
Kreuze, Jan F.
Kutnjak, D.
Lotos, L.
Maclot, F.
Maliogka, V.
Maree, Hans J.
Virus detection by high-throughput sequencing of small RNAs: large-scale performance testing of sequence analysis strategies
title Virus detection by high-throughput sequencing of small RNAs: large-scale performance testing of sequence analysis strategies
title_full Virus detection by high-throughput sequencing of small RNAs: large-scale performance testing of sequence analysis strategies
title_fullStr Virus detection by high-throughput sequencing of small RNAs: large-scale performance testing of sequence analysis strategies
title_full_unstemmed Virus detection by high-throughput sequencing of small RNAs: large-scale performance testing of sequence analysis strategies
title_short Virus detection by high-throughput sequencing of small RNAs: large-scale performance testing of sequence analysis strategies
title_sort virus detection by high throughput sequencing of small rnas large scale performance testing of sequence analysis strategies
topic diagnosis
viroids
potatoes
viruses
rna
url https://hdl.handle.net/10568/100881
work_keys_str_mv AT massarts virusdetectionbyhighthroughputsequencingofsmallrnaslargescaleperformancetestingofsequenceanalysisstrategies
AT chiumentim virusdetectionbyhighthroughputsequencingofsmallrnaslargescaleperformancetestingofsequenceanalysisstrategies
AT jonghek virusdetectionbyhighthroughputsequencingofsmallrnaslargescaleperformancetestingofsequenceanalysisstrategies
AT gloverr virusdetectionbyhighthroughputsequencingofsmallrnaslargescaleperformancetestingofsequenceanalysisstrategies
AT haegemana virusdetectionbyhighthroughputsequencingofsmallrnaslargescaleperformancetestingofsequenceanalysisstrategies
AT koloniuki virusdetectionbyhighthroughputsequencingofsmallrnaslargescaleperformancetestingofsequenceanalysisstrategies
AT kominekp virusdetectionbyhighthroughputsequencingofsmallrnaslargescaleperformancetestingofsequenceanalysisstrategies
AT kreuzejanf virusdetectionbyhighthroughputsequencingofsmallrnaslargescaleperformancetestingofsequenceanalysisstrategies
AT kutnjakd virusdetectionbyhighthroughputsequencingofsmallrnaslargescaleperformancetestingofsequenceanalysisstrategies
AT lotosl virusdetectionbyhighthroughputsequencingofsmallrnaslargescaleperformancetestingofsequenceanalysisstrategies
AT maclotf virusdetectionbyhighthroughputsequencingofsmallrnaslargescaleperformancetestingofsequenceanalysisstrategies
AT maliogkav virusdetectionbyhighthroughputsequencingofsmallrnaslargescaleperformancetestingofsequenceanalysisstrategies
AT mareehansj virusdetectionbyhighthroughputsequencingofsmallrnaslargescaleperformancetestingofsequenceanalysisstrategies