Distribution and asymptotic behavior of the phylogenetic transfer distance

The transfer distance (TD) was introduced in the classification framework and studied in the context of phylogenetic tree matching. Recently, Lemoine et al. (2018) showed that TD can be a powerful tool to assess the branch support of phylogenies with large data sets, thus providing a relevant altern...

Descripción completa

Detalles Bibliográficos
Autores principales: Felipe, M.D., Domelevo Entfellner, Jean-Baka, Lemoine, F., Truszkowski, J., Gascuel, O.
Formato: Preprint
Lenguaje:Inglés
Publicado: Cold Spring Harbor Laboratory 2018
Materias:
Acceso en línea:https://hdl.handle.net/10568/97458
_version_ 1855532312574296064
author Felipe, M.D.
Domelevo Entfellner, Jean-Baka
Lemoine, F.
Truszkowski, J.
Gascuel, O.
author_browse Domelevo Entfellner, Jean-Baka
Felipe, M.D.
Gascuel, O.
Lemoine, F.
Truszkowski, J.
author_facet Felipe, M.D.
Domelevo Entfellner, Jean-Baka
Lemoine, F.
Truszkowski, J.
Gascuel, O.
author_sort Felipe, M.D.
collection Repository of Agricultural Research Outputs (CGSpace)
description The transfer distance (TD) was introduced in the classification framework and studied in the context of phylogenetic tree matching. Recently, Lemoine et al. (2018) showed that TD can be a powerful tool to assess the branch support of phylogenies with large data sets, thus providing a relevant alternative to Felsenstein's bootstrap. This distance allows a reference branch β in a reference tree T to be compared to a branch b from another tree T, both on the same set of n taxa. The TD between these branches is the number of taxa that must be transferred from one side of b to the other in order to obtain β. By taking the minimum TD from β to all branches in T we define the transfer index, denoted by ϕ(β,T), measuring the degree of agreement of T with β. Let us consider a reference branch β having p tips on its light side and define the transfer support (TS) as 1 - ϕ(β,T)/(p-1). The aim of this article is to provide evidence that p-1 is a meaningful normalization constant in the definition of TS, and measure the statistical significance of TS, assuming that β is compared to a tree T drawn according to a null model. We obtain several results that shed light on these questions in a number of settings. In particular, we study the asymptotic behavior of TS when n tends to ∞, and fully characterize the distribution of ϕ when T is a caterpillar tree.
format Preprint
id CGSpace97458
institution CGIAR Consortium
language Inglés
publishDate 2018
publishDateRange 2018
publishDateSort 2018
publisher Cold Spring Harbor Laboratory
publisherStr Cold Spring Harbor Laboratory
record_format dspace
spelling CGSpace974582024-11-13T09:00:15Z Distribution and asymptotic behavior of the phylogenetic transfer distance Felipe, M.D. Domelevo Entfellner, Jean-Baka Lemoine, F. Truszkowski, J. Gascuel, O. trees forests The transfer distance (TD) was introduced in the classification framework and studied in the context of phylogenetic tree matching. Recently, Lemoine et al. (2018) showed that TD can be a powerful tool to assess the branch support of phylogenies with large data sets, thus providing a relevant alternative to Felsenstein's bootstrap. This distance allows a reference branch β in a reference tree T to be compared to a branch b from another tree T, both on the same set of n taxa. The TD between these branches is the number of taxa that must be transferred from one side of b to the other in order to obtain β. By taking the minimum TD from β to all branches in T we define the transfer index, denoted by ϕ(β,T), measuring the degree of agreement of T with β. Let us consider a reference branch β having p tips on its light side and define the transfer support (TS) as 1 - ϕ(β,T)/(p-1). The aim of this article is to provide evidence that p-1 is a meaningful normalization constant in the definition of TS, and measure the statistical significance of TS, assuming that β is compared to a tree T drawn according to a null model. We obtain several results that shed light on these questions in a number of settings. In particular, we study the asymptotic behavior of TS when n tends to ∞, and fully characterize the distribution of ϕ when T is a caterpillar tree. 2018 2018-09-21T06:56:01Z 2018-09-21T06:56:01Z Preprint https://hdl.handle.net/10568/97458 en Open Access Cold Spring Harbor Laboratory Felipe, M.D., Domelevo Entfellner, J.-B., Lemoine, F., Truszkowski, J. and Gascuel, O. 2018. Distribution and asymptotic behavior of the phylogenetic transfer distance. bioRxiv
spellingShingle trees
forests
Felipe, M.D.
Domelevo Entfellner, Jean-Baka
Lemoine, F.
Truszkowski, J.
Gascuel, O.
Distribution and asymptotic behavior of the phylogenetic transfer distance
title Distribution and asymptotic behavior of the phylogenetic transfer distance
title_full Distribution and asymptotic behavior of the phylogenetic transfer distance
title_fullStr Distribution and asymptotic behavior of the phylogenetic transfer distance
title_full_unstemmed Distribution and asymptotic behavior of the phylogenetic transfer distance
title_short Distribution and asymptotic behavior of the phylogenetic transfer distance
title_sort distribution and asymptotic behavior of the phylogenetic transfer distance
topic trees
forests
url https://hdl.handle.net/10568/97458
work_keys_str_mv AT felipemd distributionandasymptoticbehaviorofthephylogenetictransferdistance
AT domelevoentfellnerjeanbaka distributionandasymptoticbehaviorofthephylogenetictransferdistance
AT lemoinef distributionandasymptoticbehaviorofthephylogenetictransferdistance
AT truszkowskij distributionandasymptoticbehaviorofthephylogenetictransferdistance
AT gascuelo distributionandasymptoticbehaviorofthephylogenetictransferdistance