Large-scale genomic sequence data resolve the deepest divergences in the legume phylogeny and support a near-simultaneous evolutionary origin of all six subfamilies
Peer reviewed, Journal article
MetadataShow full item record
Original versionNew Phytologist. 2020, 225 (3), 1355-1369. 10.1111/nph.16290
Phylogenomics is increasingly used to infer deep‐branching relationships while revealing the complexity of evolutionary processes such as incomplete lineage sorting, hybridization/introgression and polyploidization. We investigate the deep‐branching relationships among subfamilies of the Leguminosae (or Fabaceae), the third largest angiosperm family. Despite their ecological and economic importance, a robust phylogenetic framework for legumes based on genome‐scale sequence data is lacking. We generated alignments of 72 chloroplast genes and 7621 homologous nuclear‐encoded proteins, for 157 and 76 taxa, respectively. We analysed these with maximum likelihood, Bayesian inference, and a multispecies coalescent summary method, and evaluated support for alternative topologies across gene trees. We resolve the deepest divergences in the legume phylogeny despite lack of phylogenetic signal across all chloroplast genes and the majority of nuclear genes. Strongly supported conflict in the remainder of nuclear genes is suggestive of incomplete lineage sorting. All six subfamilies originated nearly simultaneously, suggesting that the prevailing view of some subfamilies as ‘basal’ or ‘early‐diverging’ with respect to others should be abandoned, which has important implications for understanding the evolution of legume diversity and traits. Our study highlights the limits of phylogenetic resolution in relation to rapid successive speciation.