Publications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Inferring Taxonomic Affinities and Genetic Distances Using Morphological Features Extracted from Specimen Images: A Case Study with a Bivalve Data Set
Technical University Ilmenau Data-intensive Systems and Visualization Group (dAI.SY), , Ilmenau 98693 , Germany.ORCID iD: 0000-0002-4440-3317
Swedish Museum of Natural History, Department of Paleobiology. Swedish Museum of Natural History Department of Palaeobiology, , Stockholm 104 05 , Sweden.
Max Planck Institute for Biogeochemistry Department of Biogeochemical Integration, , Jena 07745 , Germany.
Max Planck Institute for Biogeochemistry Department of Biogeochemical Integration, , Jena 07745 , Germany;German Centre for Integrative Biodiversity Research (iDiv) , Halle-Jena-Leipzig , Germany.
Show others and affiliations
2024 (English)In: Systematic Biology, ISSN 1063-5157, E-ISSN 1076-836X, Vol. 73, p. 920-940Article in journal (Refereed) Published
Abstract [en]

Reconstructing the tree of life and understanding the relationships of taxa are core questions in evolutionary and systematic biology. The main advances in this field in the last decades were derived from molecular phylogenetics; however, for most species, molecular data are not available. Here, we explore the applicability of 2 deep learning methods—supervised classification approaches and unsupervised similarity learning—to infer organism relationships from specimen images. As a basis, we assembled an image data set covering 4144 bivalve species belonging to 74 families across all orders and subclasses of the extant Bivalvia, with molecular phylogenetic data being available for all families and a complete taxonomic hierarchy for all species. The suitability of this data set for deep learning experiments was evidenced by an ablation study resulting in almost 80% accuracy for identifications on the species level. Three sets of experiments were performed using our data set. First, we included taxonomic hierarchy and genetic distances in a supervised learning approach to obtain predictions on several taxonomic levels simultaneously. Here, we stimulated the model to consider features shared between closely related taxa to be more critical for their classification than features shared with distantly related taxa, imprinting phylogenetic and taxonomic affinities into the architecture and training procedure. Second, we used transfer learning and similarity learning approaches for zero-shot experiments to identify the higher-level taxonomic affinities of test species that the models had not been trained on. The models assigned the unknown species to their respective genera with approximately 48% and 67% accuracy. Lastly, we used unsupervised similarity learning to infer the relatedness of the images without prior knowledge of their taxonomic or phylogenetic affinities. The results clearly showed similarities between visual appearance and genetic relationships at the higher taxonomic levels. The correlation was 0.6 for the most species-rich subclass (Imparidentia), ranging from 0.5 to 0.7 for the orders with the most images. Overall, the correlation between visual similarity and genetic distances at the family level was 0.78. However, fine-grained reconstructions based on these observed correlations, such as sister–taxa relationships, require further work. Overall, our results broaden the applicability of automated taxon identification systems and provide a new avenue for estimating phylogenetic relationships from specimen images.

Place, publisher, year, edition, pages
Oxford: Oxford Academic , 2024. Vol. 73, p. 920-940
Keywords [en]
Bivalves, deep learning, morphology inference, phylogenetics, similarity learning
National Category
Biological Systematics Other Earth Sciences
Research subject
Diversity of life; The changing Earth
Identifiers
URN: urn:nbn:se:nrm:diva-5709DOI: 10.1093/sysbio/syae042OAI: oai:DiVA.org:nrm-5709DiVA, id: diva2:1917311
Note

The project was supported by funds from the Federal Ministry of Food and Agriculture (BMEL) based on a decision of the parliament of the Federal Republic of Germany via the Federal Office for Agriculture and Food (BLE) under the Federal Programme for Ecological Farming and Other Forms of Sustainable Agriculture. Federal Ministry of Food and Agriculture (BMEL) grant: 2819NA106 (M.H. and P.M.). This study was funded by the German Ministry of Education and Research (BMBF) grant: 01IS20062 (L.K., J.W., M.H., and P.M.).

Available from: 2024-12-02 Created: 2024-12-02 Last updated: 2025-09-12Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full texthttps://academic.oup.com/sysbio/article/73/6/920/7719299

Search in DiVA

By author/editor
Hofmann, MartinKiel, Steffen
By organisation
Department of Paleobiology
In the same journal
Systematic Biology
Biological SystematicsOther Earth Sciences

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 141 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf