2018 ESA Annual Meeting (August 5 -- 10)

COS 115-10 - Towards an ecological trait-data standard

Thursday, August 9, 2018: 4:40 PM
R07, New Orleans Ernest N. Morial Convention Center
Florian D. Schneider, Department of Biology, Darmstadt University of Technology, Darmstadt, Germany, Caterina Penone, Institute of Plant Sciences, University of Bern, Bern, Switzerland, Malte Jochum, University of Bern, Switzerland, Gaƫtane Le-Provost, Biodiversity and Climate Research Centre (BiK-F), Senckenberg, Frankfurt, Germany, Andreas Ostrowski, University of Jena, Germany and Nadja K. Simons, Chair for Terrestrial Ecology, Technical University of Munich, Freising, Germany
Background/Question/Methods

Trait-based research spans from evolutionary studies of individual-level properties to global patterns of biodiversity and ecosystem functioning. An increasing number of trait data is available for several organism groups while being published as open access data on a variety of file hosting services. Standardization between datasets is thus generally lacking due to heterogeneous data formats and types. Therefore, data compilation remains a difficult and time consuming task.

We reviewed existing trait databases and online services, as well as initiatives for trait data standardization. Together with data providers and users we identified a need for a minimal trait-data terminology flexible enough to include traits from all types of organisms but simple enough to be adopted by different research communities. We then tested this structure using trait datasets from several research groups working on different taxa and questions in a large project (Biodiversity Exploratories). Finally, we developed an R-based tool to convert any trait dataset into the proposed standard.

Results/Conclusions

In order to facilitate reproducibility of analyses, the reuse and share of data and the combination of datasets from multiple sources, we propose a standardized vocabulary for trait data that is compatible with existing ontologies. By relying on unambiguous identifiers, the proposed minimal vocabulary for trait data captures the different degrees of resolution and measurement detail for multiple use cases of trait-based research. It further encourages the use of global Uniform Resource Identifiers (URI) for taxa and trait definitions, methods and units, thereby following the standards for a semantic web of scientific data.

The R-package facilitates the upload of own data to hosting services but also simplifies the access to published trait data. It offers direct access to trait datasets that have been published in the public domain or under creative commons licenses. All these products are available through the Github platform (https://github.com/EcologicalTraitData) with the aim of a continuous collaboration and improvement with the research community. With these decentralized tools for trait data harmonization, we intend to facilitate the exchange and analysis of trait data for ecological research.