A semantic model for species description applied to the ensign wasps (Hymenoptera

Evaniidae) of new caledonia

James P. Balhoff, Istvan Miko, Matthew J. Yoder, Patricia L. Mullins, Andrew Robert Deans

Research output: Contribution to journalArticle

31 Citations (Scopus)

Abstract

Taxonomic descriptions are unparalleled sources of knowledge of life's phenotypic diversity. As natural language prose, these data sets are largely refractory to computation and integration with other sources of phenotypic data. By formalizing taxonomic descriptions using ontology-based semantic representation, we aim to increase the reusability and computability of taxonomists' primary data. Here, we present a revision of the ensign wasp (Hymenoptera: Evaniidae) fauna of New Caledonia using this new model for species description. Descriptive matrices, specimen data, and taxonomic nomenclature are gathered in a unified Web-based application, mx, then exported as both traditional taxonomic treatments and semantic statements using the OWL Web Ontology Language. Character:character-state combinations are then annotated following the entity-quality phenotype model, originally developed to represent mutant model organism phenotype data; concepts of anatomy are drawn from the Hymenoptera Anatomy Ontology and linked to phenotype descriptors from the Phenotypic Quality Ontology. The resulting set of semantic statements is provided in Resource Description Framework format. Applying the model to real data, that is, specimens, taxonomic names, diagnoses, descriptions, and redescriptions, provides us with a foundation to discuss limitations and potential benefits such as automated data integration and reasoner-driven queries. Four species of ensign wasp are now known to occur in New Caledonia: Szepligetella levipetiolata, Szepligetella deercreeki Deans and Mikó sp. nov., Szepligetella irwini Deans and Mikó sp. nov., and the nearly cosmopolitan Evania appendigaster. A fifth species, Szepligetella sericea, including Szepligetella impressa, syn. nov., has not yet been collected in New Caledonia but can be found on islands throughout the Pacific and so is included in the diagnostic key. [Biodiversity informatics; Evaniidae; New Caledonia; new species; ontology; semantic phenotypes; semantic species description; taxonomy.]

Original languageEnglish (US)
Pages (from-to)639-659
Number of pages21
JournalSystematic Biology
Volume62
Issue number5
DOIs
StatePublished - Sep 1 2013

Fingerprint

Szepligetella
Evaniidae
New Caledonia
Wasps
Hymenoptera
wasp
Semantics
phenotype
Szepligetella sericea
Phenotype
new species
Anatomy
Language
anatomy
Pacific Islands
Pacific Ocean Islands
redescriptions
Informatics
Information Storage and Retrieval
Biodiversity

All Science Journal Classification (ASJC) codes

  • Ecology, Evolution, Behavior and Systematics
  • Genetics

Cite this

Balhoff, James P. ; Miko, Istvan ; Yoder, Matthew J. ; Mullins, Patricia L. ; Deans, Andrew Robert. / A semantic model for species description applied to the ensign wasps (Hymenoptera : Evaniidae) of new caledonia. In: Systematic Biology. 2013 ; Vol. 62, No. 5. pp. 639-659.
@article{dbf520d7f99449dd88d3ae90440b5e70,
title = "A semantic model for species description applied to the ensign wasps (Hymenoptera: Evaniidae) of new caledonia",
abstract = "Taxonomic descriptions are unparalleled sources of knowledge of life's phenotypic diversity. As natural language prose, these data sets are largely refractory to computation and integration with other sources of phenotypic data. By formalizing taxonomic descriptions using ontology-based semantic representation, we aim to increase the reusability and computability of taxonomists' primary data. Here, we present a revision of the ensign wasp (Hymenoptera: Evaniidae) fauna of New Caledonia using this new model for species description. Descriptive matrices, specimen data, and taxonomic nomenclature are gathered in a unified Web-based application, mx, then exported as both traditional taxonomic treatments and semantic statements using the OWL Web Ontology Language. Character:character-state combinations are then annotated following the entity-quality phenotype model, originally developed to represent mutant model organism phenotype data; concepts of anatomy are drawn from the Hymenoptera Anatomy Ontology and linked to phenotype descriptors from the Phenotypic Quality Ontology. The resulting set of semantic statements is provided in Resource Description Framework format. Applying the model to real data, that is, specimens, taxonomic names, diagnoses, descriptions, and redescriptions, provides us with a foundation to discuss limitations and potential benefits such as automated data integration and reasoner-driven queries. Four species of ensign wasp are now known to occur in New Caledonia: Szepligetella levipetiolata, Szepligetella deercreeki Deans and Mik{\'o} sp. nov., Szepligetella irwini Deans and Mik{\'o} sp. nov., and the nearly cosmopolitan Evania appendigaster. A fifth species, Szepligetella sericea, including Szepligetella impressa, syn. nov., has not yet been collected in New Caledonia but can be found on islands throughout the Pacific and so is included in the diagnostic key. [Biodiversity informatics; Evaniidae; New Caledonia; new species; ontology; semantic phenotypes; semantic species description; taxonomy.]",
author = "Balhoff, {James P.} and Istvan Miko and Yoder, {Matthew J.} and Mullins, {Patricia L.} and Deans, {Andrew Robert}",
year = "2013",
month = "9",
day = "1",
doi = "10.1093/sysbio/syt028",
language = "English (US)",
volume = "62",
pages = "639--659",
journal = "Systematic Biology",
issn = "1063-5157",
publisher = "Oxford University Press",
number = "5",

}

A semantic model for species description applied to the ensign wasps (Hymenoptera : Evaniidae) of new caledonia. / Balhoff, James P.; Miko, Istvan; Yoder, Matthew J.; Mullins, Patricia L.; Deans, Andrew Robert.

In: Systematic Biology, Vol. 62, No. 5, 01.09.2013, p. 639-659.

Research output: Contribution to journalArticle

TY - JOUR

T1 - A semantic model for species description applied to the ensign wasps (Hymenoptera

T2 - Evaniidae) of new caledonia

AU - Balhoff, James P.

AU - Miko, Istvan

AU - Yoder, Matthew J.

AU - Mullins, Patricia L.

AU - Deans, Andrew Robert

PY - 2013/9/1

Y1 - 2013/9/1

N2 - Taxonomic descriptions are unparalleled sources of knowledge of life's phenotypic diversity. As natural language prose, these data sets are largely refractory to computation and integration with other sources of phenotypic data. By formalizing taxonomic descriptions using ontology-based semantic representation, we aim to increase the reusability and computability of taxonomists' primary data. Here, we present a revision of the ensign wasp (Hymenoptera: Evaniidae) fauna of New Caledonia using this new model for species description. Descriptive matrices, specimen data, and taxonomic nomenclature are gathered in a unified Web-based application, mx, then exported as both traditional taxonomic treatments and semantic statements using the OWL Web Ontology Language. Character:character-state combinations are then annotated following the entity-quality phenotype model, originally developed to represent mutant model organism phenotype data; concepts of anatomy are drawn from the Hymenoptera Anatomy Ontology and linked to phenotype descriptors from the Phenotypic Quality Ontology. The resulting set of semantic statements is provided in Resource Description Framework format. Applying the model to real data, that is, specimens, taxonomic names, diagnoses, descriptions, and redescriptions, provides us with a foundation to discuss limitations and potential benefits such as automated data integration and reasoner-driven queries. Four species of ensign wasp are now known to occur in New Caledonia: Szepligetella levipetiolata, Szepligetella deercreeki Deans and Mikó sp. nov., Szepligetella irwini Deans and Mikó sp. nov., and the nearly cosmopolitan Evania appendigaster. A fifth species, Szepligetella sericea, including Szepligetella impressa, syn. nov., has not yet been collected in New Caledonia but can be found on islands throughout the Pacific and so is included in the diagnostic key. [Biodiversity informatics; Evaniidae; New Caledonia; new species; ontology; semantic phenotypes; semantic species description; taxonomy.]

AB - Taxonomic descriptions are unparalleled sources of knowledge of life's phenotypic diversity. As natural language prose, these data sets are largely refractory to computation and integration with other sources of phenotypic data. By formalizing taxonomic descriptions using ontology-based semantic representation, we aim to increase the reusability and computability of taxonomists' primary data. Here, we present a revision of the ensign wasp (Hymenoptera: Evaniidae) fauna of New Caledonia using this new model for species description. Descriptive matrices, specimen data, and taxonomic nomenclature are gathered in a unified Web-based application, mx, then exported as both traditional taxonomic treatments and semantic statements using the OWL Web Ontology Language. Character:character-state combinations are then annotated following the entity-quality phenotype model, originally developed to represent mutant model organism phenotype data; concepts of anatomy are drawn from the Hymenoptera Anatomy Ontology and linked to phenotype descriptors from the Phenotypic Quality Ontology. The resulting set of semantic statements is provided in Resource Description Framework format. Applying the model to real data, that is, specimens, taxonomic names, diagnoses, descriptions, and redescriptions, provides us with a foundation to discuss limitations and potential benefits such as automated data integration and reasoner-driven queries. Four species of ensign wasp are now known to occur in New Caledonia: Szepligetella levipetiolata, Szepligetella deercreeki Deans and Mikó sp. nov., Szepligetella irwini Deans and Mikó sp. nov., and the nearly cosmopolitan Evania appendigaster. A fifth species, Szepligetella sericea, including Szepligetella impressa, syn. nov., has not yet been collected in New Caledonia but can be found on islands throughout the Pacific and so is included in the diagnostic key. [Biodiversity informatics; Evaniidae; New Caledonia; new species; ontology; semantic phenotypes; semantic species description; taxonomy.]

UR - http://www.scopus.com/inward/record.url?scp=84882786376&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84882786376&partnerID=8YFLogxK

U2 - 10.1093/sysbio/syt028

DO - 10.1093/sysbio/syt028

M3 - Article

VL - 62

SP - 639

EP - 659

JO - Systematic Biology

JF - Systematic Biology

SN - 1063-5157

IS - 5

ER -