Initial implementation of a comparative data analysis ontology. - Inserm - Institut national de la santé et de la recherche médicale Accéder directement au contenu
Article Dans Une Revue Evolutionary Bioinformatics Année : 2009

Initial implementation of a comparative data analysis ontology.

Résumé

Comparative analysis is used throughout biology. When entities under comparison (e.g. proteins, genomes, species) are related by descent, evolutionary theory provides a framework that, in principle, allows N-ary comparisons of entities, while controlling for non-independence due to relatedness. Powerful software tools exist for specialized applications of this approach, yet it remains under-utilized in the absence of a unifying informatics infrastructure. A key step in developing such an infrastructure is the definition of a formal ontology. The analysis of use cases and existing formalisms suggests that a significant component of evolutionary analysis involves a core problem of inferring a character history, relying on key concepts: "Operational Taxonomic Units" (OTUs), representing the entities to be compared; "character-state data" representing the observations compared among OTUs; "phylogenetic tree", representing the historical path of evolution among the entities; and "transitions", the inferred evolutionary changes in states of characters that account for observations. Using the Web Ontology Language (OWL), we have defined these and other fundamental concepts in a Comparative Data Analysis Ontology (CDAO). CDAO has been evaluated for its ability to represent token data sets and to support simple forms of reasoning. With further development, CDAO will provide a basis for tools (for semantic transformation, data retrieval, validation, integration, etc.) that make it easier for software developers and biomedical researchers to apply evolutionary methods of inference to diverse types of data, so as to integrate this powerful framework for reasoning into their research.
Fichier principal
Vignette du fichier
f_EBO-5-Stoltzfus-et-al_2088.pdf (6.34 Mo) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte

Dates et versions

inserm-00438663 , version 1 (08-12-2009)

Identifiants

  • HAL Id : inserm-00438663 , version 1
  • PUBMED : 19812726

Citer

Francisco Prosdocimi, Brandon Chisham, Enrico Pontelli, Julie D. Thompson, Arlin Stoltzfus. Initial implementation of a comparative data analysis ontology.. Evolutionary Bioinformatics, 2009, 5, pp.47-66. ⟨inserm-00438663⟩
116 Consultations
276 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More