Skip to Main content Skip to Navigation
Journal articles

Finding biomarkers in non-model species: literature mining of transcription factors involved in bovine embryo development

Abstract : Background: Since processes in well-known model organisms have specific features different from those in Bos taurus, the organism under study, a good way to describe gene regulation in ruminant embryos would be a species-specific consideration of closely related species to cattle, sheep and pig. However, as highlighted by a recent report, gene dictionaries in pig are smaller than in cattle, bringing a risk to reduce the gene resources to be mined (and so for sheep dictionaries). Bioinformatics approaches that allow an integration of available information on gene function in model organisms, taking into account their specificity, are thus needed. Besides these closely related and biologically relevant species, there is indeed much more knowledge of (i) trophoblast proliferation and differentiation or (ii) embryogenesis in human and mouse species, which provides opportunities for reconstructing proliferation and/or differentiation processes in other mammalian embryos, including ruminants. The necessary knowledge can be obtained partly from (i) stem cell or cancer research to supply useful information on molecular agents or molecular interactions at work in cell proliferation and (ii) mouse embryogenesis to supply useful information on embryo differentiation. However, the total number of publications for all these topics and species is great and their manual processing would be tedious and time consuming. This is why we used text mining for automated text analysis and automated knowledge extraction. To evaluate the quality of this "mining", we took advantage of studies that reported gene expression profiles during the elongation of bovine embryos and defined a list of transcription factors (or TF, n = 64) that we used as biological "gold standard". When successful, the "mining" approach would identify them all, as well as novel ones. Methods: To gain knowledge on molecular-genetic regulations in a non model organism, we offer an approach based on literature-mining and score arrangement of data from model organisms. This approach was applied to identify novel transcription factors during bovine blastocyst elongation, a process that is not observed in rodents and primates. As a result, searching through human and mouse corpuses, we identified numerous bovine homologs, among which 11 to 14% of transcription factors including the gold standard TF as well as novel TF potentially important to gene regulation in ruminant embryo development. The scripts of the workflow are written in Perl and available on demand. They require data input coming from all various databases for any kind of biological issue once the data has been prepared according to keywords for the studied topic and species; we can provide data sample to illustrate the use and functionality of the workflow.
Complete list of metadatas

Cited literature [58 references]  Display  Hide  Download

https://www.hal.inserm.fr/inserm-02440326
Contributor : Séverine Degrelle <>
Submitted on : Wednesday, January 15, 2020 - 10:22:50 AM
Last modification on : Tuesday, March 17, 2020 - 3:44:43 AM
Document(s) archivé(s) le : Thursday, April 16, 2020 - 1:10:46 PM

File

C4_Turenne_BDM_2012.pdf
Publication funded by an institution

Identifiers

Collections

Citation

Nicolas Turenne, Evgeniy Tiys, Vladimir Ivanisenko, Nikolay Yudin, Elena Ignatieva, et al.. Finding biomarkers in non-model species: literature mining of transcription factors involved in bovine embryo development. BioData Mining, BioMed Central, 2012, 5 (1), pp.12. ⟨10.1186/1756-0381-5-12⟩. ⟨inserm-02440326⟩

Share

Metrics

Record views

62

Files downloads

165