Using regular expressions to extract information on pacemaker implantation procedures from clinical reports.

Abstract : Objective: This study evaluated natural language processing methods to extract clinical data from free text in surgical reports related to cardiac pacing and defibrillation in order to populate a registry.Methods: The information extraction system that we have developed is a name entity recognition system based on GATE using regular expressions. 232 reports were analyzed. For each report, we performed manual abstraction, we collected the information stored in the registry, and we performed information extraction with our system. Sensitivity,positive predictive value and accuracy were used to evaluate our method.Results: Our system extracted information, including numeric data, text and combination of numbers and strings, with a high sensitivity (>90%) and very high predictive positive value (>95%). It featured a precision that was higher than the precision of the original registry database populated by manual input.Conclusion This tool based on GATE open source components provides a robust approach to extracting information from documents related to a specific narrow domain such as pacemaker reports. Further evaluation is needed for application to broader domains.
Type de document :
Article dans une revue
AMIA .. Annual Symposium proceedings [electronic resource] / AMIA Symposium. AMIA Symposium., 2008, pp.81-5
Liste complète des métadonnées

Littérature citée [19 références]  Voir  Masquer  Télécharger

http://www.hal.inserm.fr/inserm-00413991
Contributeur : Rosier Arnaud <>
Soumis le : lundi 7 septembre 2009 - 15:16:22
Dernière modification le : mercredi 16 mai 2018 - 11:23:40
Document(s) archivé(s) le : samedi 26 novembre 2016 - 12:48:12

Fichier

 Accès restreint
Fichier visible le : jamais

Connectez-vous pour demander l'accès au fichier

Identifiants

  • HAL Id : inserm-00413991, version 1
  • PUBMED : 18998970

Collections

Citation

Arnaud Rosier, Anita Burgun, Philippe Mabo. Using regular expressions to extract information on pacemaker implantation procedures from clinical reports.. AMIA .. Annual Symposium proceedings [electronic resource] / AMIA Symposium. AMIA Symposium., 2008, pp.81-5. 〈inserm-00413991〉

Partager

Métriques

Consultations de la notice

80