D. Friedman and R. Parrish, The population health record: concepts, definition, design, and implementation, Journal of the American Medical Informatics Association, vol.17, issue.4, pp.359-366, 2010.
DOI : 10.1136/jamia.2009.001578

D. Kalra, Electronic health record standards, Yearb Med Inform, pp.136-180, 2006.

F. Farsi, T. Durand, and H. Spacagna, Médicalisation des systèmes d'information -Innovation en Rhône-Alpes: création et exploitation d'une échelle d'autoévaluation. Gestions Hospitalières, pp.7-11, 2006.

T. Durand, H. Spacagna, C. Verdier, P. Biron, and F. A. , The Rhone-Alpes health platform, Methods Inf Med, vol.46, issue.4, pp.451-457, 2007.

C. Quantin, C. Guinot, A. Tursz, J. Salomez, C. Rogier et al., Article original, Revue d'??pid??miologie et de Sant?? Publique, vol.54, issue.2, pp.177-184, 2006.
DOI : 10.1016/S0398-7620(06)76711-6

C. Quantin, O. Cohen, B. Riandey, and F. Allaert, Unique Patient Concept: A key choice for European epidemiology, International Journal of Medical Informatics, vol.76, issue.5-6, pp.5-6419, 2007.
DOI : 10.1016/j.ijmedinf.2006.09.006

A. Roberts, R. Gaizauskas, M. Hepple, G. Demetriou, Y. Guo et al., Building a semantically annotated corpus of clinical texts, Journal of Biomedical Informatics, vol.42, issue.5, pp.950-966, 2009.
DOI : 10.1016/j.jbi.2008.12.013

A. Coden, G. Savova, I. Sominsky, M. Tanenblatt, J. Masanz et al., Automatically extracting cancer disease characteristics from pathology reports into a Disease Knowledge Representation Model, Journal of Biomedical Informatics, vol.42, issue.5, pp.42937-949, 2009.
DOI : 10.1016/j.jbi.2008.12.005

R. Krishna, K. Kelleher, and E. Stahlberg, Patient Confidentiality in the Research Use of Clinical Medical Databases, American Journal of Public Health, vol.97, issue.4, pp.654-658, 2007.
DOI : 10.2105/AJPH.2006.090902

C. Grouin, A. Rosier, O. Dameron, and P. Zweigenbaum, Une proc??dure d???anonymisation ?? deux niveaux pour cr??er un corpus de comptes rendus hospitaliers, Informatique et Santé, vol.17, pp.23-34, 2009.
DOI : 10.1007/978-2-287-99305-3_3

D. Proux, P. Marchal, F. Segond, I. Kergourlay, S. Darmoni et al., Natural language processing to detect risk patterns related to hospital acquired infections Bulgaria: Borvets, Proceedings of the International workshop biomedical information extraction, pp.35-41

B. Dean, J. Lam, J. Natoli, Q. Butler, D. Aguilar et al., Review: Use of Electronic Medical Records for Health Outcomes Research: A Literature Review, Medical Care Research and Review, vol.66, issue.6, pp.611-638, 2009.
DOI : 10.1177/1077558709332440

E. Lau, F. Mowat, M. Kelsh, J. Legg, N. Engel-nitz et al., PCN129 USE OF ELECTRONIC MEDICAL RECORDS (EMR) FOR ONCOLOGY OUTCOMES RESEARCH: ASSESSING THE COMPARABILITY OF EMR INFORMATION TO PATIENT REGISTRY AND HEALTH CLAIMS DATA, Value in Health, vol.14, issue.3, pp.259-272, 2011.
DOI : 10.1016/j.jval.2011.02.983

D. Proux, F. Segond, S. Gerbier, and M. Metzger, Addressing Risk Assessment for Patient Safety in Hospitals through Information Extraction in Medical Reports, Intelligent Information Processing IV. IFIP International Federation for Information Processing, pp.230-239, 2009.
DOI : 10.1007/978-0-387-87685-6_28

URL : https://hal.archives-ouvertes.fr/hal-00428294

M. Metzger, Q. Gicquel, D. Proux, S. Pereira, I. Kergourlay et al., Development of an automated detection tool for healthcareassociated infections based on screening natural language medical reports, AMIA Annu Symp Proc, p.967, 2009.
URL : https://hal.archives-ouvertes.fr/hal-00539388

S. Meystre, G. Savova, K. Kipper-schuler, and J. Hurdle, Extracting information from textual documents in the electronic health record: a review of recent research, Yearb Med Inform, pp.128-172, 2008.

W. Chapman and K. Cohen, Current issues in biomedical text mining and natural language processing, Journal of Biomedical Informatics, vol.42, issue.5, pp.757-759, 2009.
DOI : 10.1016/j.jbi.2009.09.001

URL : http://doi.org/10.1016/j.jbi.2009.09.001

H. Murff, F. Fitzhenry, M. Matheny, N. Gentry, K. Kotter et al., Automated Identification of Postoperative Complications Within an Electronic Medical Record Using Natural Language Processing, JAMA, vol.306, issue.8, pp.306848-855, 2011.
DOI : 10.1001/jama.2011.1204

C. Friedman, S. Johnson, and J. Starren, Architectural requirements for multipurpose natural language processor in the clinical environment, Proceedings of the Annual Symposium on Computer Applications in Medical Care, pp.347-51, 1995.

H. Stenzhorn, E. Pacheco, P. Nohama, and S. Schulz, Automatic mapping of clinical documentation to SNOMED CT. Stud Health Technol Inform, pp.228-232, 2009.

A. Mykowiecka, M. Marciniak, and A. Kupsc, Rule-based information extraction from patients??? clinical data, Journal of Biomedical Informatics, vol.42, issue.5, pp.923-936, 2009.
DOI : 10.1016/j.jbi.2009.07.007

URL : https://hal.archives-ouvertes.fr/inria-00420999

M. Fieschi, La gouvernance de l'interopérabilité sémantique est au coeur du développement des systèmes d'information en santé -Rapport à la Ministre de la Santé et des Sports, 2009.

S. Sakji, Q. Gicquel, S. Pereira, I. Kergoulay, D. Proux et al., Evaluation of a French medical multi-terminology indexer for the manual annotation of natural language medical reports of healthcareassociated infections Cape Town, South Africa, Proceedings of the 13thWorld Congress on Medical Informatics, pp.252-256, 2010.

M. Obenshain, Abstract, Infection Control & Hospital Epidemiology, vol.47, issue.08, pp.690-695, 2004.
DOI : 10.1067/mic.2000.109883

S. Brossette, A. Sprague, W. Jones, and S. Moser, A data mining system for infection control surveillance, Methods Inf Med, vol.39, pp.303-310, 2000.

G. Saporta, Epidémiologie et data mining ou fouille de données. In L'épidémiologie humaine: conditions de son développement en France et rôle des mathématiques, pp.137-142

F. Laforest, S. Frénot, and N. Almasri, Dossier médical semi-structuré pour des interfaces de saisie multi-modales. Revue Documents numériques, pp.29-46, 2002.
DOI : 10.3166/dn.6.1-2.29-46

URL : http://www.cairn.info/load_pdf.php?ID_ARTICLE=DN_061_0029

O. Boussaïd, P. Garçanski, F. Masseglia, and B. Trousse, In Fouille des données complexes, Revue des nouvelles technologies de l'information Cépaduès Edition, 2005.

C. Brodley and M. Friedl, Identifying mislabeled training data, J Artificial Intelligence Res, vol.11, pp.131-167, 1999.

I. Guyon and A. Elisseeff, An introduction to variable and feature selection, J Machine Learning Res, vol.3, pp.1157-1182, 2003.

S. Pakhomov, S. Weston, S. Jacobsen, C. Chute, R. Meverden et al., Electronic medical records for clinical research: application to the identification of heart failure, Am J Manag Care, vol.13, issue.6, pp.281-288, 2007.

N. Terrin, C. Schmid, J. Griffith, D. Agostino, R. Selker et al., External validity of predictive models: a comparison of logistic regression, classification trees, and neural networks, Journal of Clinical Epidemiology, vol.56, issue.8, pp.56721-729, 2003.
DOI : 10.1016/S0895-4356(03)00120-3

T. Dietterich, Approximate Statistical Tests for Comparing Supervised Classification Learning Algorithms, Neural Computation, vol.6, issue.7, pp.1895-1923, 1998.
DOI : 10.1007/BF00058655

S. Dudoit and M. Van-der-laan, Multiple testing procedures with applications to, Genomics, 2008.

N. Chawla, N. Japkowicz, and A. Kolcz, Editorial, ACM SIGKDD Explorations Newsletter, vol.6, issue.1, pp.1-6, 2004.
DOI : 10.1145/1007730.1007733

N. Japkowicz and S. Stephen, The class imbalance problem: a systematic study. Intelligent Data Analysis, pp.429-450, 2002.

G. Batista, R. Prati, and M. Monard, A study of the behavior of several methods for balancing machine learning training data, ACM SIGKDD Explorations Newsletter, vol.6, issue.1, pp.20-29, 2004.
DOI : 10.1145/1007730.1007735

P. Domingos, MetaCost, Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining , KDD '99, pp.155-164, 1999.
DOI : 10.1145/312129.312220

P. Lenca, S. Lallich, T. Do, and K. Pham, A Comparison of Different Off-Centered Entropies to Deal with Class Imbalance for Decision Trees, Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp.634-643, 2008.
DOI : 10.1007/978-3-540-68125-0_59

G. Ritschard, V. Pisetta, and D. Zighed, Inducing and evaluating classification trees with statistical implicative criteria. Statistical Implicative Analysis: Theory and Applications. Series Studies in Computational Intelligence, pp.397-420, 2008.
URL : https://hal.archives-ouvertes.fr/hal-00519611

G. Weiss, Mining with rarity, ACM SIGKDD Explorations Newsletter, vol.6, issue.1, pp.7-19, 2004.
DOI : 10.1145/1007730.1007734

Q. Gicquel, D. Proux, P. Marchal, C. Hagège, Y. Berrouane et al., Evaluation d'un outil d'aide à l'anonymisation des documents médicaux basé sur le traitement automatique du langage naturel. In Systèmes d'information pour l'amélioration de la qualité en santé: comptes rendus des quatorzièmes Journées francophones d'informatique médicale, 23 et 24 septembre, pp.165-176, 2011.
DOI : 10.1007/978-2-8178-0285-5_15

D. Centers, . Control, and . Prevention, Automated detection and reporting of notifiable diseases using electronic medical records versus passive surveillance -Massachusetts, MMWR Morb Mortal Wkly Rep, vol.57, issue.14, pp.373-376, 2006.

M. Klompas, G. Haney, D. Church, R. Lazarus, X. Hou et al., Automated Identification of Acute Hepatitis B Using Electronic Medical Record Data to Facilitate Public Health Surveillance, PLoS ONE, vol.46, issue.7, p.2626, 2008.
DOI : 10.1371/journal.pone.0002626.t003

W. Stead, H. Lin, W. Stead, and H. Lin, Community in Health Care Informatics. National-Research-Council: Free executive summary. In Computational technology for effective healthcare: immediate steps and strategic directions, pp.1-12

C. Schoen, R. Osborn, M. Doty, D. Squires, J. Peugh et al., A Survey Of Primary Care Physicians In Eleven Countries, 2009: Perspectives On Care, Costs, And Experiences, Health Affairs, vol.28, issue.6, pp.28-1171, 2009.
DOI : 10.1377/hlthaff.28.6.w1171

J. Anderson, Social, ethical and legal barriers to e-health, Int J Med Inform, vol.76, pp.5-6480, 2007.

J. Officiel-de-la-république-française, Arrêté du 22 septembre 2011 portant approbation de la convention nationale des médecins généralistes et spécialistes, 2011.

M. Apkon and P. Singhaviranon, Impact of an electronic information system on physician workflow and data collection in the intensive care unit, Intensive Care Medicine, vol.27, issue.1, pp.122-130, 2001.
DOI : 10.1007/s001340000777

J. Denny, A. Spickard, K. Johnson, N. Peterson, J. Peterson et al., Evaluation of a Method to Identify and Categorize Section Headers in Clinical Documents, Journal of the American Medical Informatics Association, vol.16, issue.6, pp.806-815, 2009.
DOI : 10.1197/jamia.M3037

C. Quantin, G. Coatrieux, M. Fassa, V. Breton, D. Jaquet-chiffelle et al., Centralised versus decentralised management of patients' medical records, Stud Health Technol Inform, vol.150, pp.700-704, 2009.
URL : https://hal.archives-ouvertes.fr/hal-00473701