S. N. Murphy, G. Weber, M. Mendis, V. Gainer, and H. ,

S. Chueh, I. Churchill, and . Kohane, Serving the enterprise and beyond with informatics for integrating biology and the bedside (i2b2), vol.17, pp.124-130, 2010.

J. Escudié, B. Rance, G. Malamut, S. Khater, A. Burgun et al., A novel data-driven workflow combining literature and electronic health records to estimate comorbidities burden for a specific disease: a case study on autoimmune comorbidities in patients with celiac disease, BMC Med Inform Decis Mak, vol.17, 2017.

E. , X. Tannier, and A. Névéol, Redundancy in French Electronic Health Records: A preliminary study, pp.21-30, 2015.

J. O. Wrenn, D. M. Stein, S. Bakken, and P. D. Stetson, Quantifying clinical narrative redundancy in an electronic health record, Journal of the American Medical Informatics Association, vol.17, pp.49-53, 2010.

R. Zhang, S. Pakhomov, B. T. Mcinnes, and G. ,

. Melton, Evaluating measures of redundancy in clinical texts, AMIA Annu Symp Proc, vol.2011, pp.1612-1620, 2011.

R. Cohen, M. Elhadad, and N. Elhadad, Redundancy in electronic health record corpora: analysis, impact on text mining performance and mitigation strategies, BMC Bioinformatics, vol.14, p.10, 2013.

S. F. Altschul, T. L. Madden, A. A. Schäffer, J. Zhang, Z. Zhang et al., Gapped BLAST and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res, vol.25, pp.3389-3402, 1997.

R. A. Gabriel, T. Kuo, J. Mcauley, and C. Hsu, Identifying and characterizing highly similar notes in big clinical note datasets, Journal of Biomedical Informatics, vol.82, pp.63-69, 2018.

E. Zapletal, N. Rodon, N. Grabar, and P. Degoulet, Methodology of integration of a clinical data warehouse with a clinical information system: the HEGP case, Stud Health Technol Inform, vol.160, pp.193-197, 2010.

M. Romedi--référentiel-ouvert-du, , 2018.

P. Di-tommaso, M. Chatzou, P. Prieto, E. Palumbo, and C. Notredame, Nextflow: A tool for deploying reproducible computational pipelines&nbsp, 2015.

. Docker, . Inc, and . Docker--build, Ship, and Run Any App, Anywhere

J. Strötgen and M. Gertz, Multilingual and crossdomain temporal tagging, Language Resources and Evaluation, vol.47, pp.269-298, 2013.

Z. Gu, Address for correspondence Bastien Rance, Email: bastien.rance@aphp.fr European Hospital Georges Pompidou, 20 rue Leblanc, p.75015, 2018.

P. France,