B. André, T. Vercauteren, A. M. Buchner, M. W. Shahid, M. B. Wallace et al., An Image Retrieval Approach to Setup Difficulty Levels in Training Systems for Endomicroscopy Diagnosis, Proc MICCAI'10, pp.480-487, 2010.
DOI : 10.1007/978-3-642-15745-5_59

S. Arya and D. M. Mount, Approximate nearest neighbor queries in fixed dimensions, Proc ACM-SIAM SODA'93, pp.271-280, 1993.

T. Blum, H. Feussner, and N. Navab, Modeling and Segmentation of Surgical Workflow from Laparoscopic Video, Proc. MICCAI'10, pp.400-407, 2010.
DOI : 10.1007/978-3-642-15711-0_50

E. Bruno, N. Moenne-loccoz, and S. Marchand-maillet, Design of Multimodal Dissimilarity Spaces for Retrieval of Video Documents, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.30, issue.9, pp.1520-1533, 2008.
DOI : 10.1109/TPAMI.2007.70801

A. M. Cano, F. Gayá, P. Lamata, P. Sánchez-gonzález, and E. J. Gómez, Laparoscopic Tool Tracking Method for Augmented Reality Surgical Applications, Proc LNCS'08, pp.191-196, 2008.
DOI : 10.1007/978-3-540-70521-5_21

Y. Cao, D. Liu, W. Tavanapong, J. Wong, J. Oh et al., Computer-Aided Detection of Diagnostic and Therapeutic Operations in Colonoscopy Videos, IEEE Transactions on Biomedical Engineering, vol.54, issue.7, pp.1268-1279, 2007.
DOI : 10.1109/TBME.2007.890734

X. Castells, M. Comas, M. Castilla, F. Cots, and S. Alarcón, Clinical outcomes and costs of cataract surgery performed by planned ECCE and phacoemulsification, International Ophthalmology, vol.22, issue.6, pp.363-367, 1998.
DOI : 10.1023/A:1006484411524

S. Dev, W. F. Mieler, J. S. Pulido, and R. A. Mittra, Visual outcomes after pars plana vitrectomy for epiretinal membranes associated with pars planitis, Ophthalmology, vol.106, issue.6, pp.1086-1090, 1999.
DOI : 10.1016/S0161-6420(99)90247-6

M. Douze, H. Jégou, and C. Schmid, An Image-Based Approach to Video Copy Detection With Spatio-Temporal Post-Filtering, IEEE Transactions on Multimedia, vol.12, issue.4, pp.257-266, 2010.
DOI : 10.1109/TMM.2010.2046265

URL : https://hal.archives-ouvertes.fr/inria-00604034

O. Duchenne, I. Laptev, J. Sivic, F. Bach, and J. Ponce, Automatic annotation of human actions in video, 2009 IEEE 12th International Conference on Computer Vision, pp.1491-1498, 2009.
DOI : 10.1109/ICCV.2009.5459279

A. Dyana, M. P. Subramanian, and S. Das, Combining Features for Shape and Motion Trajectory of Video Objects for Efficient Content Based Video Retrieval, 2009 Seventh International Conference on Advances in Pattern Recognition, pp.113-116, 2009.
DOI : 10.1109/ICAPR.2009.37

H. P. Gao and Z. Q. Yang, Content Based Video Retrieval Using Spatiotemporal Salient Objects, 2010 International Symposium on Intelligence Information Processing and Trusted Computing, pp.689-692, 2010.
DOI : 10.1109/IPTC.2010.30

S. Giannarou and G. Yang, Content-Based Surgical Workflow Representation Using Probabilistic Motion Modeling, LNCS MIAR'10, pp.314-323, 2010.
DOI : 10.1007/978-3-642-15699-1_33

A. Gionis, P. Indyk, and R. Motwani, Similarity search in high dimensions via hashing, Proc VLDB'99, pp.518-529, 1999.

J. A. Hanley and B. J. Mcneil, The meaning and use of the area under a receiver operating characteristic (ROC) curve., Radiology, vol.143, issue.1, pp.29-36, 1982.
DOI : 10.1148/radiology.143.1.7063747

B. B. Haro, L. Zappella, and R. Vidal, Surgical gesture classification from video data, Proc. MICCAI'12, pp.34-41, 2012.

Z. Harris, Distributional structure, pp.146-62, 1954.

S. C. Hoi and M. R. Lyu, A Multimodal and Multilevel Ranking Framework for Content-Based Video Retrieval, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '07, pp.1225-1228, 2007.
DOI : 10.1109/ICASSP.2007.367297

W. Hu, D. Xie, Z. Fu, W. Zeng, and S. Maybank, Semantic-Based Surveillance Video Retrieval, IEEE Transactions on Image Processing, vol.16, issue.4, pp.1168-1181, 2007.
DOI : 10.1109/TIP.2006.891352

M. Huang, W. Yang, M. Yu, Z. Lu, Q. Feng et al., Retrieval of Brain Tumors with Region-Specific Bag-of-Visual-Words Representations in Contrast-Enhanced MRI Images, Computational and Mathematical Methods in Medicine, vol.18, issue.8, 2012.
DOI : 10.1109/TPAMI.2010.147

K. Juan and H. Cuiying, Content-based video retrieval system research, Proc ICCSIT'10, pp.701-704, 2010.

F. Lalys, L. Riffaud, D. Bouget, and P. Jannin, An applicationdependent framework for the recognition of high-level surgical tasks in the OR, Proc. MICCAI'11, pp.331-338, 2011.
URL : https://hal.archives-ouvertes.fr/inserm-00617015

F. Lalys, L. Riffaud, D. Bouget, and P. Jannin, A Framework for the Recognition of High-Level Surgical Tasks From Video Images for Cataract Surgeries, IEEE Transactions on Biomedical Engineering, vol.59, issue.4, pp.966-76, 2012.
DOI : 10.1109/TBME.2011.2181168

URL : https://hal.archives-ouvertes.fr/inserm-00669682

I. Laptev, On Space-Time Interest Points, International Journal of Computer Vision, vol.17, issue.8, pp.107-123, 2005.
DOI : 10.1007/s11263-005-1838-7

C. L. Lawson and B. J. Hanson, Solving Least Squares Problems, 1974.
DOI : 10.1137/1.9781611971217

B. D. Lucas and T. Kanade, An iterative image registration technique with an application to stereo vision, Proc IUW'81, pp.121-130, 1981.

M. Marsza-lek, I. Laptev, and C. Schmid, Actions in context, Proc IEEE CVPR'09, pp.2929-2936, 2009.

X. Naturel and P. Gros, Detecting repeats for video structuring, Multimedia Tools and Applications, vol.13, issue.3, pp.233-252, 2008.
DOI : 10.1007/s11042-007-0180-1

URL : https://hal.archives-ouvertes.fr/inria-00568177

O. Neill, M. Ryan, and C. , Grammatical evolution, IEEE Trans Evol Comput, vol.5, issue.4, 2001.

N. Padoy, T. Blum, S. Ahmadi, H. Feussner, M. Berger et al., Statistical modeling and recognition of surgical workflow, Medical Image Analysis, vol.16, issue.3, pp.632-641, 2012.
DOI : 10.1016/j.media.2010.10.001

URL : https://hal.archives-ouvertes.fr/inria-00526493

B. V. Patel, A. V. Deorankar, and B. B. Meshram, Content based video retrieval using entropy, edge detection, black and white color features, 2010 2nd International Conference on Computer Engineering and Technology, pp.272-276, 2010.
DOI : 10.1109/ICCET.2010.5486262

G. Piriou, P. Bouthemy, and J. Yao, Recognition of Dynamic Video Contents With Global Probabilistic Models of Visual Motion, IEEE Transactions on Image Processing, vol.15, issue.11, pp.3417-3430, 2006.
DOI : 10.1109/TIP.2006.881963

URL : https://hal.archives-ouvertes.fr/hal-00453197

G. Quellec, M. Lamard, G. Cazuguel, B. Cochener, and C. Roux, Fast Wavelet-Based Image Characterization for Highly Adaptive Image Retrieval, IEEE Transactions on Image Processing, vol.21, issue.4, pp.1613-1623, 2012.
DOI : 10.1109/TIP.2011.2180915

URL : https://hal.archives-ouvertes.fr/hal-00945400

C. E. Reiley and G. D. Hager, Task versus Subtask Surgical Skill Evaluation of Robotic Minimally Invasive Surgery, Proc. MICCAI'09, pp.435-442, 2009.
DOI : 10.1007/978-3-642-04268-3_54

H. Sakoe and S. Chiba, Dynamic programming algorithm optimization for spoken word recognition, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol.26, issue.1, pp.43-49, 1978.
DOI : 10.1109/TASSP.1978.1163055

R. Schapire, Strength of weak learnability, Mach Learn, vol.5, pp.197-227, 1990.

A. F. Smeaton, P. Over, and W. Kraaij, Evaluation campaigns and TRECVid, Proceedings of the 8th ACM international workshop on Multimedia information retrieval , MIR '06, pp.321-330, 2006.
DOI : 10.1145/1178677.1178722

A. W. Smeulders, M. Worring, S. Santini, A. Gupta, and R. Jain, Content-based image retrieval at the end of the early years, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.22, issue.12, pp.1349-1380, 2000.
DOI : 10.1109/34.895972

T. Syeda-mahmood, D. Ponceleon, and J. Yang, Validating cardiac echo diagnosis through video similarity, Proceedings of the 13th annual ACM international conference on Multimedia , MULTIMEDIA '05, pp.527-530, 2005.
DOI : 10.1145/1101149.1101268

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.331.898

T. Tamaki and J. Yoshimuta, Computer-aided colorectal tumor classification in NBI endoscopy using local features, Medical Image Analysis, vol.17, issue.1, pp.78-100, 2013.
DOI : 10.1016/j.media.2012.08.003

L. Tao, E. Elhamifar, S. Khudanpur, G. D. Hager, and R. Vidal, Sparse Hidden Markov Models for Surgical Gesture Classification and Skill Evaluation, Proc IPCAI'12, pp.167-177, 2012.
DOI : 10.1007/978-3-642-30618-1_17

D. Xu and S. F. Chang, Video event recognition using kernel methods with multilevel temporal alignment, IEEE Trans Pattern Anal Mach Intell, vol.30, issue.11, pp.1985-1997, 2008.

. Katiacharrì-ere-was-born, She received the engineering degree in engineering and life sciences from TPS (previously named ENSPS) She is currently a 1st year Ph Her research interests include content-based video retrieval for medical applications, 2011, and the M.S degree in imaging, robotics and biomedical engineering from the University of Student at the LaTIM Inserm Research Unit 1101 and Telecom Bretagne, 1986.

M. Lamard-was-born-in-bordeaux and F. , He received the M.S. degree in applied mathematics from the University of Bordeaux, France, in 1995, and the Ph.D. degree in signal processing and telecommunication from the University He joined the LaTIM Inserm Research Unit 1101 in 2000, where he is currently a Research Associate His research interests include image processing, 3-D reconstruction, content-based image retrieval, and information fusion for medical applications, 1968.