A framework for the recognition of high-level surgical tasks from video images for cataract surgeries.

Florent Lalys; Laurent Riffaud; David Bouget; Pierre Jannin

doi:10.1109/TBME.2011.2181168

Article Dans Une Revue IEEE Transactions on Biomedical Engineering Année : 2012

A framework for the recognition of high-level surgical tasks from video images for cataract surgeries.

(1) , (2, 1) , (1) , (1)

1
2

Florent Lalys

Fonction : Auteur correspondant
PersonId : 884998

Connectez-vous pour contacter l'auteur

Vision, Action et Gestion d'informations en Santé

Laurent Riffaud

Fonction : Auteur
PersonId : 884994

Service de neurochirurgie [Rennes] = Neurosurgery [Rennes]

Vision, Action et Gestion d'informations en Santé

David Bouget

Fonction : Auteur
PersonId : 920386

Vision, Action et Gestion d'informations en Santé

Pierre Jannin

Fonction : Auteur
PersonId : 740200
IdHAL : pierre-jannin
ORCID : 0000-0002-7415-071X
IdRef : 116295848

Vision, Action et Gestion d'informations en Santé

Résumé

The need for a better integration of the new generation of computer-assisted-surgical systems has been recently emphasized. One necessity to achieve this objective is to retrieve data from the operating room (OR) with different sensors, then to derive models from these data. Recently, the use of videos from cameras in the OR has demonstrated its efficiency. In this paper, we propose a framework to assist in the development of systems for the automatic recognition of high-level surgical tasks using microscope videos analysis. We validated its use on cataract procedures. The idea is to combine state-of-the-art computer vision techniques with time series analysis. The first step of the framework consisted in the definition of several visual cues for extracting semantic information, therefore, characterizing each frame of the video. Five different pieces of image-based classifiers were, therefore, implemented. A step of pupil segmentation was also applied for dedicated visual cue detection. Time series classification algorithms were then applied to model time-varying data. Dynamic time warping and hidden Markov models were tested. This association combined the advantages of all methods for better understanding of the problem. The framework was finally validated through various studies. Six binary visual cues were chosen along with 12 phases to detect, obtaining accuracies of 94%.

Domaines

Neurosciences [q-bio.NC] Apprentissage [cs.LG] Imagerie médicale Vision par ordinateur et reconnaissance de formes [cs.CV]

Fichier principal

TBE_2012_V7.pdf (1.57 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Florent Lalys : Connectez-vous pour contacter le contributeur

https://inserm.hal.science/inserm-00669682

Soumis le : lundi 13 février 2012-16:20:02

Dernière modification le : mardi 12 décembre 2023-14:45:01

Archivage à long terme le : lundi 14 mai 2012-02:45:08

Dates et versions

inserm-00669682 , version 1 (13-02-2012)

Identifiants

HAL Id : inserm-00669682 , version 1
DOI : 10.1109/TBME.2011.2181168
PUBMED : 22203700

Citer

Florent Lalys, Laurent Riffaud, David Bouget, Pierre Jannin. A framework for the recognition of high-level surgical tasks from video images for cataract surgeries.. IEEE Transactions on Biomedical Engineering, 2012, 59 (4), pp.966-76. ⟨10.1109/TBME.2011.2181168⟩. ⟨inserm-00669682⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSERM INSTITUT-TELECOM EC-PARIS UNIV-RENNES1 CNRS INRIA INSA-RENNES IRISA LTSI IRISA-D5 INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM UR1-BIO-SA

447 Consultations

500 Téléchargements

A framework for the recognition of high-level surgical tasks from video images for cataract surgeries.

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager