Patch-based adaptive weighting with segmentation and scale (PAWSS) for visual tracking in surgical video

Xiaofei Du; Maximilian Allan; Sebastian Bodenstedt; Lena Maier-Hein; Stefanie Speidel; Alessio Dore; Danail Stoyanov

doi:10.1016/j.media.2019.07.002

Patch-based adaptive weighting with segmentation and scale (PAWSS) for visual tracking in surgical video

Med Image Anal. 2019 Oct:57:120-135. doi: 10.1016/j.media.2019.07.002. Epub 2019 Jul 4.

Authors

Xiaofei Du¹, Maximilian Allan², Sebastian Bodenstedt³, Lena Maier-Hein⁴, Stefanie Speidel⁵, Alessio Dore⁶, Danail Stoyanov⁷

Affiliations

¹ Wellcome / EPSRC Centre for Interventional and Surgical Sciences (WEISS), University College London, UK. Electronic address: xiaofei.du.13@ucl.ac.uk.
² Intuitive Surgical Inc., USA. Electronic address: Max.allan@intusurg.com.
³ Karlsruhe Institute of Technology, Karlsruhe, Germany. Electronic address: sebastian.bodenstedt@nct-dresden.de.
⁴ Division of Computer-Assisted Medical Interventions (CAMI), German Cancer Research Center (DKFZ), Heidelberg, Germany. Electronic address: l.maier-hein@dkfz-heidelberg.de.
⁵ Karlsruhe Institute of Technology, Karlsruhe, Germany. Electronic address: stefanie.speidel@nct-dresden.de.
⁶ Deliveroo, London, UK. Electronic address: alessio.dore@deliveroo.co.uk.
⁷ Wellcome / EPSRC Centre for Interventional and Surgical Sciences (WEISS), University College London, UK. Electronic address: danail.stoyanov@ucl.ac.uk.

Abstract

Vision-based tracking in an important component for building computer assisted interventions in minimally invasive surgery as it facilitates estimation of motion for instruments and anatomical targets. Tracking-by-detection algorithms are widely used for visual tracking, where the problem is treated as a classification task and a tracking target appearance model is updated over time using online learning. In challenging conditions, like surgical scenes, where tracking targets deform and vary in scale, the update step is prone to include background information in model appearance or to lack the ability to estimate change of scale, which degrades the performance of classifier. In this paper, we propose a Patch-based Adaptive Weighting with Segmentation and Scale (PAWSS) tracking framework that tackles both scale and background problems. A simple but effective colour-based segmentation model is used to suppress background information and multi-scale samples are extracted to enrich the training pool, which allows the tracker to handle both incremental and abrupt scale variations between frames. Experimentally, we evaluate our approach on Online Tracking Benchmark (OTB) dataset and Visual Object Tracking (VOT) challenge datasets, showing that our approach outperforms recent state-of-the-art trackers, and it especially improves successful rate score on OTB dataset, while on VOT datasets, PAWSS ranks among the top trackers while operating at real-time frame rates. Focusing on the application of PAWSS to surgical scenes, we evaluate on MICCAI 2015 challenge instrument tracking challenge and in vivo datasets, showing that our approach performs the best among all submitted methods and also has promising performance on in vivo surgical instrument tracking.

Keywords: Computer assisted interventions; Surgical instrument tracking; Tracking-by-detection; Visual object tracking.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Algorithms
Humans
Image Processing, Computer-Assisted / methods*
Minimally Invasive Surgical Procedures / instrumentation*
Robotic Surgical Procedures / instrumentation*
Surgery, Computer-Assisted / instrumentation*
Surgical Instruments
User-Computer Interface
Video Recording*

Grants and funding

WT_/Wellcome Trust/United Kingdom