Detection of the Pharyngeal Phase in the Videofluoroscopic Swallowing Study Using Inflated 3D Convolutional Networks
Videofluoroscopic swallowing study (VFSS) is a standard diagnostic tool for dysphagia. Previous computer assisted analysis of VFSS required manual preparation to mark several anatomical structures and to select time intervals of interest such as a pharyngeal phase during swallowing. These processes were still costly and challenging for clinicians. In this study, we present a novel approach to detect the pharyngeal phase of swallowing through whole of VFSS video clips using Inflated 3D Convolutional Networks (I3D) without additional manual annotations.
KeywordsAction classification Dysphagia Inflated 3D convolutional networks Videofluoroscopic swallowing study
- 2.Carreira, J., Zisserman, A.: Quo vadis, action recognition? a new model and the kinetics dataset. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 4724–4733. IEEE (2017)Google Scholar
- 4.Kay, W., et al.: The kinetics human action video dataset. arXiv preprint arXiv:1705.06950 (2017)
- 11.Noorwali, S.: Semi-automatic tracking of the hyoid bone and the epiglottis movements in digital videofluoroscopic images (2013)Google Scholar