Quantitative Analysis of Healthy and Pathological Vocal Fold Vibrations using an Optical Flow based Waveform
Journal Title: International Journal of Advanced Computer Science & Applications - Year 2019, Vol 10, Issue 4
Abstract
The objective assessment of the vocal fold vibrations is important in diagnosing several vocal diseases. Given the high speed of the vibrations, the high speed videoendoscopy is commonly used to capture the vocal fold movements into video recordings. Commonly, two steps are carried out in order to automatically quantify laryngeal parameters and assess the vibra-tions. The first step aims to map the spatial-temporal information contained in the video recordings into a representation that facilitates the analysis of the vibrations. Numerous techniques are reported in the literature but the majority of them require the segmentation of all the images of the video, which is a complex task. The second step aims to quantify laryngeal parameters in order to assess the vibrations. To this aim, most of the existing approaches require an additional processing to the representation in order to deduce those parameters. Furthermore, for some reported representations, the assessment of the symmetry and the periodicity of the vocal fold dynamics needs setting up parameters that are specific to the representation under consideration; which makes difficult the comparison between the existing techniques. To alleviate these problems, the present study investigates the use of a recently proposed representation named optical flow based waveform, in order to objectively quantify the laryngeal parameters. This waveform is retained in this study as it does not require the segmentation of all the images of the video. Furthermore, it will be shown in the present work that the automatic quantification of the vibrations using this waveform can be carried out without applying any additional processing. Moreover, common laryngeal parameters are exploited; hence, no specific parameters are needed to be defined for the automatic assessment of the vibrations. Experiments conducted on healthy and pathological phonation show the accuracy of the waveform. Besides, it is more sensitive to pathological phonation than the state-of-the-art techniques.
Authors and Affiliations
Heyfa Ammar
Characterizing the 2016 U.S. Presidential Campaign using Twitter Data
This paper models the 2016 U.S. presidential campaign in the context of Twitter. The study analyzes the presidential candidates’ Twitter activity by crawling their real-time tweets. More than 16,000 tweets were observed...
Hyperspectral Image Segmentation using Homogeneous Area Limiting and Shortest Path Algorithm
Segmentation, as a preprocessing, plays an important role in hyperspectral images. In this paper, considering the similarity of neighboring pixels, using the size measure, the image spectrum is divided into several segme...
Convolutional Neural Networks in Predicting Missing Text in Arabic
Missing text prediction is one of the major concerns of Natural Language Processing deep learning community’s at-tention. However, the majority of text prediction related research is performed in other languages but not...
Context based Emotion Analyzer for Interactive Agent
Emotions can affect human’s performance in a considerable manner. These emotions can be articulated in many ways such as text, speech, facial expressions, gestures and postures. Humans in effect of their emotions, have a...
Comparative Performance Analysis for Generalized Additive and Generalized Linear Modeling in Epidemiology
Most environmental-epidemiological researches emphasize modeling as the causal link of different events (e.g., hospital admission, death, disease emergency). There has been a particular concern in the use of the Generali...