Abstract

Lamiaa A. Elrefaei, Tahani Q. Alhassan, Shefaa S. Omar, An Arabic Visual Dataset for Visual Speech Recognition, Procedia Computer Science, Volume 163, 2019, Pages 400-409, ISSN 1877-0509, https://doi.org/10.1016/j.procs.2019.12.122.

• 2020

Back

Publication Information

Authors Not Available

Keywords Not Available

Journal Not Available

Publisher Not Available

Volume Not Available

Issue Not Available

Pages Not Available

publication.type International

Paper Link Open Link

Supplementary Materials Not Available

Abstract

Visual speech recognition (VSR) has received increasing attention in recent decades due to its potential uses in many applications. As for any recognition system, useful materials for training and testing are required. For VSR system development, the training and testing materials are videos representing the visual speech of the words. This paper presents the Arabic Visual Speech Dataset (AVSD) for visual speech recognition. The dataset contains 1100 videos for 10 daily communication words collected from 22 speakers and recorded using smartphones’ cameras in high-resolution and high-framerate. The process of building the dataset, including design, acquisition, post-processing phases are described in the paper. Finally, the results of evaluating AVSD using a VSR system are presented and discussed.