Theme-Logo
  • Login
  • Home
  • Course
  • Publication
  • Theses
  • Reports
  • Published books
  • Workshops / Conferences
  • Supervised PhD
  • Supervised MSc
  • Supervised projects
  • Education
  • Language skills
  • Positions
  • Memberships and awards
  • Committees
  • Experience
  • Scientific activites
  • In links
  • Outgoinglinks
  • News
  • Gallery
publication name Lamiaa A. Elrefaei, Tahani Q. Alhassan, Shefaa S. Omar, An Arabic Visual Dataset for Visual Speech Recognition, Procedia Computer Science, Volume 163, 2019, Pages 400-409, ISSN 1877-0509, https://doi.org/10.1016/j.procs.2019.12.122.
Authors
year 2020
keywords
journal
volume Not Available
issue Not Available
pages Not Available
publisher Not Available
Local/International International
Paper Link https://www.sciencedirect.com/science/article/pii/S1877050919321611?via%3Dihub
Full paper download
Supplementary materials Not Available
Abstract

Visual speech recognition (VSR) has received increasing attention in recent decades due to its potential uses in many applications. As for any recognition system, useful materials for training and testing are required. For VSR system development, the training and testing materials are videos representing the visual speech of the words. This paper presents the Arabic Visual Speech Dataset (AVSD) for visual speech recognition. The dataset contains 1100 videos for 10 daily communication words collected from 22 speakers and recorded using smartphones’ cameras in high-resolution and high-framerate. The process of building the dataset, including design, acquisition, post-processing phases are described in the paper. Finally, the results of evaluating AVSD using a VSR system are presented and discussed.

Benha University © 2023 Designed and developed by portal team - Benha University