Ponencia
LIPSFUS: A neuromorphic dataset for audio-visual sensory fusion of lip reading
Autor/es | Ríos Navarro, José Antonio
Piñero Fuentes, Enrique Canas Moreno, Salvador Javed, A. Harkin, Jim Linares Barranco, Alejandro |
Departamento | Universidad de Sevilla. Departamento de Arquitectura y Tecnología de Computadores |
Fecha de publicación | 2023-07 |
Fecha de depósito | 2023-09-11 |
Publicado en |
|
ISBN/ISSN | 978-1-6654-5109-3 2158-1525 |
Resumen | This paper presents a sensory fusion neuromorphic
dataset collected with precise temporal synchronization using a
set of Address-Event-Representation sensors and tools. The target
application is the lip reading of several ... This paper presents a sensory fusion neuromorphic dataset collected with precise temporal synchronization using a set of Address-Event-Representation sensors and tools. The target application is the lip reading of several keywords for different machine learning applications, such as digits, robotic commands, and auxiliary rich phonetic short words. The dataset is enlarged with a spiking version of an audio-visual lip reading dataset collected with frame-based cameras. LIPSFUS is publicly available and it has been validated with a deep learning architecture for audio and visual classification. It is intended for sensory fusion architectures based on both artificial and spiking neural network algorithms. |
Cita | Ríos Navarro, J.A., Piñero Fuentes, E., Canas Moreno, S., Javed, A., Harkin, J. y Linares Barranco, A. (2023). LIPSFUS: A neuromorphic dataset for audio-visual sensory fusion of lip reading. https://doi.org/10.1109/ISCAS46773.2023.10181685. |
Ficheros | Tamaño | Formato | Ver | Descripción |
---|---|---|---|---|
IE3_rios-navarro_2023_lipsfus_ ... | 2.134Mb | [PDF] | Ver/ | |