Visuo-auditory sensory substitution systems are augmented reality devices that translate a video stream into an audio stream in order to help the blind in daily tasks requiring visuo-spatial information. In this work, we present both a new mobile device and a transcoding method specifically designed to sonify moving objects. Frame differencing is used to extract spatial features from the video stream and two-dimensional spatial information is converted into audio cues using pitch, interaural time difference, and interaural level difference. Using numerical methods, we attempt to reconstruct visuo-spatial information based on audio signals generated from various video stimuli. We show that despite a contrasted visual background and a highly lossy encoding method, the information in the audio signal is sufficient to allow object localization, object trajectory evaluation, object approach detection, and spatial separation of multiple objects. We also show that this type of audio signal can be interpreted by human users by asking 10 subjects to discriminate trajectories based on generated audio signals.
Publication
Télécharger la publication
Année de publication : 2015
Type :
Article de journal
Article de journal
Auteurs :
Ambard, M.
Benezeth, Y.
& Pfister, P.
Ambard, M.
Benezeth, Y.
& Pfister, P.
Titre du journal :
Frontiers in ICT
Frontiers in ICT
Numéro du journal :
2
2
Volume du journal :
Article 20
Article 20
Mots-clés :
sensory substitution, blind, mobile device, video processing, audio synthesis, motion detection, sonification
sensory substitution, blind, mobile device, video processing, audio synthesis, motion detection, sonification