Our research interests are on speech, audio, image and video processing, specially on automatic content extraction, speech recognition and last generation video coding. We have applied these methods in a variety of research projects like embedded vocal interfaces, speech transcription for hearing-impaired caption generation, automatic image and video annotation or HDTV video coding. Our main areas of research comprise:
Classification, analysis and indexing of images and video
Object tracking in images and video
Speech technologies
Multimedia applications of machine learning
Video coding H.264/AVC, HVC, 3D, SVC and Perceptual Video Coding