david-gimeno

David Gimeno-Gómez david-gimeno

Postdoc researcher in Multimodal Speech Technologies, with interest in affective computing, pathological speech, and healthcare applications

Achievements

interpreting-ssl-parkinson-speech interpreting-ssl-parkinson-speech Public

Official source code of the paper: "Unveiling Interpretability in Self-Supervised Speech Representations for Parkinson’s Diagnosis"

Jupyter Notebook 9 2
cosmaadrian/multimodal-depression-from-video cosmaadrian/multimodal-depression-from-video Public

Official source code for the paper: "Reading Between the Frames Multi-Modal Non-Verbal Depression Detection in Videos"

Python 85 13
joactr/AnnoTheia joactr/AnnoTheia Public

AnnoTheia is a data annotation toolkit that identifies when a person speaks in a scene and transcribes their speech, also offering flexibility to replace modules for different languages.

Python 27 1
LIP-RTVE LIP-RTVE Public

An Audiovisual Database for Continuous Spanish in the Wild

Python 9 1
tailored-avsr tailored-avsr Public

Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"

Python 14