Postdoc researcher in Multimodal Speech Technologies, with interest in affective computing, pathological speech, and healthcare applications
-
PRHLT@Universitat Politècnica de València
- Valencia, Spain
- https://www.prhlt.upv.es/david-gimeno/
- https://orcid.org/0000-0002-7375-9515
- in/david-gimeno-gómez-589a5526b
- https://scholar.google.com/citations?user=DVRSla8AAAAJ&hl=en
Pinned Loading
-
interpreting-ssl-parkinson-speech
interpreting-ssl-parkinson-speech PublicOfficial source code of the paper: "Unveiling Interpretability in Self-Supervised Speech Representations for Parkinson’s Diagnosis"
-
cosmaadrian/multimodal-depression-from-video
cosmaadrian/multimodal-depression-from-video PublicOfficial source code for the paper: "Reading Between the Frames Multi-Modal Non-Verbal Depression Detection in Videos"
-
joactr/AnnoTheia
joactr/AnnoTheia PublicAnnoTheia is a data annotation toolkit that identifies when a person speaks in a scene and transcribes their speech, also offering flexibility to replace modules for different languages.
-
tailored-avsr
tailored-avsr PublicOfficial source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"
Python 14
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.
