I am PhD student in Computer Science at Universitat Politècnica de València. My interests are Speech Technologies, Computer Vision, and Affective Computing.
-
Pattern Recognition and Human Languages Technology, Research Center
- Valencia, Spain
- https://www.prhlt.upv.es/david-gimeno/
- https://orcid.org/0000-0002-7375-9515
- in/david-gimeno-gómez-589a5526b
- https://scholar.google.com/citations?user=DVRSla8AAAAJ&hl=en
Pinned Loading
-
joactr/AnnoTheia
joactr/AnnoTheia PublicAnnoTheia is a data annotation toolkit that identifies when a person speaks in a scene and transcribes their speech, also offering flexibility to replace modules for different languages.
Python 26
-
cosmaadrian/multimodal-depression-from-video
cosmaadrian/multimodal-depression-from-video PublicOfficial source code for the paper: "Reading Between the Frames Multi-Modal Non-Verbal Depression Detection in Videos"
-
tailored-avsr
tailored-avsr PublicOfficial source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"
Python 8
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.