-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
add linguistics #94
Comments
Some datasets already have transcriptions (but I skip that since I don't think it will be needed). It can be added as an additional column in the CSV or audformat. If there is no transcription, we can utilize hugging face (such as a whisper) to generate transcripts during pre-processing in each dataset. Then, the "linguistic feature extractor" will process transcription in the This is useful to use speech along with transcription for the detection of such degradation like Alzheimer's. |
If there is a transcript field/column in the CSV file (assuming format is CSV), this can be easily solved using speechbrain package. https://speechbrain.readthedocs.io/en/latest/API/speechbrain.wordemb.transformer.html e.g.,
|
to me it seems most intuitive to extend the predict module by
|
Nkululeko could be multimodal, if a
transcript
field is added to the audio files
and then,
linguistic features extractors could be added to the feature_sets
The text was updated successfully, but these errors were encountered: