Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Speaker diarisation /speaker detection for interview trascription #492

Open
menelic opened this issue Jun 14, 2023 · 4 comments
Open
Labels
enhancement New feature or request

Comments

@menelic
Copy link

menelic commented Jun 14, 2023

Also mentioned in #469 This is implemented in this Whisper gui built in streamlit: https://github.com/jojojaeger/whisper-streamlit (you find the diarisation version here https://github.com/jojojaeger/whisper-streamlit/tree/master/whisper-streamlit-speaker but info on to in readme at the first link) first link) Because yours is a cross platform desktop app, this can become a go-to for many journalists, researchers etc for whom such a feature would be key.

@bfrye26
Copy link

bfrye26 commented Jun 20, 2023

I would love this, it is such an easy app to use, and if it had this feature it would be something I use daily!

@johnfelipe
Copy link
Contributor

Pls add this feature

@marrie
Copy link

marrie commented Aug 17, 2024

It would actually be wonderful to do this even if it was just "speaker 1" "speaker 2" etc. so
speaker 1: american 1040 requesting IFR
speaker 2: American 1040 go ahead.

you might be able to clean up the transcript then in VsCode for clarity. Thoughts?

@raivisdejus
Copy link
Collaborator

@menelic @bfrye26 @johnfelipe @marrie Please share your intended workflow in this discussion #1043 so we can figure out the best way to implement this.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

5 participants