You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm actually very interested in this as well. Just to clarify, are you referring to something like FaceNet for voice? Have you done any more research into this area as of late?
Also there are a few deep embedded clustering implementations around. Also one in Keras: https://github.com/fferroni/DEC-Keras but I don't know if this one is well tested
And perhaps using a keras embedding layer to learn a representation for speakers?
The text was updated successfully, but these errors were encountered: