Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Build up chatCtx for multimodal agent #248

Open
FlorDonnaSanders opened this issue Jan 13, 2025 · 3 comments
Open

Build up chatCtx for multimodal agent #248

FlorDonnaSanders opened this issue Jan 13, 2025 · 3 comments

Comments

@FlorDonnaSanders
Copy link
Contributor

FlorDonnaSanders commented Jan 13, 2025

It would be most useful to be able to access the "chat" history for the multi-modal agent like is possible with the pipeline agent.

To achieve this, it would make sense to me that the messages array in the chatCtx object already present in the MultimodalAgent class would be built up as user/agent transcripts and tool calls come in.

Similar functionality is already available in OpenAI's beta typescript client for the realtime API.

@nbsp
Copy link
Member

nbsp commented Jan 20, 2025

MultimodalAgent and VoicePipelineAgent are undergoing a major change on the Python side that will be ported over in the next few weeks, hopefully. since it's a rebuild from the ground up, i don't see much reason to refactor the existing MMA to sync chat history with the LLM, since that's going to be part of the new agent structure anyway. thank you for your understanding

@FlorDonnaSanders
Copy link
Contributor Author

FlorDonnaSanders commented Jan 20, 2025

Thanks for replying with the additional context.
I have a serviceable workaround on my side where I listen to the emitted transcript events to build the chat history on my side.

Looking forward to what the re-build will bring, though.
(Off-topic: Will the new EOU detector also be ported to the TS release along with these changes?)

@nbsp
Copy link
Member

nbsp commented Jan 20, 2025

yes, the turn detector is planned for 0.7

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants