Implement streaming audio Websocket #30

NeonDaniel · 2024-10-04T00:10:45Z

Description

Implements a WS API for streaming raw audio (input and output) with a per-client listener running in the backend

Issues

Other Notes

Consider implementing streaming response audio

Implemented. Stream socket handles chunked audio input and sends wav audio segments, all as bytes.
Consider WW config and included plugins

Global configuration will be used (at least initially). Plugins will match neon-speech defaults.
Consider configuring max allowed clients to prevent overloading the backend server with listener instances

Implemented in configuration with unit test coverage

Outline handling of client audio stream

Lazy init streaming when clients connect to the endpoint TODO note client cleanup upon disconnection

…dio support Update mocked methods for compat with dinkum 0.1.0+ Add websocket dependencies for streaming client Add apt dependencies to Dockerfile for Python module builds

Separate streaming dependencies from basic WS Refactor streaming client code into a separate module

Handle streaming socket retry if too early Implement streaming audio responses

Remove duplicate docstring not included in OpanAPI pages

Update docstring to note undocumented functionality may change

… tests

neon_hana/app/routers/node_server.py

mikejgray · 2024-10-08T01:22:06Z

neon_hana/streaming_client.py

+                                          stt=Mock(transcribe=Mock(return_value=[])),
+                                          fallback_stt=Mock(transcribe=Mock(return_value=[])),


Consider MagicMock, which should stub out all of the necessary methods without having to be explicit

Looks like the inferred return type is not a list if I use MagicMock (similar issue with transformers return value)

│ Exception in thread Thread-6: │ │ Traceback (most recent call last): │ │ File "/usr/local/lib/python3.9/threading.py", line 980, in _bootstrap_inner │ │ self.run() │ │ File "/usr/local/lib/python3.9/site-packages/neon_hana/streaming_client.py", line 67, in run │ │ self.voice_loop.run() │ │ File "/usr/local/lib/python3.9/site-packages/ovos_dinkum_listener/voice_loop/voice_loop.py", line 269, in run │ │ self._after_cmd(chunk) │ │ File "/usr/local/lib/python3.9/site-packages/ovos_dinkum_listener/voice_loop/voice_loop.py", line 783, in _after_cmd │ │ utts, stt_context = self._get_tx(stt_context) │ │ File "/usr/local/lib/python3.9/site-packages/ovos_dinkum_listener/voice_loop/voice_loop.py", line 731, in _get_tx │ │ filtered = [max(utts, key=lambda k: k[1])] │ │ ValueError: max() arg is an empty sequence

You can set a return type to anything you'd like with a MagicMock, but this is definitely not blocking:

Python 3.12.6 (main, Sep 6 2024, 19:03:47) [Clang 15.0.0 (clang-1500.3.9.4)] on darwin Type "help", "copyright", "credits" or "license" for more information. >>> from unittest.mock import MagicMock >>> >>> # Create a mock transcription result >>> mock_transcription = [("Hello, this is a test", 0.9)] >>> >>> # Set up the STT objects using MagicMock >>> stt = MagicMock() >>> stt.transcribe.return_value = mock_transcription >>> >>> fallback_stt = MagicMock() >>> fallback_stt.transcribe.return_value = mock_transcription >>> stt.transcribe <MagicMock name='mock.transcribe' id='4314005728'> >>> stt.transcribe() [('Hello, this is a test', 0.9)] >>> fallback_stt.transcribe() [('Hello, this is a test', 0.9)] >>> type(fallback_stt.transcribe()) <class 'list'> >>> [max(stt.transcribe(), key=lambda k: k[1])] [('Hello, this is a test', 0.9)]

I see. 6 of one half dozen of the other IMO since we're explicitly specifying those methods and their return values

neon_hana/streaming_client.py

requirements/websocket.txt

Add locking around session changes

NeonDaniel added 4 commits September 20, 2024 19:27

Add a streaming audio input endpoint

8df1130

Outline handling of client audio stream

Implement RemoteStreamHandler to consume input audio chunks

c9254c5

Lazy init streaming when clients connect to the endpoint TODO note client cleanup upon disconnection

Cleanup after clients upon disconnection

e135119

Update docker config to enable webrtcvad for Node server streaming au…

69553e4

…dio support Update mocked methods for compat with dinkum 0.1.0+ Add websocket dependencies for streaming client Add apt dependencies to Dockerfile for Python module builds

NeonDaniel mentioned this pull request Oct 4, 2024

[FEAT] ESP32-Compatible Node NeonGeckoCom/neon-nodes#5

Open

NeonDaniel added 3 commits October 3, 2024 17:18

Add streaming dependencies to unit tests

270479b

Separate streaming dependencies from basic WS Refactor streaming client code into a separate module

Fix typo in extra dependencies

2970e1d

Handle streaming socket retry if too early Implement streaming audio responses

Add docstrings to Node documentation endpoints

832c5d9

Remove duplicate docstring not included in OpanAPI pages

NeonDaniel mentioned this pull request Oct 7, 2024

Implement streaming audio client NeonGeckoCom/neon-nodes#17

Draft

NeonDaniel added 4 commits October 7, 2024 15:24

Update streaming WW plugin dependencies to match Neon speech container

2dd7ba1

Add supported responses to Node endpoint documentation

5fccf65

Update docstring to note undocumented functionality may change

Implement configured maximum node streams with documentation and unit…

c997355

… tests

Add apt dependencies to unit test automation

fffef5c

NeonDaniel requested a review from mikejgray October 7, 2024 23:07

NeonDaniel marked this pull request as ready for review October 7, 2024 23:09

mikejgray approved these changes Oct 8, 2024

View reviewed changes

NeonDaniel added 3 commits October 8, 2024 09:16

Address logging review comments

391a978

Remove newline change

eddc804

Refactor client connection check into MQWebsocketAPI class

c3e533e

Add locking around session changes

NeonDaniel requested a review from mikejgray October 8, 2024 17:01

mikejgray approved these changes Oct 9, 2024

View reviewed changes

NeonDaniel merged commit d851940 into dev Oct 9, 2024
6 checks passed

NeonDaniel deleted the FEAT_StreamInputAudio branch October 9, 2024 21:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement streaming audio Websocket #30

Implement streaming audio Websocket #30

NeonDaniel commented Oct 4, 2024 •

edited

Loading

mikejgray Oct 8, 2024

NeonDaniel Oct 8, 2024

mikejgray Oct 9, 2024 •

edited

Loading

NeonDaniel Oct 9, 2024

		stt=Mock(transcribe=Mock(return_value=[])),
		fallback_stt=Mock(transcribe=Mock(return_value=[])),

Implement streaming audio Websocket #30

Implement streaming audio Websocket #30

Conversation

NeonDaniel commented Oct 4, 2024 • edited Loading

Description

Issues

Other Notes

mikejgray Oct 8, 2024

Choose a reason for hiding this comment

NeonDaniel Oct 8, 2024

Choose a reason for hiding this comment

mikejgray Oct 9, 2024 • edited Loading

Choose a reason for hiding this comment

NeonDaniel Oct 9, 2024

Choose a reason for hiding this comment

NeonDaniel commented Oct 4, 2024 •

edited

Loading

mikejgray Oct 9, 2024 •

edited

Loading