-
Notifications
You must be signed in to change notification settings - Fork 51
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: add marker bit when packetizing opus. #572
base: main
Are you sure you want to change the base?
Conversation
So markerbit should be set at "start" of talkspurt. Is it true that the first packet in a talkspurt is 1 byte? |
c15137c
to
4199108
Compare
I might have read some details wrong, i actually added it at the end of a talk spurt, fixed that |
f41f799
to
719738d
Compare
You can rename this file to test_talk_startspurt: https://github.com/algesten/str0m/blob/main/tests/rtp_to_frame.rs Then validate that marker bit get's added. Ideally test the different combinations of rtp/frame api. |
@algesten should some api be added for this, so in case dtx is set to false marker bit will never be set and it will not be negotiated over sdp. |
Can we really auto-detect talk spurts without some audio analysis? I mean just because there's some noise on the line, doesn't mean there is a talk spurt starting. How does libWebRTC do this? (or pion if they support it) |
Audio analysis is done in Opus so it will handle this. I need to double check the packet pattern that Opus emits not sure this logic will work. |
I don't think Pion supports this, best would be to check how libwebrtc set the marker bit on Opus packages. |
I took the inspiration from rtpopuspayloader
I was thinking of passing down the UseDtx flag from sdp to codec but never found any other codec doing it, so skipped it for now. I did see something like Or probably at the start of initialising the struct codec params can be passed |
719738d
to
097fd42
Compare
I didn't find a way to get access to previous packet inside the opus packetizer, since we don't cache it (something about audio packets not being nackable), so i changed the implementation a bit. I have also added a rtp level test for the same. |
incase of silence opus, rtp payload sets the marker bit after a talking spurt. usually the data in silence packets are empty, it just uses 1-2 bytes for header. introduced a marker state inside the OpusPacketizer since we don't have access to previous packet incase of audio. fixes algesten#125
097fd42
to
1066c4a
Compare
For doing this only when dtx is enabled: we have Line 58 in 5c4c9d5
passing the CodecSpec down from the caller ? |
@@ -23,8 +26,23 @@ impl Packetizer for OpusPacketizer { | |||
} | |||
|
|||
fn is_marker(&mut self, data: &[u8], previous: Option<&[u8]>, last: bool) -> bool { | |||
// TODO: dtx | |||
false | |||
// any non silenced packet would generally have more than 2 byts |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
s/byts/bytes/
That would mean all the packetizers would need to accept it, and its more like a initialising info thats once set would not change, we can pass it down at the start somehow |
@ramyak-mehra when we instantiate the packetizer, the fmtp parameters are available:
The Line 145 in 5c4c9d5
I think all is in place for that already. |
while we can pass it down i want to understand the rational behing passing extra_codec_params in each depacketize call. |
No. I meant it can be done from the caller of is_marker(). Like: |
I don't understand why we are talking about the depacketizer? This PR is about the packetizer, right?
I mean that this has nothing to do with the code calling the packetizer. The |
If we do it via OpusPacketizer constructor I think we have take care of dynamic updates somehow. DTX can be activated and deactivated via js apis and user settings during an ongoing call. Currently payloaders are stored based on |
We can just ignore the sdp use_dtx, if it's off the decoder on the other side should just ignore the marker bit. Thogh I am not sure how do we handle sdp re negotiations in this library will take a look |
I'm not convinced this is an attribute that can be changed runtime. AFAIK, the only thing you can change on an already negotiated m-line is the direction. However even if it can be changed runtime, why would you want that? I think we can just say str0m does not change this dynamically and we could potentially detect dynamic changes and reject them in incoming SDP. |
use_dtx is negotiated with the sdp, we can use tha in the opus packetizer while calculating marker bit
Updated the PR to pass down use_dtx and added tests on the packetizer itself. |
I was reading more on the whole "talk spurt" thing and its not a very well defined thing. Yes you set the marker bit on start of a talk spurt but to detect talk spurt there are various alogorithms which seems out of scope for the project itself. There should ideally be a way to set the marker bit from outside the packetizer which seems like a bigger change |
I don't think it's too bad. Check out the https://docs.rs/str0m/0.6.3/src/str0m/media/writer.rs.html#74-78 |
Fixes #125