Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: add marker bit when packetizing opus. #572

Open
wants to merge 3 commits into
base: main
Choose a base branch
from

Conversation

ramyak-mehra
Copy link

Fixes #125

@xnorpx
Copy link
Collaborator

xnorpx commented Oct 13, 2024

So markerbit should be set at "start" of talkspurt. Is it true that the first packet in a talkspurt is 1 byte?

@xnorpx
Copy link
Collaborator

xnorpx commented Oct 13, 2024

@ramyak-mehra
Copy link
Author

I might have read some details wrong, i actually added it at the end of a talk spurt, fixed that

@ramyak-mehra ramyak-mehra force-pushed the feat/opus-dtx branch 2 times, most recently from f41f799 to 719738d Compare October 13, 2024 13:09
@xnorpx
Copy link
Collaborator

xnorpx commented Oct 13, 2024

You can rename this file to test_talk_startspurt: https://github.com/algesten/str0m/blob/main/tests/rtp_to_frame.rs

Then validate that marker bit get's added.

Ideally test the different combinations of rtp/frame api.

@xnorpx
Copy link
Collaborator

xnorpx commented Oct 13, 2024

@algesten should some api be added for this, so in case dtx is set to false marker bit will never be set and it will not be negotiated over sdp.

@algesten
Copy link
Owner

Can we really auto-detect talk spurts without some audio analysis? I mean just because there's some noise on the line, doesn't mean there is a talk spurt starting. How does libWebRTC do this? (or pion if they support it)

@xnorpx
Copy link
Collaborator

xnorpx commented Oct 13, 2024

Can we really auto-detect talk spurts without some audio analysis? I mean just because there's some noise on the line, doesn't mean there is a talk spurt starting. How does libWebRTC do this? (or pion if they support it)

Audio analysis is done in Opus so it will handle this.

I need to double check the packet pattern that Opus emits not sure this logic will work.

@xnorpx
Copy link
Collaborator

xnorpx commented Oct 13, 2024

I don't think Pion supports this, best would be to check how libwebrtc set the marker bit on Opus packages.

@ramyak-mehra
Copy link
Author

ramyak-mehra commented Oct 13, 2024

I took the inspiration from rtpopuspayloader

so in case dtx is set to false marker bit will never be set and it will not be negotiated over sdp.

I was thinking of passing down the UseDtx flag from sdp to codec but never found any other codec doing it, so skipped it for now. I did see something like H264CodecExtra to pass down options to de packetizer should something like that exists for packetizer as well?

Or probably at the start of initialising the struct codec params can be passed

@ramyak-mehra
Copy link
Author

I didn't find a way to get access to previous packet inside the opus packetizer, since we don't cache it (something about audio packets not being nackable), so i changed the implementation a bit.

I have also added a rtp level test for the same.

incase of silence opus, rtp payload sets the marker bit after a talking spurt.
usually the data in silence packets are empty, it just uses 1-2 bytes for header.

introduced a marker state inside the OpusPacketizer since we don't have
access to previous packet incase of audio.

fixes algesten#125
@davibe
Copy link
Collaborator

davibe commented Oct 14, 2024

For doing this only when dtx is enabled: we have FormatParams::use_dtx inside CodecSpec, maybe a simple solution would be impl this around here

let marker = self.pack.is_marker(data.as_slice(), previous_data, last);

passing the CodecSpec down from the caller ?

@@ -23,8 +26,23 @@ impl Packetizer for OpusPacketizer {
}

fn is_marker(&mut self, data: &[u8], previous: Option<&[u8]>, last: bool) -> bool {
// TODO: dtx
false
// any non silenced packet would generally have more than 2 byts
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

s/byts/bytes/

@ramyak-mehra
Copy link
Author

For doing this only when dtx is enabled: we have FormatParams::use_dtx inside CodecSpec, maybe a simple solution would be impl this around here

let marker = self.pack.is_marker(data.as_slice(), previous_data, last);

passing the CodecSpec down from the caller ?

That would mean all the packetizers would need to accept it, and its more like a initialising info thats once set would not change, we can pass it down at the start somehow

@lolgesten
Copy link

@ramyak-mehra when we instantiate the packetizer, the fmtp parameters are available:

            Payloader::new(params.spec)

The spec here is CodecSpec, which has a field for FormatParams, which in turn has the use_dtx setting.

pub use_dtx: Option<bool>,

I think all is in place for that already.

@ramyak-mehra
Copy link
Author

while we can pass it down i want to understand the rational behing passing extra_codec_params in each depacketize call.
Are there para meters that change with each packet thats why its there? Do we want to enable both kinds of approach passing down with each call as well as at initilizing?

@davibe
Copy link
Collaborator

davibe commented Oct 14, 2024

That would mean all the packetizers would need to accept

No. I meant it can be done from the caller of is_marker(). Like: marker = if audio && !dtx { false } else { packetizer…

@algesten
Copy link
Owner

while we can pass it down i want to understand the rational behing passing extra_codec_params in each depacketize call. Are there para meters that change with each packet thats why its there? Do we want to enable both kinds of approach passing down with each call as well as at initilizing?

I don't understand why we are talking about the depacketizer? This PR is about the packetizer, right?

No. I meant it can be done from the caller of is_marker(). Like: marker = if audio && !dtx { false } else { packetizer…

I mean that this has nothing to do with the code calling the packetizer. The use_dtx function is dependent on which packetizer we are talking about. This should be passed into creating the OpusPacketizer (as I illustrated above), and be remembered internally for that packetizer only.

@davibe
Copy link
Collaborator

davibe commented Oct 14, 2024

If we do it via OpusPacketizer constructor I think we have take care of dynamic updates somehow. DTX can be activated and deactivated via js apis and user settings during an ongoing call. Currently payloaders are stored based on (pt, rid) so if an SDP renegotiation changes a:fmtp:... adding usedtx I am afraid we would not pick up the change, or do we ?.

@ramyak-mehra
Copy link
Author

ramyak-mehra commented Oct 14, 2024

We can just ignore the sdp use_dtx, if it's off the decoder on the other side should just ignore the marker bit.

Thogh I am not sure how do we handle sdp re negotiations in this library will take a look

@algesten
Copy link
Owner

I think we have take care of dynamic updates somehow. DTX can be activated and deactivated via js apis and user settings during an ongoing call.

Thogh I am not sure how do we handle sdp re negotiations in this library will take a look

I'm not convinced this is an attribute that can be changed runtime. AFAIK, the only thing you can change on an already negotiated m-line is the direction.

However even if it can be changed runtime, why would you want that? I think we can just say str0m does not change this dynamically and we could potentially detect dynamic changes and reject them in incoming SDP.

use_dtx is negotiated with the sdp, we can use tha in the opus
packetizer while calculating marker bit
@ramyak-mehra
Copy link
Author

Updated the PR to pass down use_dtx and added tests on the packetizer itself.
On a side note:
I see we create a new payloader everytime we want to write. Seems a bit redundant though or am i missing something?

@ramyak-mehra
Copy link
Author

I was reading more on the whole "talk spurt" thing and its not a very well defined thing. Yes you set the marker bit on start of a talk spurt but to detect talk spurt there are various alogorithms which seems out of scope for the project itself. There should ideally be a way to set the marker bit from outside the packetizer which seems like a bigger change

@algesten
Copy link
Owner

I was reading more on the whole "talk spurt" thing and its not a very well defined thing. Yes you set the marker bit on start of a talk spurt but to detect talk spurt there are various alogorithms which seems out of scope for the project itself. There should ideally be a way to set the marker bit from outside the packetizer which seems like a bigger change

I don't think it's too bad. Check out the Writer struct. It's a builder pattern and you can see how we implement stuff like audio_level. This is similar, but the data is not transported in an RTP header extension.

https://docs.rs/str0m/0.6.3/src/str0m/media/writer.rs.html#74-78

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Support for Opus DTX
5 participants