Configure RTX ratio cap #570

alexlapa · 2024-10-07T10:58:37Z

Another thing is it would be great to be able to configure that 0.15 RTX cache drop ratio, mind if i do it in another PR? Probably as an additional argument to StreamTx::set_rtx_cache, so it will be fn set_rtx_cache(&mut self, max_packets: usize, max_age: Duration, rtx_cache_drop_ratio: Option<f32>) with None completely disabling this mechanic.

lolgesten · 2024-10-07T12:08:41Z

When would we want this to be None?

alexlapa · 2024-10-07T12:49:33Z

You mean why not just use value bigger than 1.0? It's just seems to be a more obvious off switch. I dont mind changing this if you want.

Or do you mean why would anyone want to disable this functionality? Well, i consider disabling it since its up to receiver to decide whether it need some specific packet or not, however I'm still testing this.

It is not expected that a receiver will send a NACK for every lost RTP packet; rather, it needs to consider the cost of sending NACK feedback and the importance of the lost packet to make an informed decision on whether it is worth telling the sender about a packet-loss event.

algesten · 2024-10-07T13:23:44Z

@davibe can provide more context, but I think this was put into place for really bad connections, where we risk crowding the network with resends rather than real packets. I think there's definitely a case where the client keeps nacking so much that it makes no sense to just try and fulfill that.

alexlapa · 2024-10-08T08:06:25Z

I think there's definitely a case where the client keeps nacking so much that it makes no sense to just try and fulfill that.

Sure, i can image a situation when receiver has a good uplink so NACKs make it to the server but bad downlink so both TXs and RTXs are getting lost, sure. But also there are cases when receiver can stretch jitter buffer to a few seconds in none delay sensitive scenarios.

And i don't really understand why would you want to forbid this option. If this PR is allowing to configure this cap then why not. Do you want to assert that its in, lets say, [0.15..0.5] range?

davibe · 2024-10-08T09:32:28Z

Packets can be lost due to bandwidth constraints (or be or too late, which is the same). Retransmitting "too much" uses additional bandwidth which further exceeds the connection capabilities causing even more loss. This can happen also in scenarios that are not delay-sensitive because bandwidth may become not-enough indefinitely.

In our internal products - without this arbitrary 0.15 limit - we found that varying network conditions would get the peer stuck in a loop where str0m would continuously flood them with retransmissions and big keyframes while they kept nacking and sending PLIs as they were unable to recover.

Maybe what we need is to evolve this 0.15 limit into something that is more elaborate and takes more things into account (what is the bw of the peer, how much are we overusing, are we about to send a keyframe, ...). In absence of a more elaborate logic, configurability may be useful to some.

I thought 15% was quite a high limit, but public wifis may surprise. If a network has more than 15% loss but can still sustain high speeds, then 15% is not the best option. This is a dynamic per-peer consideration. This PR seems to allow to change the limit dynamically via direct api but it does not handle the change smoothly (the rtx cache is dropped on reconfiguration). Maybe it's enough for now.

src/streams/send.rs

alexlapa · 2024-10-08T12:26:05Z

In our internal products - without this arbitrary 0.15 limit - we found that varying network conditions would get the peer stuck in a loop where str0m would continuously flood them with retransmissions and big keyframes while they kept nacking and sending PLIs as they were unable to recover.

Yeah that looks like bandwidth issue. Mobile networks maybe? My issue with 0.15 cap is actually normal bandwidth clients that still have some sporadic bursts of ~80-180 packets lost for a single RTP stream. And increasing the cap makes it so there is no PLI and just a minor ~100ms freeze. And for the low bandwidth there is SVC, which does its job pretty good.

I thought 15% was quite a high limit, but public wifis may surprise. If a network has more than 15% loss but can still sustain high speeds, then 15% is not the best option.

Well it depends on the time frame. From what i see, average packet loss might stay under 1% but the maximum over 1 second might exceed 50% sometimes.

UPD:

I guess that in the specific case that you have described it would make sense to clear not only resends queue but also RtxCache, if you really want to completely reset receiver leaving him with no other option but requesting a keyframe.

alexlapa · 2024-10-15T04:27:07Z

@davibe ,

Is there still any issues with this PR?

algesten · 2024-10-15T08:12:23Z

Thanks for the reminder. I'll
do a final review later today.

algesten · 2024-10-15T17:06:29Z

Thanks!

alexlapa added 4 commits September 26, 2024 12:38

wip

fbf2306

Merge branch 'refs/heads/main' into configure-rtx-cache-drop-ratio

01407e1

some docs and changelog

c80f1a0

Merge branch 'refs/heads/main' into configure-rtx-cache-drop-ratio

5d21ec3

alexlapa marked this pull request as ready for review October 7, 2024 12:49

davibe reviewed Oct 8, 2024

View reviewed changes

src/streams/send.rs Outdated Show resolved Hide resolved

fix review notes

ee66d19

alexlapa requested a review from davibe October 8, 2024 12:37

algesten approved these changes Oct 15, 2024

View reviewed changes

algesten merged commit 7170c3a into algesten:main Oct 15, 2024
22 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Configure RTX ratio cap #570

Configure RTX ratio cap #570

alexlapa commented Oct 7, 2024

lolgesten commented Oct 7, 2024

alexlapa commented Oct 7, 2024

algesten commented Oct 7, 2024

alexlapa commented Oct 8, 2024

davibe commented Oct 8, 2024

alexlapa commented Oct 8, 2024 •

edited

Loading

alexlapa commented Oct 15, 2024

algesten commented Oct 15, 2024

algesten commented Oct 15, 2024

Configure RTX ratio cap #570

Configure RTX ratio cap #570

Conversation

alexlapa commented Oct 7, 2024

lolgesten commented Oct 7, 2024

alexlapa commented Oct 7, 2024

algesten commented Oct 7, 2024

alexlapa commented Oct 8, 2024

davibe commented Oct 8, 2024

alexlapa commented Oct 8, 2024 • edited Loading

alexlapa commented Oct 15, 2024

algesten commented Oct 15, 2024

algesten commented Oct 15, 2024

alexlapa commented Oct 8, 2024 •

edited

Loading