Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Reduce number of allocations #8

Open
algesten opened this issue Dec 21, 2022 · 7 comments
Open

Reduce number of allocations #8

algesten opened this issue Dec 21, 2022 · 7 comments

Comments

@algesten
Copy link
Owner

algesten commented Dec 21, 2022

The impls has a few Vec<u8> arguments. It would be good to look through the instances of these to see where we can re-use buffers (or use ring buffers), and where we can use lifetimes to avoid internal allocation.

Zero allocations is a non-goal, but we can do better than we currently do.

@VictorBulba
Copy link

It would be nice to add support for rust allocators api. As this implementation works more like state machine, it could benefit a lot from bump allocators which could be reset each polling iteration

@algesten
Copy link
Owner Author

@VictorBulba Cool! I never used those, but sounds interesting!

@k0nserv
Copy link
Collaborator

k0nserv commented Sep 11, 2023

I was thinking some more about this, for both async and sync APIs it's typically to use &[u8] for writing and &mut [u8] for reading. This way the caller is completely in charge of allocation decisions and the reference only lives for the duration of the IO call.

In sans-IO this is not as trivial. In the case of a write the caller needs to pass owned data so progress on writing can be made at a later stage. In str0m this manifests as Writer::write taking a impl Into<Vec<u8>> rather than a &[u8]. For read it similarly manifests as str0m returning Vec<u8>s instead of taking a &mut [u8] and writing into that.

The bump allocator idea is interesting, but it could only be used for temporary allocations during each handle_timeout and poll_output call. For persistent allocations(those returned out from str0m and those given to str0m as input) I'm not sure it would work as well.

@VictorBulba
Copy link

Str0m could use some circular buffer which is a contiguous chunk of memory, and allows append bytes to the end, and consuming then from the start.

Something like:

// BytesBuffer is a special circular buffer for str0m

let buf = BytesBuffer::with_capacity(1024);

// It can be written to
buf.write(&[0x01, 0x02, 0x03, 0x04, 0x05, 0x06]);

// Reading updates the circular buffer's offset,
// so consumed memory can be resused
{
    // Read guard keeps mutable reference to the buffer,
    // so it can't be written to while the guard is alive
    //
    // fn consumed_read<'a>(&'a mut self, len: usize) -> ReadGuard<'a>

    let read_guard = buf.consumed_read(3);

    assert_eq!(read_guard.get_bytes(), &[0x01, 0x02, 0x03]);

    drop(read_guard);
}

// Previous read memory has been consumed
assert_eq!(buf.consumed_read(2).get_bytes(), &[0x04, 0x05]);

str0m could use multiple of such buffers for different purposes. When user does write str0m could copy user bytes into that buffer, so it acts as a "queue". Also ReadGuard can be used as a return type for transmit operations, so it references str0m internal bytes chunk, eliminating all those Vec<u8> and Vec<Vec<u8>>

@thomaseizinger
Copy link
Collaborator

thomaseizinger commented Dec 23, 2023

Parsing a StunMessage currently also allocates because all attributes are collected into a Vec. Given that we only support a constrained set of attributes, this can probably be improved upon by either keeping each attribute in an Option or using something like SmallVec.

Keeping things within an Option or a static array would have the advantage that we likely could make StunMessage Copy! Note that the use of Copy doesn't mean we will copy the payload. The payload is stored by reference so we are only copying references, not the actual data. SmallVec will allocate on the heap once the inline-storage is exceeded so it cannot be Copy.

@algesten
Copy link
Owner Author

Good point!

I favor Option. This is actually exactly equivalent to our RTP header extension values, where do already do it: https://github.com/algesten/str0m/blob/main/src/rtp/ext.rs#L846 – it would be really nice to make StunMessage the same.

@thomaseizinger
Copy link
Collaborator

Good point!

I favor Option. This is actually exactly equivalent to our RTP header extension values, where do already do it: main/src/rtp/ext.rs#L846 – it would be really nice to make StunMessage the same.

I submitted a PR for for that: #425

efer-ms referenced this issue in pthatcher/str0m Nov 25, 2024
…of OpenSSL (#8)

Implement new Str0m feature 'wincrypto'.

When enabled, this cause Str0m to use Windows Cryptographic APIs for crypto.
This feature cannot be used in conjunction with 'sha1' not 'openssl' features, as it provides the same functionality.

Windows APIs are accessed via the windows-rs crate. These calls are `unsafe`, so in order to isolate them from
`safe` str0m code, all `unsafe` code is in a separate `str0m_wincrypto` crate. The code in the core `str0m/crypto/wincrypto` is the glue code between Str0m and this str0m_wincrypto crate.

The `str0m_wincrypto` crate is not intended to be a generic crypto crate, and is very much tailored to match str0m's crypto APIs.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants