Use `O_DIRECT` #51

JackKelly · 2024-02-10T13:47:55Z

todo

Chain <open><read><close> #1
Use a flat crate structure so this git repo can store multiple (interconnected) crates in a single workspace #94
create a new crate called something like aligned_buffer
Benchmark on workstation
Don't copy the aligned_buffer into a Bytes struct. Instead, return an AlignedBuffer. Benchmark again.
Reduce code duplication between get and get_range #90
Ensure that the read operation reads the correct number of bytes #97
Write unit-tests for AlignedBuffer to check behaviour when using start_offset. #98

The text was updated successfully, but these errors were encountered:

JackKelly · 2024-02-14T11:59:16Z

I think the problem is that the buffer needs to be aligned to 512-byte boundaries. And possibly the length also needs to be a multiple of 512 bytes.

My SSD uses 512-byte logical block sizes:

jack@jack-NUC:~/dev/rust/light-speed-io$ sudo blockdev --getss /dev/nvme0n1
512

See the O_DIRECT notes near the bottom of the manual for open.

I should try using statx to find the O_DIRECT support and alignment restrictions for the files.

JackKelly · 2024-02-14T13:41:32Z

Huh! ~~Surprisingly, O_DIRECT seems to be slower on my Intel NUC! Some speeds:~~

~~560 MiB/s: O_DIRECT, and using vec.push(0) in a loop to init the AVec. The vec.push is slow.~~
~~886 MiB/s: O_DIRECT, but not initialising the AVec.~~
~~1011 MiB/s: Not using O_DIRECT. But still using (uninitialised) AVec.~~
~~1200 MiB/s: Not using O_DIRECT. Using a normal vec![].~~

UPDATE: These results are invalid. See comments below

I've run a flamegraph for the second scenario. It's in the associated pull request.

JackKelly · 2024-02-14T13:44:58Z

Because this is so much slower (on my NUC), I'm not going to merge this PR.

But I should re-visit this when I get my new workstation with a PCIe gen5 SSD.

And maybe I should try some other ways to create a vector which is aligned to 512-bytes:

Slow! Only about 560 MiB/s. See #51. Pausing work on this PR for now because it's so slow. I'll re-visit when I get my workstation with a PCIe gen 5 SSD!

JackKelly · 2024-02-14T17:21:31Z

Hopefully DIRECT will come into its own when we're loading huge numbers of small chunks (DIRECT avoids read ahead) and when we recycle buffers (because allocating aligned buffers seems to take a while!)

JackKelly · 2024-02-21T20:42:42Z

I think the high numbers of page-faults might actually be due to copying data from the OS's page cache into the program's memory space. So O_DIRECT should help. And I need to re-run benchmarking now I've fixed the benchmarks! #71

JackKelly · 2024-02-26T16:03:48Z

I recently figured out that my benchmarks weren't correctly clearing the cache before every run. That invalidates my findings above that O_DIRECT is slow. So I should have another go with O_DIRECT!

uring: 100 MiB/s local_file_system: 105 MiB/s Hopefully uring will pull ahead when using O_DIRECT (#51)! Closes #73

uring: 100 MiB/s local_file_system: 105 MiB/s Hopefully uring will pull ahead when we use O_DIRECT! (#51) Closes #73

JackKelly · 2024-02-28T13:50:09Z

How to create an aligned buffer?

In summary: I think I should create a new struct AlignedBuffer<const ALIGNMENT: usize>{vec: Vec<[u8; ALIGNMENT]>, len: usize} struct. Internally, allocate memory using Vec<[u8; ALIGNMENT]>::with_capacity(). impl Deref for AlignedBuffer to allow a view using std::slice::from_raw_parts. Implement a set_len_in_bytes method to set the length of the slice.

Alternatives:

Create a new AlignedBuffer struct which uses alloc::alloc to allocate memory. And uses std::slice::from_raw_parts to provide a view into the buffer. AlignedBuffer will take care of deallocating using alloc::dealloc. Similar to this answer. One disadvantage of this (compared to plan A) is that I need to manually handle what happens when allocation fails.
Allocate using alloc::alloc and then create a Vec with Vec::from_raw_parts(ptr: *mut [u8], length: usize, capacity: usize). This won't deallocate correctly because deallocation also needs to know the alignment, and the type [u8] is unlikely to be aligned correctly.
Vec::align_to should kind of work. But it's unsafe. And allocates more memory than we need. And I'd have to keep the Vec alive while using the middle (aligned) slice.
Creating a Vec<[u8; 512]>, dismantling it to raw parts, and then re-creating a Vec<u8> (like this answer) will also lead to UB, because the Vec<u8> won't deallocate correctly.
aligned_vec::AVec looked good. But I can't see how to create an uninitialsed AVec. AVec doesn't have a set_len method.

JackKelly · 2024-02-28T14:32:00Z

On second thoughts, let's use alloc::alloc, so we can control alignment at runtime (and different filesystems may have different alignment requirements)

JackKelly · 2024-03-07T15:49:00Z

See fio results in this comment for evidence that O_DIRECT appears pretty important to achieve full speed.

JackKelly · 2024-03-11T20:18:43Z

Hmm, so, I've implemented AlignedBuffer. It passes unit tests for AlignedBuffer. But, for some reason, test_get_with_io_uring_local doesn't pass!

JackKelly · 2024-03-12T13:11:43Z

Still TODO:

EDIT: Moved to top of this thread

JackKelly · 2024-03-12T13:59:41Z

Yay! Using O_DIRECT has sped things up!

JackKelly added enhancement New feature or request performance Improvements to runtime performance labels Feb 10, 2024

JackKelly added this to the Perform at least as fast as `fio` milestone Feb 10, 2024

JackKelly added this to light-speed-io Feb 10, 2024

JackKelly moved this to Todo in light-speed-io Feb 10, 2024

JackKelly mentioned this issue Feb 10, 2024

Use polling for completion queue. #53

Open

JackKelly self-assigned this Feb 14, 2024

JackKelly added a commit that referenced this issue Feb 14, 2024

Trying to align buffer by using AVec. Failing to convert to Bytes. #51

b77eca3

JackKelly linked a pull request Feb 14, 2024 that will close this issue

O_DIRECT #58

Closed

JackKelly moved this from Todo to In Progress in light-speed-io Feb 14, 2024

JackKelly added a commit that referenced this issue Feb 14, 2024

O_DIRECT, and using vec.push(0) in a loop to init the AVec.

99dcf67

Slow! Only about 560 MiB/s. See #51. Pausing work on this PR for now because it's so slow. I'll re-visit when I get my workstation with a PCIe gen 5 SSD!

JackKelly moved this from In Progress to Todo in light-speed-io Feb 21, 2024

JackKelly mentioned this issue Feb 26, 2024

Implement IoUringLocal::get_range() (to load one small chunk) #74

Closed

4 tasks

JackKelly added a commit that referenced this issue Feb 28, 2024

Benchmark get_range

57dfcdd

uring: 100 MiB/s local_file_system: 105 MiB/s Hopefully uring will pull ahead when using O_DIRECT (#51)! Closes #73

JackKelly added a commit that referenced this issue Feb 28, 2024

Create benchmark for get_range.

78be51d

uring: 100 MiB/s local_file_system: 105 MiB/s Hopefully uring will pull ahead when we use O_DIRECT! (#51) Closes #73

JackKelly linked a pull request Feb 28, 2024 that will close this issue

O_DIRECT attempt 2! #92

Merged

JackKelly mentioned this issue Feb 28, 2024

Drop ObjectStore & async/await. Use Channels instead. Focus entirely (for now) on io_uring for local file storage. #93

Closed

7 tasks

JackKelly mentioned this issue Mar 8, 2024

Why's the io_uring code so slow?! 🙂 #95

Closed

4 tasks

JackKelly moved this from Todo to In Progress in light-speed-io Mar 11, 2024

JackKelly closed this as completed in #92 Mar 12, 2024

github-project-automation bot moved this from In Progress to Done in light-speed-io Mar 12, 2024

This was referenced Mar 12, 2024

Ensure that the read operation reads the correct number of bytes #97

Closed

Write unit-tests for AlignedBuffer to check behaviour when using start_offset. #98

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use `O_DIRECT` #51

Use `O_DIRECT` #51

JackKelly commented Feb 10, 2024 •

edited

Loading

JackKelly commented Feb 14, 2024

JackKelly commented Feb 14, 2024 •

edited

Loading

JackKelly commented Feb 14, 2024

JackKelly commented Feb 14, 2024

JackKelly commented Feb 21, 2024

JackKelly commented Feb 26, 2024

JackKelly commented Feb 28, 2024 •

edited

Loading

JackKelly commented Feb 28, 2024 •

edited

Loading

JackKelly commented Mar 7, 2024

JackKelly commented Mar 11, 2024

JackKelly commented Mar 12, 2024 •

edited

Loading

JackKelly commented Mar 12, 2024

Use O_DIRECT #51

Use O_DIRECT #51

Comments

JackKelly commented Feb 10, 2024 • edited Loading

todo

JackKelly commented Feb 14, 2024

JackKelly commented Feb 14, 2024 • edited Loading

JackKelly commented Feb 14, 2024

JackKelly commented Feb 14, 2024

JackKelly commented Feb 21, 2024

JackKelly commented Feb 26, 2024

JackKelly commented Feb 28, 2024 • edited Loading

How to create an aligned buffer?

JackKelly commented Feb 28, 2024 • edited Loading

JackKelly commented Mar 7, 2024

JackKelly commented Mar 11, 2024

JackKelly commented Mar 12, 2024 • edited Loading

JackKelly commented Mar 12, 2024

Use `O_DIRECT` #51

Use `O_DIRECT` #51

JackKelly commented Feb 10, 2024 •

edited

Loading

JackKelly commented Feb 14, 2024 •

edited

Loading

JackKelly commented Feb 28, 2024 •

edited

Loading

JackKelly commented Feb 28, 2024 •

edited

Loading

JackKelly commented Mar 12, 2024 •

edited

Loading