Performance of components build with cargo-component #2980

radu-matei · 2025-01-14T10:56:00Z

Spin version (spin --version): spin 3.2.0-pre0 (771282e0 2025-01-13)
Repro repo: https://github.com/radu-matei/spin-yolo (make lib && spin build to build everything)

In the application linked in the repo above, I have two HTTP components:

/rust -- this is a Rust HTTP handler that directly uses a Rust dependency to perform a face detection algorithm on the request body
/... -- this is also a Rust HTTP handler, but imports a Wasm component through component dependencies and executes the same logic

I am seeing a significant performance difference when sending the same request to those two handlers:

# calling the HTTP handler that runs the process directly
$ time curl http://localhost:3000/rust --data-binary @grace_hopper.jpg
0.01s user 0.01s system 0% cpu 2.204 total

# calling the HTTP handler that uses component dependencies
$ time curl http://localhost:3000 --data-binary @grace_hopper.jpg
0.01s user 0.01s system 0% cpu 5.537 total

The performance difference here is pretty big; any thoughts into what might be going on here?

Thanks!

The text was updated successfully, but these errors were encountered:

radu-matei · 2025-01-15T10:52:15Z

cc @alexcrichton who's been doing some digging here.

(TL; DR: the main issue is coming from the optimization level for the component itself, which was the default cargo-component template: https://github.com/radu-matei/spin-yolo/blob/92910318f050a818a4106df9e5acf40adbafb4f8/lib/Cargo.toml#L16)

alexcrichton · 2025-01-15T15:35:14Z

Ah yes the findings I've got so far are:

The original major performance difference is due to the component dependencies version using opt-level = "s" and the "direct-rust" version using opt-level = "3". As @radu-matei mentioned that's due to the default of cargo component new using opt-level = "s".
- Fixing the opt-levels is in Optimize the demo radu-matei/spin-yolo#2 which brings the performance of the two in line with each other
- That PR also enables +simd128 during codegen which improves performance as this is a computation-heavy benchmark that benefits a lot from simd.
- I've also submitted Remove generated [profile.release] from new projects bytecodealliance/cargo-component#369 to change the defaults for cargo component because I feel it's best to not configure [profile.release] from the defaults.
To some extent this is still pretty far off from native performance, so some more performance gaps I've found are:
- Spin enables epoch interruption by default (e.g. for timeouts and time-slicing) and that adds significant overhead on AArch64 macOS M2 (which I think @radu-matei is on). I measured less relative overhead on an x64 machine I have
- Wasmtime has spectre mitigation for tables on-by-default which hurts aarch64 performance way more than x64.

If epochs/spectre-mitigations are disabled then the performance is relatively close to native, or about what you might expect from wasm's overhead. There's still some subpar instruction selection in Cranelift that may be possible to improve on the x64 side of things. Additionally the library in use here, tract-linalg, has wasm simd optimizations but doesn't use the relaxed-simd proposal namely the f32x4.relaxed_madd instruction. I briefly tried that locally though and didn't get much speedup, so I might be wrong in which kernel is being used in that file.

This comment was marked as resolved.

Sign in to view

lann changed the title ~~Performance of request going through component dependency~~ Performance of components build with cargo-component Jan 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance of components build with cargo-component #2980

Performance of components build with cargo-component #2980

radu-matei commented Jan 14, 2025

This comment was marked as resolved.

radu-matei commented Jan 15, 2025

alexcrichton commented Jan 15, 2025

Performance of components build with cargo-component #2980

Performance of components build with cargo-component #2980

Comments

radu-matei commented Jan 14, 2025

This comment was marked as resolved.

radu-matei commented Jan 15, 2025

alexcrichton commented Jan 15, 2025