Regression for `mul!` from 1.9 to 1.10 #469

gdalle · 2023-11-09T18:33:16Z

This is about in-place multiplication mul!(b, A, x) of a sparse matrix A by a vector x.
From 1.9.3 to 1.10.0-rc1, this operation

has gotten significantly slower
has started allocating when x is a view

MWE

using BenchmarkTools, LinearAlgebra, SparseArrays

function testmul(n)
    A = sparse(Float64, I, n, n)
    b = Vector{Float64}(undef, n)
    @btime mul!($b, $A, x) setup=(x=ones($n))
    @btime mul!($b, $A, x) setup=(x=view(ones($n, 1), :, 1))
    return nothing
end

testmul(1000)

Results

	Julia 1.9	Julia 1.10
`x` vector	2.135 μs (0 allocations)	3.298 μs (0 allocations)
`x` view	2.502 μs (0 allocations)	4.087 μs (1 allocation)

The text was updated successfully, but these errors were encountered:

dkarrasch · 2023-11-09T21:46:50Z

Could you please investigate whether this is taking a different path then what it used to take? It could be that we missed some dispatch...

gdalle · 2023-11-10T14:27:36Z

First investigations on Discourse: https://discourse.julialang.org/t/why-does-mul-u-a-v-allocate-when-a-is-sparse-and-u-v-are-views/105995/10?u=gdalle

gdalle · 2023-11-10T20:51:07Z

Investigating the vector case first, and indeed the chain of functions has changed a lot.

In Julia 1.9, the call stack goes like this:

mul!(C, A, B)
mul!(C::StridedVecOrMat, A::AbstractSparseMatrixCSC, B::DenseInputVecOrMat, α::Number, β::Number)

In Julia 1.10, the call stack goes like that:

mul!(C, A, B)
mul!(y::AbstractVector, A::AbstractVecOrMat, x::AbstractVector, alpha::Number, beta::Number)
generic_matvecmul!(C::StridedVecOrMat, tA, A::SparseMatrixCSCUnion, B::DenseInputVector, _add::MulAddMul)
spdensemul!(C, tA, tB, A, B, _add)
_spmatmul!(C, A, B, α, β)

dkarrasch · 2023-11-12T13:06:44Z

I have absolutely no idea what's going on. When I compare, on current master I get

julia> using BenchmarkTools, LinearAlgebra, SparseArrays

julia> n = 1000;

julia> A = sparse(Float64, I, n, n);

julia> b = Vector{Float64}(undef, n);

julia> x = ones(n);

julia> @btime SparseArrays._spmatmul!($b, $A, $x, true, false);
  3.317 μs (0 allocations: 0 bytes)

julia> @btime mul!($b, $A, $x);
  3.311 μs (0 allocations: 0 bytes)
 1.568 μs (0 allocations: 0 bytes) # on v1.9

where _spmatmul! contains the multiplication kernel only (no character and dispatch overhead). So, it doesn't seem to be related to the changes in method dispatch AFAICT.

gdalle · 2023-11-12T13:54:04Z

That's a bad regression, right? Maybe an issue on the Julia repo is in order if SparseArrays is not to blame?

dkarrasch · 2023-11-12T14:41:03Z

Yes. It's also not clear to me why the rewrap works without allocation in the plain vector case, but allocates in the view case. Both run by the same code line.

jishnub · 2023-11-25T06:12:56Z

The allocation is fixed on master now

julia> testmul(1000)
  3.072 μs (0 allocations: 0 bytes)
  3.441 μs (0 allocations: 0 bytes)

julia> VERSION
v"1.11.0-DEV.984"

I suspect this is the fix.

ViralBShah · 2024-04-02T16:09:15Z

Is this resolved now?

jishnub · 2024-04-09T11:42:10Z

The regression is still present. Here's what I see:

julia> testmul(1000)
  2.148 μs (0 allocations: 0 bytes)
  2.178 μs (0 allocations: 0 bytes)

julia> VERSION
v"1.9.4"

vs

julia> testmul(1000)
  2.704 μs (0 allocations: 0 bytes)
  3.025 μs (0 allocations: 0 bytes)

julia> VERSION
v"1.12.0-DEV.301"

ViralBShah · 2024-04-09T13:50:29Z

@vtjnash @oscardssmith Any ideas what is going on here?

dkarrasch · 2024-04-09T16:46:43Z

As JuliaLang/julia#52137 shows, this is unrelated to this package and possibly something upstream.

KristofferC · 2024-05-30T11:41:07Z

If this was only in SparseArrays I would blame JuliaLang/julia#54464 but JuliaLang/julia#52137 (as already have been said) seem to exclude this from being a SparseArray specific issue.

KristofferC · 2024-05-30T12:17:56Z

Ref JuliaLang/julia#52137 (comment) for a bisect.

gdalle mentioned this issue Nov 10, 2023

Aggressive constprop in matvecmul and matmatmul JuliaLang/julia#51961

Merged

dkarrasch mentioned this issue Nov 12, 2023

Regression in sparse-dense multiplication JuliaLang/julia#52137

Open

ViralBShah mentioned this issue Nov 22, 2023

Using @view leads to 100x performance loss #475

Closed

ViralBShah mentioned this issue Feb 3, 2024

irinterp: improve semi-concrete interpretation accuracy JuliaLang/julia#52275

Merged

fjebaker mentioned this issue Feb 22, 2024

mul! is failing with mul!(::Vector, ::Matrix, ::Matrix) fjebaker/SpectralFitting.jl#79

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Regression for `mul!` from 1.9 to 1.10 #469

Regression for `mul!` from 1.9 to 1.10 #469

gdalle commented Nov 9, 2023 •

edited

Loading

dkarrasch commented Nov 9, 2023

gdalle commented Nov 10, 2023

gdalle commented Nov 10, 2023 •

edited

Loading

dkarrasch commented Nov 12, 2023

gdalle commented Nov 12, 2023 •

edited

Loading

dkarrasch commented Nov 12, 2023

jishnub commented Nov 25, 2023 •

edited

Loading

ViralBShah commented Apr 2, 2024

jishnub commented Apr 9, 2024

ViralBShah commented Apr 9, 2024

dkarrasch commented Apr 9, 2024

KristofferC commented May 30, 2024

KristofferC commented May 30, 2024 •

edited

Loading

Regression for mul! from 1.9 to 1.10 #469

Regression for mul! from 1.9 to 1.10 #469

Comments

gdalle commented Nov 9, 2023 • edited Loading

dkarrasch commented Nov 9, 2023

gdalle commented Nov 10, 2023

gdalle commented Nov 10, 2023 • edited Loading

dkarrasch commented Nov 12, 2023

gdalle commented Nov 12, 2023 • edited Loading

dkarrasch commented Nov 12, 2023

jishnub commented Nov 25, 2023 • edited Loading

ViralBShah commented Apr 2, 2024

jishnub commented Apr 9, 2024

ViralBShah commented Apr 9, 2024

dkarrasch commented Apr 9, 2024

KristofferC commented May 30, 2024

KristofferC commented May 30, 2024 • edited Loading

Regression for `mul!` from 1.9 to 1.10 #469

Regression for `mul!` from 1.9 to 1.10 #469

gdalle commented Nov 9, 2023 •

edited

Loading

gdalle commented Nov 10, 2023 •

edited

Loading

gdalle commented Nov 12, 2023 •

edited

Loading

jishnub commented Nov 25, 2023 •

edited

Loading

KristofferC commented May 30, 2024 •

edited

Loading