Implement quadratic forms #62

kalmarek · 2024-12-02T00:00:40Z

@blegat I hacked the form adding cfs kwarg to operate! on UnsafeAddMul;

QuadraticForm(Q) is a simple quadratic form.
QuadraticForm{star}(Q) is a sesqui-linear form (but if star === identity on your basis it will not perform additional starring).

A few gripes:

I don't like all of those args as they bring little value and only obfuscate the code
debugging call stack is a nightmare for anyone not intimately familiar with MA. It's a technical debt that will bite us in our arses;
operate!(op::UnsafeAddMul, res, c) should become operate!(op::UnsafeAdd, res, c)
we're gaining nothing by passing around the args (only a recursive call)

I propose implementing it via loop (of length known to the compiler) via UnsafeLAMul! (or whatever name) with mul!(C, A, B, α, β) performing in-place A·B·α + C·β.
As we don't currently need β, I'd vote for 4-arg mul!(C, A, B, α) until β is needed.

Advantages:

No recursive calls
Easy to un-hack cfs as α
vararg can be implemented with a simple loop on top;

codecov · 2024-12-02T00:02:01Z

Codecov Report

Attention: Patch coverage is 96.29630% with 2 lines in your changes missing coverage. Please review.

Project coverage is 83.31%. Comparing base (bbcd649) to head (cb27cca).

Files with missing lines	Patch %	Lines
src/arithmetic.jl	75.00%	2 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main      #62      +/-   ##
==========================================
- Coverage   85.93%   83.31%   -2.63%     
==========================================
  Files          14       14              
  Lines         761      791      +30     
==========================================
+ Hits          654      659       +5     
- Misses        107      132      +25

Flag	Coverage Δ
unittests	`83.31% <96.29%> (-2.63%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

kalmarek · 2024-12-02T00:04:15Z

@blegat there are some problems with allocations of simple
@allocated(MA.operate_to!(d, *, 2, a)) == 0 ??

What broke it? MA allocation stuff seems really fragile...

blegat · 2024-12-06T08:00:29Z

I do need the args... in case the user uses the domain argument in SumOfSquares, it's the Putinar certificate.

test/quadratic_form.jl

src/mstructures.jl

blegat · 2024-12-06T08:14:43Z

src/mstructures.jl

+    for (i, b1) in pairs(basis(Q))
+        b1★ = ε(b1)
+        for (j, b2) in pairs(basis(Q))
+            MA.operate!(op, res, coeffs(b1★), coeffs(b2); cfs = Q[i, j])


Why is res already a coefficient ? I would simply call MA.operate!(op, res, b1★, b2; cfs = Q[i, j]) and let it follow the normal call stack that will then unwrap the algebra elements calling coeffs but first checking they are of the same algebra which is safer

b1★ and b2 come from basis(Q), i.e. the assumption is already fulfilled

to go through the regular call stack we'd need to support cfs kwarg throughout the whole stack; maybe it is sensible, or maybe it isn't;

res doesn't have to be in the same algebra as b1 and b2, it's enough that their keys are in the basis of res (e.g. they are defined over some fixed basis, but res is done over implicit one).

But the actual true reason is that it's already too complicated for me to follow the call stack mentally ;D There are too many operate!, operate_to! methods which I don't even know what's the difference and where are they located in the stack. So I took shortcut just to make this work

b1★ and b2 come from basis(Q), i.e. the assumption is already fulfilled

How do we know it is ?

to go through the regular call stack we'd need to support cfs kwarg throughout the whole stack; maybe it is sensible, or maybe it isn't;

Not the whole call stack, only the unsafe add mul one. It should be supported for unsafe add mul with algebra elements and with coefficients.

res doesn't have to be in the same algebra as b1 and b2, it's enough that their keys are in the basis of res (e.g. they are defined over some fixed basis, but res is done over implicit one).

If we have an unsafe add mul call that writes to an implicit, it could indeed support that some of the arguments are in explicit form indeed

But the actual true reason is that it's already too complicated for me to follow the call stack mentally ;D There are too many operate!, operate_to! methods which I don't even know what's the difference and where are they located in the stack. So I took shortcut just to make this work

We could not use the MA API for the unsafe add mul method but for the other one, it's important we still use it because we gain all the optimization that were done with the rewriting for JuMP at the cost of the added complexity ;)

basis here is of a star algebra (considered as a vector space), hence spanned by vectors indexed by elements of the algebra. But this is only a soft assumption, what we need to know is that the multiplicative structure of the basis of res handles the keys of products of basis(Q) well.

Not the whole call stack, only the unsafe add mul one. It should be supported for unsafe add mul with algebra elements and with coefficients.

I went with a 4-argument unsafe add mul with the form MA.opearte_to!(res, ::UnsafeAddMul, A, B, α) which computes α·A·B storing the result in res. Have a look at the changes.

Since we always pretend that res are truly mutable I changed all the methods to MA.operate_to!(res, ...). I still don't understand the difference of operate!(::UnsafeMulAdd, res, ...) and operate_to!(res, ::UnsafeMulAdd, ...).

Well the benefit right now is that I'm afraid of touching those MA.something methods. They feel to me like typesetting latex table :P

It's not operate_to! because operate_to! overwrites the result while we just want to append to it (hence the Add in UnsafeAddMul. So it's really operate! :)

have a look now;

MA would really benefit from proper documentation (some design principles, maybe a tutorial, show-casting the call stack etc.)

blegat · 2024-12-09T09:18:14Z

I propose implementing it via loop (of length known to the compiler) via UnsafeLAMul! (or whatever name) with mul!(C, A, B, α, β) performing in-place A·B·α + C·β.
As we don't currently need β, I'd vote for 4-arg mul!(C, A, B, α) until β is needed.

I'm ok with not using MA for UnsafeAddMul, until there is a canonical! notion in MA, we don't gain much by using the MA interface for it.
mul! will just overwrite and erase C while here we need to add to C, leaving the terms that are already in C. It is also unsafe because there are duplicates so it should be unsafe_add_mul!.
I need at least 3 algebra elements for the putinar certificate so we could have unsafe_add_mul!(*, C, A, B, args...)! and unsafe_add_mul_with_constant!(*, C, α, A, B, args...).

blegat · 2024-12-09T09:21:20Z

I actually already tried adding scalars in the mix in
#34 (comment)
But I agree it's best to stay with at most one scalar and have an explicit place for it (keyword argument or second argument).

kalmarek · 2024-12-10T00:06:17Z

I propose implementing it via loop (of length known to the compiler) via UnsafeLAMul! (or whatever name) with mul!(C, A, B, α, β) performing in-place A·B·α + C·β.
As we don't currently need β, I'd vote for 4-arg mul!(C, A, B, α) until β is needed.

I'm ok with not using MA for UnsafeAddMul, until there is a canonical! notion in MA, we don't gain much by using the MA interface for it. mul! will just overwrite and erase C while here we need to add to C, leaving the terms that are already in C. It is also unsafe because there are duplicates so it should be unsafe_add_mul!. I need at least 3 algebra elements for the putinar certificate so we could have unsafe_add_mul!(*, C, A, B, args...)! and unsafe_add_mul_with_constant!(*, C, α, A, B, args...).

This starts looking very much like C, where it's the best to encode the signature in the name of the function... did you mean unsafe_add_scalar_mul_vararg!?

blegat · 2024-12-10T09:59:09Z

This starts looking very much like C, where it's the best to encode the signature in the name of the function... did you mean unsafe_add_scalar_mul_vararg!?

And the beauty is, you can do C-like code without having to leave the language ;) We can have the same function with and without the constant if we have a convention that the constant appears before the factors but then we need to have a convention on what is the type of the constant or what is the type of the coefficients. Here, the constant could be a JuMP expression so we cannot assume that it's a Number.

blegat · 2024-12-10T10:01:18Z

In the meantime, I'm making progress on jump-dev/SumOfSquares.jl#375 to clarify what the target is (we consume the rope by burning it on both sides)

blegat · 2024-12-15T21:04:18Z

I'm going to make a PR on SumOfSquares using this to double check

blegat · 2024-12-15T21:05:02Z

Maybe drop Julia v1.6 and add a v1.10 as minimum requirement, it's the new LTS anyway

blegat · 2024-12-16T08:25:33Z

src/quadratic_form.jl

+basis(qf::QuadraticForm) = basis(qf.Q)
+Base.getindex(qf::QuadraticForm, i::T, j::T) where {T} = qf.Q[i, j]
+
+function MA.operate_to!(


This should be operate!(ms, res, Q, args...). For Putinar, I need to add several quadratic form so it won't work if you zero. And I need the extra args 😇

Do I read correctly, that you need to add several quadratic forms together?
E.g.

function MA.operate_to!(res, ms::MultiplicativeStructure, Qs::Vararg{<:QuadraticForm}) MA.operate!(zero, res) for Q in Qs MA.operate!(ms, res, Q) end MA.operate!(canonical, res) return res end function MA.operate!(ms::MultiplicativeStructure, res, Q::QuadraticForm{T,ε}) where {T,ε} op = UnsafeAddMul(ms) for (i, b1) in pairs(basis(Q)) b1★ = ε(b1) for (j, b2) in pairs(basis(Q)) MA.operate!(op, res, coeffs(b1★), coeffs(b2), Q[i, j]) end end return res end

It would be much easier to push this over the finish line had you provided the use-cases you have in mind for SoS as I asked many a times... :D

I'm as lost in the design space as you are ^^

MA.operate_to!(res, ms::MultiplicativeStructure, Qs::Vararg{<:QuadraticForm}) doesn't work because it's a sum, MA.operate_to!(res, ::typeof(+), Qs::Vararg{<:QuadraticForm}) is correct but I won't use it because I will multiply each graph by a poynomial to do putinar

So the use case is

p = zero(...) for (q, g) in zip(...) MA.operate!(UnsafeAddMul(...), p, QuadraticForm(q), g) end

blegat · 2024-12-17T14:04:00Z

So the use case is

p = zero(...)
for (q, g) in zip(...)
    MA.operate!(UnsafeAddMul(...), p, SA.QuadraticForm(q), g)
end

The other use case when it's not putinar is:

p = zero(...)
MA.operate!(UnsafeAddMul(...), p, SA.QuadraticForm(q))

kalmarek · 2024-12-17T17:24:28Z

p = zero(...)
MA.operate!(UnsafeAddMul(...), p, SA.QuadraticForm(q))

what is multiplied here? do you mean p += AlgebraElement(qf, parent(p)?

p = zero(...)
for (q, g) in zip(...)
    MA.operate!(UnsafeAddMul(...), p, SA.QuadraticForm(q), g)
end

I think doing this will be a huge performance bottleneck.
you will end up in nterms(q)^2*nterms(g) terms in p.
How many terms will there be after canonicalization of qf?

I honestly doubt it will be beneficial for large scale examples. In one of my cases (which is worse by a lot than any commutative case) nterms(q)^2 is ~2e7, while nterms(qf) is <1e7 (after canonical). Do you really want to pay double (times nterms(g)) the price?

kalmarek · 2024-12-20T11:27:49Z

@blegat I added a 3-argument MA.operate! with quadratic form. Something like this is now possible:

    A = SA.StarAlgebra(...)
    Q = SA.QuadraticForm(...)
    res = zero(eltype(Q), A) 
    p = sum(basis(Q)) # a 'random' element from A 
    MA.operate_to!(coeffs(res), SA.mstructure(A), Q, p)
    @test res == A(Q) * p

MA.operate_to! dispatches through MA.operate!(SA.UnsafeAddMul(...), res, Q, p). Does it suit your needs?

blegat · 2024-12-24T11:06:22Z

p = zero(...)
MA.operate!(UnsafeAddMul(...), p, SA.QuadraticForm(q))
what is multiplied here? do you mean p += AlgebraElement(qf, parent(p))?

Yes

I think doing this will be a huge performance bottleneck. you will end up in nterms(q)^2*nterms(g) terms in p. How many terms will there be after canonicalization of qf?

I honestly doubt it will be beneficial for large scale examples. In one of my cases (which is worse by a lot than any commutative case) nterms(q)^2 is ~2e7, while nterms(qf) is <1e7 (after canonical). Do you really want to pay double (times nterms(g)) the price?

It may be, the user can canonicalize after adding each term if needed. We could also have a SparseCoefficient operating as a dictionary like JuMP.AffExpr that could optionally be used so that there is no duplicate. For small number of variables, large degree, you indeed expect to have a lot of terms that are the same so a dictionary would be better. SumOfSquares could check and use the appropriate one.

Does it suit your needs?

Just tested, it almost works. I have the issue that I do operate!(UnsafeAddMul(*), a, b) somewhere in SumOfSquares. Now, it redirects to operate!(UnsafeAddMul(...), coeffs(a), coeffs(b), true) which has no method.

src/mstructures.jl

blegat · 2024-12-24T12:02:08Z

src/quadratic_form.jl

+    for (i, b1) in pairs(basis(Q))
+        b1★ = ε(b1)
+        for (j, b2) in pairs(basis(Q))
+            MA.operate!(op, res, coeffs(b1★), coeffs(b2), Q[i, j])


So every basis element should implement coeffs ? It's weird because it's not really an AlgebraElement

What if the elements of the basis are just group elements

If we see AlgebraElement, this implementation is correct. Otherwise we should multiply b1 and b2 using the multiplicative structure

It can just return coeffs(x) = (x, true) if you like;

They are not just group elements, they are Diracs at group elements. The proper formalism is to talk about functions (on groups or other objects) instead of formal sums.

So far it seems that you can place in a basis something sortable (unless you canonicalize! yourself) that implements coeffs.

If you use arbitrary SA.ExplicitBasis it won't work right ? For example you place group elements in the basis and coeffs is not part of GroupsCore API.

I would just call some new internal function and distinguish two cases with multiple dispatch

Two AlgebraElement -> then do what is currently done

Otherwise, just multiply them with the multiplicative structure

You don't just place group elements in basis, you need to pick a multiplicative structure. It could be e.g. DiracMStructure and then Diracs are in your basis (they aren't AlgebraElements either).

Does your element behave like coefficients? then we could define SA.coeffs(x::YourTypeWhichBehavesLikeAbstractCoeffs) = x to push x down the call?

but in general, I'm not convinced that this quadratic form is a good design. Let's brainstorm for a bit where should all coeffs calls go.

Can you formulate a concrete example specifying:

What will be in the basis

what is the mstructure
?

Ok let me formulate a use case in the main thread

Co-authored-by: Benoît Legat <[email protected]>

kalmarek · 2024-12-26T00:33:01Z

It may be, the user can canonicalize after adding each term if needed. We could also have a SparseCoefficient operating as a dictionary like JuMP.AffExpr that could optionally be used so that there is no duplicate. For small number of variables, large degree, you indeed expect to have a lot of terms that are the same so a dictionary would be better. SumOfSquares could check and use the appropriate one.

What I meant is with the current code the user cannot escape creating all of those redundant terms even if the storage is canonicalizing on the spot (via dictionary or something).

I'm not convinced that the triple product (Putinar) is even beneficial to have. Small example will run fast either with or without allocations; large examples will suffer from algorithmic complexity.

blegat · 2024-12-28T08:52:30Z

I see, you mean that we should first transform the gram matrix into an AlgebraElement, then canonicalize and then multiply with the multiplier. That's a good point. In that case I would mostly need operate_to! ^^
Having a better worst case complexity is indeed better than saving some allocations so you convinced me.

kalmarek · 2025-01-03T23:19:41Z

@blegat What is left here?

Remove quadratic form with coefficient
Solve the coeffs placement

blegat · 2025-01-05T15:39:18Z

src/arithmetic.jl

+)
+    # ABC::Vararg{AlgebraElement,N},
+    # ) where {N}
+    MA.operate!(mul, coeffs(res), coeffs(A), coeffs(B), α)


Why don't we check that they have the same parent here ?

blegat · 2025-01-05T15:40:35Z

src/coefficients.jl

@@ -115,7 +115,7 @@ function LinearAlgebra.dot(w::AbstractVector, ac::AbstractCoefficients)
    if isempty(values(ac))
        return zero(MA.promote_sum_mul(eltype(w), value_type(ac)))
    else
-        return sum(w[i] * star(v) for (i, v) in nonzero_pairs(ac))
+        return sum(w[i] * v for (i, v) in nonzero_pairs(ac))


Why don't we take the conjugation?

blegat · 2025-01-05T15:42:45Z

src/mstructures.jl

    end
    MA.operate!(zero, res)
-    MA.operate!(UnsafeAddMul(ms), res, args...)
+    res = MA.operate!(UnsafeAddMul(ms), res, args...)


With operate! you are guaranteed that the argument res is the same as what is returned so no need to get the result

blegat · 2025-01-05T16:02:43Z

src/quadratic_form.jl

+end
+
+function MA.operate!(
+    op::UnsafeAddMul,


Now that we have UnsafeAdd, we should use UnsafeAdd here

blegat · 2025-01-05T16:06:59Z

src/quadratic_form.jl

+
+function MA.operate_to!(
+    res,
+    ms::MultiplicativeStructure,


What's the plan for this ms ? Is the plan to allow res to have a different basis compared to Q and ms is the rule to do the mapping ? If yes, this should be commented on. Otherwise, just use copy

yeah, this signature needs to change. What we want here is to say:
Take this quadratic form (which comes with its own basis) and realize it algebraically storing the result in res.

Is copy the correct convention? store of copyto could also work;
how about MA.operate_to!(res, Q) where the quadratic form itself "operates"?

(this ms is an artifact of the past.)

There is precedent for copy in
https://github.com/jump-dev/MutableArithmetics.jl/blob/master/src/implementations/BigInt.jl#L20
It's weird that it's not the same type though so it might not be the right one.
Having Q be the operator is weird also. We could use UnsafeAdd here, it's like the unary +.

blegat · 2025-01-05T16:07:33Z

src/quadratic_form.jl

+
+function MA.operate_to!(
+    res,
+    ms::MultiplicativeStructure,


Suggested change

ms::MultiplicativeStructure,

::typeof(copy),

blegat · 2025-01-05T16:08:39Z

src/quadratic_form.jl

+    Q::QuadraticForm,
+)
+    MA.operate!(zero, res)
+    MA.operate!(UnsafeAddMul(ms), res, Q)


Suggested change

MA.operate!(UnsafeAddMul(ms), res, Q)

MA.operate!(UnsafeAdd(), res, Q)

blegat · 2025-01-05T16:21:23Z

Let's focus on making this work, this is what is needed for SumOfSquares.

b1 = # explicit basis containing monomials or group elements
A = # algebra over the implicit version of b1
b2 = # explicit basis containing `AlgebraElement`s over A
Q1 = # QuadraticForm with b1
Q2 = # QuadraticForm with b2
AlgebraElement(Q1, A) # should be over A
AlgebraElement(Q2, A) # should be over A

kalmarek added 3 commits December 1, 2024 23:09

fixes to dot

c5c1cfa

add implementation of QuadraticForm

41b20a4

allow efficient both quadratic and sesqui-linear forms

82d0cc8

kalmarek requested a review from blegat December 2, 2024 00:04

blegat reviewed Dec 6, 2024

View reviewed changes

test/quadratic_form.jl Outdated Show resolved Hide resolved

blegat reviewed Dec 6, 2024

View reviewed changes

src/mstructures.jl Outdated Show resolved Hide resolved

blegat reviewed Dec 6, 2024

View reviewed changes

kalmarek added 3 commits December 9, 2024 17:21

move QuadraticForm to a separate file

feb307c

replace args::Vararg multiplication by 4-argument one

0ea0688

fix allocation tests on julia-1.11

5ca08fb

kalmarek added 3 commits December 10, 2024 14:01

bring back a 3-arg MA.opearate_to!(res, ::UnsafeAddMul, ...)

ab37477

Base.in(x, object) is needed for fast DiracBasis indexing

f35b2a9

fix: rename MA.operate_to! to MA.operate! when appropriate

51d67fe

blegat mentioned this pull request Dec 15, 2024

Release v0.3 #63

Open

blegat reviewed Dec 16, 2024

View reviewed changes

blegat mentioned this pull request Dec 16, 2024

Use StarAlgebras.QuadraticForm jump-dev/SumOfSquares.jl#391

Open

1 task

kalmarek closed this Dec 17, 2024

kalmarek reopened this Dec 17, 2024

kalmarek added 3 commits December 20, 2024 12:10

rename Gramm → Gram

3323cb8

add 3-argument operate! for QuadraticForm

6020a6d

use UnsafeAddMul with operate!s on QuadraticForm

1eed904

blegat reviewed Dec 24, 2024

View reviewed changes

src/mstructures.jl Show resolved Hide resolved

blegat reviewed Dec 24, 2024

View reviewed changes

src/mstructures.jl Show resolved Hide resolved

blegat reviewed Dec 24, 2024

View reviewed changes

src/mstructures.jl Outdated Show resolved Hide resolved

Fix

8c9a842

blegat reviewed Dec 24, 2024

View reviewed changes

Update docstring

e9e408a

Co-authored-by: Benoît Legat <[email protected]>

kalmarek added 2 commits January 4, 2025 00:56

get rid of the vararg method for UnsafeAddMul

62b9555

make quadratic forms coeffs-oblivious

cb27cca

blegat reviewed Jan 5, 2025

View reviewed changes

src/quadratic_form.jl

end

function MA.operate!(

op::UnsafeAddMul,

Copy link

Member

blegat Jan 5, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Now that we have UnsafeAdd, we should use UnsafeAdd here

blegat reviewed Jan 5, 2025

View reviewed changes

	MA.operate!(UnsafeAddMul(ms), res, Q)
	MA.operate!(UnsafeAdd(), res, Q)

Implement quadratic forms #62

Are you sure you want to change the base?

Implement quadratic forms #62

Conversation

kalmarek commented Dec 2, 2024 • edited Loading

codecov bot commented Dec 2, 2024 • edited Loading

Codecov Report

kalmarek commented Dec 2, 2024 • edited Loading

blegat commented Dec 6, 2024

blegat Dec 6, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

blegat commented Dec 9, 2024 • edited Loading

blegat commented Dec 9, 2024

kalmarek commented Dec 10, 2024

blegat commented Dec 10, 2024

blegat commented Dec 10, 2024

blegat commented Dec 15, 2024

blegat commented Dec 15, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

blegat commented Dec 17, 2024

kalmarek commented Dec 17, 2024 • edited Loading

kalmarek commented Dec 20, 2024

blegat commented Dec 24, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kalmarek commented Dec 26, 2024

blegat commented Dec 28, 2024 • edited Loading

kalmarek commented Jan 3, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

blegat commented Jan 5, 2025 • edited Loading

kalmarek commented Dec 2, 2024 •

edited

Loading

codecov bot commented Dec 2, 2024 •

edited

Loading

kalmarek commented Dec 2, 2024 •

edited

Loading

blegat Dec 6, 2024 •

edited

Loading

blegat commented Dec 9, 2024 •

edited

Loading

kalmarek commented Dec 17, 2024 •

edited

Loading

blegat commented Dec 28, 2024 •

edited

Loading

kalmarek commented Jan 3, 2025 •

edited

Loading

blegat commented Jan 5, 2025 •

edited

Loading