Optimization for multi-valued LDAP attributes such as groups uniqueMember #284

intmgroupe · 2024-05-10T12:47:42Z

See #255.
This pull request replaces #259 as per the discussion in its thread.

The goal is to make FORCE operations on large datasets faster by doing a few add and delete instead of one constantly huge replace when it is beneficial to do so.
The testing on this is nowhere near substantial, nor are any test cases updated to reflect the change in behaviour.

These changes were developed by INTM on behalf of EDF, who noticed the slowdown in real use, and the fix is succesfully working for them.

Please let us know if something is wrong or doesn't fit the guidelines adopted by the LSC project, so we can make any required modification before this can be merged.

Use smaller ADD and DELETE modifications instead of a huge REPLACE, at worst equivalent

davidcoutadeur · 2024-05-14T14:54:43Z

Giving a look to this PR.

davidcoutadeur

I like this PR, it is quite elegant in my opinion.

I have only 2 general comments:

it would be really nice to have a corresponding test. Maybe we could grab the LSC log showing that we are in the condition: (missingValues.size() + extraValues.size()) >= toSetAttrValues.size()?
as it is a behavior change for everyone, I think we must document this (in main documentation + in release notes)

davidcoutadeur · 2024-05-14T17:21:20Z

src/main/java/org/lsc/beans/BeanComparator.java

+						// check if there are any extra values to be removed
+						Set<Object> extraValues = SetUtils.findMissingNeedles(toSetAttrValues, dstAttrValues);
+
+						if((missingValues.size() + extraValues.size()) >= toSetAttrValues.size()) {


I am not sure at which point one replace is more efficient than 1 add + 1 delete.

Maybe it can be a configuration parameter? The condition would be:

(missingValues.size() + extraValues.size()) >= max_value_modifications

It would make the LSC modifications more predictable.

However I understand that depending on the entry size has also its benefits. For example, with the previous condition, if we had an entry with 1000 values, and max_value_modifications = 100, then 50 adds and 51 adds would lead to a big replace, which may be less efficient than adding 50 values + removing 51 values.

Without further efficiency analysis, I think we can keep the initial proposition in this PR.

davidcoutadeur · 2024-05-15T15:29:56Z

I like this PR, it is quite elegant in my opinion.

I have only 2 general comments:

* it would be really nice to have a corresponding test. Maybe we could grab the LSC log showing that we are in the condition: `(missingValues.size() + extraValues.size()) >= toSetAttrValues.size()`?

* as it is a behavior change for everyone, I think we must document this (in main documentation + in release notes)

I have just written the corresponding unit test demonstrating the correct selection of 1 big replace / 1 add + 1 delete.

See this PR: #285

davidcoutadeur · 2024-05-15T17:01:28Z

I like this PR, it is quite elegant in my opinion.

I have only 2 general comments:

* it would be really nice to have a corresponding test. Maybe we could grab the LSC log showing that we are in the condition: `(missingValues.size() + extraValues.size()) >= toSetAttrValues.size()`?

* as it is a behavior change for everyone, I think we must document this (in main documentation + in release notes)

I have also written the documentation and upgrade notes in corresponding project, see: lsc-project/documentation#5

davidcoutadeur · 2024-05-15T17:04:37Z

For me, everything is ready for merging. @rouazana and @soisik, I don't know if you have any more remarks?

davidcoutadeur · 2024-05-15T17:10:15Z

This PR is replaced by #285

LiquidFenrir added 2 commits December 18, 2023 10:25

Optimize FORCE policyType for REPLACE_VALUES operation

8b3bfc9

Use smaller ADD and DELETE modifications instead of a huge REPLACE, at worst equivalent

Optimize more: allow replace when better to do so

ca52c16

coudot added this to the 2.2 milestone May 14, 2024

coudot requested review from rouazana, davidcoutadeur and soisik May 14, 2024 06:36

coudot added the enhancement label May 14, 2024

coudot linked an issue May 14, 2024 that may be closed by this pull request

Optimization for multi-valued LDAP attributes such as groups uniqueMember #255

Closed

davidcoutadeur reviewed May 14, 2024

View reviewed changes

davidcoutadeur mentioned this pull request May 15, 2024

255 optimization multi valued attributes #285

Merged

davidcoutadeur closed this May 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimization for multi-valued LDAP attributes such as groups uniqueMember #284

Optimization for multi-valued LDAP attributes such as groups uniqueMember #284

intmgroupe commented May 10, 2024

davidcoutadeur commented May 14, 2024

davidcoutadeur left a comment

davidcoutadeur May 14, 2024

davidcoutadeur commented May 15, 2024

davidcoutadeur commented May 15, 2024

davidcoutadeur commented May 15, 2024

davidcoutadeur commented May 15, 2024

Optimization for multi-valued LDAP attributes such as groups uniqueMember #284

Optimization for multi-valued LDAP attributes such as groups uniqueMember #284

Conversation

intmgroupe commented May 10, 2024

davidcoutadeur commented May 14, 2024

davidcoutadeur left a comment

Choose a reason for hiding this comment

davidcoutadeur May 14, 2024

Choose a reason for hiding this comment

davidcoutadeur commented May 15, 2024

davidcoutadeur commented May 15, 2024

davidcoutadeur commented May 15, 2024

davidcoutadeur commented May 15, 2024