Add DocValuesProducers for releasing memory when close index #1946

luyuncheng · 2024-08-09T13:55:28Z

Description

in #1885 we talk about a method that we need release memory when a producer closed. so i open this PR and added KNN80DocValuesProducer. this producer can release memory when reader closed a segment.

also i added refCount as the comments cares about.

i think this pr is the 1st step, we only added a producers, because we talked in #1885, that we need get binaryDocValues in DocValuesProducers from native engine.

i see there is an #1853 on going, so i prefer to in next step to read binaryDocValues from native engines.

Related Issues

#1885

Check List

New functionality includes testing.
New functionality has been documented.
API changes companion pull request created.
Commits are signed per the DCO using --signoff.
Public documentation issue/PR created.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

navneet1v · 2024-08-09T20:18:30Z

@luyuncheng can we fix the build CIs. BWC CIs is failing across other PRs will check whats happening there.

jmazanec15

Thanks for making this @luyuncheng. Overall looks pretty good. A few comments. I think it is okay to initially focus on DocValues and then add functionality for KnnVectorsFormat in another PR (and get rid of file watcher in this PR).

jmazanec15 · 2024-08-09T20:37:48Z

src/main/java/org/opensearch/knn/index/codec/KNN80Codec/KNN80CompoundDirectory.java

+
+    @Override
+    public IndexInput openInput(String name, IOContext context) throws IOException {
+        return delegate.openInput(name, context);


For future extendability, could we check if this is a k-NN file and open it from the directory? I know we typically dont open k-NN files via openInput, but if we do, this might fail:

if (KNNEngine.getEnginesThatCreateCustomSegmentFiles().stream().anyMatch(engine -> name.endsWith(engine.getCompoundExtension()))) { return directory.openInput(name, context); } return delegate.openInput(name, context);

@jmazanec15 why we want to do this? this can lead to unnecessary mapping of the file in memory. My thoughts will be not to do this.

I dont think anyone should call openInput on k-NN files. But if they do, return delegate.openInput(name, context); will throw exception because file isnt actually in compound file

I am not sure if exception will come. @luyuncheng have you seen an exception here? I think it will open the input.

But even if there is no exception we should not open the input, it is just unnecessary mapping of file. One we start to use IndexInput to read the graph file(ref: #1951) we can start opening the files here.

i did not find any exception. but i do want to implement an nativeEngine indexInput like #1951 says.
how about:

we return return delegate.openInput(name, context);

and added a assert when name is native engines.

also added an TODO comments here: TODO: using the native engine IndexInput.

@navneet1v @jmazanec15

@navneet1v See how we write: https://github.com/luyuncheng/k-NN-1/blob/DocValuesProducers/src/main/java/org/opensearch/knn/index/codec/KNN80Codec/KNN80CompoundFormat.java#L48-L53. We dont use the delegate to write the file, we use directory directly.

@jmazanec15 I understand we would be directly writing the file. I have tested a small code change where in KNNWeight class I was opening .hnsw file using IndexInput and it worked. I didn't get the exception. So want to know when you say there will be exception what will be that exception.

Neverthless I believe we should not even open the nativeIndex files in the producer as it will unnecessary map the file on RAM.

In your test was it an .hnsw file or .hnswc? I think .hnswc should fail with this line

src/main/java/org/opensearch/knn/index/memory/NativeMemoryAllocation.java

src/main/java/org/opensearch/knn/index/codec/KNN80Codec/KNN80DocValuesProducer.java

src/main/java/org/opensearch/knn/index/codec/KNN80Codec/KNN80CompoundDirectory.java

src/main/java/org/opensearch/knn/index/codec/KNN80Codec/KNN80DocValuesProducer.java

kotwanikunal · 2024-08-12T21:52:24Z

Added some thoughts related to the cache free up: #1885 (comment)

jmazanec15

One minor comment - other than that looks good!

src/main/java/org/opensearch/knn/index/codec/KNN80Codec/KNN80DocValuesProducer.java

luyuncheng · 2024-08-15T13:12:29Z

Can we check if FilterDirectory unwraps to FSDirectory before casting? If it does not, can we warn log and NO-OP? My main concern here is around remove directory implementations.

@jmazanec15 at 2bcc139 how about added try catch ClassCastException

jmazanec15

That looks good. Thanks approving!

src/main/java/org/opensearch/knn/index/codec/KNN80Codec/KNN80DocValuesProducer.java

src/main/java/org/opensearch/knn/index/codec/util/KNNCodecUtil.java

src/main/java/org/opensearch/knn/index/memory/NativeMemoryAllocation.java

navneet1v · 2024-09-03T00:27:44Z

@luyuncheng can you please look at the comment added reply on them.

luyuncheng · 2024-09-06T15:47:23Z

@luyuncheng can you please look at the comment added reply on them.

@navneet1v Sorry for the late reply, i just get back from vacation. i'll do it ASAP.

Signed-off-by: luyuncheng <[email protected]>

jmazanec15

LGTM

Add DocValuesProducers for releasing memory when close index #1946 (cherry picked from commit 004fcc0)

luyuncheng · 2024-09-14T02:46:52Z

@jmazanec15 @navneet1v Thanks guys for reviewing

…2109) Add DocValuesProducers for releasing memory when close index #1946 (cherry picked from commit 004fcc0) Co-authored-by: luyuncheng <[email protected]>

0ctopus13prime · 2024-10-16T20:06:47Z

Hi @luyuncheng
Could you share why we don't include a field having 'model_id' in its field attribute into indexPathMap??
Wondering having 'model_id' in its field attribute mean something else??

Link : https://github.com/opensearch-project/k-NN/blob/main/src/main/java/org/opensearch/knn/index/codec/KNN80Codec/KNN80DocValuesProducer.java#L140

// Only Native Engine put into indexPathMap
KNNEngine knnEngine = getNativeKNNEngine(field);
if (knnEngine == null) {
    continue;
}

private KNNEngine getNativeKNNEngine(@NonNull FieldInfo field) {
    final String modelId = field.attributes().get(MODEL_ID);
    if (modelId != null) {
        return null; <------------- This
    }
    ...

luyuncheng · 2024-10-17T09:11:31Z

Could you share why we don't include a field having 'model_id' in its field attribute into indexPathMap??
Wondering having 'model_id' in its field attribute mean something else??

because in model engine, do not need docValuesProducer for merge. so i filter it out

0ctopus13prime · 2024-10-17T22:48:06Z

Could you share why we don't include a field having 'model_id' in its field attribute into indexPathMap??
Wondering having 'model_id' in its field attribute mean something else??

because in model engine, do not need docValuesProducer for merge. so i filter it out

@luyuncheng
Thank you for the response.
Could you provide more info about it? Still not clear and hard for me to build a connection between two.
Specificlly,

What is model engine??
What do we do with a model engine??
How we differentiate model engine from the plain vector engine (ex: Hnsw)??
Why a model engine does not need a resource clean up in cache manager when DVReader is being closed??

Thank you!

luyuncheng · 2024-10-21T02:58:51Z

@0ctopus13prime

What is model engine??

i think only faiss can have a model engine?

What do we do with a model engine??
How we differentiate model engine from the plain vector engine (ex: Hnsw)??

sorry i do not understand this question.

Why a model engine does not need a resource clean up in cache manager when DVReader is being closed??

i see there are 3 memory allocation IndexAllocation, TrainingDataAllocation, AnonymousAllocation. cache manager do not mange all of these?

0ctopus13prime · 2024-10-21T21:52:12Z

@luyuncheng
Thank you, but it still does not answer the questions. :)
Overall, I'm trying to understand what 'model engine' is. And how it's different from a plain graph based vector engine like HNSW.

Could you give more contexts why did you add the logic so that those 'model engines' are excluded from invalidating the cache??

    private KNNEngine getNativeKNNEngine(@NonNull FieldInfo field) {
        final String modelId = field.attributes().get(MODEL_ID);
        if (modelId != null) { <------------ This one. Why did you add it?
            return null;
        }
        KNNEngine engine = FieldInfoExtractor.extractKNNEngine(field);
        if (KNNEngine.getEnginesThatCreateCustomSegmentFiles().contains(engine)) {
            return engine;
        }
        return null;
    }

0ctopus13prime · 2024-11-01T17:45:45Z

Hi @luyuncheng,
could you answer ^^ questions when you have time?

What I understand is modelId coming from cluster state also I confirmed OpneSearch build its index internally.
I recently removed the code so that allocated memory can be released after we deprecated FileWatcher.
I think this is safe to do that, but hoping to get your motivation for the if-branch just in case.

Thank you

luyuncheng requested review from heemin32, navneet1v, VijayanB, vamshin, jmazanec15, naveentatikonda, junqiu-lei, martin-gaievski and ryanbogan as code owners August 9, 2024 13:55

jmazanec15 added Bug Fixes Changes to a system or product designed to handle a programming bug/glitch backport 2.x labels Aug 9, 2024

jmazanec15 reviewed Aug 9, 2024

View reviewed changes

navneet1v reviewed Aug 11, 2024

View reviewed changes

jmazanec15 reviewed Aug 14, 2024

View reviewed changes

src/main/java/org/opensearch/knn/index/codec/KNN80Codec/KNN80DocValuesProducer.java Outdated Show resolved Hide resolved

jmazanec15 previously approved these changes Aug 15, 2024

View reviewed changes

navneet1v reviewed Aug 15, 2024

View reviewed changes

src/main/java/org/opensearch/knn/index/codec/KNN80Codec/KNN80DocValuesProducer.java Outdated Show resolved Hide resolved

navneet1v reviewed Aug 15, 2024

View reviewed changes

src/main/java/org/opensearch/knn/index/codec/KNN80Codec/KNN80DocValuesProducer.java Show resolved Hide resolved

navneet1v reviewed Aug 15, 2024

View reviewed changes

src/main/java/org/opensearch/knn/index/codec/KNN80Codec/KNN80DocValuesProducer.java Outdated Show resolved Hide resolved

src/main/java/org/opensearch/knn/index/codec/util/KNNCodecUtil.java Show resolved Hide resolved

navneet1v reviewed Aug 15, 2024

View reviewed changes

src/main/java/org/opensearch/knn/index/memory/NativeMemoryAllocation.java Show resolved Hide resolved

luyuncheng dismissed jmazanec15’s stale review via 7932452 August 16, 2024 10:29

luyuncheng force-pushed the DocValuesProducers branch from 12b96e4 to 7932452 Compare August 16, 2024 10:29

luyuncheng force-pushed the DocValuesProducers branch from f043fc4 to 439c5b7 Compare September 6, 2024 15:39

luyuncheng requested review from navneet1v and jmazanec15 September 9, 2024 13:46

Add DocValuesProducer for KNNNativeEngine

e2c8420

Signed-off-by: luyuncheng <[email protected]>

luyuncheng added 13 commits September 13, 2024 22:00

FIXED conflicts interface

2a1b39d

Signed-off-by: luyuncheng <[email protected]>

FIXED conflicts interface and spotlessapply

5e0a5b6

Signed-off-by: luyuncheng <[email protected]>

Check directory casting state

46df02f

Signed-off-by: luyuncheng <[email protected]>

Check directory casting state

6d693ac

Signed-off-by: luyuncheng <[email protected]>

FIX Comments

4e599a3

Signed-off-by: luyuncheng <[email protected]>

FIX Comments

d0ab6c9

Signed-off-by: luyuncheng <[email protected]>

FIX rebase error

cab6ae3

Signed-off-by: luyuncheng <[email protected]>

FIX rebase error

5609c6d

Signed-off-by: luyuncheng <[email protected]>

Rebase and FIX error

18d8de5

Signed-off-by: luyuncheng <[email protected]>

Rebase and FIX error

8140452

Signed-off-by: luyuncheng <[email protected]>

Rebase and FIX error

04e4005

Signed-off-by: luyuncheng <[email protected]>

Rebase and FIX error

e88f40e

Signed-off-by: luyuncheng <[email protected]>

Rebase and FIX error

1edcdc8

Signed-off-by: luyuncheng <[email protected]>

luyuncheng force-pushed the DocValuesProducers branch from e8310c2 to 1edcdc8 Compare September 13, 2024 14:01

jmazanec15 approved these changes Sep 13, 2024

View reviewed changes

navneet1v approved these changes Sep 13, 2024

View reviewed changes

luyuncheng merged commit 004fcc0 into opensearch-project:main Sep 14, 2024
28 of 29 checks passed

opensearch-trigger-bot bot pushed a commit that referenced this pull request Sep 14, 2024

Add DocValuesProducers for releasing memory when close index (#1946)

c165756

Add DocValuesProducers for releasing memory when close index #1946 (cherry picked from commit 004fcc0)

opensearch-trigger-bot bot mentioned this pull request Sep 14, 2024

[Backport 2.x] Add DocValuesProducers for releasing memory when close index #2109

Merged

VijayanB mentioned this pull request Sep 25, 2024

[FEATURE] Evict graphs from cache when index is closed or delete for indices that are created using NativeEngines990KnnVectorsFormat #2148

Closed

0ctopus13prime mentioned this pull request Sep 26, 2024

Move away from file watcher for releasing memory #1885

Closed

luyuncheng mentioned this pull request Oct 17, 2024

[BUG] KNN doesn't release memory when close index #1012

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add DocValuesProducers for releasing memory when close index #1946

Add DocValuesProducers for releasing memory when close index #1946

luyuncheng commented Aug 9, 2024

navneet1v commented Aug 9, 2024

jmazanec15 left a comment

jmazanec15 Aug 9, 2024

navneet1v Aug 15, 2024

jmazanec15 Aug 16, 2024

navneet1v Aug 17, 2024

luyuncheng Aug 19, 2024

jmazanec15 Aug 19, 2024

navneet1v Aug 21, 2024

jmazanec15 Aug 21, 2024

kotwanikunal commented Aug 12, 2024

jmazanec15 left a comment

luyuncheng commented Aug 15, 2024

jmazanec15 left a comment

navneet1v commented Sep 3, 2024

luyuncheng commented Sep 6, 2024

jmazanec15 left a comment

luyuncheng commented Sep 14, 2024

0ctopus13prime commented Oct 16, 2024

luyuncheng commented Oct 17, 2024

0ctopus13prime commented Oct 17, 2024

luyuncheng commented Oct 21, 2024

0ctopus13prime commented Oct 21, 2024

0ctopus13prime commented Nov 1, 2024

Add DocValuesProducers for releasing memory when close index #1946

Add DocValuesProducers for releasing memory when close index #1946

Conversation

luyuncheng commented Aug 9, 2024

Description

Related Issues

Check List

navneet1v commented Aug 9, 2024

jmazanec15 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kotwanikunal commented Aug 12, 2024

jmazanec15 left a comment

Choose a reason for hiding this comment

luyuncheng commented Aug 15, 2024

jmazanec15 left a comment

Choose a reason for hiding this comment

navneet1v commented Sep 3, 2024

luyuncheng commented Sep 6, 2024

jmazanec15 left a comment

Choose a reason for hiding this comment

luyuncheng commented Sep 14, 2024

0ctopus13prime commented Oct 16, 2024

luyuncheng commented Oct 17, 2024

0ctopus13prime commented Oct 17, 2024

luyuncheng commented Oct 21, 2024

0ctopus13prime commented Oct 21, 2024

0ctopus13prime commented Nov 1, 2024