Trace refactored #116

VladoLavor · 2023-03-17T11:01:59Z

The way how the API trace is initialized was changed - the trace is no longer initialized (and immediately disabled) by the connection. Instead, the user creates a new trace object and bounds it to a connection.

API changes:

trace is no longer available from the connection object via the Trace() method.
Enable(bool) was removed from the Trace API. Instead, the method NewTrace(connection, size) is added.
Added method Close() to release resources used by the tracer.

Workflow changes:

tracer has a maximum size
trace is done after the vppClient.SendMsg and the succeeded/failed status of the message is recorded
instead of appending to a list, the connection sends records to the buffered channel
GetRecords waits until all incoming messages are processed before returning the list

Signed-off-by: Vladimir Lavor [email protected]

Signed-off-by: Vladimir Lavor <[email protected]>

core/trace.go

core/request_handler.go

ondrej-fabry · 2023-06-08T13:05:54Z

I have tested this locally repeatedly (-count=10000) with race checker enabled (-race) and it passed OK, so I assume this will resolve the issue with intermittent unit test failures.

However I have a few comments:

The Tracer usage makes me a bit uneasy, it is referenced from the trace field of Connection, which is handled without any lock and could potentially run into race issues when the trace records are being created.
The API of Tracer does seem a bit limiting. For example, what about the overwriting of records? It might be more important for the user to have the last N trace records when running into some issue. Otherwise it is needed to continually call Clear or re-create the tracer, while user cannot reliably know when the records storage is full.
The Tracer always stores all of the requests occurring on the connection, which can easily pollute with records uninteresting to user (e.g. keepalive pings, other non-important channels, ..), since the filtering of records for a specific channel happens when user retrieves the records and not when they are being stored
The Record is missing some info about the requests/replies which might be important to the user, to be specific:
- request context - used for multiplexing multiple parallel requests on connection,
  it actually consists of these attributes: sequenceNumber + channelId + isMultiRequest
- message ID - identifies the message (might be different on each VPP run)
- data length - how much data is being sent/received
- data - the actual data sent/received (might be useful for debugging a bug in encoding/decoding)

@sknat could you take a look and do a quick review as well?

VladoLavor · 2023-06-23T12:25:35Z

@ondrej-fabry here are some of my thoughts:

Do you mean the issue that should be fixed by this patch? The trace manages all locks by itself.
I agree with this point. The initial tracer was not implemented with all the bells and whistles and its API could be much more versatile.
Valdi point. The per-channel filter was based on the specific scenario where all records were needed at first and later sorted by channel. But the sorting can be done easily by the user if needed. Filtering should be made before messages are stored.
+1

Signed-off-by: Vladimir Lavor <[email protected]>

sknat · 2023-12-18T13:03:07Z

Following up on the discussion we had last week - sorry for the really long review cycle -
and after having played with it quite a bit I guess my position so far align:

I think I can also see edge cases happening where you would register a tracer with API messages in flight i.e. receive a message for which we didn't Add() the WaitGroup.
Also aligned here, although it is still unclear to me whether we should aim at having a tracer implementation with bells and whistles in this repository or if we should just export a channel and let users implement their own traces.
Filtering might not be required in all cases, although I am under the impression the current implementation is rather specific (i.e. asserting a certain number of messages in advance).

That said I think we can move ahead with this patch as is, and evolve it when adding other usecases.

sknat

lgtm

core/request_handler.go

Signed-off-by: Vladimir Lavor <[email protected]>

core/request_handler.go

Signed-off-by: Vladimir Lavor <[email protected]>

sknat

lgtm, thanks

Signed-off-by: Vladimir Lavor <[email protected]>

VladoLavor added 4 commits March 17, 2023 11:41

govpp trace refactored

0bc81e9

Signed-off-by: Vladimir Lavor <[email protected]>

fix test

9559fc5

Signed-off-by: Vladimir Lavor <[email protected]>

Merge branch 'master' into trace-fix

2c55f25

simplify

d6e7ed9

Signed-off-by: Vladimir Lavor <[email protected]>

VladoLavor requested review from ondrej-fabry and sknat March 17, 2023 11:29

VladoLavor mentioned this pull request Mar 17, 2023

Investigate intermittent failure of TestTraceEnabled #108

Closed

VladoLavor added 3 commits May 4, 2023 15:03

Merge branch 'master' into trace-fix

09e0c5c

Merge branch 'master' into trace-fix

3d31403

Merge branch 'master' into trace-fix

8915fe7

ondrej-fabry reviewed Jun 8, 2023

View reviewed changes

core/trace.go Outdated Show resolved Hide resolved

ondrej-fabry reviewed Jun 8, 2023

View reviewed changes

core/trace.go Outdated Show resolved Hide resolved

ondrej-fabry reviewed Jun 8, 2023

View reviewed changes

core/trace.go Show resolved Hide resolved

ondrej-fabry reviewed Jun 8, 2023

View reviewed changes

core/request_handler.go Outdated Show resolved Hide resolved

ondrej-fabry linked an issue Jun 8, 2023 that may be closed by this pull request

Investigate intermittent failure of TestTraceEnabled #108

Closed

ondrej-fabry added the bug label Jun 8, 2023

ondrej-fabry assigned VladoLavor Jun 8, 2023

VladoLavor added 3 commits June 23, 2023 14:31

Merge branch 'master' into trace-fix

96900c4

Signed-off-by: Vladimir Lavor <[email protected]>

some fixes

b67ba44

Signed-off-by: Vladimir Lavor <[email protected]>

init records list out of the lock

f2ee3d5

Signed-off-by: Vladimir Lavor <[email protected]>

ondrej-fabry mentioned this pull request Jul 10, 2023

Tracetest #142

Merged

VladoLavor added 4 commits July 27, 2023 14:20

Merge branch 'master' into trace-fix

7a2b4da

Signed-off-by: Vladimir Lavor <[email protected]>

Added succeed/fail flag to traced message

d1b91ca

Signed-off-by: Vladimir Lavor <[email protected]>

Merge branch 'master' into trace-fix

1a339a6

Merge branch 'master' into trace-fix

6f809a5

VladoLavor requested a review from ondrej-fabry September 4, 2023 07:14

VladoLavor added 2 commits September 21, 2023 11:57

Merge branch 'master' into trace-fix

a0d199b

Merge branch 'master' into trace-fix

ce3a5fd

VladoLavor added 2 commits November 13, 2023 09:58

Merge branch 'master' into trace-fix

1fd714b

Merge branch 'master' into trace-fix

6d0521a

VladoLavor and others added 4 commits December 19, 2023 10:00

Merge branch 'master' into trace-fix

1870192

Merge branch 'master' into trace-fix

8859ead

Merge branch 'master' into trace-fix

44b1d74

Merge branch 'master' into trace-fix

6366087

sknat previously approved these changes Feb 8, 2024

View reviewed changes

ondrej-fabry reviewed Feb 15, 2024

View reviewed changes

core/request_handler.go Outdated Show resolved Hide resolved

ondrej-fabry reviewed Feb 15, 2024

View reviewed changes

core/request_handler.go Outdated Show resolved Hide resolved

decrease the size of the trace prealocation

229f522

Signed-off-by: Vladimir Lavor <[email protected]>

VladoLavor dismissed sknat’s stale review via 229f522 February 16, 2024 14:12

VladoLavor added 2 commits February 16, 2024 15:47

make it safe to init trace during api calls

b5e13b4

Signed-off-by: Vladimir Lavor <[email protected]>

Merge branch 'master' into trace-fix

e33b77a

VladoLavor requested review from ondrej-fabry and sknat February 21, 2024 13:14

ondrej-fabry reviewed Feb 22, 2024

View reviewed changes

core/request_handler.go Outdated Show resolved Hide resolved

Do not use trace lock if trace is disabled

783ca18

Signed-off-by: Vladimir Lavor <[email protected]>

sknat previously approved these changes Feb 22, 2024

View reviewed changes

fix data race on trace nil check

d42dde7

Signed-off-by: Vladimir Lavor <[email protected]>

VladoLavor dismissed sknat’s stale review via d42dde7 February 22, 2024 15:23

VladoLavor requested review from ondrej-fabry and sknat February 22, 2024 15:25

ondrej-fabry approved these changes Feb 22, 2024

View reviewed changes

sknat approved these changes Feb 22, 2024

View reviewed changes

ondrej-fabry merged commit 553f5ca into master Feb 22, 2024
9 checks passed

ondrej-fabry deleted the trace-fix branch February 22, 2024 15:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Trace refactored #116

Trace refactored #116

VladoLavor commented Mar 17, 2023 •

edited

Loading

ondrej-fabry commented Jun 8, 2023 •

edited

Loading

VladoLavor commented Jun 23, 2023

sknat commented Dec 18, 2023

sknat left a comment

sknat left a comment

Trace refactored #116

Trace refactored #116

Conversation

VladoLavor commented Mar 17, 2023 • edited Loading

ondrej-fabry commented Jun 8, 2023 • edited Loading

VladoLavor commented Jun 23, 2023

sknat commented Dec 18, 2023

sknat left a comment

Choose a reason for hiding this comment

sknat left a comment

Choose a reason for hiding this comment

VladoLavor commented Mar 17, 2023 •

edited

Loading

ondrej-fabry commented Jun 8, 2023 •

edited

Loading