Handling plugin extraction errors #125

jasondellaluce · 2021-11-09T16:10:59Z

Motivation
Currently, errors are ignored during field extraction in the plugin framework. In theory, a plugin might fail extracting a field for two main reasons:

The field is not present in the given event, for which the ss_plugin_extract_field.field_present flag is set to false
The extract_fields exported plugin function encounters some error and returns a code different than SS_PLUGIN_SUCCESS.
In the current implementation, in both cases the filtercheck returns a NULL pointer, which is interpreted as a not-available field. This is visible here 👇🏼

libs/userspace/libsinsp/plugin.cpp

Line 630 in 83f460c

return false;

libs/userspace/libsinsp/plugin.cpp

Line 304 in 83f460c

return NULL;

Although this is semantically correct, the two failure paths have a quite different meaning. In the second case, the plugin returns a failure code and the framework silently ignores it to maintain a non-blocking extraction flow. This is makes error handling efforts useless for plugin developers, and generally makes it harder to debug plugins at runtime.

Feature
I propose to catch the error and make it visible somehow.

I agree that maintaining field extraction non-blocking might be a priority here, so maybe throwing an exception might not be a viable option. We can consider some weaker error propagation methods, or maybe logging to stderr. To the bare minimum, we might log the error if a debug mode is enabled.

Alternatives
Keep things as they are, and just ignore plugin failures for extract_fields.

The text was updated successfully, but these errors were encountered:

jasondellaluce · 2021-11-09T16:11:14Z

@leogr @mstemm

leogr · 2021-11-10T08:01:37Z

Good catch. Not sure what's the best option atm, for sure I will take a look.

poiana · 2022-02-08T10:14:31Z

Issues go stale after 90d of inactivity.

Mark the issue as fresh with /remove-lifecycle stale.

Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Provide feedback via https://github.com/falcosecurity/community.

/lifecycle stale

leogr · 2022-02-08T11:30:38Z

/remove-lifecycle stale

poiana · 2022-05-09T17:26:10Z

Issues go stale after 90d of inactivity.

Mark the issue as fresh with /remove-lifecycle stale.

Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Provide feedback via https://github.com/falcosecurity/community.

/lifecycle stale

FedeDP · 2022-05-11T11:53:37Z

/remove-lifecycle stale

poiana · 2022-08-09T15:43:13Z

Issues go stale after 90d of inactivity.

Mark the issue as fresh with /remove-lifecycle stale.

Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Provide feedback via https://github.com/falcosecurity/community.

/lifecycle stale

jasondellaluce · 2022-08-10T12:32:42Z

/remove-lifecycle stale

poiana · 2022-11-08T15:29:06Z

Issues go stale after 90d of inactivity.

Mark the issue as fresh with /remove-lifecycle stale.

Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Provide feedback via https://github.com/falcosecurity/community.

/lifecycle stale

jasondellaluce · 2022-11-08T16:03:17Z

/remove-lifecycle stale

falcosecurity#125) Various sinsp logic sets m_tid_to_remove, to record the fact that a thread has been identified as ready-to-be-removed from the threadtable. And if automatic threadtable purging is configured, sinsp::next() takes care of removing these threads by calling remove_thread(), then clearing m_tid_to_remove. But remove_thread() may itself set m_tid_to_remove in certain situations. The current sinsp::next() logic loses track of this request; as a result, these threads will languish in the threadtable until the next remove_inactive_threads() interval, default value 20 minutes. This fix changes the sinsp::next() logic to recognize and handle the case where remove_thread() records a thread for removal. Signed-off-by: Joseph Pittman <[email protected]> Signed-off-by: Joseph Pittman <[email protected]>

poiana · 2023-02-06T21:49:35Z

Issues go stale after 90d of inactivity.

Mark the issue as fresh with /remove-lifecycle stale.

Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Provide feedback via https://github.com/falcosecurity/community.

/lifecycle stale

jasondellaluce · 2023-02-07T08:36:55Z

/remove-lifecycle stale

jasondellaluce · 2023-03-20T10:26:48Z

/milestone 0.11.0

FedeDP · 2023-04-27T09:15:57Z

/milestone 0.12.0

incertum · 2023-08-23T18:47:32Z

We have had lots of plugins refactors, is this still relevant?

leogr · 2023-08-24T12:49:55Z

I believe this is still relevant. @jasondellaluce to confirm.

However, I think it would be best to extend this discussion to the plugin domain and all field extraction mechanisms, including those built-in sinsp. I know Jason has some thoughts in this regard, so I'm eager to hear from him.

That being said, I don't think this is a top priority. However, it is still an improvement that is worth tackling.

Just my 2 cents

cc @Andreagit97 @FedeDP

poiana · 2023-11-22T15:46:38Z

Issues go stale after 90d of inactivity.

Mark the issue as fresh with /remove-lifecycle stale.

Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Provide feedback via https://github.com/falcosecurity/community.

/lifecycle stale

leogr · 2023-11-22T15:48:43Z

/remove-lifecycle stale

poiana · 2024-02-20T15:49:07Z

Issues go stale after 90d of inactivity.

Mark the issue as fresh with /remove-lifecycle stale.

Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Provide feedback via https://github.com/falcosecurity/community.

/lifecycle stale

leogr · 2024-02-20T16:49:45Z

/remove-lifecycle stale

leogr · 2024-02-20T16:49:57Z

/help

poiana · 2024-02-20T16:50:00Z

@leogr:
This request has been marked as needing help from a contributor.

Please ensure the request meets the requirements listed here.

If this request no longer meets these requirements, the label can be removed
by commenting with the /remove-help command.

In response to this:

/help

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

poiana · 2024-05-20T21:52:59Z

Issues go stale after 90d of inactivity.

Mark the issue as fresh with /remove-lifecycle stale.

Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Provide feedback via https://github.com/falcosecurity/community.

/lifecycle stale

leogr · 2024-05-21T09:00:29Z

/remove-lifecycle stale

poiana · 2024-08-19T10:09:36Z

Issues go stale after 90d of inactivity.

Mark the issue as fresh with /remove-lifecycle stale.

Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Provide feedback via https://github.com/falcosecurity/community.

/lifecycle stale

Andreagit97 · 2024-08-19T15:50:48Z

/remove-lifecycle stale

poiana · 2024-11-17T16:12:11Z

Issues go stale after 90d of inactivity.

Mark the issue as fresh with /remove-lifecycle stale.

Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Provide feedback via https://github.com/falcosecurity/community.

/lifecycle stale

leogr · 2024-11-18T16:23:45Z

/remove-lifecycle stale

jasondellaluce added the kind/feature New feature or request label Nov 9, 2021

poiana added the lifecycle/stale label Feb 8, 2022

poiana removed the lifecycle/stale label Feb 8, 2022

poiana added the lifecycle/stale label May 9, 2022

poiana removed the lifecycle/stale label May 11, 2022

poiana added the lifecycle/stale label Aug 9, 2022

poiana removed the lifecycle/stale label Aug 10, 2022

poiana added the lifecycle/stale label Nov 8, 2022

poiana removed the lifecycle/stale label Nov 8, 2022

poiana added the lifecycle/stale label Feb 6, 2023

poiana removed the lifecycle/stale label Feb 7, 2023

jasondellaluce mentioned this issue Mar 20, 2023

[New Feature] Standard logging in Plugin API #989

Closed

poiana added this to the 0.11.0 milestone Mar 20, 2023

poiana modified the milestones: 0.11.0, 0.12.0 Apr 27, 2023

leogr added this to the 0.13.0 milestone May 3, 2023

Andreagit97 modified the milestones: 0.13.0, 0.12.0, libs-backlog Jun 7, 2023

poiana added the lifecycle/stale label Nov 22, 2023

poiana removed the lifecycle/stale label Nov 22, 2023

poiana added the lifecycle/stale label Feb 20, 2024

poiana removed the lifecycle/stale label Feb 20, 2024

poiana added the help wanted Extra attention is needed label Feb 20, 2024

poiana added the lifecycle/stale label May 20, 2024

poiana removed the lifecycle/stale label May 21, 2024

poiana added the lifecycle/stale label Aug 19, 2024

poiana removed the lifecycle/stale label Aug 19, 2024

poiana added the lifecycle/stale label Nov 17, 2024

poiana removed the lifecycle/stale label Nov 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handling plugin extraction errors #125

Handling plugin extraction errors #125

jasondellaluce commented Nov 9, 2021

jasondellaluce commented Nov 9, 2021

leogr commented Nov 10, 2021

poiana commented Feb 8, 2022

leogr commented Feb 8, 2022

poiana commented May 9, 2022

FedeDP commented May 11, 2022

poiana commented Aug 9, 2022

jasondellaluce commented Aug 10, 2022

poiana commented Nov 8, 2022

jasondellaluce commented Nov 8, 2022

poiana commented Feb 6, 2023

jasondellaluce commented Feb 7, 2023

jasondellaluce commented Mar 20, 2023

FedeDP commented Apr 27, 2023

incertum commented Aug 23, 2023

leogr commented Aug 24, 2023

poiana commented Nov 22, 2023

leogr commented Nov 22, 2023

poiana commented Feb 20, 2024

leogr commented Feb 20, 2024

leogr commented Feb 20, 2024

poiana commented Feb 20, 2024

poiana commented May 20, 2024

leogr commented May 21, 2024

poiana commented Aug 19, 2024

Andreagit97 commented Aug 19, 2024

poiana commented Nov 17, 2024

leogr commented Nov 18, 2024

Handling plugin extraction errors #125

Handling plugin extraction errors #125

Comments

jasondellaluce commented Nov 9, 2021

jasondellaluce commented Nov 9, 2021

leogr commented Nov 10, 2021

poiana commented Feb 8, 2022

leogr commented Feb 8, 2022

poiana commented May 9, 2022

FedeDP commented May 11, 2022

poiana commented Aug 9, 2022

jasondellaluce commented Aug 10, 2022

poiana commented Nov 8, 2022

jasondellaluce commented Nov 8, 2022

poiana commented Feb 6, 2023

jasondellaluce commented Feb 7, 2023

jasondellaluce commented Mar 20, 2023

FedeDP commented Apr 27, 2023

incertum commented Aug 23, 2023

leogr commented Aug 24, 2023

poiana commented Nov 22, 2023

leogr commented Nov 22, 2023

poiana commented Feb 20, 2024

leogr commented Feb 20, 2024

leogr commented Feb 20, 2024

poiana commented Feb 20, 2024

poiana commented May 20, 2024

leogr commented May 21, 2024

poiana commented Aug 19, 2024

Andreagit97 commented Aug 19, 2024

poiana commented Nov 17, 2024

leogr commented Nov 18, 2024