fix: panic index out of range for invalid series keys #24565

jdockerty · 2024-01-10T17:35:29Z

Panics can be avoided by checking the validity of the return from ReadSeriesKeyLen, at the moment the size check resulting from this function is entirely discarded. The contained call to binary.Uvarint contains an implicit error, depending on the returned value:

If an error occurred, the value is 0 and the number of bytes n is <= 0

In every occurrence of ReadSeriesKeyMeasurement, the ReadSeriesKeyLen function is called prior to it, determined by glancing over the output of rg "ReadSeriesKeyMeasurement" -B 5 across the full codebase.

This PR refactors the flow slightly to include ReadSeriesKeyLen returned value checks and the appropriate action to avoid panicking when trying to access memory out of bounds, it has a knock-on effect such that the ParseSeriesKey function can return nil at another point which indicates an invalid result and the appropriate action should be taken.

Closes: #24454, #24469, #24432, #17409, #20245, #23266

I've read the contributing section of the project README.
Signed CLA (if not already signed).

…to fix/tsm-out-of-range-index

As the Len function is used as part of the parseSeriesKey, this also needs to be accounted for on the nil return from this function as it is used in different contexts

davidby-influx

The overall comment I have is that the tests when ReadSeriesKeyLen return a zero size or an empty buffer should be stricter in preventing the next operation, but you need to be sure they are correct. For instance, can we have a key with no tags? What does that produce out of the various parsing functions? I believe it is possible, so depending on what happens after the parsing depends on what's being done; are we processing the tags, or only the key name?

So, take a look at each test you've added and see what comes next, then make the test as simple and robust as possible.

tsdb/series_file.go

tsdb/index/tsi1/log_file.go

tsdb/index.go

tsdb/series_partition.go

tsdb/index.go

davidby-influx

Onbly one question remaining on the conditionals and early abandonment.

tsdb/index.go

In both sections for index.go there is a pre-existing length check against the series key which should catch invalid values, perhaps this explains why it hasn't cropped up in the reported panics. For even more safety, we can also skip a nil key because we know that subsequent calls will cause a panic where this key is attempted to be used

A key with no tags is valid, so we should not check for BOTH nil key and tags as a key could be nil, which is invalid, yet still have tags and therefore cause the check to pass which we do not want

davidby-influx

A few more tests requested.

tsdb/series_file_test.go

tsdb/index.go

davidby-influx

One unresolved issue we already discussed.

tsdb/series_file.go

Prior to this, the else was always defaulted to at the end of the conditional branch, which causes unexpected behaviour and a failure of a bunch of tests.

davidby-influx

LGTM - pass on to @gwossum for a second clean review and prepare for all the ports (1.11, main-2.x, 2.7)

In a recent change to this, we agreed on a simple name == nil check for the actual data. As a follow on to this, I just realised that we don't actually want to nil back the tags, even if they're not checked, because having no tags is a valid input so we can simply return whatever we were passed unchanged.

…data/influxdb into fix/tsm-out-of-range-index

davidby-influx

LGTM! Thanks for contributing. Please get a review from @gwossum and if he approves, create issues for ports to main-2.x, 2.7, and 1.11 branches, then cherry-pick into those and put up the PRs.

cmd/influx_inspect/dumptsi/dumptsi.go

gwossum

Assuming name == nil is the check we need to perform, I don't see any issues.

…data/influxdb into fix/tsm-out-of-range-index

davidby-influx

LGTM

gwossum

LGTM

* chore: add scaffolding for naive solution * feat: test case scaffolding * fix: implement check for series key before proceeding * fix: add validation for ReadSeriesKeyMeasurement usage * refactor: explicit use of series key len * feat: add remaining check to index * feat: add check to remaining files As the Len function is used as part of the parseSeriesKey, this also needs to be accounted for on the nil return from this function as it is used in different contexts * feat: expand test cases * chore: go fmt * chore: update test failure message * chore: impl feedback on unnecessary sz checks * feat: expand test cases * fix: nil series key check In both sections for index.go there is a pre-existing length check against the series key which should catch invalid values, perhaps this explains why it hasn't cropped up in the reported panics. For even more safety, we can also skip a nil key because we know that subsequent calls will cause a panic where this key is attempted to be used * fix: remove nil tags check A key with no tags is valid, so we should not check for BOTH nil key and tags as a key could be nil, which is invalid, yet still have tags and therefore cause the check to pass which we do not want * feat: extend test cases from feedback * fix: extend checks for CompareSeriesKeys * feat: add nilKeyHandler for shared key checking logic * fix: logical error in nilKeyHandler Prior to this, the else was always defaulted to at the end of the conditional branch, which causes unexpected behaviour and a failure of a bunch of tests. * fix: return tags keep nil data In a recent change to this, we agreed on a simple name == nil check for the actual data. As a follow on to this, I just realised that we don't actually want to nil back the tags, even if they're not checked, because having no tags is a valid input so we can simply return whatever we were passed unchanged. * fix: use len == 0 for extra safety * feat: extra test for blank series key

jdockerty added 2 commits January 10, 2024 17:34

chore: add scaffolding for naive solution

69f7e61

feat: test case scaffolding

eb802aa

jdockerty self-assigned this Jan 10, 2024

jdockerty added 9 commits January 11, 2024 09:54

chore: merge branch 'master-1.x' of github.com:influxdata/influxdb in…

bc3bb84

…to fix/tsm-out-of-range-index

fix: implement check for series key before proceeding

7c33dd9

fix: add validation for ReadSeriesKeyMeasurement usage

32592d3

refactor: explicit use of series key len

0e5318c

feat: add remaining check to index

7092d1b

feat: add check to remaining files

cf5717a

As the Len function is used as part of the parseSeriesKey, this also needs to be accounted for on the nil return from this function as it is used in different contexts

feat: expand test cases

e904943

chore: go fmt

390a9c6

chore: update test failure message

2556e38

jdockerty force-pushed the fix/tsm-out-of-range-index branch from 1f3fb89 to 2556e38 Compare January 12, 2024 16:58

davidby-influx self-requested a review January 12, 2024 17:41

davidby-influx reviewed Jan 12, 2024

View reviewed changes

chore: impl feedback on unnecessary sz checks

de044d4

davidby-influx reviewed Jan 16, 2024

View reviewed changes

tsdb/index.go Outdated Show resolved Hide resolved

jdockerty added 3 commits January 17, 2024 10:11

feat: expand test cases

4680200

fix: remove nil tags check

362468d

A key with no tags is valid, so we should not check for BOTH nil key and tags as a key could be nil, which is invalid, yet still have tags and therefore cause the check to pass which we do not want

jdockerty force-pushed the fix/tsm-out-of-range-index branch from 52b83b9 to 362468d Compare January 17, 2024 10:53

jdockerty changed the title ~~fix: panic index out of range for series key measurement~~ fix: panic index out of range for invalid series keys Jan 17, 2024

davidby-influx requested changes Jan 17, 2024

View reviewed changes

tsdb/series_file_test.go Show resolved Hide resolved

tsdb/index.go Outdated Show resolved Hide resolved

feat: extend test cases from feedback

86d8dee

davidby-influx requested changes Jan 17, 2024

View reviewed changes

tsdb/series_file.go Show resolved Hide resolved

jdockerty added 3 commits January 17, 2024 21:21

fix: extend checks for CompareSeriesKeys

24cd179

feat: add nilKeyHandler for shared key checking logic

39821d6

fix: logical error in nilKeyHandler

74b3f3a

Prior to this, the else was always defaulted to at the end of the conditional branch, which causes unexpected behaviour and a failure of a bunch of tests.

davidby-influx previously approved these changes Jan 19, 2024

View reviewed changes

chore: merge branch 'fix/tsm-out-of-range-index' of github.com:influx…

db50933

…data/influxdb into fix/tsm-out-of-range-index

jdockerty dismissed davidby-influx’s stale review via db50933 January 19, 2024 11:06

jdockerty marked this pull request as ready for review January 19, 2024 16:33

jdockerty requested a review from gwossum January 19, 2024 16:33

davidby-influx previously approved these changes Jan 19, 2024

View reviewed changes

gwossum reviewed Jan 22, 2024

View reviewed changes

cmd/influx_inspect/dumptsi/dumptsi.go Outdated Show resolved Hide resolved

gwossum reviewed Jan 22, 2024

View reviewed changes

jdockerty added 3 commits January 22, 2024 21:15

fix: use len == 0 for extra safety

566e0b0

feat: extra test for blank series key

43d1ea3

chore: merge branch 'fix/tsm-out-of-range-index' of github.com:influx…

42759c8

…data/influxdb into fix/tsm-out-of-range-index

jdockerty dismissed davidby-influx’s stale review via 42759c8 January 22, 2024 21:16

davidby-influx approved these changes Jan 22, 2024

View reviewed changes

gwossum approved these changes Jan 22, 2024

View reviewed changes

jdockerty merged commit 6af0be9 into master-1.x Jan 23, 2024
9 checks passed

jdockerty mentioned this pull request Jan 23, 2024

Panic index out of range for invalid series keys [Port to 1.11] #24593

Closed

jdockerty deleted the fix/tsm-out-of-range-index branch January 23, 2024 17:10

davidby-influx mentioned this pull request Feb 6, 2024

Crash every day after compacting TSM and TSI #24432

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: panic index out of range for invalid series keys #24565

fix: panic index out of range for invalid series keys #24565

jdockerty commented Jan 10, 2024 •

edited

Loading

davidby-influx left a comment

davidby-influx left a comment

davidby-influx left a comment

davidby-influx left a comment

davidby-influx left a comment

davidby-influx left a comment

gwossum left a comment

davidby-influx left a comment

gwossum left a comment

fix: panic index out of range for invalid series keys #24565

fix: panic index out of range for invalid series keys #24565

Conversation

jdockerty commented Jan 10, 2024 • edited Loading

davidby-influx left a comment

Choose a reason for hiding this comment

davidby-influx left a comment

Choose a reason for hiding this comment

davidby-influx left a comment

Choose a reason for hiding this comment

davidby-influx left a comment

Choose a reason for hiding this comment

davidby-influx left a comment

Choose a reason for hiding this comment

davidby-influx left a comment

Choose a reason for hiding this comment

gwossum left a comment

Choose a reason for hiding this comment

davidby-influx left a comment

Choose a reason for hiding this comment

gwossum left a comment

Choose a reason for hiding this comment

jdockerty commented Jan 10, 2024 •

edited

Loading