[DOC] Document which algorithms expect Fortran vs. C contiguous data #5929

beckernick · 2024-06-13T15:10:24Z

For many algorithms, whether the input data is C or Fortran contiguous determines whether an expensive memory copy needs to be made. While this seems innocuous, it can have significant UX implications because it's not well understood by most users and, when it rears its head, it's not obvious based on errors.

We should document this.

viclafargue · 2024-07-24T12:48:11Z

Opened a PR that should inform users when a possibly useless copy is performed. As stated here, data on host (Numpy arrays and Pandas dataframes) will be copied over to device anyways, cuDF dataframes are deepcopied too and cuDF series are 1D and thus not affected by the issue. Then only cuda array interface compliant arrays (and numba arrays) can be copied only because of data order/contiguousness change. This change should allow the user to be informed.

If the user is informed through logging, is it necessary to also document it? If so, should we add the expected data order/contiguousness on the documentation of each function parameter providing data everywhere in the entire library? What should we do when function parameters are left undocumented (many occurrences)?

beckernick added feature request New feature or request ? - Needs Triage Need team to review and classify doc Documentation and removed feature request New feature or request labels Jun 13, 2024

beckernick changed the title ~~[FEA] Document which algorithms expect Fortran vs. C contiguous data~~ [DOC] Document which algorithms expect Fortran vs. C contiguous data Jun 13, 2024

beckernick mentioned this issue Jun 13, 2024

[FEA] cuML estimators should warn users when a copy is being made due to C vs. Fortran contiguousness (and explain the impact) #5930

Open

beckernick removed the ? - Needs Triage Need team to review and classify label Jun 13, 2024

viclafargue mentioned this issue Jul 24, 2024

Better communicate expectations of data order/contiguousness #5975

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DOC] Document which algorithms expect Fortran vs. C contiguous data #5929

[DOC] Document which algorithms expect Fortran vs. C contiguous data #5929

beckernick commented Jun 13, 2024

viclafargue commented Jul 24, 2024

[DOC] Document which algorithms expect Fortran vs. C contiguous data #5929

[DOC] Document which algorithms expect Fortran vs. C contiguous data #5929

Comments

beckernick commented Jun 13, 2024

viclafargue commented Jul 24, 2024