Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Gridded Data Selectors #799

Open
1 of 2 tasks
hellkite500 opened this issue Apr 19, 2024 · 3 comments
Open
1 of 2 tasks

Gridded Data Selectors #799

hellkite500 opened this issue Apr 19, 2024 · 3 comments
Assignees

Comments

@hellkite500
Copy link
Contributor

hellkite500 commented Apr 19, 2024

Following the intergration of the Gridded Forcing Engine (#705), implement gridded data selectors for DataProviders to allow connecting gridded formulations to gridded input data.

  • Structured Grid Data Selector
  • Mesh Data Selector
@program-- program-- self-assigned this Apr 20, 2024
@program--
Copy link
Contributor

program-- commented Apr 20, 2024

Some notes:

Initial Design

Currently, I'm envisioning a gridded data selector as a struct holding cell indices, where Cell is defined as a simple POD struct:

struct Cell {
    std::uint64_t x, y, z;
    double value;
}; // 32 bytes

Then, for selection, we'd just need a GridSpecification:

struct Extent {
    double xmin, xmax, ymin, ymax;
};

struct GridSpecification {
  //! Total number of rows (aka y)
  std::uint64_t rows;

  //! Total number of columns (aka x)
  std::uint64_t columns;

  //! Extent of the grid region (aka min-max corner points)
  Extent extent;
};

Using a GridDataSelector, we then define constructors that take in a grid specification and the type to extract against. For example, a point selector may have a constructor like:

struct SelectorConfig {
  //! Initial time for query.
  //! @todo Refactor to use std::chrono
  time_t init_time;

  //! Duration for query, in seconds.
  //! @todo Refactor to use std::chrono
  long duration_seconds;
  
  //! Variable to return from query.
  std::string variable_name;

  //! Units for output variable.
  std::string variable_units;
};

struct GridDataSelector;

GridDataSelector::GridDataSelector(
    SelectorConfig config,
    const GridSpecification& grid,
    boost::span<const geojson::coordinate_t> points
);

Where the implementation simply encodes the point coordinates against the grid specification. I'll probably assume planar coordinates, as that simplifies the implementation and the numerical error will most likely be negligible.

Encoding Coordinates

Coordinates can be encoded to a grid axis by taking the size of that axis (upper_bound), computing the spacing (diff), and flooring the relative position. For example, a simple generic implementation is:

std::uint64_t coordinate_to_axis_index(
    double position,
    double min,
    double max,
    std::uint64_t upper_bound
) {
    const double diff = static_cast<double>(upper_bound) / (max - min);

    // Coordinate position is within or on the grid axis bounds
    if (position >= min && position <= max) {
      return std::floor((position - min) * diff);
    }

    // Coordinate position is out of bounds, so
    // return a sentinel or throw an exception
    return static_cast<std::uint64_t>(-1);
}

Providers

Unlike the current data providers, a gridded data provider may look like:

class SomeGridDataProvider : public data_access::DataProvider<Cell, GridDataSelector>
{ ... };

The main difference is in the base class. This will imply the return value

SomeGridDataProvider::get_values(...) -> std::vector<Cell>

So, a GridDataSelector can hold a std::vector<Cell>, populate it with selected cells, pass it to the provider, the provider can fill the Cell::value members, and return the resulting vector of Cells to the consumer. Alternatively, if we want to implement resampling/regridding within this, a new vector corresponding to the target grid could be returned instead.

@PhilMiller
Copy link
Contributor

Just to make sure I don't confuse things with irrelevant comments, this issue's notes are all aimed at models like Noah-OWP-Modular, which will run over (a masked subset of) a regular grid, right? Unstructured mesh nodes/elements will be addressed separately?

@program--
Copy link
Contributor

program-- commented Apr 22, 2024

@PhilMiller For the first pass of gridded data selectors, yes, I'm just tackling (uniform) structured grids.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants