-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Challenge 21 - Polly: A Natural Language Processing Interface to Extract Complex Features from Weather Datacubes #3
Comments
Hello @mathleur and @awarde96, We are a team of six MRes students from the Centre for Geospatial Science CDT, and we're currently preparing our submission for this challenge. To help polish the proposal, we have a couple of questions to ensure our approach aligns with the expectations and best uses the Polytope infrastructure:
|
Hi @cdrowley, Thank you for the questions and we look forward to receiving your proposal! To answer your questions:
I hope this answers your questions, but let us know if something is unclear or you have more questions! |
Hello @mathleur and @awarde96, @mathleur based on my understanding of your previous response, the goal of this challenge is to integrate an LLM layer (chatbot) with your high-level API to extract crucial information (features) from users' complex queries. These features are then forwarded to MARS to formulate MARS requests, which are subsequently integrated with Polytope. Polytope will utilize the MARS request to retrieve custom data required by the user. Could you confirm if this is the correct workflow, or if there are any additional steps I may have overlooked? Thank you, as I look forward to your response! |
Thanks for the question! Indeed, the goal of this challenge is to create a LLM layer on top of our high-level Polytope API. However, the user requests (or "features") are not forwarded to MARS. Polytope is actually a parallel alternative service we are building to MARS so that we can extract non-box shapes of data. This is currently not supported by MARS and the MARS language so Polytope will enable this capability. Because Polytope will enable such more complex requests, we need to build more specialized APIs to help users request data. For this challenge, we would like to build an LLM layer as we believe this has the potential to be a great API to help users access the data that they need. The real focus of this challenge however is to translate user queries and prompts directly to the shapes implemented in the mid-level Polytope API that already exists. Note that the main Polytope extraction algorithm already exists so the challenge is not to reimplement this, but rather build a new layer that uses this algorithm/service. Time-permitting, another part of the challenge could be to then create a nice web user interface which users can easily access! I hope this helps and would be happy to answer any other questions! |
Challenge 21 - Polly: A Natural Language Processing Interface to Extract Complex Features from Weather Datacubes
Goal
Develop a machine learning “chat” user interface to retrieve arbitrary user-specified queries from the Destination Earth Weather Extremes digital twin using Polytope. Time-permitting, a secondary goal would be to design a web user interface for this chatbot.
Mentors and skills
Challenge description
As part of the Destination Earth initiative, ECMWF is implementing a new data access mechanism called Polytope. This method allows users to extract complex shapes of data, such as 2D country cut-outs or 4D flight paths, from NWP datacubes instead of whole fields.
Within the algorithm itself, the Polytope software extracts convex polytopes from datacubes, but the software also comes with a mid-level interface which supports primitive shapes, like boxes or disks, as well as constructive geometry operations, such as taking unions of several shapes.
Building on top of this, we are currently developing a higher-level interface to Polytope which will support a range of more intricate domain-specific shapes like countries, timeseries or vertical profiles for meteorology applications. In order to do this, we are extending the MARS language to include a “feature” keyword which represents the shape we want to retrieve instead of the whole field. Whilst this higher-level interface is much more usable than the Polytope-native shapes, for non-technical stakeholders who would like to use Destination Earth data, defining the exact feature that they want to access might still be challenging.
In this project, we would thus like to go a step further and leverage ML techniques to build a chatbot which transforms more complex user queries, such as “Find the wind speed over tomorrow’s A150 flight from London to Paris”, directly into extended MARS requests with features. These MARS requests will then be called from Polytope to seamlessly retrieve custom data user requests.
The Polytope feature extraction library already exists, and can handle different levels of requests, so the main aim of the project would be to add the final ML layer. This layer could for example be implemented using chatGPT plugins, Poe (a tool for creating chatbots with various LLMs as backends) or any other LLM approach.
The text was updated successfully, but these errors were encountered: