-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Observations as RDF Statements - (re)usability of observations/measurements #5
Comments
Hey, I'm not the author of this repository but I think the "semantically whacky statements" you're talking about are due modelling errors rather than issues with the cube vocabulary. A So
I understand the desire to reuse these things, and perhaps associate a measurement with the person or river which was measured, however you could always attach the measurement to the measured thing with a
|
@RickMoynihan, thanks for your remarks.
I agree, and I don't believe I stated anywhere that this is the case. What is key, is to be clear what the subject of the statistical population/observation is.
In this example, some observation is being stated to have a life expectancy. I maintain that this is whacky. I would like to state that the subject of this observation has a life expectancy.
I believe I would like to model the example as follows:
My problem with the data cube model is that IMO it creates a silo of statistical information (models), and I believe that with some minor tweaks this can remedied. Curious to hear your thoughts. |
Whether it's whacky or not depends entirely on the Fair point about Essentially you're battling battling the distinction between the real world river avon; and the RDF description of some aspects of it. See also the distinction between id and doc URI's. All things in computer science are solved by another level indirection :-), So |
Depends on the statistic. For life expectancy, agreed. But one could also have measurements for singular subjects. Like height or weight.
I don't see how this has to do with http-range-14, which is about the subtle difference between an IRI identifying an object vs identifying a web document about said object and whether or not that distinction is important. In this case the difference between an object and one or more observations made about that object, is much less subtle. But yes, it is about modeling philosophy. But so is this rdf cube schema, right?
What I'm trying to do is to make observations useable in a data graph outside of a cube context in a natural way. In other words, in terms of SPARQL I would like to
And find interesting statements, like
And hopefully be able to use that in a generic way. It's about (IMO) reusable / non-siloed information. |
What I've always thought was a missed opportunity in the RDF data cube recommendation is that RDF data cube observation statements are not easily transferrable and directly (semantically) useful in a broader context. You need to "understand" data cube language to make sense of these statements.
One of the key reasons for this is that a instances of qb:MeasureProperty are expected to be used on instances of qb:Observation. This leads to some semantically whacky statements like
<some-observation> <has-life-expectancy> 80 .
, and it also means that I can basically only use measurement statements in a data cube observation context.For example, let's say were measuring Alice's height. Consider the following RDF data cube based example:
So here we've created a data cube with an observation of Alices height. But, consider we also have some other data on Alice.
Now to find out Alice's height at some point in time, I have to find and interpret the observation, which is somehow connected to Alice.
IMO it would be much more natural to be able to state:
And, we could also have that be the core of an observation, if we define
We could then state something like:
Note that, because we now use a more semantically sense making statement we can easily make use of existing vocabularies for our observation. In this case we can replace
ex:height
with the already definedschema:height
.This also fits the RDF* approach (and possibly also property graphs). We can combine this like so:
This also plays nice with the schema.org vocab for Observations.
One downside of this approach is that it doesn't nicely fit multi-measurement observations. I don't know how much of a dealbreaker that would be. Personally I'd place greater weight on having observations statements be semantically "reusable" in different contexts, than having the ability to have multi-measurement observations.
I'm curious what your viewpoint on this is and what kind of issues you see with an approach like this @ktk
The text was updated successfully, but these errors were encountered: